issue_comments
22 rows where issue = 180451196 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: created_at (date), updated_at (date)
issue 1
- Disable automatic cache with dask · 22 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
261047576 | https://github.com/pydata/xarray/pull/1024#issuecomment-261047576 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI2MTA0NzU3Ng== | kynan 346079 | 2016-11-16T19:33:08Z | 2016-11-16T19:33:08Z | NONE | @shoyer Great, thanks, I'll give that a try. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
261046945 | https://github.com/pydata/xarray/pull/1024#issuecomment-261046945 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI2MTA0Njk0NQ== | shoyer 1217238 | 2016-11-16T19:30:53Z | 2016-11-16T19:30:53Z | MEMBER | @kynan I think this is fixed in #1128, which has a slightly more robust solution. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
260818311 | https://github.com/pydata/xarray/pull/1024#issuecomment-260818311 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI2MDgxODMxMQ== | kynan 346079 | 2016-11-16T00:45:51Z | 2016-11-16T00:45:51Z | NONE | @crusaderky @shoyer There are still cases where dask arrays are converted to ndarrays where I think they shouldn't be: if you create a |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
260436147 | https://github.com/pydata/xarray/pull/1024#issuecomment-260436147 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI2MDQzNjE0Nw== | crusaderky 6213168 | 2016-11-14T19:28:38Z | 2016-11-14T19:28:38Z | MEMBER | Happy to contribute! On 14 Nov 2016 16:58, "Stephan Hoyer" notifications@github.com wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
260393416 | https://github.com/pydata/xarray/pull/1024#issuecomment-260393416 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI2MDM5MzQxNg== | shoyer 1217238 | 2016-11-14T16:57:59Z | 2016-11-14T16:57:59Z | MEMBER | Thanks for your patience! This is a nice improvement. I have an idea for a variation that might make for a cleaner (less dask specific) way to handle the remaining caching logic -- I'll add you a reviewer on that PR. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
260202467 | https://github.com/pydata/xarray/pull/1024#issuecomment-260202467 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI2MDIwMjQ2Nw== | crusaderky 6213168 | 2016-11-13T18:21:26Z | 2016-11-13T18:21:26Z | MEMBER | Changed to cache IndexVariable._data on init. Please review... |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
260137073 | https://github.com/pydata/xarray/pull/1024#issuecomment-260137073 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI2MDEzNzA3Mw== | crusaderky 6213168 | 2016-11-12T17:47:10Z | 2016-11-12T17:47:10Z | MEMBER | Finished and waiting for code review |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
258679497 | https://github.com/pydata/xarray/pull/1024#issuecomment-258679497 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1ODY3OTQ5Nw== | shoyer 1217238 | 2016-11-06T13:01:50Z | 2016-11-06T13:01:50Z | MEMBER | Awesome, thanks for your help! On Sat, Nov 5, 2016 at 6:56 PM crusaderky notifications@github.com wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
258647829 | https://github.com/pydata/xarray/pull/1024#issuecomment-258647829 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1ODY0NzgyOQ== | crusaderky 6213168 | 2016-11-05T22:56:27Z | 2016-11-05T22:56:27Z | MEMBER | roger that, getting to work :) |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
258620328 | https://github.com/pydata/xarray/pull/1024#issuecomment-258620328 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1ODYyMDMyOA== | shoyer 1217238 | 2016-11-05T15:53:06Z | 2016-11-05T15:53:06Z | MEMBER |
Yes, please do! @crusaderky I think we are OK going ahead here with Option D. If we do eventually extend xarray with out of core indexes, that will be done with a separate layer (not in |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
258619392 | https://github.com/pydata/xarray/pull/1024#issuecomment-258619392 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1ODYxOTM5Mg== | benbovy 4160723 | 2016-11-05T15:37:46Z | 2016-11-05T15:37:46Z | MEMBER |
Oh yes, true!
Indeed, that doesn't look very nice.
From what I intend to do next with xarray, I'd say that extending its support for out-of-core operations to big indexes would be a great feature! I haven't seen yet how |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
258524115 | https://github.com/pydata/xarray/pull/1024#issuecomment-258524115 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1ODUyNDExNQ== | shoyer 1217238 | 2016-11-04T19:19:00Z | 2016-11-04T19:19:00Z | MEMBER |
Right now, xarray is not going to be great fit for such cases, because we already cache an index in memory for any labeled indexing operations. So at best, you could do something like I doubt very many people are relying on this, though indeed, this would include some users of an array database we wrote at my former employer, which did not support different chunking schemes for different variables (which could make coordinate lookup very slow). CC @ToddSmall in case he has opinions here. For out-of-core operations with labels on big unstructured meshes, you really need a generalization of the |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
258496425 | https://github.com/pydata/xarray/pull/1024#issuecomment-258496425 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1ODQ5NjQyNQ== | benbovy 4160723 | 2016-11-04T17:29:54Z | 2016-11-04T17:29:54Z | MEMBER | Option D seems indeed the cleanest and safest option, but
I can see use cases where this might happen. For example, It is common for 1, 2 or higher-dimension unstructured meshes that the coordinates x, y, z are arranged as 1-d arrays of length that equals the number of nodes (which can be very high!). See for example ugrid conventions. I admit that currently xarray is perhaps not very suited for handling unstructured meshes, but IMO it has great potential (especially considering multi-index support) and I'd love to use it here. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
256125722 | https://github.com/pydata/xarray/pull/1024#issuecomment-256125722 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1NjEyNTcyMg== | shoyer 1217238 | 2016-10-25T18:25:30Z | 2016-10-25T18:25:30Z | MEMBER | I'm going to ping the mailing list for input, but I think it would be pretty safe. On Tue, Oct 25, 2016 at 11:11 AM, crusaderky notifications@github.com wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
256114879 | https://github.com/pydata/xarray/pull/1024#issuecomment-256114879 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1NjExNDg3OQ== | crusaderky 6213168 | 2016-10-25T18:11:35Z | 2016-10-25T18:11:35Z | MEMBER | Hi Stephen, Thank you for your thinking. IMHO option D is the cleanest and safest. Could you come up with any example where it may be problematic? On 21 Oct 2016 4:36 am, "Stephan Hoyer" notifications@github.com wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
255286001 | https://github.com/pydata/xarray/pull/1024#issuecomment-255286001 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1NTI4NjAwMQ== | shoyer 1217238 | 2016-10-21T03:36:01Z | 2016-10-21T03:36:01Z | MEMBER |
I'm nervous about eager loading, especially for non-index coordinates. They can have more than one dimension, and thus can contain a lot of data. So potentially eagerly loading non-index coordinates could break existing use cases. On the other hand, non-index coordinates indeed checked for equality in most xarray operations (e.g., for the coordinate merge in align). So it is indeed useful not to have to recompute them all the time. Even eagerly loading indexes is potentially problematic, if loading the index values is expensive. So I'm conflicted:
- I like the current caching behavior for I'm going to start throwing out ideas for how to deal with this: Option AAdd two new (public?) methods, something like Hypothetically, we could even have options for turning this caching systematically on/off (e.g., Your proposal is basically an extreme version of this, where we call Advantages:
- It's fairly predictable when caching happens (especially if we opt for calling Downsides: - Caching is more aggressive than necessary -- we cache indexes even if that coord isn't actually indexed. Option BLike Option A, but someone infer the full set of variables that need to be cached (e.g., in a This solves the downside of A, but diminishes the predictability. We're basically back to how things work now. Option CCache dask.array in Advantages: - Much simpler and easier to implement than the alternatives. - Implicit conversions are greatly diminished. Downsides:
- Non-index coordinates get thrown away after being evaluated once. If you're doing lots of operations of the form Option DLoad the contents of an This has the most predictable performance, but might cause trouble for some edge use cases? I need to think about this a little more, but right now I am leaning towards Option C or D. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
255066352 | https://github.com/pydata/xarray/pull/1024#issuecomment-255066352 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1NTA2NjM1Mg== | crusaderky 6213168 | 2016-10-20T10:14:30Z | 2016-10-20T10:14:30Z | MEMBER | ping - how do you prefer me to proceed? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
253133290 | https://github.com/pydata/xarray/pull/1024#issuecomment-253133290 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1MzEzMzI5MA== | crusaderky 6213168 | 2016-10-12T06:49:23Z | 2016-10-12T06:49:23Z | MEMBER | I've been thinking about this... Maybe the simple, clean solution is to simply invoke compute() on all coords as soon as they are assigned to the DataArray / Dataset? On 12 Oct 2016 02:18, "Stephan Hoyer" notifications@github.com wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
252033741 | https://github.com/pydata/xarray/pull/1024#issuecomment-252033741 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1MjAzMzc0MQ== | crusaderky 6213168 | 2016-10-06T17:34:01Z | 2016-10-06T17:34:01Z | MEMBER | I can't reproduce the above failure test.test_conventions.TestEncodeCFVariable.testMethod=test_missing_fillvalue. I suspect it might be a random failure, because - it used to succeed until my latest commit, which eclusively changed the readme - it suceeds on python 3 |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
251983000 | https://github.com/pydata/xarray/pull/1024#issuecomment-251983000 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1MTk4MzAwMA== | crusaderky 6213168 | 2016-10-06T14:39:47Z | 2016-10-06T14:39:47Z | MEMBER | I added the disclaimer in the release notes. Is there any other outstanding issue? Thanks |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
251209654 | https://github.com/pydata/xarray/pull/1024#issuecomment-251209654 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1MTIwOTY1NA== | crusaderky 6213168 | 2016-10-03T19:58:53Z | 2016-10-03T20:31:48Z | MEMBER | What happened before this PR was that all coords were blindly converted to dask on chunk(). Then, the first time anything invoked the values property, e.g. Something as simple as If you deliberately use a dask array as a coord, it won't be converted to numpy. However I can't think of any reason why anybody would want to do it in practice. I'll add it to the breaking changes as if somebody did do the above, the performance of his program will degrade with this release as his coord will risk being evaluated multiple times. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 | |
250922373 | https://github.com/pydata/xarray/pull/1024#issuecomment-250922373 | https://api.github.com/repos/pydata/xarray/issues/1024 | MDEyOklzc3VlQ29tbWVudDI1MDkyMjM3Mw== | crusaderky 6213168 | 2016-10-01T16:41:04Z | 2016-10-01T17:37:39Z | MEMBER | Well, crud. This introduces a regression where DataArray.chunk() converts the data and the coords to dask. This becomes enormously problematic later on as pretty much nothing expects a dask-based coord. [edit] fixed below |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Disable automatic cache with dask 180451196 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 4