issue_comments
8 rows where issue = 462859457 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- Multidimensional dask coordinates unexpectedly computed · 8 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
508862961 | https://github.com/pydata/xarray/issues/3068#issuecomment-508862961 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwODg2Mjk2MQ== | djhoese 1828519 | 2019-07-05T21:10:50Z | 2019-07-05T21:10:50Z | CONTRIBUTOR | Ah, good call. The |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 | |
508853564 | https://github.com/pydata/xarray/issues/3068#issuecomment-508853564 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwODg1MzU2NA== | shoyer 1217238 | 2019-07-05T20:15:27Z | 2019-07-05T20:15:27Z | MEMBER |
Dask actually already does this canonicalization. If two arrays have the same In [6]: y = da.zeros((10, 10), chunks=2) + 5 In [7]: x.name Out[7]: 'add-f7441a0f46f5cf40458391cd08406c23' In [8]: y.name Out[8]: 'add-f7441a0f46f5cf40458391cd08406c23' ``` So xarray could safely look at |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 | |
507656176 | https://github.com/pydata/xarray/issues/3068#issuecomment-507656176 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwNzY1NjE3Ng== | djhoese 1828519 | 2019-07-02T12:31:54Z | 2019-07-02T12:33:15Z | CONTRIBUTOR | @shoyer Understood. That explains why something like this wasn't caught before, but what would be the best solution for a short term fix? For the long term, I also understand that there isn't really a good way to check equality of two dask arrays. I wonder if dask's graph optimization could be used to "simplify" two dask arrays' graph separately and check the graph equality. For example, two dask arrays created by doing Edit: "short term fix": What is the best way to avoid the unnecessary transpose? Or is this not even the right way to approach this? Change dask to avoid the unnecessary transpose or change xarray to not do the tranpose or something else? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 | |
507480052 | https://github.com/pydata/xarray/issues/3068#issuecomment-507480052 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwNzQ4MDA1Mg== | shoyer 1217238 | 2019-07-02T01:15:36Z | 2019-07-02T01:15:36Z | MEMBER | The source of the problem here is that when combining objects, xarray needs to decide what coordinates should remain. Our current heuristic, which pre-dates dask support, was really designed for array in memory: we keep around coordinates if they are equal on both arguments, and remove them otherwise. In some cases we can avoid the computation, if we know that the coordinates are the same object. I am open to ideas on how to make this work better. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 | |
507455821 | https://github.com/pydata/xarray/issues/3068#issuecomment-507455821 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwNzQ1NTgyMQ== | dhirschfeld 881019 | 2019-07-01T23:03:12Z | 2019-07-01T23:03:12Z | NONE | FYI: @djhoese, you can inline code snippits using the permanent link to the source: https://github.com/pydata/xarray/blob/e5bb647637063153a7feb750793d6fd8fb58dda8/xarray/core/variable.py#L1223 |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 | |
507410467 | https://github.com/pydata/xarray/issues/3068#issuecomment-507410467 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwNzQxMDQ2Nw== | djhoese 1828519 | 2019-07-01T20:20:05Z | 2019-07-01T20:20:05Z | CONTRIBUTOR | Modifying this line to be:
Then this issue is avoided for at least the |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 | |
507405717 | https://github.com/pydata/xarray/issues/3068#issuecomment-507405717 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwNzQwNTcxNw== | djhoese 1828519 | 2019-07-01T20:05:51Z | 2019-07-01T20:05:51Z | CONTRIBUTOR | Ok another update. In the previous example I accidentally added the However, if I fix this redundancy and do:
I do get a progress bar again (lons2 is being computed). I've tracked it down to this https://github.com/pydata/xarray/blob/master/xarray/core/variable.py#L1223 I'm not sure if this would be considered a bug in dask or xarray. Also, not sure why the redundant version of the example worked. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 | |
507396912 | https://github.com/pydata/xarray/issues/3068#issuecomment-507396912 | https://api.github.com/repos/pydata/xarray/issues/3068 | MDEyOklzc3VlQ29tbWVudDUwNzM5NjkxMg== | djhoese 1828519 | 2019-07-01T19:38:06Z | 2019-07-01T19:38:06Z | CONTRIBUTOR | Ok I'm getting a little more of an understanding on this. The main issue is that the dask array is not literally considered the same object because I'm creating the object twice. If I create a single dask array and pass it:
I still get the progress bar because xarray is creating two new If I make a single DataArray that becomes the coordinate variable then it seems to work:
I get no progress bar. ``` |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Multidimensional dask coordinates unexpectedly computed 462859457 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 3