issue_comments
4 rows where issue = 252541496 and user = 1217238 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: created_at (date), updated_at (date)
issue 1
- open_mfdataset reads coords from disk multiple times · 4 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
327338750 | https://github.com/pydata/xarray/issues/1521#issuecomment-327338750 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNzMzODc1MA== | shoyer 1217238 | 2017-09-06T00:20:49Z | 2017-09-06T00:20:49Z | MEMBER | Enjoy your holiday! On Tue, Sep 5, 2017 at 5:01 PM crusaderky notifications@github.com wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
open_mfdataset reads coords from disk multiple times 252541496 | |
326867685 | https://github.com/pydata/xarray/issues/1521#issuecomment-326867685 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNjg2NzY4NQ== | shoyer 1217238 | 2017-09-04T05:13:59Z | 2017-09-04T05:20:50Z | MEMBER | The problem is these lines in We inspect compare coordinates for equality in order to decide whether to ignore redundant coordinates or stack them up. This happens if |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
open_mfdataset reads coords from disk multiple times 252541496 | |
326868217 | https://github.com/pydata/xarray/issues/1521#issuecomment-326868217 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNjg2ODIxNw== | shoyer 1217238 | 2017-09-04T05:18:55Z | 2017-09-04T05:18:55Z | MEMBER | So, to be more precise, I think the problem is that the first variable is computed many times over (once per comparison), inside the A very simple fix, slightly more conservative than loading every coordinate into memory, is to simply compute these first coordinates on the first variable, e.g., |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
open_mfdataset reads coords from disk multiple times 252541496 | |
324708622 | https://github.com/pydata/xarray/issues/1521#issuecomment-324708622 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNDcwODYyMg== | shoyer 1217238 | 2017-08-24T17:51:42Z | 2017-08-24T17:51:42Z | MEMBER |
In principle, coords can have the same shape as data variables. In those cases, you probably want to use the same chunking scheme.
@rabernat is interested in this use case. See https://github.com/pydata/xarray/issues/1385 and https://github.com/pydata/xarray/pull/1413 for discussion.
Yes, I think you're correct here as well. This is also an annoying inefficiency, but the API design is a little tricky. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
open_mfdataset reads coords from disk multiple times 252541496 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 1