issue_comments
4 rows where author_association = "MEMBER", issue = 252541496 and user = 1217238 sorted by updated_at descending
This data as json, CSV (advanced)
issue 1
- open_mfdataset reads coords from disk multiple times · 4 ✖
| id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 327338750 | https://github.com/pydata/xarray/issues/1521#issuecomment-327338750 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNzMzODc1MA== | shoyer 1217238 | 2017-09-06T00:20:49Z | 2017-09-06T00:20:49Z | MEMBER | Enjoy your holiday! On Tue, Sep 5, 2017 at 5:01 PM crusaderky notifications@github.com wrote:
|
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
open_mfdataset reads coords from disk multiple times 252541496 | |
| 326867685 | https://github.com/pydata/xarray/issues/1521#issuecomment-326867685 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNjg2NzY4NQ== | shoyer 1217238 | 2017-09-04T05:13:59Z | 2017-09-04T05:20:50Z | MEMBER | The problem is these lines in We inspect compare coordinates for equality in order to decide whether to ignore redundant coordinates or stack them up. This happens if |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
open_mfdataset reads coords from disk multiple times 252541496 | |
| 326868217 | https://github.com/pydata/xarray/issues/1521#issuecomment-326868217 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNjg2ODIxNw== | shoyer 1217238 | 2017-09-04T05:18:55Z | 2017-09-04T05:18:55Z | MEMBER | So, to be more precise, I think the problem is that the first variable is computed many times over (once per comparison), inside the A very simple fix, slightly more conservative than loading every coordinate into memory, is to simply compute these first coordinates on the first variable, e.g., |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
open_mfdataset reads coords from disk multiple times 252541496 | |
| 324708622 | https://github.com/pydata/xarray/issues/1521#issuecomment-324708622 | https://api.github.com/repos/pydata/xarray/issues/1521 | MDEyOklzc3VlQ29tbWVudDMyNDcwODYyMg== | shoyer 1217238 | 2017-08-24T17:51:42Z | 2017-08-24T17:51:42Z | MEMBER |
In principle, coords can have the same shape as data variables. In those cases, you probably want to use the same chunking scheme.
@rabernat is interested in this use case. See https://github.com/pydata/xarray/issues/1385 and https://github.com/pydata/xarray/pull/1413 for discussion.
Yes, I think you're correct here as well. This is also an annoying inefficiency, but the API design is a little tricky. |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
open_mfdataset reads coords from disk multiple times 252541496 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] (
[html_url] TEXT,
[issue_url] TEXT,
[id] INTEGER PRIMARY KEY,
[node_id] TEXT,
[user] INTEGER REFERENCES [users]([id]),
[created_at] TEXT,
[updated_at] TEXT,
[author_association] TEXT,
[body] TEXT,
[reactions] TEXT,
[performed_via_github_app] TEXT,
[issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
ON [issue_comments] ([user]);
user 1