issue_comments
9 rows where issue = 712189206 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- Preprocess function for save_mfdataset · 9 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
702307334 | https://github.com/pydata/xarray/issues/4475#issuecomment-702307334 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMjMwNzMzNA== | heerad 2560426 | 2020-10-01T18:07:55Z | 2020-10-01T18:07:55Z | NONE | Sounds good, I'll do this in the meantime. Still quite interested in |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
702276824 | https://github.com/pydata/xarray/issues/4475#issuecomment-702276824 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMjI3NjgyNA== | dcherian 2448579 | 2020-10-01T17:13:16Z | 2020-10-01T17:13:16Z | MEMBER |
I think so. I would try multiple processes and see if that is fast enough for what you want to do. Or else, write to zarr. This will be parallelized and is a lot easier than dealing with HDF5 |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
702265883 | https://github.com/pydata/xarray/issues/4475#issuecomment-702265883 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMjI2NTg4Mw== | heerad 2560426 | 2020-10-01T16:52:59Z | 2020-10-01T16:52:59Z | NONE | Multiple threads (the default), because it's recommended "for numeric code that releases the GIL (like NumPy, Pandas, Scikit-Learn, Numba, …)" according to the dask docs. I guess I could do multi-threaded for the compute part (everything up to the definition of |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
702226256 | https://github.com/pydata/xarray/issues/4475#issuecomment-702226256 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMjIyNjI1Ng== | dcherian 2448579 | 2020-10-01T15:46:45Z | 2020-10-01T15:46:45Z | MEMBER | Are you using multiple threads or multiple processes? IIUC you should be using multiple processes for max writing efficiency. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
702178407 | https://github.com/pydata/xarray/issues/4475#issuecomment-702178407 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMjE3ODQwNw== | heerad 2560426 | 2020-10-01T14:34:28Z | 2020-10-01T14:34:28Z | NONE | Thank you, this works for me. However, it's quite slow and seems to scale faster than linearly as the length of Could it be connected to https://github.com/pydata/xarray/issues/2912#issuecomment-485497398 where they suggest to use Appreciate the help! |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
701694586 | https://github.com/pydata/xarray/issues/4475#issuecomment-701694586 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMTY5NDU4Ng== | shoyer 1217238 | 2020-09-30T23:13:33Z | 2020-09-30T23:13:33Z | MEMBER | I think we could support delayed objects in result = [dask.delayed(write_dataset)(ds, path) for ds, path in zip(datasets, path)] dask.compute(result) ``` |
{ "total_count": 3, "+1": 3, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
701688956 | https://github.com/pydata/xarray/issues/4475#issuecomment-701688956 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMTY4ODk1Ng== | dcherian 2448579 | 2020-09-30T22:55:28Z | 2020-09-30T22:55:28Z | MEMBER | You could write to netCDF in I guess this is a good argument for adding a |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
701676076 | https://github.com/pydata/xarray/issues/4475#issuecomment-701676076 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMTY3NjA3Ng== | heerad 2560426 | 2020-09-30T22:17:24Z | 2020-09-30T22:17:24Z | NONE | Unfortunately that doesn't work:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 | |
701577652 | https://github.com/pydata/xarray/issues/4475#issuecomment-701577652 | https://api.github.com/repos/pydata/xarray/issues/4475 | MDEyOklzc3VlQ29tbWVudDcwMTU3NzY1Mg== | dcherian 2448579 | 2020-09-30T18:51:25Z | 2020-09-30T18:51:25Z | MEMBER | you could use
I think this will work, but I've never used |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Preprocess function for save_mfdataset 712189206 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 3