issue_comments
3 rows where issue = 327064908 and user = 1217238 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: created_at (date), updated_at (date)
issue 1
- Parallel non-locked read using dask.Client crashes · 3 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
392672562 | https://github.com/pydata/xarray/issues/2190#issuecomment-392672562 | https://api.github.com/repos/pydata/xarray/issues/2190 | MDEyOklzc3VlQ29tbWVudDM5MjY3MjU2Mg== | shoyer 1217238 | 2018-05-29T06:59:32Z | 2018-05-29T06:59:32Z | MEMBER | Indeed, HDF5 supports parallel IO, but only with MPI. Unfortunately that didn't work with Dask, at least not yet. Zarr is certainly worth a try for performance. The motivation for zarr (rather than HDF5) was performance with distributed reads/writes, especially with cloud storage. On Mon, May 28, 2018 at 11:27 PM Karel van de Plassche notifications@github.com wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Parallel non-locked read using dask.Client crashes 327064908 | |
392649160 | https://github.com/pydata/xarray/issues/2190#issuecomment-392649160 | https://api.github.com/repos/pydata/xarray/issues/2190 | MDEyOklzc3VlQ29tbWVudDM5MjY0OTE2MA== | shoyer 1217238 | 2018-05-29T04:24:58Z | 2018-05-29T04:24:58Z | MEMBER | Maybe there's some place we could document this more clearly?
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Parallel non-locked read using dask.Client crashes 327064908 | |
392647556 | https://github.com/pydata/xarray/issues/2190#issuecomment-392647556 | https://api.github.com/repos/pydata/xarray/issues/2190 | MDEyOklzc3VlQ29tbWVudDM5MjY0NzU1Ng== | shoyer 1217238 | 2018-05-29T04:11:55Z | 2018-05-29T04:11:55Z | MEMBER | Unfortunately HDF5 doesn't support reading or writing files (even different files) in parallel via the same process, which is why xarray by default adds a lock around all read/write operations from NetCDF4/HDF5 files. So I'm afraid this is expected behavior. You might have better luck using dask-distributed multiple processes, but then you'll encounter other bottlenecks with data transfer. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Parallel non-locked read using dask.Client crashes 327064908 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 1