issue_comments
13 rows where author_association = "CONTRIBUTOR" and issue = 140291221 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: created_at (date), updated_at (date)
issue 1
- dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf · 13 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
200629173 | https://github.com/pydata/xarray/issues/793#issuecomment-200629173 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDIwMDYyOTE3Mw== | pwolfram 4295853 | 2016-03-24T02:49:13Z | 2016-03-24T02:49:26Z | CONTRIBUTOR | I'm going to close this for now but will reopen it if the issue arises again following the dask release. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
199879233 | https://github.com/pydata/xarray/issues/793#issuecomment-199879233 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5OTg3OTIzMw== | pwolfram 4295853 | 2016-03-22T15:56:42Z | 2016-03-22T15:56:42Z | CONTRIBUTOR | Note, also waiting on |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
199855592 | https://github.com/pydata/xarray/issues/793#issuecomment-199855592 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5OTg1NTU5Mg== | pwolfram 4295853 | 2016-03-22T15:02:54Z | 2016-03-22T15:03:18Z | CONTRIBUTOR | Thanks @shoyer! I ran into this problem again with this morning and as you note I had multiple arrays in the file that were being written. PR https://github.com/pydata/xarray/pull/800 implements your suggestion and should hopefully resolve the issue, although it is not clear to me how to build a reproducible test case-- perhaps write a file with a ton of random arrays to crash it out on the write? Any thoughts or suggestions you have on this would be very helpful. Note that the PR is preliminary until I can verify that it resolves the issue via testing. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
199544425 | https://github.com/pydata/xarray/issues/793#issuecomment-199544425 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5OTU0NDQyNQ== | pwolfram 4295853 | 2016-03-21T23:56:45Z | 2016-03-21T23:56:45Z | CONTRIBUTOR | @shoyer, I'm assuming there needs to be an xarray PR corresponding to Matt's merged PR, is that correct? Do you think this will be a difficult xarray change? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
196926696 | https://github.com/pydata/xarray/issues/793#issuecomment-196926696 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NjkyNjY5Ng== | pwolfram 4295853 | 2016-03-15T17:08:22Z | 2016-03-15T17:08:22Z | CONTRIBUTOR | @thanks @shoyer for looking into this further and for figuring out the cause of the problem. @mrocklin, does this mean that I should submit a dask issue? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195811187 | https://github.com/pydata/xarray/issues/793#issuecomment-195811187 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTgxMTE4Nw== | pwolfram 4295853 | 2016-03-12T21:30:14Z | 2016-03-12T21:30:14Z | CONTRIBUTOR | I can't fully confirm that the above scripts works with synchronous execution because the job ran out of its 16hr run time. However, it does appear to be the case that forcing synchronous execution resolves potential issues because previous runs of the script crashed and this one did not. I'll have to try more cases with synchronous execution, especially over the next half week, to see if I encounter more issues but am suspicious this is the problem. @mrocklin and I noted that the netCDF reader has problems when threading is on when we were using distributed, so this appears to be a likely candidate. We got the same I'm suspicious that the netCDF reader is not thread safe and may not have been compiled as such (http://hdf-forum.184993.n3.nabble.com/Activate-thread-safe-and-enable-cxx-in-HDF5-td2993951.html) but there appear other potential issues that could be part of the problem, e.g., https://github.com/Unidata/netcdf4-python/issues/279 because I am doing so many reads. It may also be possible, as you note @shoyer, that the tread locks aren't aggressive enough. It would probably be good to come up with some type of testing strategy to better isolate the problem... I'll have to give this more thought. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195572996 | https://github.com/pydata/xarray/issues/793#issuecomment-195572996 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTU3Mjk5Ng== | pwolfram 4295853 | 2016-03-11T22:11:54Z | 2016-03-11T22:12:12Z | CONTRIBUTOR | @mrocklin, For option 1, should the command be |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195565307 | https://github.com/pydata/xarray/issues/793#issuecomment-195565307 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTU2NTMwNw== | pwolfram 4295853 | 2016-03-11T21:40:11Z | 2016-03-11T21:40:11Z | CONTRIBUTOR | Test 2 passed, so it doesn't appear to be due to too many open file handles. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195563852 | https://github.com/pydata/xarray/issues/793#issuecomment-195563852 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTU2Mzg1Mg== | pwolfram 4295853 | 2016-03-11T21:33:54Z | 2016-03-11T21:33:54Z | CONTRIBUTOR | Agreed. I'll let you know what I find out. Thanks @mrocklin. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195562125 | https://github.com/pydata/xarray/issues/793#issuecomment-195562125 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTU2MjEyNQ== | pwolfram 4295853 | 2016-03-11T21:27:19Z | 2016-03-11T21:27:33Z | CONTRIBUTOR | Quick question @mrocklin, for 2, are you proposing a script that just opens all the files, e.g., something like this ``` get full xr datasetdslist = []
nfiles = len(glob.glob('dispersion_calcs_rlzn0layerrange_0000-0000.nc'))
for i in np.arange(nfiles): do an operation spanning Nr space and Nb spaceprint dstotal.dtdays.values ``` where |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195552065 | https://github.com/pydata/xarray/issues/793#issuecomment-195552065 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTU1MjA2NQ== | pwolfram 4295853 | 2016-03-11T21:08:00Z | 2016-03-11T21:08:00Z | CONTRIBUTOR | There are a large number of files (1320) where |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195550711 | https://github.com/pydata/xarray/issues/793#issuecomment-195550711 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTU1MDcxMQ== | pwolfram 4295853 | 2016-03-11T21:05:44Z | 2016-03-11T21:05:44Z | CONTRIBUTOR | I should note that serialization also does not appear to be robust under reshaping the data via |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 | |
195550183 | https://github.com/pydata/xarray/issues/793#issuecomment-195550183 | https://api.github.com/repos/pydata/xarray/issues/793 | MDEyOklzc3VlQ29tbWVudDE5NTU1MDE4Mw== | pwolfram 4295853 | 2016-03-11T21:04:45Z | 2016-03-11T21:04:45Z | CONTRIBUTOR | cc @mrocklin |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
dask.async.RuntimeError: NetCDF: HDF error on xarray to_netcdf 140291221 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 1