issues: 371906566
This data as json
id | node_id | number | title | user | state | locked | assignee | milestone | comments | created_at | updated_at | closed_at | author_association | active_lock_reason | draft | pull_request | body | reactions | performed_via_github_app | state_reason | repo | type |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
371906566 | MDU6SXNzdWUzNzE5MDY1NjY= | 2494 | Concurrent acces with multiple processes using open_mfdataset | 32069530 | closed | 0 | 4 | 2018-10-19T10:52:46Z | 2018-10-26T12:37:30Z | 2018-10-26T12:37:30Z | NONE | Hi everyone, First: thanks to the developers for this amazing xarray library ! Great piece of work ! Here comes my troubles: I run several (about 500) independant processes (dask distributed) that need simultaneous reading (only) access to a same (group of) netcdf files. I only pass the files-path strings to the processes to avoid pickling a netcdf python-object (issue). In each process, I run
but it leads to typical errors for many concurrent access that fail... : Invalid id or Exception: CancelledError("('mul-484a58bf5830233021e08456b45eb60d', 0, 0)",), ... I was using netCDF4 module with parallel option set to True, when playing with a single netcdf file and it was running fine:
Output of
|
{ "url": "https://api.github.com/repos/pydata/xarray/issues/2494/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
completed | 13221727 | issue |