issues: 371906566
This data as json
| id | node_id | number | title | user | state | locked | assignee | milestone | comments | created_at | updated_at | closed_at | author_association | active_lock_reason | draft | pull_request | body | reactions | performed_via_github_app | state_reason | repo | type |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 371906566 | MDU6SXNzdWUzNzE5MDY1NjY= | 2494 | Concurrent acces with multiple processes using open_mfdataset | 32069530 | closed | 0 | 4 | 2018-10-19T10:52:46Z | 2018-10-26T12:37:30Z | 2018-10-26T12:37:30Z | NONE | Hi everyone, First: thanks to the developers for this amazing xarray library ! Great piece of work ! Here comes my troubles: I run several (about 500) independant processes (dask distributed) that need simultaneous reading (only) access to a same (group of) netcdf files. I only pass the files-path strings to the processes to avoid pickling a netcdf python-object (issue). In each process, I run
but it leads to typical errors for many concurrent access that fail... : Invalid id or Exception: CancelledError("('mul-484a58bf5830233021e08456b45eb60d', 0, 0)",), ... I was using netCDF4 module with parallel option set to True, when playing with a single netcdf file and it was running fine:
Output of
|
{
"url": "https://api.github.com/repos/pydata/xarray/issues/2494/reactions",
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
completed | 13221727 | issue |