html_url,issue_url,id,node_id,user,created_at,updated_at,author_association,body,reactions,performed_via_github_app,issue
https://github.com/pydata/xarray/issues/2494#issuecomment-431439592,https://api.github.com/repos/pydata/xarray/issues/2494,431439592,MDEyOklzc3VlQ29tbWVudDQzMTQzOTU5Mg==,2443309,2018-10-19T17:34:25Z,2018-10-19T17:34:25Z,MEMBER,"To clear a few things up, the parallel option in `netCDF4.Dataset` is not the same as the parallel option in `xarray.opne_mfdataset`. In xarray, that option is meant to help speed up the time it takes to open *many* files at once. If you are using dask distributed, this should be done using that scheduler.
If you are only seeing thread parallelism in the `open_mfdataset(..., parallel=True)` call, I would start by looking at your dask distributed setup.
Can you try this workflow with and without the parallel option and report back:
```Python
client = Client(...)
ds = xr.open_mfdataset(myfiles_path, concat_dim='t', engine='h5netcdf', paralel=...)
x = ds['x'].load().data
y = ds['y'].load().data
ds.close()
```
Provided that you are setting up distributed to use multiple processes, you should get parallelism from multiple processes in this case.
","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,371906566