html_url,issue_url,id,node_id,user,created_at,updated_at,author_association,body,reactions,performed_via_github_app,issue
https://github.com/pydata/xarray/issues/3386#issuecomment-540477057,https://api.github.com/repos/pydata/xarray/issues/3386,540477057,MDEyOklzc3VlQ29tbWVudDU0MDQ3NzA1Nw==,42270910,2019-10-10T09:11:31Z,2019-10-10T09:11:31Z,NONE,"@dcherian a dump of a single file:
```
ncdump -hs era5_mean_sea_level_pressure_2002.nc
netcdf era5_mean_sea_level_pressure_2002 {
dimensions:
longitude = 1440 ;
latitude = 721 ;
time = 8760 ;
variables:
float longitude(longitude) ;
longitude:units = ""degrees_east"" ;
longitude:long_name = ""longitude"" ;
float latitude(latitude) ;
latitude:units = ""degrees_north"" ;
latitude:long_name = ""latitude"" ;
int time(time) ;
time:units = ""hours since 1900-01-01 00:00:00.0"" ;
time:long_name = ""time"" ;
time:calendar = ""gregorian"" ;
short msl(time, latitude, longitude) ;
msl:scale_factor = 0.23025422306319 ;
msl:add_offset = 99003.8223728885 ;
msl:_FillValue = -32767s ;
msl:missing_value = -32767s ;
msl:units = ""Pa"" ;
msl:long_name = ""Mean sea level pressure"" ;
msl:standard_name = ""air_pressure_at_mean_sea_level"" ;
// global attributes:
:Conventions = ""CF-1.6"" ;
:history = ""2019-10-03 16:05:54 GMT by grib_to_netcdf-2.10.0: /opt/ecmwf/eccodes/bin/grib_to_netcdf -o /cache/data5/adaptor.mars.internal-1570117777.9045198-23871-11-c8564b6f-4db5-48d8-beab-ba9fef91d4e8.nc /cache/tmp/c8564b6f-4db5-48d8-beab-ba9fef91d4e8-adaptor.mars.internal-1570117777.905033-23871-3-tmp.grib"" ;
:_Format = ""64-bit offset"" ;
}
```
@shoyer : thanks for the tip, I think that it indeed simply adding more data-loading threads is the best solution.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,504497403
https://github.com/pydata/xarray/issues/3386#issuecomment-539916279,https://api.github.com/repos/pydata/xarray/issues/3386,539916279,MDEyOklzc3VlQ29tbWVudDUzOTkxNjI3OQ==,42270910,2019-10-09T09:20:06Z,2019-10-09T09:20:06Z,NONE,"setting
```dask.config.set(scheduler=""synchronous"")```
globally indeed resolved the threading issues, thanks.
However, loading and preprocessing a single timeslice of data is ~40 % slower with dask and open_mfdataset (with chunks={'time':1}) compared to netCDF4.MFDataset . Is this is expected/a known issue? If not, I can try to create a minimal reproducible example.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,504497403