html_url,issue_url,id,node_id,user,created_at,updated_at,author_association,body,reactions,performed_via_github_app,issue https://github.com/pydata/xarray/issues/3386#issuecomment-540477057,https://api.github.com/repos/pydata/xarray/issues/3386,540477057,MDEyOklzc3VlQ29tbWVudDU0MDQ3NzA1Nw==,42270910,2019-10-10T09:11:31Z,2019-10-10T09:11:31Z,NONE,"@dcherian a dump of a single file: ``` ncdump -hs era5_mean_sea_level_pressure_2002.nc netcdf era5_mean_sea_level_pressure_2002 { dimensions: longitude = 1440 ; latitude = 721 ; time = 8760 ; variables: float longitude(longitude) ; longitude:units = ""degrees_east"" ; longitude:long_name = ""longitude"" ; float latitude(latitude) ; latitude:units = ""degrees_north"" ; latitude:long_name = ""latitude"" ; int time(time) ; time:units = ""hours since 1900-01-01 00:00:00.0"" ; time:long_name = ""time"" ; time:calendar = ""gregorian"" ; short msl(time, latitude, longitude) ; msl:scale_factor = 0.23025422306319 ; msl:add_offset = 99003.8223728885 ; msl:_FillValue = -32767s ; msl:missing_value = -32767s ; msl:units = ""Pa"" ; msl:long_name = ""Mean sea level pressure"" ; msl:standard_name = ""air_pressure_at_mean_sea_level"" ; // global attributes: :Conventions = ""CF-1.6"" ; :history = ""2019-10-03 16:05:54 GMT by grib_to_netcdf-2.10.0: /opt/ecmwf/eccodes/bin/grib_to_netcdf -o /cache/data5/adaptor.mars.internal-1570117777.9045198-23871-11-c8564b6f-4db5-48d8-beab-ba9fef91d4e8.nc /cache/tmp/c8564b6f-4db5-48d8-beab-ba9fef91d4e8-adaptor.mars.internal-1570117777.905033-23871-3-tmp.grib"" ; :_Format = ""64-bit offset"" ; } ``` @shoyer : thanks for the tip, I think that it indeed simply adding more data-loading threads is the best solution.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,504497403 https://github.com/pydata/xarray/issues/3386#issuecomment-539916279,https://api.github.com/repos/pydata/xarray/issues/3386,539916279,MDEyOklzc3VlQ29tbWVudDUzOTkxNjI3OQ==,42270910,2019-10-09T09:20:06Z,2019-10-09T09:20:06Z,NONE,"setting ```dask.config.set(scheduler=""synchronous"")``` globally indeed resolved the threading issues, thanks. However, loading and preprocessing a single timeslice of data is ~40 % slower with dask and open_mfdataset (with chunks={'time':1}) compared to netCDF4.MFDataset . Is this is expected/a known issue? If not, I can try to create a minimal reproducible example.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,504497403