html_url,issue_url,id,node_id,user,created_at,updated_at,author_association,body,reactions,performed_via_github_app,issue https://github.com/pydata/xarray/issues/1854#issuecomment-365925282,https://api.github.com/repos/pydata/xarray/issues/1854,365925282,MDEyOklzc3VlQ29tbWVudDM2NTkyNTI4Mg==,1797906,2018-02-15T13:21:33Z,2018-02-15T13:24:46Z,NONE,"@rabernat Still seem to get a SIGKILL 9 (exit code 137) when trying to run with that pre-processor as well. Maybe my expectations of how it lazy loads files is too high. The machine I'm running on has 8GB or ram and the files in total are just under 1Tb","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-365896646,https://api.github.com/repos/pydata/xarray/issues/1854,365896646,MDEyOklzc3VlQ29tbWVudDM2NTg5NjY0Ng==,1797906,2018-02-15T11:12:48Z,2018-02-15T11:12:48Z,NONE,"@jhamman Here's the `ncdump` of one of the resource files: ```bash netcdf \34.128_1900_01_05_05 { dimensions: longitude = 720 ; latitude = 361 ; time = UNLIMITED ; // (124 currently) variables: float longitude(longitude) ; longitude:units = ""degrees_east"" ; longitude:long_name = ""longitude"" ; float latitude(latitude) ; latitude:units = ""degrees_north"" ; latitude:long_name = ""latitude"" ; int time(time) ; time:units = ""hours since 1900-01-01 00:00:0.0"" ; time:long_name = ""time"" ; time:calendar = ""gregorian"" ; short sst(time, latitude, longitude) ; sst:scale_factor = 0.000552094668668839 ; sst:add_offset = 285.983000319853 ; sst:_FillValue = -32767s ; sst:missing_value = -32767s ; sst:units = ""K"" ; sst:long_name = ""Sea surface temperature"" ; // global attributes: :Conventions = ""CF-1.6"" ; :history = ""2017-08-04 06:17:58 GMT by grib_to_netcdf-2.4.0: grib_to_netcdf /data/data05/scratch/_mars-atls09-95e2cf679cd58ee9b4db4dd119a05a8d-gF5gxN.grib -o /data/data04/scratch/_grib2netcdf-atls01-a562cefde8a29a7288fa0b8b7f9413f7-VvH7PP.nc -utime"" ; :_Format = ""64-bit offset"" ; } ``` Unfortunately removing the chunks didn't seem to help. I'm running with the pre-process workaround this morning to see if that completes. Sorry for the late response on this - been pretty busy.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-364492783,https://api.github.com/repos/pydata/xarray/issues/1854,364492783,MDEyOklzc3VlQ29tbWVudDM2NDQ5Mjc4Mw==,1797906,2018-02-09T16:58:42Z,2018-02-09T16:58:42Z,NONE,"I'll give both of those a shot. For hosting, the files are currently on a local drive and they sum to about 1Tb. I can probably host a couple examples though. Thanks again for the support.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-364488847,https://api.github.com/repos/pydata/xarray/issues/1854,364488847,MDEyOklzc3VlQ29tbWVudDM2NDQ4ODg0Nw==,1797906,2018-02-09T16:45:51Z,2018-02-09T16:45:51Z,NONE,"That run was killed with the output ```bash ~/.pyenv/versions/3.4.6/lib/python3.4/site-packages/xarray/core/dtypes.py:23: FutureWarning: Conversion of the second argument of issubdtype from `float` to `np.floating` is deprecated. In future, it will be treated as `np.float64 == np.dtype(float).type`. if np.issubdtype(dtype, float): Process finished with exit code 137 (interrupted by signal 9: SIGKILL) ``` I wasn't watching the machine at the time but I assume that's it falling over to memory pressure. Hi @jhamman, I'm using `0.10.0` of `xarray` with `dask` `0.16.1` and `distrobuted` `1.18.0`. I realise that last one is out of date, I will update and retry. I'm just using whatever the default scheduler is as that's pretty much all the code I've got written above. I'm unsure how to do a performance check as the dataset can't even be fully loaded currently. I've tried different chuck sizes in the past hoping to stumble on a magic size, but have been unsuccessful with that.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-364463855,https://api.github.com/repos/pydata/xarray/issues/1854,364463855,MDEyOklzc3VlQ29tbWVudDM2NDQ2Mzg1NQ==,1797906,2018-02-09T15:22:38Z,2018-02-09T15:22:38Z,NONE,"Sure, I'm running that now. I'll reply once/if it finished. Though watching my system monitor memory usage, it does not appear to be growing. I seem to remember the open function continually allocating itself more ram until it was killed. I'll take a read through that issue while I wait.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-364459162,https://api.github.com/repos/pydata/xarray/issues/1854,364459162,MDEyOklzc3VlQ29tbWVudDM2NDQ1OTE2Mg==,1797906,2018-02-09T15:06:37Z,2018-02-09T15:09:02Z,NONE,"That's true, maybe I misread last time or it's month dependant. Hopefully this is what you're after - let me know if not. I used 3 `*.nc` files to make this, with the snippet you posted above. ```bash Dimensions: (time: 728) Coordinates: longitude float32 10.0 latitude float32 10.0 * time (time) datetime64[ns] 1992-01-01 1992-01-01T03:00:00 ... Data variables: mwp (time) float64 dask.array Attributes: Conventions: CF-1.6 history: 2017-08-10 04:58:48 GMT by grib_to_netcdf-2.4.0: grib_to_ne... ``` If you're after the entire dataset, I should be able to get that but may take some time.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-364451782,https://api.github.com/repos/pydata/xarray/issues/1854,364451782,MDEyOklzc3VlQ29tbWVudDM2NDQ1MTc4Mg==,1797906,2018-02-09T14:40:20Z,2018-02-09T14:40:20Z,NONE,"Sure, this is the repr of a single file: ```bash Dimensions: (time: 248) Coordinates: longitude float32 10.0 latitude float32 10.0 * time (time) datetime64[ns] 2004-12-01 2004-12-01T03:00:00 ... Data variables: mwd (time) float64 dask.array Attributes: Conventions: CF-1.6 history: 2017-08-09 16:22:56 GMT by grib_to_netcdf-2.4.0: grib_to_ne... ``` Thanks","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-364399084,https://api.github.com/repos/pydata/xarray/issues/1854,364399084,MDEyOklzc3VlQ29tbWVudDM2NDM5OTA4NA==,1797906,2018-02-09T10:41:28Z,2018-02-09T10:41:28Z,NONE,Sorry to bump this. Still looking to a solution to this problem if anyone has had a similar experience. Thanks.,"{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965 https://github.com/pydata/xarray/issues/1854#issuecomment-361576685,https://api.github.com/repos/pydata/xarray/issues/1854,361576685,MDEyOklzc3VlQ29tbWVudDM2MTU3NjY4NQ==,1797906,2018-01-30T12:19:12Z,2018-01-30T12:19:12Z,NONE,"Hi @rabernat, thanks for the response. Sorry it's taken me a few days to get back to you. Here's the info dump of one of the files: ``` xarray.Dataset { dimensions: latitude = 361 ; longitude = 720 ; time = 248 ; variables: float32 longitude(longitude) ; longitude:units = degrees_east ; longitude:long_name = longitude ; float32 latitude(latitude) ; latitude:units = degrees_north ; latitude:long_name = latitude ; datetime64[ns] time(time) ; time:long_name = time ; float64 mwd(time, latitude, longitude) ; mwd:units = Degree true ; mwd:long_name = Mean wave direction ; // global attributes: :Conventions = CF-1.6 ; :history = 2017-08-09 18:15:34 GMT by grib_to_netcdf-2.4.0: grib_to_netcdf /data/data05/scratch/_mars-atls02-70e05f9f8ba4e9d19932f1c45a7be8d8-Pwy6jZ.grib -o /data/data01/scratch/_grib2netcdf-atls02-95e2cf679cd58ee9b4db4dd119a05a8d-v4TKah.nc -utime ; ```","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,291332965