home / github / issue_comments

Menu
  • Search all tables
  • GraphQL API

issue_comments: 520136482

This data as json

html_url issue_url id node_id user created_at updated_at author_association body reactions performed_via_github_app issue
https://github.com/pydata/xarray/issues/3200#issuecomment-520136482 https://api.github.com/repos/pydata/xarray/issues/3200 520136482 MDEyOklzc3VlQ29tbWVudDUyMDEzNjQ4Mg== 6213168 2019-08-10T10:06:07Z 2019-08-10T10:06:07Z MEMBER

Hi,

xarray doesn't have any global objects that I know of that can cause the leak - I'm willing to bet on the underlying libraries.

  1. given your installed packages, open_mfdataset should be defaulting NetCDF4. Please try your measure again after setting it explicitly open_mfdataset(..., engine='netcdf4')
  2. See if the problem disappears if you pass engine='h5netcdf'
  3. Once you have confirmed the actual underlying library, try using it directly without xarray in your ReadFiles test: for every file returned by glob, open it with the netCDF4 package and load into memory all coords (not the data).
  4. Once NetCDF4 is confirmed to be the culprit, if you can it would be great if you could rewrite the test (only the read part) in C using the NetCDF C library to figure out if the leak is in it or in the Python wrapper.
{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  479190812
Powered by Datasette · Queries took 0.641ms · About: xarray-datasette