home / github / issue_comments

Menu
  • Search all tables
  • GraphQL API

issue_comments: 540208420

This data as json

html_url issue_url id node_id user created_at updated_at author_association body reactions performed_via_github_app issue
https://github.com/pydata/xarray/issues/3386#issuecomment-540208420 https://api.github.com/repos/pydata/xarray/issues/3386 540208420 MDEyOklzc3VlQ29tbWVudDU0MDIwODQyMA== 1217238 2019-10-09T21:28:48Z 2019-10-09T21:28:48Z MEMBER

netCDF4.MFDataset works on a much more restricted set of netCDF files than xarray.open_mfdataset. I'm not surprised it's a little bit faster, but I'm not sure it's worth the maintenance burden of supporting this separate code path. Making a fully featured version of open_mfdataset with dask would be challenging.

Can you simply add more threads in TensorFlow/Keras for loading the data? My other suggestion is to pre-shuffle the data on disk, so you don't need random access inside your training loop.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  504497403
Powered by Datasette · Queries took 0.786ms · About: xarray-datasette