home / github / issue_comments

Menu
  • GraphQL API
  • Search all tables

issue_comments: 1450727551

This data as json

html_url issue_url id node_id user created_at updated_at author_association body reactions performed_via_github_app issue
https://github.com/pydata/xarray/issues/7522#issuecomment-1450727551 https://api.github.com/repos/pydata/xarray/issues/7522 1450727551 IC_kwDOAMm_X85WeFh_ 6042212 2023-03-01T19:22:54Z 2023-03-01T19:22:54Z CONTRIBUTOR

I do generally recommend cache_type="first" for reading HDF5 files, because they tend to have most of the metadata in the header area of the file, with short pieces of metadata "elsewhere"; so the default readahead doesn't perform very well.

As to what the two writers might be doing differently, I only have guesses. I imagine xarray leaves it entirely to HDF to make whatever choices it likes. Dask does not write in parallel, since HDF does not support that, but it may order the writes more logically. It does set up the whole set of variables as a initialisation stage before writing any data - I don't know if xarray does this.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  1581046647
Powered by Datasette · Queries took 2.041ms · About: xarray-datasette