home / github / issue_comments

Menu
  • Search all tables
  • GraphQL API

issue_comments: 1428872842

This data as json

html_url issue_url id node_id user created_at updated_at author_association body reactions performed_via_github_app issue
https://github.com/pydata/xarray/issues/7522#issuecomment-1428872842 https://api.github.com/repos/pydata/xarray/issues/7522 1428872842 IC_kwDOAMm_X85VKt6K 39069044 2023-02-13T23:49:31Z 2023-02-13T23:49:31Z CONTRIBUTOR

I did try many loops and different order of operations to make sure this isn't a caching or auth issue. You can see the std dev of the timeit calls above is pretty consistent.

For my actual use case, the difference is very apparent, with open_dataset taking about 9 seconds on the numpy-saved file and <1 second on the dask-saved one. I can also clearly see when monitoring network traffic that the slow version has to read in hundreds of MB of data to open the dataset, while the fast one only reads the tiny headers.

I also inspected the actual header bytes of these two files and see they are indeed different.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  1581046647
Powered by Datasette · Queries took 0.744ms · About: xarray-datasette