home / github / issues

Menu
  • Search all tables
  • GraphQL API

issues: 1952739368

This data as json

id node_id number title user state locked assignee milestone comments created_at updated_at closed_at author_association active_lock_reason draft pull_request body reactions performed_via_github_app state_reason repo type
1952739368 PR_kwDOAMm_X85dTMKv 8339 Reduce dask tokenization time 6042212 closed 0     6 2023-10-19T17:22:06Z 2023-10-20T23:13:44Z 2023-10-20T23:13:43Z CONTRIBUTOR   0 pydata/xarray/pulls/8339

When using dask (e.g., chunks={} with a zarr dataset), each dask.array gets a token. Calculating this token currently hits a recursive path within dask and is relatively slow (~10ms), which adds up for many variables. This PR makes a simpler but still unique token.

An example profile of open_dataset before:

and after

{
    "url": "https://api.github.com/repos/pydata/xarray/issues/8339/reactions",
    "total_count": 2,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 2,
    "eyes": 0
}
    13221727 pull

Links from other tables

  • 1 row from issues_id in issues_labels
  • 0 rows from issue in issue_comments
Powered by Datasette · Queries took 0.588ms · About: xarray-datasette