home / github

Menu
  • GraphQL API
  • Search all tables

issue_comments

Table actions
  • GraphQL API for issue_comments

3 rows where author_association = "MEMBER" and issue = 1151751524 sorted by updated_at descending

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: reactions, created_at (date), updated_at (date)

user 3

  • rabernat 1
  • benbovy 1
  • max-sixty 1

issue 1

  • xr.doctor(): diagnostics on a Dataset / DataArray ? · 3 ✖

author_association 1

  • MEMBER · 3 ✖
id html_url issue_url node_id user created_at updated_at ▲ author_association body reactions performed_via_github_app issue
1305780610 https://github.com/pydata/xarray/issues/6308#issuecomment-1305780610 https://api.github.com/repos/pydata/xarray/issues/6308 IC_kwDOAMm_X85N1KGC benbovy 4160723 2022-11-07T15:28:35Z 2022-11-07T15:28:35Z MEMBER

The kind of data wrapped in an Xarray Dataset (e.g., a Numpy array, a Dask array or any other array #5648) is already something useful that xr.doctor or xr.describe may tell!

From my experience of introducing Xarray to new users, they often completely ignore what is under the hood until something or someone makes them aware, likely after they experience some weird behavior or performance issue that is hard to figure out by themselves. Xarray objects are flexible container wrappers connected to a wide range of other Python libraries, such that it is hard to give a short introduction that covers all the important aspects (lazy / non-lazy, chunked / non-chunked, etc.). For example, it may be possible that someone who has never heard of Dask nor Zarr follows an Xarray tutorial that starts by opening a chunked dataset from a zarr store. In this case the rich repr of the Xarray Dataset doesn't even help.

Rather than a performance report or a profiling tool, the proposal here (still very elusive) is to provide a helper function that returns some information and explanation in plain english (why not with some hyperlinks, pretty printing, etc.) that would help users making sense of an Xarray object and its wrapped data/metadata. Some kind of interactive documentation very specific to the actual Xarray object. Some kind of smart tool that would partially "replace" custom (though very basic) user support.

{
    "total_count": 2,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 2,
    "rocket": 0,
    "eyes": 0
}
  xr.doctor(): diagnostics on a Dataset / DataArray ? 1151751524
1300863799 https://github.com/pydata/xarray/issues/6308#issuecomment-1300863799 https://api.github.com/repos/pydata/xarray/issues/6308 IC_kwDOAMm_X85NiZs3 rabernat 1197350 2022-11-02T16:39:53Z 2022-11-02T16:39:53Z MEMBER

Just found this issue! I agree that this would be helpful. But isn't it fundamentally a Dask issue? Vanilla Xarray + Numpy has none of these problems because everything is in memory.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  xr.doctor(): diagnostics on a Dataset / DataArray ? 1151751524
1052626478 https://github.com/pydata/xarray/issues/6308#issuecomment-1052626478 https://api.github.com/repos/pydata/xarray/issues/6308 IC_kwDOAMm_X84-vc4u max-sixty 5635139 2022-02-26T21:16:05Z 2022-02-26T21:16:05Z MEMBER

Very much agree with the goal!

I wonder whether there's a broader approach with something like xr.describe — i.e. give lots of useful info about the metadata of the array, including any warnings. It's not that performance sensitive, so it would be fine to throw lots of things in there.

Either way, I'm a +1

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  xr.doctor(): diagnostics on a Dataset / DataArray ? 1151751524

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);
Powered by Datasette · Queries took 12.538ms · About: xarray-datasette