html_url,issue_url,id,node_id,user,created_at,updated_at,author_association,body,reactions,performed_via_github_app,issue
https://github.com/pydata/xarray/issues/6308#issuecomment-1305780610,https://api.github.com/repos/pydata/xarray/issues/6308,1305780610,IC_kwDOAMm_X85N1KGC,4160723,2022-11-07T15:28:35Z,2022-11-07T15:28:35Z,MEMBER,"The kind of data wrapped in an Xarray Dataset (e.g., a Numpy array, a Dask array or any other array #5648) is already something useful that `xr.doctor` or `xr.describe` may tell!
From my experience of introducing Xarray to new users, they often completely ignore what is under the hood until something or someone makes them aware, likely after they experience some weird behavior or performance issue that is hard to figure out by themselves. Xarray objects are flexible container wrappers connected to a wide range of other Python libraries, such that it is hard to give a short introduction that covers all the important aspects (lazy / non-lazy, chunked / non-chunked, etc.). For example, it may be possible that someone who has never heard of Dask nor Zarr follows an Xarray tutorial that starts by opening a chunked dataset from a zarr store. In this case the rich repr of the Xarray Dataset doesn't even help.
Rather than a performance report or a profiling tool, the proposal here (still very elusive) is to provide a helper function that returns some information and explanation in plain english (why not with some hyperlinks, pretty printing, etc.) that would help users making sense of an Xarray object and its wrapped data/metadata. Some kind of interactive documentation very specific to the actual Xarray object. Some kind of smart tool that would partially ""replace"" custom (though very basic) user support.
","{""total_count"": 2, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 2, ""rocket"": 0, ""eyes"": 0}",,1151751524
https://github.com/pydata/xarray/issues/6308#issuecomment-1301908749,https://api.github.com/repos/pydata/xarray/issues/6308,1301908749,IC_kwDOAMm_X85NmY0N,226720,2022-11-03T10:36:30Z,2022-11-03T10:36:30Z,NONE,"> Vanilla Xarray + Numpy has none of these problems because everything is in memory.
This is my understanding of xarray. Or is there a way that a xarray variable points to a dask structure?
> But isn't it fundamentally a Dask issue?
Dask has already some `performance_report` capabilities documented on https://docs.dask.org/en/stable/diagnostics-distributed.html#capture-diagnostics. Anything missing out there?","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,1151751524
https://github.com/pydata/xarray/issues/6308#issuecomment-1300863799,https://api.github.com/repos/pydata/xarray/issues/6308,1300863799,IC_kwDOAMm_X85NiZs3,1197350,2022-11-02T16:39:53Z,2022-11-02T16:39:53Z,MEMBER,Just found this issue! I agree that this would be helpful. But isn't it fundamentally a Dask issue? Vanilla Xarray + Numpy has none of these problems because everything is in memory.,"{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,1151751524
https://github.com/pydata/xarray/issues/6308#issuecomment-1052626478,https://api.github.com/repos/pydata/xarray/issues/6308,1052626478,IC_kwDOAMm_X84-vc4u,5635139,2022-02-26T21:16:05Z,2022-02-26T21:16:05Z,MEMBER,"Very much agree with the goal!
I wonder whether there's a broader approach with something like `xr.describe` — i.e. give _lots_ of useful info about the metadata of the array, including any warnings. It's not that performance sensitive, so it would be fine to throw lots of things in there.
Either way, I'm a +1","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,1151751524