issue_comments
7 rows where author_association = "MEMBER" and issue = 1376109308 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- Should Xarray stop doing automatic index-based alignment? · 7 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
1326262197 | https://github.com/pydata/xarray/issues/7045#issuecomment-1326262197 | https://api.github.com/repos/pydata/xarray/issues/7045 | IC_kwDOAMm_X85PDSe1 | benbovy 4160723 | 2022-11-24T10:35:02Z | 2022-11-24T10:35:02Z | MEMBER | I find the analogy with relational databases quite meaningful! Rectangular grids likely have been the primary use case in Xarray for a long time, but I wonder to which extent it is the case nowadays. Probably a good question to ask for the next user survey? Interestingly, the 2021 user survey results (*) show that "interoperability with pandas" is not a critical feature while "label-based indexing, interpolation, groupby, reindexing, etc." is most important, although the description of the latter is rather broad. It would be interesting to compute the correlation between these two variables. The results also show that "more flexible indexing (selection, alignment)" is very useful or critical for 2/3 of the participants. Not sure how to interpret those results within the context of this discussion, though. (*) The 2022 user survey results doesn't show significant differences in general
Not that improbable for unstructured meshes, curvilinear grids, staggered grids, etc. Xarray is often chosen to handle them too (e.g., uxarray, xgcm). |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Should Xarray stop doing automatic index-based alignment? 1376109308 | |
1251975597 | https://github.com/pydata/xarray/issues/7045#issuecomment-1251975597 | https://api.github.com/repos/pydata/xarray/issues/7045 | IC_kwDOAMm_X85Kn6Gt | benbovy 4160723 | 2022-09-20T07:51:45Z | 2022-09-20T07:51:45Z | MEMBER |
Another solution for more flexibility or a smooth transition may be to add a build option to the
I agree, although this is getting addressed slowly but surely. In Xarray internals, most of the indexes logic is now in the IMO nearly all the complication and confusion emerge from the mixed concept of a dimension coordinate in the Xarray data model. Once the concept of an index is clearly decoupled from the concept of a coordinate and both concepts are represented as 1st-class citizens, it will help users focusing on the parts of the API and/or documentation that are relevant to their needs. It will also help "selling" Xarray to users who don't need much of the index capabilities (this has been discussed several times, either as external feedback or between Xarray devs, e.g., proposal of a "xarray-lite" package). Finally it will make more affordable major changes such as the one proposed here by @shoyer. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Should Xarray stop doing automatic index-based alignment? 1376109308 | |
1250007817 | https://github.com/pydata/xarray/issues/7045#issuecomment-1250007817 | https://api.github.com/repos/pydata/xarray/issues/7045 | IC_kwDOAMm_X85KgZsJ | kmuehlbauer 5821660 | 2022-09-17T05:55:58Z | 2022-09-17T05:55:58Z | MEMBER | I still find myself struggling to understand which of those options are needed for my use cases (inner, outer etc.). Default is working in many cases, but in other cases it is trial and error. In that sense this proposal would make me have to really understand what's going on. The suggestion of another mode by @max-sixty just made me think, if this automatic alignment machinery could be moved to another package. If that package is installed the current behaviour is preserved, if not then the new behaviour proposed by @shoyer comes into play. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Should Xarray stop doing automatic index-based alignment? 1376109308 | |
1249929257 | https://github.com/pydata/xarray/issues/7045#issuecomment-1249929257 | https://api.github.com/repos/pydata/xarray/issues/7045 | IC_kwDOAMm_X85KgGgp | max-sixty 5635139 | 2022-09-16T23:14:26Z | 2022-09-16T23:14:26Z | MEMBER | I think I really empathize with the pain here. There's a very real explicitness vs "helpfulness" tradeoff, often depending on whether people are doing exploratory research vs hardened production (a bit like Ask vs Guess culture!). But from the perspective of someone who works with lots of people who use Xarray for their daily research, I think this would be a big hurdle, even without considering the change costs. One analogy is xarray vs. pandas for 2D data — among my colleagues xarray is known to be a smaller, more reliable API surface, while pandas is more fully featured but also a maze of surprising methods and behavior ( "Make another mode" can seem like an easy decision — "who doesn't want another mode" — but it could make development more difficult, since we'd need calls to check which mode we're in & tests for those. It's not insurmountable though, and maybe it would only be required in a couple of methods, so testing those would be sufficient to ensure the resulting behavior would be correct? (FWIW we don't use float indexes, so it could be fine to dispense with those) |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Should Xarray stop doing automatic index-based alignment? 1376109308 | |
1249910951 | https://github.com/pydata/xarray/issues/7045#issuecomment-1249910951 | https://api.github.com/repos/pydata/xarray/issues/7045 | IC_kwDOAMm_X85KgCCn | shoyer 1217238 | 2022-09-16T22:26:36Z | 2022-09-16T22:26:36Z | MEMBER | As a concrete example, suppose we have two datasets: 1. Hourly predictions for 10 days 2. Daily observations for a month. ```python import numpy as np import pandas as pd import xarray predictions = xarray.DataArray( np.random.RandomState(0).randn(24*10), {'time': pd.date_range('2022-01-01', '2022-01-11', freq='1h', closed='left')}, ) observations = xarray.DataArray( np.random.RandomState(1).randn(31), {'time': pd.date_range('2022-01-01', '2022-01-31', freq='24h')}, ) ``` Today, if you compare these datasets, they automatically align: ```
With this proposed change, you would get an error, e.g., something like: ```
Instead, you would need to manually align these objects, e.g., with
To (partially) simulate the effect of this change on a codebase today, you could write |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Should Xarray stop doing automatic index-based alignment? 1376109308 | |
1249601076 | https://github.com/pydata/xarray/issues/7045#issuecomment-1249601076 | https://api.github.com/repos/pydata/xarray/issues/7045 | IC_kwDOAMm_X85Ke2Y0 | shoyer 1217238 | 2022-09-16T17:16:52Z | 2022-09-16T17:18:38Z | MEMBER |
The problem is that user expectations are actually rather different for different options:
This would definitely be a step forward! However, it's a tricky nut to crack. We would both need a heuristic for defining Even then, automatic alignment is often problematic, e.g., imagine cases where a coordinate is defined in separate units. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Should Xarray stop doing automatic index-based alignment? 1376109308 | |
1249580349 | https://github.com/pydata/xarray/issues/7045#issuecomment-1249580349 | https://api.github.com/repos/pydata/xarray/issues/7045 | IC_kwDOAMm_X85KexU9 | dcherian 2448579 | 2022-09-16T16:51:55Z | 2022-09-16T16:51:55Z | MEMBER | I think I agree here but a lot of things are going to break. IMO we could first align (hah) these choices to be the same:
so that they're all controlled by
What do you think of making the default FloatIndex use a reasonable (hard to define!) |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Should Xarray stop doing automatic index-based alignment? 1376109308 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 5