issue_comments
12 rows where author_association = "MEMBER", issue = 262642978 and user = 4160723 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- Explicit indexes in xarray's data-model (Future of MultiIndex) · 12 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
1259326037 | https://github.com/pydata/xarray/issues/1603#issuecomment-1259326037 | https://api.github.com/repos/pydata/xarray/issues/1603 | IC_kwDOAMm_X85LD8pV | benbovy 4160723 | 2022-09-27T10:50:36Z | 2022-09-27T10:50:36Z | MEMBER | Should we close this issue and continue the discussion in #6293? For anyone who wants to track the progress on this topic: https://github.com/pydata/xarray/projects/1 |
{ "total_count": 2, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 2, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
949494376 | https://github.com/pydata/xarray/issues/1603#issuecomment-949494376 | https://api.github.com/repos/pydata/xarray/issues/1603 | IC_kwDOAMm_X844mCJo | benbovy 4160723 | 2021-10-22T10:27:26Z | 2021-10-22T10:27:26Z | MEMBER |
Agreed, and both are supported by xarray actually. In case we want to keep the original dimensions like ("x", "y") in the example above, it's better to use masking. This discussion is broader than the topic covered in this issue so I'd suggest you start a new discussion if you want to further discuss this with the xarray community. Thanks. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
949449312 | https://github.com/pydata/xarray/issues/1603#issuecomment-949449312 | https://api.github.com/repos/pydata/xarray/issues/1603 | IC_kwDOAMm_X844l3Jg | benbovy 4160723 | 2021-10-22T09:28:01Z | 2021-10-22T09:28:01Z | MEMBER | For such case you could already do After the explicit index refactor, we could imagine a custom index that supports multi-dimension coordinates such that you would only need to do something like ```python
or without explicitly providing the name of the packed dimension: ```python
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
949413144 | https://github.com/pydata/xarray/issues/1603#issuecomment-949413144 | https://api.github.com/repos/pydata/xarray/issues/1603 | IC_kwDOAMm_X844luUY | benbovy 4160723 | 2021-10-22T08:41:36Z | 2021-10-22T08:41:36Z | MEMBER | Sorry but this is confusing. To me It still looks like you want implicit broadcasting of the |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
949358898 | https://github.com/pydata/xarray/issues/1603#issuecomment-949358898 | https://api.github.com/repos/pydata/xarray/issues/1603 | IC_kwDOAMm_X844lhEy | benbovy 4160723 | 2021-10-22T07:22:24Z | 2021-10-22T07:22:24Z | MEMBER | Thanks for the detailed description @weipeng1999. For the first 4 slides I don't see how this is different from how does |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
946474674 | https://github.com/pydata/xarray/issues/1603#issuecomment-946474674 | https://api.github.com/repos/pydata/xarray/issues/1603 | IC_kwDOAMm_X844ag6y | benbovy 4160723 | 2021-10-19T08:19:54Z | 2021-10-19T08:19:54Z | MEMBER | Hi @weipeng1999, I'm not sure to fully understand your suggestion, would you mind sharing some illustrative examples? It is useful to have two distinct It also helps to have a clear separation between the Currently in Xarray the It looks like what you suggest is some kind of implicit (co-)indexes hidden behind any dataset variable(s)? We actually took the opposite direction, trying to make everything explicit. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
444403484 | https://github.com/pydata/xarray/issues/1603#issuecomment-444403484 | https://api.github.com/repos/pydata/xarray/issues/1603 | MDEyOklzc3VlQ29tbWVudDQ0NDQwMzQ4NA== | benbovy 4160723 | 2018-12-05T08:39:35Z | 2018-12-05T08:39:35Z | MEMBER |
Agreed. It seems very strict indeed, but it will be easier to relax this later than the other way. There is also a (very rare?) case where the two indexed coordinates have the same labels but are named differently in the two datasets (e.g., |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
444132393 | https://github.com/pydata/xarray/issues/1603#issuecomment-444132393 | https://api.github.com/repos/pydata/xarray/issues/1603 | MDEyOklzc3VlQ29tbWVudDQ0NDEzMjM5Mw== | benbovy 4160723 | 2018-12-04T15:06:21Z | 2018-12-04T15:19:08Z | MEMBER |
Sorry for maybe asking this again but I'm a bit confused now: is there any good reason of supporting "multiple single indexes" along the same dimension? After all, perhaps better defaults would be to set indexes ( If you want a different behavior, then you need to use
I think that one big source of confusion has been so far mixing coordinates/variables and indexes. These are really two separate concepts, and the indexes refactoring should address that IMHO. For example, I think that Take for example ```python
I find it so weird being able to do this: ```python
Where does come from I might be a good thing explicitly requiring |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
443172604 | https://github.com/pydata/xarray/issues/1603#issuecomment-443172604 | https://api.github.com/repos/pydata/xarray/issues/1603 | MDEyOklzc3VlQ29tbWVudDQ0MzE3MjYwNA== | benbovy 4160723 | 2018-11-30T11:14:24Z | 2018-11-30T11:14:24Z | MEMBER | A couple of thoughts: If nothing useful can be done in the case of "multiple single indexes", would it make sense to discourage users explicitly creating multiple single indexes along a dimension? "Multiple single indexes" would be just a default situation when nothing specific as been defined yet or resulting from a failback. For example, why not requiring that Hence, would it be possible to avoid |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
442907394 | https://github.com/pydata/xarray/issues/1603#issuecomment-442907394 | https://api.github.com/repos/pydata/xarray/issues/1603 | MDEyOklzc3VlQ29tbWVudDQ0MjkwNzM5NA== | benbovy 4160723 | 2018-11-29T16:49:12Z | 2018-11-29T17:18:10Z | MEMBER |
Indeed I haven't really thought about How do you currently Contrary to Wouldn't be possible to easily support
This is a good question. A related question: apart from
I agree, although whether or not we will eventually support custom indexes might influence the design choices that we have to do now, IMO. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
442797084 | https://github.com/pydata/xarray/issues/1603#issuecomment-442797084 | https://api.github.com/repos/pydata/xarray/issues/1603 | MDEyOklzc3VlQ29tbWVudDQ0Mjc5NzA4NA== | benbovy 4160723 | 2018-11-29T11:15:17Z | 2018-11-29T11:15:17Z | MEMBER |
Looking at the reported issues related to multi-indexes in xarray, I have the same feeling. Simply reusing If we re-design indexes so that we allow 3rd-party indexes, maybe we could support both and let the user choose the one (xarray or pandas baked) that best suits his needs? Regarding MultiIndex as part of the data schema vs an implementation detail, if we support extending indexes (and already given the different kinds of multi-coordinate indexes: MultiIndex, KDTree, etc.), then I think that it should be transparent to the user. However, I don't really see why a multi-coordinate index should have its own variable (with tuples of values). I don't want to speak for others, but IMHO If a variable for each multi-coordinate index is "just" for data schema consistency, then why not showing all those indexes in a separate section of the repr? For example:
It is equally transparent, not more verbose, and it is clear that multi-indexes are not part of the coordinates (in fact there is no need of "virtual" coordinates either, nor to name the index). I don't think single indexes should be shown here as it would results in duplicated, uninformative lines. More generally, here is how I would see indexes handled in xarray (I might be missing important aspects, though):
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 | |
334091075 | https://github.com/pydata/xarray/issues/1603#issuecomment-334091075 | https://api.github.com/repos/pydata/xarray/issues/1603 | MDEyOklzc3VlQ29tbWVudDMzNDA5MTA3NQ== | benbovy 4160723 | 2017-10-04T08:52:08Z | 2017-10-04T08:52:08Z | MEMBER | I think that promoting "Indexes" to a first-class concept is indeed a very good idea, at both internal and public levels, even if at the latter level it would be another concept for users (it should be already familiar for pandas users, though). IMHO the "coordinate" and "index" concepts are different enough to consider them separately. I like the proposed repr for I have to think a bit more about the details but I like the idea. |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 1