issue_comments
14 rows where author_association = "MEMBER", issue = 628719058 and user = 35968931 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- Feature Request: Hierarchical storage and processing in xarray · 14 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
904817641 | https://github.com/pydata/xarray/issues/4118#issuecomment-904817641 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X8417mvp | TomNicholas 35968931 | 2021-08-24T17:00:24Z | 2022-05-19T16:33:26Z | MEMBER | So I had a crack at making a full It's based on @benbovy's Some limitations of the approach I used are:
- Each dataset in the tree is entirely separate, so doing something like You can create a It's about 70% working, but some things I could do with some help with are:
1) ~Fundamental design questions about the class structure, such as whether There will definitely be many bugs, but any thoughts or input appreciated! |
{ "total_count": 8, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 8, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
1047944213 | https://github.com/pydata/xarray/issues/4118#issuecomment-1047944213 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X84-dlwV | TomNicholas 35968931 | 2022-02-22T15:58:48Z | 2022-02-22T15:58:48Z | MEMBER | Also thanks @OriolAbril , it's useful to have an ArViz perspective.
I see In either case I imagine all we might need to do is slightly extend |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
1047932340 | https://github.com/pydata/xarray/issues/4118#issuecomment-1047932340 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X84-di20 | TomNicholas 35968931 | 2022-02-22T15:47:15Z | 2022-02-22T15:50:41Z | MEMBER | Hi @LunarLanding , thanks for your ideas!
It sounds a bit like what you are suggesting is essentially a model in which dimensions are explicit objects, which can be referred to from other groups, like in netCDF. (NetCDF has "dimension IDs".) This would be a bit of a departure from the model that
By "variable" length, do you mean that the length of dimensions differs between variables in the same group, or just that you don't know the length of the dimension in advance? Is there a specific use case which you think would require explicit dimensions to solve? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
1043638105 | https://github.com/pydata/xarray/issues/4118#issuecomment-1043638105 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X84-NKdZ | TomNicholas 35968931 | 2022-02-17T23:47:44Z | 2022-02-17T23:47:44Z | MEMBER |
@alexamici can you expand on the role of the CF conventions in this statement? Are you talking about CF conventions allowing one variable in one group to refer to dimension present in another group, or something else? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
1039572760 | https://github.com/pydata/xarray/issues/4118#issuecomment-1039572760 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X8499p8Y | TomNicholas 35968931 | 2022-02-14T21:19:56Z | 2022-02-14T21:40:21Z | MEMBER | We would like some opinions from the community on two different possible models for a tree-like structure in xarray. A tree contains many groups, but the question is what constraints should be imposed on the contents of those groups.
This is by no means the only question, and we have various choices to make within these options. The questions for the potential users here are: - Do you have use cases which one of these designs could handle but the other couldn't? - How important to you is being able to support all valid files of these certain formats? - Which of these designs is clearer/more intuitive/more appealing to you? (@alexamici , @shoyer, @jhamman, @aurghs please edit this comment to add anything I've missed) |
{ "total_count": 2, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 2, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
905472692 | https://github.com/pydata/xarray/issues/4118#issuecomment-905472692 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X841-Gq0 | TomNicholas 35968931 | 2021-08-25T12:50:04Z | 2021-08-25T13:02:10Z | MEMBER | Thanks @benbovy !
I don't know much about HTML, but graphs where you can mouseover nodes to see node information sound awesome!
They aren't separate: The idea was that creating a single node of a tree by specifying only its We could just merge the two signatures into one
They were originally separate (I had
Good to know that other nested structures took a similar approach. I think that as we want to be able to save and load any subtree even after changing parents etc. then we ideally don't want to treat any one node as special. |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
904987705 | https://github.com/pydata/xarray/issues/4118#issuecomment-904987705 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X8418QQ5 | TomNicholas 35968931 | 2021-08-24T21:25:17Z | 2021-08-24T21:25:37Z | MEMBER | Thanks @jhamman - expect things to break as I keep realizing certain methods have to be defined differently from in Dataset for things to work. Help with 3 would be especially appreciated, as at the moment whilst I can open and alter a file with groups, I can't save my resulting tree :sweat_smile: |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
901954045 | https://github.com/pydata/xarray/issues/4118#issuecomment-901954045 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X841wrn9 | TomNicholas 35968931 | 2021-08-19T14:16:45Z | 2021-08-19T14:16:45Z | MEMBER | Oh excellent, thanks for the clarification Stephan! On Thu, 19 Aug 2021, 00:23 Stephan Hoyer, @.***> wrote:
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
901594249 | https://github.com/pydata/xarray/issues/4118#issuecomment-901594249 | https://api.github.com/repos/pydata/xarray/issues/4118 | IC_kwDOAMm_X841vTyJ | TomNicholas 35968931 | 2021-08-19T04:10:30Z | 2021-08-19T04:10:30Z | MEMBER | I think that xarray's current use of both dict-like access and attribute-like access for variables makes representing a general netCDF file in a single Consider a tree with a node structure for a hypothetical
We ideally want to be able to seamlessly access both subtrees and individual variables via chains of keys, e.g.
This particular example is fine, and would correspond to a netCDF file with groups "root", "root/weather", and "root/weather/temperature", plus the four stored DataArray variables. However, if one of the variables has the same name as one of the groups (which I think is permitted in the netCDF format), then there is no easy way to access all the elements whilst retaining the nice syntax. For example consider
Now we have a key collision between the group named "B" and the DataArray named "B", i.e. We can't just forbid this type of tree because then there would be netCDF files that we couldn't represent as a We can't use different types of access (e.g. (We could divide access through The only way I can see around this is to hide a node's data variables behind a It sounds like @emilbiju avoided this by not satisfying
so I'm wondering if anyone else has other suggestions or thoughts? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
873492892 | https://github.com/pydata/xarray/issues/4118#issuecomment-873492892 | https://api.github.com/repos/pydata/xarray/issues/4118 | MDEyOklzc3VlQ29tbWVudDg3MzQ5Mjg5Mg== | TomNicholas 35968931 | 2021-07-04T00:51:19Z | 2021-07-04T00:51:19Z | MEMBER | Some other thoughts about tags: 1) Does the definition of tags include variable names of DataArrays? I think it should. 2) As @martinitus mentioned, a 3) Selecting via tags would need to allow a distinction between "get me all leaves with these exact tags" and "get me all leaves whose tags include these ones". Maybe 4) The latter type of tag-based access would make plotting different leaves against one another easier too - given a multi-resolution (or multi-model) datatree like this:
then assuming that the definition of tags included the DataArray variable names, then
would select all leaves with a tempature tag, check that the temperature DataArrays had the same dimensions (but no need for any 5) With a tag-based system you can create cycles of tags, like A&B, B&C, C&A, which you can't really do with hierarchical trees. I don't think that actually causes any problems though... |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
873307873 | https://github.com/pydata/xarray/issues/4118#issuecomment-873307873 | https://api.github.com/repos/pydata/xarray/issues/4118 | MDEyOklzc3VlQ29tbWVudDg3MzMwNzg3Mw== | TomNicholas 35968931 | 2021-07-02T23:54:09Z | 2021-07-02T23:54:09Z | MEMBER | @shoyer if you used tags wouldn't you lose the ability to round-trip a netCDF file with groups? When you read in the groups from the file you would be throwing information away by going from a hierarchy A/B to simply tags A&B, and there wouldn't be a way to restore that before calling |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
873231425 | https://github.com/pydata/xarray/issues/4118#issuecomment-873231425 | https://api.github.com/repos/pydata/xarray/issues/4118 | MDEyOklzc3VlQ29tbWVudDg3MzIzMTQyNQ== | TomNicholas 35968931 | 2021-07-02T20:05:06Z | 2021-07-02T20:05:06Z | MEMBER |
That is interesting. I think there is an argument for using a hierarchical model to map onto the full netCDF data model with groups, but perhaps methods to select elements via tags could be included too, for the best of both? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
873179375 | https://github.com/pydata/xarray/issues/4118#issuecomment-873179375 | https://api.github.com/repos/pydata/xarray/issues/4118 | MDEyOklzc3VlQ29tbWVudDg3MzE3OTM3NQ== | TomNicholas 35968931 | 2021-07-02T18:22:49Z | 2021-07-02T18:22:49Z | MEMBER | Flagging another possible use case, this time in Magnetic Confinement Fusion: representing the IMAS data model. IMAS is currently closed-source (being part of the ITER project), but there is a big push to make it open-source and the standard data model for tokamak plasma data. I'm not very familiar with IMAS (@smithsp and @orso82 are more so), but it is hierarchical. There is some more information in appendix A3 of this paper, which talks about "taking advantage of the homogeneity of grid sizes that is commonly found across arrays of structures", which sounds very closely related to the This might allow the |
{ "total_count": 2, "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 | |
808366093 | https://github.com/pydata/xarray/issues/4118#issuecomment-808366093 | https://api.github.com/repos/pydata/xarray/issues/4118 | MDEyOklzc3VlQ29tbWVudDgwODM2NjA5Mw== | TomNicholas 35968931 | 2021-03-26T16:47:53Z | 2021-03-26T16:47:53Z | MEMBER | This sounds like an interesting project - I'm also about to be able to work on xarray much more directly (thanks @rabernat ). Should I add this as another xarray project board alongside explicit indexes and so on? I wonder if this could find another domain use case in plasmapy as part of the overall |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Feature Request: Hierarchical storage and processing in xarray 628719058 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 1