github: issue_comments: 11 rows where author_association = "NONE" and issue = 262642978 sorted by updated

11 rows where author_association = "NONE" and issue = 262642978 sorted by updated_at descending

Search:

descending

id	html_url	issue_url	node_id	user	created_at	updated_at ▲	author_association	body	reactions	issue
949485684	https://github.com/pydata/xarray/issues/1603#issuecomment-949485684	https://api.github.com/repos/pydata/xarray/issues/1603	IC_kwDOAMm_X844mAB0	weipeng1999 38346144	2021-10-22T10:15:39Z	2021-10-22T10:15:39Z	NONE	So I think maintain the origin dims may do less broken on current code.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
949484507	https://github.com/pydata/xarray/issues/1603#issuecomment-949484507	https://api.github.com/repos/pydata/xarray/issues/1603	IC_kwDOAMm_X844l_vb	weipeng1999 38346144	2021-10-22T10:14:01Z	2021-10-22T10:14:01Z	NONE	For such case you could already do `ds.stack(z=("t", "x")).set_index(z="C2").sel(z=["a", "e", "h"])`. After the explicit index refactor, we could imagine a custom index that supports multi-dimension coordinates such that you would only need to do something like ```python S_res = S4.sel(C2=("z", ["a", "e", "h"])) S_res <xarray.Dataset> Dimensions: (z: 3) Coordinates: * C2 (z) <U1 'a' 'e' 'h' Data variables: A1 (z) float64 4 3 3 ``` or without explicitly providing the name of the packed dimension: ```python S_res = S4.sel(C2=["a", "e", "h"]) S_res <xarray.Dataset> Dimensions: (C2: 3) Coordinates: * C2 (C2) <U1 'a' 'e' 'h' Data variables: A1 (C2) float64 4 3 3 ``` well, both "contain the origin dims" or just "generate another one" have its benefit. if we contain origin dims, we can ensure that: - less difference between 1d coordinate and multi dims ones, both can run like S1.sel(C1=["a", "e", "h"]) S4.sel(C2=["a", "e", "h"]) and return a new data set with origin dims ( that's why I highly not recommended the implicit one ) - return a new data set have original dims which means if you change C1 to C2, and the rest code have S_res.sel(x=[1,2,3]) still work.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
949423480	https://github.com/pydata/xarray/issues/1603#issuecomment-949423480	https://api.github.com/repos/pydata/xarray/issues/1603	IC_kwDOAMm_X844lw14	weipeng1999 38346144	2021-10-22T08:56:38Z	2021-10-22T09:15:17Z	NONE	well, here are my ideas on how to define coordinates with multi dims.(because of github's bug, the characters of 1st image are white, I can not fix it)	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
949401881	https://github.com/pydata/xarray/issues/1603#issuecomment-949401881	https://api.github.com/repos/pydata/xarray/issues/1603	IC_kwDOAMm_X844lrkZ	weipeng1999 38346144	2021-10-22T08:25:54Z	2021-10-22T08:25:54Z	NONE	Thanks for the detailed description @weipeng1999. For the first 4 slides I don't see how this is different from how does `S_res = S1.sel(C1=['a', 'b']` and `S_res = S2.sel(C1=['a', 'b'])` currently? And for the last 2 slides, I don't think that we always want such implicit broadcasting for dimensions that are not involved in the indexed coordinates. thank you for figuring out the wrong things what I done. Well, it' is hard to explain the idea because it is a bit complicated, the last two picture is wrong and make misunderstanding, here are two images explain what I actuarily mean:	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
947480352	https://github.com/pydata/xarray/issues/1603#issuecomment-947480352	https://api.github.com/repos/pydata/xarray/issues/1603	IC_kwDOAMm_X844eWcg	weipeng1999 38346144	2021-10-20T09:15:41Z	2021-10-20T09:15:41Z	NONE	Hi @weipeng1999, I'm not sure to fully understand your suggestion, would you mind sharing some illustrative examples? It is useful to have two distinct `coordinate variable` vs `data variable` concepts. Although both are data arrays, the former is used to locate data in the dimensional space(s) defined by all dimensions in the dataset while the latter is used to store field data. It also helps to have a clear separation between the `coordinate variable` and `index` concepts. An index is a specific data structure or object that allows efficient data extraction or alignment based one or more coordinate labels. Sometimes an index object may be handled like a data array (like pandas indexes) but this is not always the case (e.g., a KD-Tree). Currently in Xarray the `index` concept is hidden behind "dimension" coordinate variables. The goal of the explicit index refactor is to bring it to the light and make it available to any coordinate (and also open it to custom index structures, not only pandas indexes). It looks like what you suggest is some kind of implicit (co-)indexes hidden behind any dataset variable(s)? We actually took the opposite direction, trying to make everything explicit. Try to explain my idea, I make a PPT.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
946337314	https://github.com/pydata/xarray/issues/1603#issuecomment-946337314	https://api.github.com/repos/pydata/xarray/issues/1603	IC_kwDOAMm_X844Z_Yi	weipeng1999 38346144	2021-10-19T03:32:13Z	2021-10-19T03:33:54Z	NONE	Well, maybe we can consider the coordinates in a more generic way. Let us define coordinate an array in data set cause co-indexed when we index its data set. It means that: If A1,A2,A3 are in a same data set S, we index S[ {'A1':I} ] will return a new data set which not only have indexed A1, but they also been Indexed that the A2 A3 which have dims shared with A1. This behavior I call it co-index. Use dims to determined the way how other array of the data set will be co-indexed. If all dims of A1(as coordinate) are also in A2(as regular array co-indexed), obviously the behavior can simply follow the old behavior, just change at the same dim and contain others. If A1 has a dim which not in A2, we should broadcast A2 at the dim, because the older behavior is to consider None dim as broadcast-able dim during other operation so co-index should follow it. Some compatibility issues: maybe need a New Type like DataArray but only have dims instead of both dims and coordinate just define how Dataset to deal with index, maybe DataArray is simlar.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
822122172	https://github.com/pydata/xarray/issues/1603#issuecomment-822122172	https://api.github.com/repos/pydata/xarray/issues/1603	MDEyOklzc3VlQ29tbWVudDgyMjEyMjE3Mg==	Hoeze 1200058	2021-04-19T02:18:58Z	2021-04-19T02:19:24Z	NONE	Many array types do have implicit indices. For example, sparse arrays do have their coordinates / CSR representation as primary index (`.sel()`) while dense array's primary index is the position (`.isel()`). Every labeled dimension is therefore just a separate mapping of a string to the index position in the array. Going one step further, one could have continuous dimensions where positional indexing (`.isel()`) does not really make sense. Looking at TileDB's dimensions provides an example for this. => Having explicit and implicit indices on arrays would be awesome, even if they don't support all xarray features!	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
557579503	https://github.com/pydata/xarray/issues/1603#issuecomment-557579503	https://api.github.com/repos/pydata/xarray/issues/1603	MDEyOklzc3VlQ29tbWVudDU1NzU3OTUwMw==	NowanIlfideme 2067093	2019-11-22T15:34:57Z	2019-11-22T15:34:57Z	NONE	Thanks @NowanIlfideme for your feedback. Could you perhaps share a gist of code related to your use case? The first example in this comment is similar to my use case: https://github.com/pydata/xarray/issues/3213#issuecomment-520741706 . There are several "core" dimensions, but some part of the coordinates may be hierarchical or cross-defined (e.g. country > province > city > building, but also country > province > voting district > building). We might have a full or nearly-full panel in the MultiIndex representation, but have a huge cross product (even if we keep strictly hierarchical dimensions out). Meanwhile using a true COO sparse representation (as I understand it) will likely end up with slower operations overall, since nearly all machine learning models (think: linear regression) require a dense array input anyways. I'll make an example of this when I find some free time, along with a contrasting one in Pandas. :)	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
557563566	https://github.com/pydata/xarray/issues/1603#issuecomment-557563566	https://api.github.com/repos/pydata/xarray/issues/1603	MDEyOklzc3VlQ29tbWVudDU1NzU2MzU2Ng==	NowanIlfideme 2067093	2019-11-22T14:59:29Z	2019-11-22T14:59:29Z	NONE	I've noticed that basically all my current troubles with xarray lead to this issue (lack of MultiIndex support). I use xarray for machine learning/data science/econometrics. My current problem requires a semi-hierarchical indexing on one of the dimensions, and slicing/aggregation along some levels of those dimensions. My first attempt was to just assume each dimension was orthogonal, which resulted in out-of-memory errors. I ended up using a MultiIndex for the hierarchy dimension to have a "dense" representation of a sparse subspace. Unfortunately, currently `.sel()` and such will cut out MultiIndex dimensions, and I've had to do boolean masking to keep all the dimensions I need. Multidimensional groupby, especially within the MultiIndex, is a headache as it currently stands. I had to resort to making auxilliary dimensions with one-hot encoded levels (dummy variables) and doing multiply-aggregate operations by hand. `xarray` is really beautiful and should be used more by data scientists, but it's really difficult to recommend it to colleagues when not all the familiar `pandas`-style operations are supported.	{ "total_count": 2, "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
491229992	https://github.com/pydata/xarray/issues/1603#issuecomment-491229992	https://api.github.com/repos/pydata/xarray/issues/1603	MDEyOklzc3VlQ29tbWVudDQ5MTIyOTk5Mg==	aldanor 2418513	2019-05-10T09:47:39Z	2019-05-10T09:47:39Z	NONE	There's now a good few dozen issues that reference this PR. Wondering if there's any particular help needed (in the form of coding, discussion, or any other fashion), so as to try and speed it up and unblock those issues? (I'm personally interested in resolving problems like #934 myself - allowing selection on non-dim coords, which seems to be a major hassle for a lot of use cases.)	{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978
340005903	https://github.com/pydata/xarray/issues/1603#issuecomment-340005903	https://api.github.com/repos/pydata/xarray/issues/1603	MDEyOklzc3VlQ29tbWVudDM0MDAwNTkwMw==	jjpr-mit 25231875	2017-10-27T15:34:42Z	2017-10-27T15:34:42Z	NONE	Will the new API preserve the order of the levels? One of the features that's necessary for `MultiIndex` to be truly hierarchical is that there is a defined order to the levels.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Explicit indexes in xarray's data-model (Future of MultiIndex) 262642978

Advanced export

JSON shape: default, array, newline-delimited, object

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);

issue_comments

11 rows where author_association = "NONE" and issue = 262642978 sorted by updated_at descending

Let us define coordinate an array in data set cause co-indexed when we index its data set. It means that:

Use dims to determined the way how other array of the data set will be co-indexed.

Some compatibility issues:

Advanced export