issue_comments
7 rows where issue = 166439490 and user = 6213168 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: created_at (date), updated_at (date)
issue 1
- unstack() sorts data alphabetically · 7 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
269479071 | https://github.com/pydata/xarray/issues/906#issuecomment-269479071 | https://api.github.com/repos/pydata/xarray/issues/906 | MDEyOklzc3VlQ29tbWVudDI2OTQ3OTA3MQ== | crusaderky 6213168 | 2016-12-28T13:46:19Z | 2016-12-28T13:46:19Z | MEMBER | @shoyer, are you happy for me to go ahead and change unstack() to respect the order of the first found series? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
unstack() sorts data alphabetically 166439490 | |
234687071 | https://github.com/pydata/xarray/issues/906#issuecomment-234687071 | https://api.github.com/repos/pydata/xarray/issues/906 | MDEyOklzc3VlQ29tbWVudDIzNDY4NzA3MQ== | crusaderky 6213168 | 2016-07-23T00:27:49Z | 2016-07-23T00:27:49Z | MEMBER | Thanks, didn't know https://gist.github.com/crusaderky/002ba64ee270164931d32ea3366dce1f |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
unstack() sorts data alphabetically 166439490 | |
234686438 | https://github.com/pydata/xarray/issues/906#issuecomment-234686438 | https://api.github.com/repos/pydata/xarray/issues/906 | MDEyOklzc3VlQ29tbWVudDIzNDY4NjQzOA== | crusaderky 6213168 | 2016-07-23T00:20:41Z | 2016-07-23T00:20:41Z | MEMBER | Fixed in attachment. The code uses the first found series as the order. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
unstack() sorts data alphabetically 166439490 | |
234004910 | https://github.com/pydata/xarray/issues/906#issuecomment-234004910 | https://api.github.com/repos/pydata/xarray/issues/906 | MDEyOklzc3VlQ29tbWVudDIzNDAwNDkxMA== | crusaderky 6213168 | 2016-07-20T16:33:15Z | 2016-07-20T16:33:15Z | MEMBER | I see. I'll see if I can think a good way to cope with your two examples. BTW, my code above is buggy as it blindly assumes that the first dim is also the outermost. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
unstack() sorts data alphabetically 166439490 | |
233904555 | https://github.com/pydata/xarray/issues/906#issuecomment-233904555 | https://api.github.com/repos/pydata/xarray/issues/906 | MDEyOklzc3VlQ29tbWVudDIzMzkwNDU1NQ== | crusaderky 6213168 | 2016-07-20T09:52:42Z | 2016-07-20T09:52:42Z | MEMBER | This preamble should be integrated inside unstack(): ``` python import operator from functools import reduce def proper_unstack(array, dim):
proper_unstack(a, 'dim_0') ```
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
unstack() sorts data alphabetically 166439490 | |
233888081 | https://github.com/pydata/xarray/issues/906#issuecomment-233888081 | https://api.github.com/repos/pydata/xarray/issues/906 | MDEyOklzc3VlQ29tbWVudDIzMzg4ODA4MQ== | crusaderky 6213168 | 2016-07-20T08:42:19Z | 2016-07-20T08:42:19Z | MEMBER | the order of appearance should be what dictates the output.
Not true. Using the order of appearance requires you to do a pick-by-index on the index. At the moment, you're doing a pick-by-index on the data. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
unstack() sorts data alphabetically 166439490 | |
233794061 | https://github.com/pydata/xarray/issues/906#issuecomment-233794061 | https://api.github.com/repos/pydata/xarray/issues/906 | MDEyOklzc3VlQ29tbWVudDIzMzc5NDA2MQ== | crusaderky 6213168 | 2016-07-19T23:11:57Z | 2016-07-19T23:11:57Z | MEMBER | this workaround works:
However, I think that the whole thing is incredibly convoluted. Namely, because everything looks good both if you visualize the original pandas Series/DataFrame, as well as the stacked DataArray. unstack() is causing an internal technicality of pandas to produce real change in the data. I came through this issue because I am using pandas to load a multi-index CSV from disk, and then convert it to a n-dimensional xarray. In this situation, I have no control over the multiindex - short of manually rebuilding it after the CSV load. The pandas dataframe looks right, the stacked xarray looks right, the unstacked xarray gets magically sorted :$ Also I don't understand why you say there's no performance implications. You're basically doing a pick-by-index rebuild of the array, which does potentially random access to the whole input array - thus nullifying the benefits of the CPU cache. This is compared to a numpy.ndarray.reshape(), which has the cost of a memcpy(). I was going to add something about doing pick-by-index with a dask array will be even worse, when I realised that multiindex does not work at all when you chunk()... :( |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
unstack() sorts data alphabetically 166439490 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 1