home / github

Menu
  • Search all tables
  • GraphQL API

issue_comments

Table actions
  • GraphQL API for issue_comments

8 rows where author_association = "MEMBER" and issue = 168272291 sorted by updated_at descending

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: reactions, created_at (date), updated_at (date)

user 4

  • shoyer 5
  • rabernat 1
  • jhamman 1
  • benbovy 1

issue 1

  • WIP: progress toward making groupby work with multiple arguments · 8 ✖

author_association 1

  • MEMBER · 8 ✖
id html_url issue_url node_id user created_at updated_at ▲ author_association body reactions performed_via_github_app issue
371866327 https://github.com/pydata/xarray/pull/924#issuecomment-371866327 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDM3MTg2NjMyNw== benbovy 4160723 2018-03-09T16:38:32Z 2018-03-09T16:38:32Z MEMBER

I'd very happy to start working on #1603, but unfortunately I wish I could have more time for this at the moment (and before this, it is also probably a good idea that I continue the work in #1820, which has stalled for quite a while).

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291
371672217 https://github.com/pydata/xarray/pull/924#issuecomment-371672217 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDM3MTY3MjIxNw== shoyer 1217238 2018-03-09T00:27:32Z 2018-03-09T00:27:32Z MEMBER

@pwolfram Personally, I am waiting to implement this until after https://github.com/pydata/xarray/issues/1603. I'm pretty sure that the alternative model for indexes proposed there will make this far easier to implement.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291
315174359 https://github.com/pydata/xarray/pull/924#issuecomment-315174359 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDMxNTE3NDM1OQ== jhamman 2443309 2017-07-13T19:11:38Z 2017-07-13T19:11:38Z MEMBER

@chunweiyuan - yes, you are welcome to give this a shot.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291
286517390 https://github.com/pydata/xarray/pull/924#issuecomment-286517390 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDI4NjUxNzM5MA== shoyer 1217238 2017-03-14T18:31:12Z 2017-03-14T18:31:12Z MEMBER

I don't have any progress to report since my last comment.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291
280441902 https://github.com/pydata/xarray/pull/924#issuecomment-280441902 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDI4MDQ0MTkwMg== shoyer 1217238 2017-02-16T20:00:44Z 2017-02-16T20:00:44Z MEMBER

@RafalSkolasinski Sure, here is the current list:

  1. The implementation should probably switch to use set_index() (after stacking) to combine groupby arguments into a single MultiIndex, instead of the current version which constructs the MultiIndex explicitly in GroupBy.__init__, duplicating a lot of the set_index code.
  2. We need to consider how to handle multiple group-by arguments along orthogonal dimensions. For example, consider the 4x4 array from the gist at the top of this PR where grouping is done over 2x2 blocks. With the current PR, the array is flattened, so iteration is done over flat arrays of length 4, but we probably want to actually iterate over 2x2 arrays that match the shape of the original data. This could be done either by stacking/unstacking at each iteration step (slow, but would work) or by making groupby truly do indexing and concatenation over multiple dimensions at once (certainly faster and more elegant, but significantly more work to implement).
  3. The change also needs to be rebased to account for changes on master, but by the time you account for the above changes there may not be much of the current version remaining.
  4. We also need comprehensive test coverage for the desired behavior.
{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291
279100389 https://github.com/pydata/xarray/pull/924#issuecomment-279100389 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDI3OTEwMDM4OQ== shoyer 1217238 2017-02-11T00:08:18Z 2017-02-11T00:08:18Z MEMBER

@RafalSkolasinski This pull request was mostly working, but still needs some significant work to clean it up and update it to the current version of the codebase.

I don't think anyone is working on it currently (I'm not) but I'm sure someone will get to it eventually.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291
236402472 https://github.com/pydata/xarray/pull/924#issuecomment-236402472 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDIzNjQwMjQ3Mg== shoyer 1217238 2016-07-31T01:49:15Z 2016-07-31T01:49:15Z MEMBER

@rabernat I updated the top post with examples. So yes, for your example, the coordinates of the output would have every unique combination of x and y. More generally, something like ds.groupby(['x', 'y']).mean() will be equivalent to ds.mean('z') for a dataset with dimensions (x, y, z). I think this is the only sane way to define these grouped operations.

Once we figure out squeezing out grouped/stacked dimensions (not quite working yet), this will let us write things like ds.groupby(['x', 'y']).apply(calculate_trend) or better yet with group_over, ds.group_over('time').apply(calculate_trend).

{
    "total_count": 1,
    "+1": 1,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291
236358965 https://github.com/pydata/xarray/pull/924#issuecomment-236358965 https://api.github.com/repos/pydata/xarray/issues/924 MDEyOklzc3VlQ29tbWVudDIzNjM1ODk2NQ== rabernat 1197350 2016-07-30T10:54:50Z 2016-07-30T10:54:50Z MEMBER

This looks like a really useful addition. It would be useful for me to have an example of how this is supposed to work. The tests are a starting point, but perhaps kind of trivial cases.

If I create the following 2D dataset:

python ds = xr.Dataset({'foo': (['x','y'], np.random.rand(2,4))})

and then do

python ds.groupby(['x','y']).sum()

What should the coordinates of the output be? Every unique combination of x and y?

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: progress toward making groupby work with multiple arguments 168272291

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);
Powered by Datasette · Queries took 11.164ms · About: xarray-datasette