issue_comments
8 rows where issue = 1295939038 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- simple groupby_bins 10x slower than numpy · 8 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
1177245770 | https://github.com/pydata/xarray/issues/6758#issuecomment-1177245770 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GK1hK | vnoel 731499 | 2022-07-07T08:26:26Z | 2022-07-07T08:26:26Z | CONTRIBUTOR | @dcherian Just to be complete, I thought the following one-liner would work as well:
but apparently it produces slightly different results for reasons I don't understand |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 | |
1177163992 | https://github.com/pydata/xarray/issues/6758#issuecomment-1177163992 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GKhjY | vnoel 731499 | 2022-07-07T06:53:52Z | 2022-07-07T06:53:52Z | CONTRIBUTOR | { "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 | ||
1177106457 | https://github.com/pydata/xarray/issues/6758#issuecomment-1177106457 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GKTgZ | kmuehlbauer 5821660 | 2022-07-07T05:44:41Z | 2022-07-07T05:44:41Z | MEMBER | I'm getting a bit off topic now, but ... @dcherian Thanks for bringing back fond memories of the past. I still have @davidwfanning's IDL books on the shelf. And for sure it was a great pleasure reading @jdtsmith's IDL tricks and trying to understand those helped a lot. Great stuff. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 | |
1176918900 | https://github.com/pydata/xarray/issues/6758#issuecomment-1176918900 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GJlt0 | dcherian 2448579 | 2022-07-07T01:03:17Z | 2022-07-07T01:03:17Z | MEMBER | { "total_count": 2, "+1": 2, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 | ||
1176777842 | https://github.com/pydata/xarray/issues/6758#issuecomment-1176777842 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GJDRy | vnoel 731499 | 2022-07-06T21:40:37Z | 2022-07-06T21:40:37Z | CONTRIBUTOR | @dcherian I just tested your numpy suggestions, and I'm getting 100x speedups compared to my naive numpy approach (~200µs vs ~20ms). Thankyouthankyouthankyou! I've been doing this for years, I can't believe I've never run into that particular solution. It's like the IDL histogram function but in numpy. I'm going to use this like crazy Thanks again |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 | |
1176730713 | https://github.com/pydata/xarray/issues/6758#issuecomment-1176730713 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GI3xZ | dcherian 2448579 | 2022-07-06T20:54:23Z | 2022-07-06T20:54:23Z | MEMBER | Yes that's right. For this simple problem you could combine
And then wrap this using apply_ufunc. See https://github.com/ml31415/numpy-groupies/blob/412be938dcdfd74c6d673dd29012d18dc25dc94f/numpy_groupies/aggregate_numpy.py#L8-L28 for inspiration. |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 | |
1176701867 | https://github.com/pydata/xarray/issues/6758#issuecomment-1176701867 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GIwur | vnoel 731499 | 2022-07-06T20:37:12Z | 2022-07-06T20:37:12Z | CONTRIBUTOR | @dcherian this means that xarray's groupby_bins will always be slow unless flox is installed, correct? I have unfortunately little or no say on what packages are installed on the system that runs my code. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 | |
1176441916 | https://github.com/pydata/xarray/issues/6758#issuecomment-1176441916 | https://api.github.com/repos/pydata/xarray/issues/6758 | IC_kwDOAMm_X85GHxQ8 | dcherian 2448579 | 2022-07-06T16:40:47Z | 2022-07-06T16:40:47Z | MEMBER | On xarray main with flox installed: ``` python import numpy as np import xarray as xr display(xr.version) N = 3728 ds = xr.Dataset() ds["latitude"] = ("x", 0 + 20 * np.random.standard_normal(N)) ds["data"] = ("x", 0 + 100 * np.random.standard_normal(N)) %timeit ds.groupby_bins("latitude", np.arange(-40, 40, 0.1)).sum() ``` 50.3 ms ± 203 µs per loop (mean ± std. dev. of 7 runs, 10 loops each) You could try it on our pre-release (https://docs.xarray.dev/en/latest/whats-new.html#v2022-06-0rc0-9-june-2022) or use xhistogram which should be faster even. |
{ "total_count": 1, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 1, "rocket": 0, "eyes": 0 } |
simple groupby_bins 10x slower than numpy 1295939038 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 3