issue_comments
14 rows where issue = 1475567394 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- Avoid loading entire dataset by getting the nbytes in an array · 14 ✖
| id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1474176353 | https://github.com/pydata/xarray/pull/7356#issuecomment-1474176353 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85X3iVh | dcherian 2448579 | 2023-03-17T17:30:51Z | 2023-03-17T17:31:22Z | MEMBER | Because we have lazy data reading functionality ```python import xarray as xr ds = xr.tutorial.open_dataset("air_temperature") var = ds.air.variable print(type(var._data)) # memory cached array print(type(var._data.array.array)) # ah that's wrapping a lazy array, no data read in yet print(var._data.size) # can access size print(type(var._data.array.array)) # still a lazy array .data forces a disk loadprint(type(var.data)) # oops disk-load print(type(var._data)) # "still memory cached array" print(type(var._data.array.array)) # but that's wrapping numpy data in memory ```
|
{
"total_count": 2,
"+1": 2,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1474149056 | https://github.com/pydata/xarray/pull/7356#issuecomment-1474149056 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85X3brA | TomNicholas 35968931 | 2023-03-17T17:10:44Z | 2023-03-17T17:10:44Z | MEMBER | This came up in the xarray office hours today, and I'm confused why this PR made any difference to the behavior at all? The |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1362322800 | https://github.com/pydata/xarray/pull/7356#issuecomment-1362322800 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85RM2Vw | hmaarrfk 90008 | 2022-12-22T02:40:59Z | 2022-12-22T02:40:59Z | CONTRIBUTOR | Any chance of a release, this is quite breaking for large datasets that can only be out of memory. |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1346924547 | https://github.com/pydata/xarray/pull/7356#issuecomment-1346924547 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85QSHAD | hmaarrfk 90008 | 2022-12-12T17:27:47Z | 2022-12-12T17:27:47Z | CONTRIBUTOR | 👍🏾 |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1339624818 | https://github.com/pydata/xarray/pull/7356#issuecomment-1339624818 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85P2Q1y | hmaarrfk 90008 | 2022-12-06T16:19:19Z | 2022-12-06T16:19:19Z | CONTRIBUTOR | Yes, without chunks of anything |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1339624418 | https://github.com/pydata/xarray/pull/7356#issuecomment-1339624418 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85P2Qvi | hmaarrfk 90008 | 2022-12-06T16:18:59Z | 2022-12-06T16:18:59Z | CONTRIBUTOR | Very smart test! |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1339575144 | https://github.com/pydata/xarray/pull/7356#issuecomment-1339575144 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85P2Eto | Illviljan 14371165 | 2022-12-06T15:44:01Z | 2022-12-06T15:44:01Z | MEMBER | I'm not really opposed to this change, shape and dtype uses Without using This test just looked so similar to the tests in #6797. I think you can do a similar lazy test taking inspiration from: https://github.com/pydata/xarray/blob/ed60c6ccd3d6725cd91190b8796af4355f3085c2/xarray/tests/test_formatting.py#L715-L727 |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1339457617 | https://github.com/pydata/xarray/pull/7356#issuecomment-1339457617 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85P1oBR | hmaarrfk 90008 | 2022-12-06T14:18:11Z | 2022-12-06T14:18:11Z | CONTRIBUTOR | The data is loaded from an NetCDF store through open_dataset |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1339452942 | https://github.com/pydata/xarray/pull/7356#issuecomment-1339452942 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85P1m4O | hmaarrfk 90008 | 2022-12-06T14:14:57Z | 2022-12-06T14:14:57Z | CONTRIBUTOR | No explicit test was added to ensure that the data wasn't loaded. I just experienced this bug enough (we would accidentally load 100GB files in our code base) that I knew exactly how to fix it. If you want i can add a test to ensure that future optimizations to nbytes do not trigger a data load. I was hoping the 1 line fix would be a shoe in. |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1339423992 | https://github.com/pydata/xarray/pull/7356#issuecomment-1339423992 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85P1fz4 | Illviljan 14371165 | 2022-12-06T13:53:03Z | 2022-12-06T13:53:03Z | MEMBER | Is that test targetting your issue with RAM crashing the laptop? Shouldn't there be some check if the values were loaded? How did you import your data? self.data looks like this: https://github.com/pydata/xarray/blob/ed60c6ccd3d6725cd91190b8796af4355f3085c2/xarray/core/variable.py#L420-L435 I was expecting your data to be a duck_array? |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1336731702 | https://github.com/pydata/xarray/pull/7356#issuecomment-1336731702 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85PrOg2 | hmaarrfk 90008 | 2022-12-05T04:20:08Z | 2022-12-05T04:20:08Z | CONTRIBUTOR | It seems that checking hasattr on the |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1336711830 | https://github.com/pydata/xarray/pull/7356#issuecomment-1336711830 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85PrJqW | hmaarrfk 90008 | 2022-12-05T03:58:50Z | 2022-12-05T03:58:50Z | CONTRIBUTOR | I think that at the very lease, the current implementation works as well as the old one for arrays that are defined by the |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1336700669 | https://github.com/pydata/xarray/pull/7356#issuecomment-1336700669 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85PrG79 | hmaarrfk 90008 | 2022-12-05T03:36:31Z | 2022-12-05T03:36:31Z | CONTRIBUTOR | Looking into the history a little more. I seem to be proposing to revert: https://github.com/pydata/xarray/commit/60f8c3d3488d377b0b21009422c6121e1c8f1f70 I think this is important since many users have arrays that are larger than memory. For me, I found this bug when trying to access the number of bytes in a 16GB dataset that I'm trying to load on my wimpy laptop. Not fun to start swapping. I feel like others might be hitting this too. xref: https://github.com/pydata/xarray/pull/6797 https://github.com/pydata/xarray/issues/4842 |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 | |
| 1336696899 | https://github.com/pydata/xarray/pull/7356#issuecomment-1336696899 | https://api.github.com/repos/pydata/xarray/issues/7356 | IC_kwDOAMm_X85PrGBD | hmaarrfk 90008 | 2022-12-05T03:30:31Z | 2022-12-05T03:30:31Z | CONTRIBUTOR | I personally do not even think the |
{
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
Avoid loading entire dataset by getting the nbytes in an array 1475567394 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] (
[html_url] TEXT,
[issue_url] TEXT,
[id] INTEGER PRIMARY KEY,
[node_id] TEXT,
[user] INTEGER REFERENCES [users]([id]),
[created_at] TEXT,
[updated_at] TEXT,
[author_association] TEXT,
[body] TEXT,
[reactions] TEXT,
[performed_via_github_app] TEXT,
[issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
ON [issue_comments] ([user]);
user 4