issue_comments
13 rows where issue = 942738904 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: reactions, created_at (date), updated_at (date)
issue 1
- Decoding netCDF is giving incorrect values for a large file · 13 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
1492937244 | https://github.com/pydata/xarray/issues/5597#issuecomment-1492937244 | https://api.github.com/repos/pydata/xarray/issues/5597 | IC_kwDOAMm_X85Y_Goc | kmuehlbauer 5821660 | 2023-04-01T11:03:02Z | 2023-04-01T11:03:02Z | MEMBER |
This is aimed at in #7654 |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
879561954 | https://github.com/pydata/xarray/issues/5597#issuecomment-879561954 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3OTU2MTk1NA== | shoyer 1217238 | 2021-07-14T03:43:37Z | 2021-07-14T03:44:00Z | MEMBER | Thanks for sharing the subset netCDF file, that is very helpful for debugging indeed! The weird thing is that the dtype picking logic seems to have a special case that, per the code comment, suggesting we want to be using float64 here: https://github.com/pydata/xarray/blob/eea76733770be03e78a0834803291659136bca31/xarray/coding/variables.py#L231-L238 But in fact, the dtype picking logic doesn't do that, because the dtype is already converted into float32, first. The culprit seems to be this line in CFMaskCoder, which promotes the dtype to float32 to fit a fill-value of NaN: https://github.com/pydata/xarray/blob/eea76733770be03e78a0834803291659136bca31/xarray/coding/variables.py#L202 To fix this, I think logic in |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
879554360 | https://github.com/pydata/xarray/issues/5597#issuecomment-879554360 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3OTU1NDM2MA== | ohsqueezy 1373406 | 2021-07-14T03:19:53Z | 2021-07-14T03:19:53Z | NONE | That explains it to me! Not sure if it's still useful but I exported the subset as a netCDF file. ```python In [59]: packed_vals = xarray.open_dataset("packed_solar_data_subset.nc", mask_and_scale=False).ssrd.values In [60]: packed_vals[0] * numpy.float32(e["scale_factor"]) + numpy.float32(e["add_offset"]) In [61]: packed_vals[0] * numpy.float64(e["scale_factor"]) + numpy.float64(e["add_offset"]) |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
879361320 | https://github.com/pydata/xarray/issues/5597#issuecomment-879361320 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3OTM2MTMyMA== | shoyer 1217238 | 2021-07-13T19:58:39Z | 2021-07-13T19:58:39Z | MEMBER | This may just be the expected floating point error from using float32: ``` In [5]: import numpy as np In [6]: -32766 * np.float32(625.6492454183389) + np.float32(20500023.17537729) Out[6]: 1.2984619140625 ``` If you use full float64, then the data does decode to 0.0:
So the question then is why this ends up being decoded using float32 instead of float64, and if that logic should be adjusted or made customizable: https://github.com/pydata/xarray/blob/eea76733770be03e78a0834803291659136bca31/xarray/coding/variables.py#L225 |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
879282134 | https://github.com/pydata/xarray/issues/5597#issuecomment-879282134 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3OTI4MjEzNA== | ohsqueezy 1373406 | 2021-07-13T17:49:09Z | 2021-07-13T17:49:09Z | NONE | sure, no prob
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
879199563 | https://github.com/pydata/xarray/issues/5597#issuecomment-879199563 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3OTE5OTU2Mw== | keewis 14808389 | 2021-07-13T15:45:05Z | 2021-07-13T15:45:05Z | MEMBER | can you post the value of |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
878920991 | https://github.com/pydata/xarray/issues/5597#issuecomment-878920991 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3ODkyMDk5MQ== | ohsqueezy 1373406 | 2021-07-13T09:16:03Z | 2021-07-13T09:16:03Z | NONE | h5netcdf seems to be a separate issue for me as it gives me the error
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
878914016 | https://github.com/pydata/xarray/issues/5597#issuecomment-878914016 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3ODkxNDAxNg== | kmuehlbauer 5821660 | 2021-07-13T09:07:14Z | 2021-07-13T09:07:14Z | MEMBER | @ohsqueezy You might also try |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
878910801 | https://github.com/pydata/xarray/issues/5597#issuecomment-878910801 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3ODkxMDgwMQ== | ohsqueezy 1373406 | 2021-07-13T09:03:04Z | 2021-07-13T09:03:04Z | NONE | Thanks for your help! I checked using the netCDF4 module, and the data is returned correctly ```python $ d = netCDF4.Dataset("BIG_FILE_packed.nc") $ d["ssrd"][d["time"][:] < d["time"][24], d["latitude"][:] == 44.8, d["longitude"][:] == 287.1]
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
878833159 | https://github.com/pydata/xarray/issues/5597#issuecomment-878833159 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3ODgzMzE1OQ== | max-sixty 5635139 | 2021-07-13T07:03:43Z | 2021-07-13T07:03:43Z | MEMBER | Thanks. Does passing different values to |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
878830308 | https://github.com/pydata/xarray/issues/5597#issuecomment-878830308 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3ODgzMDMwOA== | kmuehlbauer 5821660 | 2021-07-13T06:58:24Z | 2021-07-13T06:58:24Z | MEMBER | @ohsqueezy Does this issue also show up, if just plain netCDF4 is used to open the files? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
878824626 | https://github.com/pydata/xarray/issues/5597#issuecomment-878824626 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3ODgyNDYyNg== | ohsqueezy 1373406 | 2021-07-13T06:46:55Z | 2021-07-13T06:46:55Z | NONE | That example is actually a different file than the original. I unpacked the original file externally using In all the examples, the data is the same time and location, so they should be the same values outside of whatever is lost from compressing to int16 and decompressing, and the output arrays are from selecting a single day (24 hours) at a single location from the dataset returned by So actually there are three files I've tested with, all of which should have the same data (assuming the issue isn't with how the files are built, which could be the case): |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 | |
878792388 | https://github.com/pydata/xarray/issues/5597#issuecomment-878792388 | https://api.github.com/repos/pydata/xarray/issues/5597 | MDEyOklzc3VlQ29tbWVudDg3ODc5MjM4OA== | max-sixty 5635139 | 2021-07-13T05:32:27Z | 2021-07-13T05:32:27Z | MEMBER | A small question to clarify:
Is that the output of the |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
Decoding netCDF is giving incorrect values for a large file 942738904 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 5