home / github

Menu
  • Search all tables
  • GraphQL API

issue_comments

Table actions
  • GraphQL API for issue_comments

11 rows where issue = 285294506 sorted by updated_at descending

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: created_at (date), updated_at (date)

user 3

  • AlexeyPechnikov 6
  • shoyer 4
  • stale[bot] 1

author_association 2

  • NONE 7
  • MEMBER 4

issue 1

  • Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) · 11 ✖
id html_url issue_url node_id user created_at updated_at ▲ author_association body reactions performed_via_github_app issue
561264615 https://github.com/pydata/xarray/issues/1805#issuecomment-561264615 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDU2MTI2NDYxNQ== stale[bot] 26384082 2019-12-03T17:09:29Z 2019-12-03T17:09:29Z NONE

In order to maintain a list of currently relevant issues, we mark issues as stale after a period of inactivity

If this issue remains relevant, please comment here or remove the stale label; otherwise it will be marked as closed automatically

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354810393 https://github.com/pydata/xarray/issues/1805#issuecomment-354810393 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDgxMDM5Mw== shoyer 1217238 2018-01-02T16:41:46Z 2018-01-02T16:41:46Z MEMBER

@mobigroup copy(deep=True) makes an independent copy but again (as of #1024) does not load data into memory. You need to use .compute() or .load() to load the dask arrays into memory.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354709385 https://github.com/pydata/xarray/issues/1805#issuecomment-354709385 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwOTM4NQ== AlexeyPechnikov 7342379 2018-01-02T06:33:38Z 2018-01-02T06:34:15Z NONE

And in-memory dataarray couldn't be modified too: import xarray as xr ds = xr.open_mfdataset('./nctiles_monthly/TFLUX/TFLUX*.nc',concat_dim='tile') TFLUX = ds.TFLUX.copy(deep=True) TFLUX.values[1,1,1] = 0*TFLUX.values[1,1,1] TFLUX.values[1,1,1] array([ 60.46262741, 57.78710175, 56.33338547, 56.44807434, ..., 76.84630585, 74.65901184], dtype=float32) But why?

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354707556 https://github.com/pydata/xarray/issues/1805#issuecomment-354707556 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwNzU1Ng== AlexeyPechnikov 7342379 2018-01-02T06:06:42Z 2018-01-02T06:06:42Z NONE

Is it possible to fix the "cache" option for open_mfdataset()? And raise error in other case.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354705941 https://github.com/pydata/xarray/issues/1805#issuecomment-354705941 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwNTk0MQ== shoyer 1217238 2018-01-02T05:40:38Z 2018-01-02T05:57:03Z MEMBER

OK, I can reproduce this. The problem is assigning to specific elements of .values when a DataArray's data is not stored as a NumPy array, e.g., if it is a dask array or one of xarray's lazy array classes (used internally for lazy access when opening netCDF files). In practice, this arises with open_mfdataset() or open_dataset(..., cache=False).

It's not entirely obvious how to fix this. Probably the simplest fix is to make .values a readonly numpy array in these cases, so that an error is raised in these cases. This would usually works, but could lead to issues with passing arrays to scikit-learn or pandas due to a Cython bug with read-only memoryview buffers (https://github.com/cython/cython/issues/1605).

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354706872 https://github.com/pydata/xarray/issues/1805#issuecomment-354706872 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwNjg3Mg== shoyer 1217238 2018-01-02T05:56:23Z 2018-01-02T05:56:33Z MEMBER

Hm, I tested with "cache=True" but it still doesn't work.

I meant open_dataset(cache=False) (it arises with either value for cache for open_mfdataset).

So old code is broken now.

Yes, this is an unfortunate/unintended consequence of https://github.com/pydata/xarray/pull/1024 ("Disable automatic cache with dask").

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354706693 https://github.com/pydata/xarray/issues/1805#issuecomment-354706693 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwNjY5Mw== AlexeyPechnikov 7342379 2018-01-02T05:53:26Z 2018-01-02T05:53:26Z NONE

In practice, this arises with open_mfdataset() or open_dataset() with cache=False.

Hm, I tested with "cache=True" but it still doesn't work.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354706420 https://github.com/pydata/xarray/issues/1805#issuecomment-354706420 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwNjQyMA== AlexeyPechnikov 7342379 2018-01-02T05:49:06Z 2018-01-02T05:49:06Z NONE

pip2 install xarray==0.8.2

``` import xarray as xr ds = xr.open_mfdataset('./nctiles_monthly/TFLUX/TFLUX.nc',concat_dim='tile') TFLUX = ds.TFLUX TFLUX.values[1,1,1] = 0TFLUX.values[1,1,1] TFLUX.values[1,1,1]

array([ 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., ..., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.], dtype=float32) ```

So old code is broken now.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354706237 https://github.com/pydata/xarray/issues/1805#issuecomment-354706237 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwNjIzNw== AlexeyPechnikov 7342379 2018-01-02T05:45:55Z 2018-01-02T05:45:55Z NONE

It's works correct for xarray 0.8.2...

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354705066 https://github.com/pydata/xarray/issues/1805#issuecomment-354705066 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDcwNTA2Ng== AlexeyPechnikov 7342379 2018-01-02T05:25:31Z 2018-01-02T05:25:31Z NONE

I use this dataset: ftp://mit.ecco-group.org/ecco_for_las/version_4/release2/nctiles_monthly/TFLUX My code: ``` import xarray as xr ds = xr.open_mfdataset('./nctiles_monthly/TFLUX/TFLUX.nc',concat_dim='tile') TFLUX = ds.TFLUX TFLUX.values[1,1,1] = 0TFLUX.values[1,1,1] TFLUX.values[1,1,1]

array([ 60.46262741, 57.78710175, 56.33338547, 56.44807434, ..., 76.84630585, 74.65901184], dtype=float32)

Also xarray.show_versions()

INSTALLED VERSIONS

commit: None python: 2.7.14.final.0 python-bits: 64 OS: Darwin OS-release: 17.3.0 machine: x86_64 processor: i386 byteorder: little LC_ALL: en_US.UTF-8 LANG: en_US.UTF-8 LOCALE: None.None

xarray: 0.10.0 pandas: 0.22.0 numpy: 1.13.3 scipy: 0.18.1 netCDF4: 1.2.6 h5netcdf: None Nio: None bottleneck: None cyordereddict: None dask: 0.16.0 matplotlib: 2.1.0 cartopy: None seaborn: None setuptools: 36.5.0 pip: 9.0.1 conda: None pytest: 3.2.3 IPython: 5.1.0 sphinx: None ```

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506
354674571 https://github.com/pydata/xarray/issues/1805#issuecomment-354674571 https://api.github.com/repos/pydata/xarray/issues/1805 MDEyOklzc3VlQ29tbWVudDM1NDY3NDU3MQ== shoyer 1217238 2018-01-01T20:28:59Z 2018-01-01T20:28:59Z MEMBER

Can you share full information for how you created this dataset? I cannot reproduce this: ``` In [36]: test = xr.DataArray(np.ones(10), ...: coords={ ...: 'x': range(10) ...: }, ...: dims=['x'] ...: ).chunk() ...:

In [37]: test Out[37]: <xarray.DataArray (x: 10)> dask.array<shape=(10,), dtype=float64, chunksize=(10,)> Coordinates: * x (x) int64 0 1 2 3 4 5 6 7 8 9

In [38]: test.values = 0 * test.values

In [39]: test.values Out[39]: array([ 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.]) ```

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  Couldn't change values for on-disk datasets opened by open_mfdataset() (xarray > 0.8.2) 285294506

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);
Powered by Datasette · Queries took 14.72ms · About: xarray-datasette