issues: 440988633
This data as json
id | node_id | number | title | user | state | locked | assignee | milestone | comments | created_at | updated_at | closed_at | author_association | active_lock_reason | draft | pull_request | body | reactions | performed_via_github_app | state_reason | repo | type |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
440988633 | MDU6SXNzdWU0NDA5ODg2MzM= | 2943 | Rolling operations loose chunking with dask and bottleneck | 161133 | closed | 0 | 1 | 2019-05-07T01:52:05Z | 2019-05-07T02:01:13Z | 2019-05-07T02:01:13Z | CONTRIBUTOR | Code Sample, a copy-pastable example if possibleA "Minimal, Complete and Verifiable Example" will make it much easier for maintainers to help you: http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports ```python import bottleneck import xarray import dask data = dask.array.ones((100,), chunks=(10,)) da = xarray.DataArray(data, dims=['time']) rolled = da.rolling(time=15).mean() Expect the 'rolled' dataset to be chunked approximately the same as 'data',however there is only one chunk in 'rolled' instead of 10assert len(rolled.chunks[0]) > 1 ``` Problem descriptionRolling operations loose chunking over the rolled dimension when using dask datasets with bottleneck installed, which is a problem for large datasets where we don't want to load the entire thing. The issue appears to be caused by Expected OutputChunks should be preserved through Output of
|
{ "url": "https://api.github.com/repos/pydata/xarray/issues/2943/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
completed | 13221727 | issue |