issues: 440988633
This data as json
| id | node_id | number | title | user | state | locked | assignee | milestone | comments | created_at | updated_at | closed_at | author_association | active_lock_reason | draft | pull_request | body | reactions | performed_via_github_app | state_reason | repo | type |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 440988633 | MDU6SXNzdWU0NDA5ODg2MzM= | 2943 | Rolling operations loose chunking with dask and bottleneck | 161133 | closed | 0 | 1 | 2019-05-07T01:52:05Z | 2019-05-07T02:01:13Z | 2019-05-07T02:01:13Z | CONTRIBUTOR | Code Sample, a copy-pastable example if possibleA "Minimal, Complete and Verifiable Example" will make it much easier for maintainers to help you: http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports ```python import bottleneck import xarray import dask data = dask.array.ones((100,), chunks=(10,)) da = xarray.DataArray(data, dims=['time']) rolled = da.rolling(time=15).mean() Expect the 'rolled' dataset to be chunked approximately the same as 'data',however there is only one chunk in 'rolled' instead of 10assert len(rolled.chunks[0]) > 1 ``` Problem descriptionRolling operations loose chunking over the rolled dimension when using dask datasets with bottleneck installed, which is a problem for large datasets where we don't want to load the entire thing. The issue appears to be caused by Expected OutputChunks should be preserved through Output of
|
{
"url": "https://api.github.com/repos/pydata/xarray/issues/2943/reactions",
"total_count": 0,
"+1": 0,
"-1": 0,
"laugh": 0,
"hooray": 0,
"confused": 0,
"heart": 0,
"rocket": 0,
"eyes": 0
} |
completed | 13221727 | issue |