issues: 103966799
This data as json
id | node_id | number | title | user | state | locked | assignee | milestone | comments | created_at | updated_at | closed_at | author_association | active_lock_reason | draft | pull_request | body | reactions | performed_via_github_app | state_reason | repo | type |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
103966799 | MDExOlB1bGxSZXF1ZXN0NDM3MjQ0Njk= | 555 | Use deterministic names for dask arrays from open_dataset | 1217238 | closed | 0 | 1307323 | 0 | 2015-08-31T00:50:23Z | 2015-10-21T07:05:47Z | 2015-09-14T20:33:16Z | MEMBER | 0 | pydata/xarray/pulls/555 | This will allow xray users to take advantage of dask's nascent support for caching intermediate results (https://github.com/blaze/dask/pull/502). For example: ``` In [1]: import xray In [2]: from dask.diagnostics.cache import Cache In [3]: c = Cache(5e7) In [4]: c.register() In [5]: ds = xray.open_mfdataset('/Users/shoyer/data/era-interim/2t/2014-*.nc', engine='scipy') In [6]: %time ds.sum().load() CPU times: user 2.72 s, sys: 2.7 s, total: 5.41 s Wall time: 3.85 s Out[6]: <xray.Dataset> Dimensions: () Coordinates: empty Data variables: t2m float64 5.338e+10 In [7]: %time ds.mean().load() CPU times: user 5.31 s, sys: 1.86 s, total: 7.17 s Wall time: 1.81 s Out[7]: <xray.Dataset> Dimensions: () Coordinates: empty Data variables: t2m float64 279.0 In [8]: %time ds.mean().load() CPU times: user 7.73 ms, sys: 2.73 ms, total: 10.5 ms Wall time: 8.45 ms Out[8]: <xray.Dataset> Dimensions: () Coordinates: empty Data variables: t2m float64 279.0 ``` Still needs docs (probably in the dask section) and a what's new item. Also, this will update the minimum required version of dask to 0.7 (which should be called out in docs). |
{ "url": "https://api.github.com/repos/pydata/xarray/issues/555/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
13221727 | pull |