id,node_id,number,title,user,state,locked,assignee,milestone,comments,created_at,updated_at,closed_at,author_association,active_lock_reason,draft,pull_request,body,reactions,performed_via_github_app,state_reason,repo,type
2045596856,I_kwDOAMm_X8557VS4,8555,Docs look odd in dark mode,306380,open,0,,,1,2023-12-18T02:31:26Z,2023-12-19T15:32:11Z,,MEMBER,,,,"### What happened?

<img width=""834"" alt=""Screenshot 2023-12-17 at 6 30 55 PM"" src=""https://github.com/pydata/xarray/assets/306380/40995a3f-5dd2-4da3-b223-9e9431d73d08"">


### What did you expect to happen?

_No response_

### Minimal Complete Verifiable Example

_No response_

### MVCE confirmation

- [ ] Minimal example — the example is as focused as reasonably possible to demonstrate the underlying issue in xarray.
- [ ] Complete example — the example is self-contained, including all data and the text of any traceback.
- [ ] Verifiable example — the example copy & pastes into an IPython prompt or [Binder notebook](https://mybinder.org/v2/gh/pydata/xarray/main?urlpath=lab/tree/doc/examples/blank_template.ipynb), returning the result.
- [ ] New issue — a search of GitHub Issues suggests this is not a duplicate.
- [ ] Recent environment — the issue occurs with the latest version of xarray and its dependencies.

### Relevant log output

_No response_

### Anything else we need to know?

_No response_

### Environment

<details>


</details>
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/8555/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,issue
1821467933,I_kwDOAMm_X85skWUd,8021,Specify chunks in bytes,306380,open,0,,,4,2023-07-26T02:29:43Z,2023-10-06T10:09:33Z,,MEMBER,,,,"### Is your feature request related to a problem?

I'm playing around with xarray performance and would like a way to easily tweak chunk sizes.  I'm able to do this by backing out what xarray chooses in an `open_zarr` call and then provide the right `chunks=` argument.  I'll admit though that I wouldn't mind giving Xarray a value like `""1 GiB""` though and having it use that when determining `""auto""` chunk sizes.  

Dask array does this in two ways.  We can provide a value in chunks as like the following:

```python
x = da.random.random(..., chunks=""1 GiB"")
```

We also refer to a value in Dask config

```python
In [1]: import dask

In [2]: dask.config.get(""array.chunk-size"")
Out[2]: '128MiB'
```

This is not very important (I'm unblocked) but I thought I'd mention it in case someone is looking for some fun work 🙂 

### Describe the solution you'd like

_No response_

### Describe alternatives you've considered

_No response_

### Additional context

_No response_","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/8021/reactions"", ""total_count"": 2, ""+1"": 2, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,issue
295270362,MDU6SXNzdWUyOTUyNzAzNjI=,1895,Avoid Adapters in task graphs?,306380,closed,0,,,13,2018-02-07T19:52:02Z,2022-05-11T20:26:42Z,2022-05-11T20:26:42Z,MEMBER,,,,"Looking at an `open_zarr` computation from @rabernat I'm coming across intermediate values like the following:

```python
>>> Future('zarr-adt-0f90b3f56f247f966e5ef01277f31374').result()
ImplicitToExplicitIndexingAdapter(array=LazilyIndexedArray(array=<xarray.backends.zarr.ZarrArrayWrapper object at 0x7fa921fec278>, key=BasicIndexer((slice(None, None, None), slice(None, None, None), slice(None, None, None)))))
```

This object has many dependents, and so will presumably have to float around the network to all of the workers

```python
>>> len(dep.dependents)
1781
```

In principle this is fine, especially if this object is cheap to serialize, move, and deserialize.  It does introduce a bit of friction though.  I'm curious how hard it would be to build task graphs that generated these objects on the fly, or else removed them altogether.  It is slightly more convenient from a task scheduling perspective for data access tasks to not have any dependencies.","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1895/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
336371511,MDExOlB1bGxSZXF1ZXN0MTk3ODQwODgw,2255,Add automatic chunking to open_rasterio,306380,closed,0,,,10,2018-06-27T20:15:07Z,2022-04-07T20:21:24Z,2022-04-07T20:21:24Z,MEMBER,,0,pydata/xarray/pulls/2255,"This uses the automatic chunking in dask 0.18+ to chunk rasterio
datasets in a nicely aligned way.

Currently this doesn't implement tests due to a difficulty in creating
chunked tiff images.

This also uncovered some inefficiencies in how Dask doesn't align rechunking to existing chunk schemes.

 - [x] Closes #2093 
 - [ ] Tests added (for all bug fixes or enhancements)
 - [ ] Tests passed (for all non-documentation changes)
 - [ ] Fully documented, including `whats-new.rst` for all changes and `api.rst` for new API (remove if this change should not be visible to users, e.g., if it is an internal clean-up, or if this is part of a larger project that will be documented later)

I could use help on how the following:

-  How to create tiled TIFF files in the tests
-  The right way to merge different dtypes and block shapes in the tiff file.  Currently I'm assuming that they're uniform","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/2255/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
908971901,MDU6SXNzdWU5MDg5NzE5MDE=,5426,Implement dask.sizeof for xarray.core.indexing.ImplicitToExplicitIndexingAdapter,306380,open,0,,,17,2021-06-02T01:55:23Z,2021-11-16T15:08:03Z,,MEMBER,,,,"I'm looking at a [pangeo gallery workflow](https://nbviewer.jupyter.org/github/pangeo-gallery/osm2020tutorial/blob/master/AWS-notebooks/aws_mur_sst_tutorial_short.ipynb) that suffers from poor load balancing because objects of type `xarray.core.indexing.ImplicitToExplicitIndexingAdapter` are being interpretted as 48B when in fact, I suspect, they are significantly larger to move around.  

I'm seeing number of processing tasks charts that look like the following, which is a common sign of the load balancer not making good decisions, which is most commonly caused by poor data size measurements: 

![Screen Shot 2021-06-01 at 8 54 29 PM](https://user-images.githubusercontent.com/306380/120412098-923df780-c31b-11eb-8c9e-9d4069ddaa4a.png)
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/5426/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,issue
207021356,MDU6SXNzdWUyMDcwMjEzNTY=,1262,Logical DTypes,306380,open,0,,,11,2017-02-12T01:26:23Z,2020-12-26T14:26:00Z,,MEMBER,,,,"**tl;dr**: Can XArray enable user-defined logical dtypes on top of physical NumPy arrays ?

### The Need for New Datatypes

NumPy's dtypes (int, float, etc.) are appropriate for many, but not all cases.  There are a variety of situations where we want numpy-like array semantics (broadcasting, memory layout) but with different element properties.  Use cases include the following:

1.  Datetimes with timezones
2.  Categorical values (such as for land-use in climate data)
3.  IPv4 or IPv6 addresses
4.  ...

Currently dtypes need to be added directly to the NumPy source code.  This is a high barrier for many community members, requires general approval (there can be only one datetime implementation) (this is good and bad), and limits experimentation.  There is value to supporting user-definable datatypes.

### This is hard to do in NumPy

Ideally we would implement extensible user-defined dtypes within NumPy (and there may be long-standing plans to do just this).  However, changing NumPy today is hard, both because it's hard to find developers who are comfortable operating at that level and because the backwards compatibility pressure on NumPy is large.  

So as an alternative, we might consider lightly wrapping NumPy arrays in a new object that also includes extra dtype information.  For example we might wrap an int64 numpy array with some datetime/timezone metadata to achieve a *logical* datetime array using a *physical* int64 array.  We continue using NumPy as is but use this higher layer when necessary for more complex dtypes.

However ""lightly wrapping"" NumPy arrays is hard to do while still maintaining a closed system where all operations remain consistent (raw NumPy arrays inevitably leak through).  Additionally, asking communities to switch to new libraries is socially quite challenging.

### XArray is well placed

Fortunately XArray appears to have already solved some of these technical and social challenges.  XArray lightly wraps NumPy arrays in a consistent manner.  NumPy-like operations on XArrays remain XArrays.  Interactions with other NumPy arrays are well defined.  XArray has also attracted an active user/developer community and has attained general respect from the broader ecosystem.  XArray seems to be hackable, benefits from a decently active community, and is not yet under as much backwards compatibility pressure.

So question: *Is it sensible to add logical dtype information to XArray?  Can this be done with only moderate effort and maintenance costs to the XArray project?  If the answer is ""yes, probably"", then what is the right way to go about this?*","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1262/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,issue
318950038,MDU6SXNzdWUzMTg5NTAwMzg=,2093,Default chunking in GeoTIFF images,306380,closed,0,,,10,2018-04-30T16:21:30Z,2020-06-18T06:27:07Z,2020-06-18T06:27:07Z,MEMBER,,,,"Given a tiled GeoTIFF image I'm looking for the best practice in reading it as a chunked dataset.  I did this in [this notebook](https://gist.github.com/mrocklin/3df315e93d4bdeccf76db93caca2a9bd) by first opening the file with rasterio, looking at the block sizes, and then using those to inform the argument to `chunks=` in `xarray.open_rasterio`.  This works, but is somewhat cumbersome because I also had to dive into the rasterio API.  Do we want to provide defaults here?

In dask.array every time this has come up we've always shot it down, automatic chunking is error prone and hard to do well.  However in these cases the object we're being given usually also conveys its chunking in a way that matches how dask.array thinks about it, so the extra cognitive load on the user has been somewhat low.  Rasterio's model and API feel much more foreign to me though than a project like NetCDF or H5Py.  I find myself wanting a `chunks=True` or `chunks='100MB'` option.

Thoughts on this?  Is this in-scope?  If so then what is the right API and what is the right policy for how to make xarray/dask.array chunks larger than GeoTIFF chunks?","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/2093/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
400948664,MDU6SXNzdWU0MDA5NDg2NjQ=,2692,Xarray tutorial at SciPy 2019?,306380,closed,0,,,10,2019-01-19T01:56:38Z,2020-03-25T04:34:27Z,2019-02-17T05:07:45Z,MEMBER,,,,"Is anyone interested in submitting a tutorial to SciPy 2019?  I think that it would be useful to have an official Xarray tutorial out there somewhere on the internet.  This could be good motivation to create one.

https://www.scipy2019.scipy.org/tutorials

See also: https://github.com/pydata/xarray/issues/1882","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/2692/reactions"", ""total_count"": 2, ""+1"": 2, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
287969295,MDU6SXNzdWUyODc5NjkyOTU=,1822,Use apply_ufunc in xESMF regridding package,306380,closed,0,,,4,2018-01-12T00:17:04Z,2020-01-15T00:01:49Z,2020-01-15T00:01:49Z,MEMBER,,,,"I would like to call attention to https://github.com/JiaweiZhuang/xESMF/issues/3#issuecomment-354668897 .  It seems like the xESMF package does regridding in a way that at least some XArray users find sensible.  It should probably make use of, but does not currently use apply_ufunc, and is not particularly parallelizable (or at least that is my understanding).  It could be that some modest development by someone more familiar with XArray could have a large impact by properly using apply_ufunc within that codebase.

I apologize for posting an issue about another package in this issue tracker.  Feel free to close.

cc @JiaweiZhuang ","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1822/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
221858543,MDU6SXNzdWUyMjE4NTg1NDM=,1375,Sparse arrays,306380,closed,0,,,25,2017-04-14T18:00:14Z,2019-08-30T02:36:12Z,2019-08-13T03:31:14Z,MEMBER,,,,"I would like to have an XArray that has scipy.sparse arrays rather than numpy arrays.  Is this in scope?  

What would need to happen within XArray to support this?","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1375/reactions"", ""total_count"": 8, ""+1"": 8, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
323785231,MDU6SXNzdWUzMjM3ODUyMzE=,2143,Upstream changes in Dask,306380,closed,0,,,1,2018-05-16T21:01:21Z,2019-08-15T15:16:54Z,2019-08-15T15:16:54Z,MEMBER,,,,"Hi All, 

There are a couple changes coming in dask that might affect XArray code:

1.  We're replacing the `get=dask.threaded.get` keyword with `scheduler='threads'`
2.  We're replacing `dask.set_options(...)` with `dask.config.set(...)` 

Both of the old systems will still work, at least for a version or two, but we plan to remove them in the future.  I thought I'd bring these changes up here so that we can plan a clean deprecation within XArray.  These are also both not yet released, so both features are still up for discussion if this community has additional constraints.","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/2143/reactions"", ""total_count"": 1, ""+1"": 1, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
355308699,MDU6SXNzdWUzNTUzMDg2OTk=,2390,Why are there two compute calls for plot?,306380,closed,0,,,3,2018-08-29T19:53:45Z,2019-08-04T23:00:59Z,2019-08-04T23:00:59Z,MEMBER,,,,Anecdotally I find that when I call `.plot()` on a dataset object that holds dask arrays `compute` gets called twice.  Why is this?  I'm curious if this is something that should be resolved.,"{""url"": ""https://api.github.com/repos/pydata/xarray/issues/2390/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
456239422,MDU6SXNzdWU0NTYyMzk0MjI=,3022,LazilyOuterIndexedArray doesn't support slicing with slice objects,306380,open,0,,,2,2019-06-14T13:05:56Z,2019-06-14T21:48:12Z,,MEMBER,,,,"#### Code Sample, a copy-pastable example if possible

```python
from xarray.core.indexing import LazilyOuterIndexedArray
import numpy as np
x = LazilyOuterIndexedArray(np.ones((5, 5)))
x[:3]
```

```python-traceback
---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-4-42bee9beb30a> in <module>
----> 1 x[:3]

~/workspace/xarray/xarray/core/indexing.py in __getitem__(self, indexer)
    518             array = LazilyVectorizedIndexedArray(self.array, self.key)
    519             return array[indexer]
--> 520         return type(self)(self.array, self._updated_key(indexer))
    521
    522     def __setitem__(self, key, value):

~/workspace/xarray/xarray/core/indexing.py in _updated_key(self, new_key)
    483
    484     def _updated_key(self, new_key):
--> 485         iter_new_key = iter(expanded_indexer(new_key.tuple, self.ndim))
    486         full_key = []
    487         for size, k in zip(self.array.shape, self.key.tuple):

AttributeError: 'slice' object has no attribute 'tuple'
```


#### Problem description

Dask array meta computations like to run `x[:0, :0]` on input arrays.  This breaks with this class.

This is on master","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/3022/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,issue
390443869,MDExOlB1bGxSZXF1ZXN0MjM4MjA5ODI1,2603,Support HighLevelGraphs,306380,closed,0,,,2,2018-12-12T22:52:28Z,2018-12-13T17:13:10Z,2018-12-13T17:13:00Z,MEMBER,,0,pydata/xarray/pulls/2603,"Fixes https://github.com/dask/dask/issues/4291

 - [x] Fully documented, including `whats-new.rst` for all changes and `api.rst` for new API
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/2603/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
372640063,MDExOlB1bGxSZXF1ZXN0MjI0NzY5Mjg1,2500,Avoid use of deprecated get= parameter in tests,306380,closed,0,,,7,2018-10-22T18:25:58Z,2018-10-23T10:31:37Z,2018-10-23T00:22:51Z,MEMBER,,0,pydata/xarray/pulls/2500," - [x] Fully documented, including `whats-new.rst` for all changes and `api.rst` for new API (remove if this change should not be visible to users, e.g., if it is an internal clean-up, or if this is part of a larger project that will be documented later)
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/2500/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
282178751,MDU6SXNzdWUyODIxNzg3NTE=,1784,Add compute=False keywords to `to_foo` functions,306380,closed,0,,,9,2017-12-14T17:25:19Z,2018-05-16T15:05:03Z,2018-05-16T15:05:03Z,MEMBER,,,,"When working with @jhamman profiling the `to_zarr` method on large datasets I wanted the ability to run through the `to_zarr` setup code, but avoid waiting on the dask computation to finish.  In many functions in Dask proper our `to_foo` methods have a `compute=False` keyword that returns a `dask.delayed` object on which people can call `compute` later if desired.

cc @jhamman @rabernat @jakirkham (who has been looking at similar questions within `dask.array.Array.store`)","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1784/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
295146502,MDU6SXNzdWUyOTUxNDY1MDI=,1894,Zarr keys include variable name,306380,closed,0,,,1,2018-02-07T13:56:32Z,2018-02-17T04:40:15Z,2018-02-17T04:40:15Z,MEMBER,,,,"When using open_zarr on a dataset with many variables the keynames include the variable name, like 

    ('zarr-temperature-1234', 1, 3, 2)

In the distributed scheduler these keynames get shortened to prefixes like `zarr-temperature` and used both for scheduling heuristics (all keys with the same prefix are expected to take similar-ish amounts of time) and for diagnostics, such as in the progress bar plot below:

![image](https://user-images.githubusercontent.com/306380/35920009-9f1fbeda-0be4-11e8-8f3f-061387bdb149.png)

We may want to avoid including the variable name into the keyname here in order to avoid breaking these out into several groups.  Instead you might consider putting the variable name within the key as another member of the tuple like the following:

    ('zarr-1234', 'temperature', 1, 3 ,2)","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1894/reactions"", ""total_count"": 1, ""+1"": 1, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
296434660,MDExOlB1bGxSZXF1ZXN0MTY4NjIyOTM3,1904,Replace task_state with tasks in dask test,306380,closed,0,,,4,2018-02-12T16:19:14Z,2018-02-12T21:08:06Z,2018-02-12T21:08:06Z,MEMBER,,0,pydata/xarray/pulls/1904,"This internal state was changed in the latest release

 - [x] Closes #1903 (remove if there is no corresponding issue, which should only be the case for minor changes)
 - [ ] Tests added (for all bug fixes or enhancements)
 - [x] Tests passed (for all non-documentation changes)
 - [ ] Fully documented, including `whats-new.rst` for all changes and `api.rst` for new API (remove if this change should not be visible to users, e.g., if it is an internal clean-up, or if this is part of a larger project that will be documented later)
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1904/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
276448264,MDExOlB1bGxSZXF1ZXN0MTU0NDI5ODYz,1741,Auto flake,306380,closed,0,,,2,2017-11-23T18:00:47Z,2018-01-14T20:49:20Z,2018-01-14T20:49:20Z,MEMBER,,0,pydata/xarray/pulls/1741," - [ ] Closes #xxxx
 - [x] Tests added / passed
 - [ ] Passes ``flake8 xarray``
 - [ ] Fully documented, including `whats-new.rst` for all changes and `api.rst` for new API

I had a free half hour so I decided to run autoflake and autopep8 tools on the codebase.  `flake8 xarray` passes.  I copied over the exclusions that we use within `dask/distributed` and extended the line length to 120.   You may wish to review these decisions.  ","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1741/reactions"", ""total_count"": 2, ""+1"": 2, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
286448591,MDU6SXNzdWUyODY0NDg1OTE=,1810,data_array.<tab> reads data,306380,closed,0,,,4,2018-01-06T01:34:55Z,2018-01-06T14:26:36Z,2018-01-06T14:26:36Z,MEMBER,,,,"#### Code Sample, a copy-pastable example if possible

```python
ds = xarray.open_dataset(...)
da = ds.variables['...']
da.<tab>
```
#### Problem description

This starts reading data.  I don't know why.  I'm using XArray against a FUSE system that is both expensive (it's targetting Google Cloud Storage) and also has logging. I can see that auto-completion immediately starts a lot of file reading on the file system.

#### Output of ``xr.show_versions()``

<details>

```python
# Paste the output here xr.show_versions() here
>>> xr.show_versions()

INSTALLED VERSIONS
------------------
commit: None
python: 3.6.3.final.0
python-bits: 64
OS: Linux
OS-release: 4.10.0-42-generic
machine: x86_64
processor: x86_64
byteorder: little
LC_ALL: en_US.UTF-8
LANG: en_US.UTF-8
LOCALE: en_US.UTF-8

xarray: 0.10.0
pandas: 0.21.0
numpy: 1.13.3
scipy: 1.0.0
netCDF4: 1.3.1
h5netcdf: 0.5.0
Nio: None
bottleneck: 1.2.1
cyordereddict: None
dask: 0.16.0
matplotlib: None
cartopy: None
seaborn: None
setuptools: 36.6.0
pip: 9.0.1
conda: 4.3.29
pytest: 3.3.1
IPython: 6.2.1
sphinx: None
```

</details>
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1810/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
279198672,MDExOlB1bGxSZXF1ZXN0MTU2MzQwNDk3,1760,Fix DataArray.__dask_scheduler__ to point to dask.threaded.get,306380,closed,0,,,8,2017-12-05T00:12:21Z,2017-12-07T22:13:42Z,2017-12-07T22:09:18Z,MEMBER,,0,pydata/xarray/pulls/1760,"Previously this erroneously pointed to an optimize function, likely a
copy-paste error.

For testing this also redirects the .compute methods to use the
dask.compute function directly *if* dask.__version__ >= '0.16.0'.

Closes #1759

 - [x] Closes #xxxx (remove if there is no corresponding issue, which should only be the case for minor changes)
 - [x] Tests added (for all bug fixes or enhancements)
 - [x] Tests passed (for all non-documentation changes)
 - [x] Passes ``git diff upstream/master **/*py | flake8 --diff`` (remove if you did not edit any Python files)
 - [x] Fully documented, including `whats-new.rst` for all changes and `api.rst` for new API (remove if this change should not be visible to users, e.g., if it is an internal clean-up, or if this is part of a larger project that will be documented later)
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1760/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
269886091,MDExOlB1bGxSZXF1ZXN0MTQ5NzMxMDM5,1674,Support Dask interface,306380,closed,0,,,12,2017-10-31T09:15:52Z,2017-11-07T18:37:06Z,2017-11-07T18:31:45Z,MEMBER,,0,pydata/xarray/pulls/1674,"This integrates the new dask interface methods into XArray.  This will place XArray as a first-class dask collection and help in particular with newer dask.distributed features.

 - [x] Closes https://github.com/pangeo-data/pangeo/issues/5
 - [x] Tests added / passed
 - [x] Passes ``git diff upstream/master **/*py | flake8 --diff``
 - [x] Fully documented, including `whats-new.rst` for all changes and `api.rst` for new API

Builds on work from @jcrist here: https://github.com/dask/dask/pull/2748
Depends on https://github.com/dask/dask/pull/2847","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1674/reactions"", ""total_count"": 1, ""+1"": 1, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
218314868,MDU6SXNzdWUyMTgzMTQ4Njg=,1343,Some XArray key names don't group nicely,306380,closed,0,,,2,2017-03-30T20:15:44Z,2017-05-22T20:38:56Z,2017-05-22T20:38:56Z,MEMBER,,,,"Some XArray loading functions provide keys that don't adhere to dask conventions used for naming.  

We can solve this in XArray by using names like `'load-' + dask.base.tokenize(stuff)` or in dask by trying to identify and avoid names like these.  It might be wise to attempt both.  I expect that this will be easier to solve in XArray (though that's also in my own self interest :))

![image](https://cloud.githubusercontent.com/assets/306380/24522302/26575ca2-155d-11e7-98cc-f4a7108a2249.png)
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1343/reactions"", ""total_count"": 1, ""+1"": 1, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
218942553,MDExOlB1bGxSZXF1ZXN0MTEzOTM2OTk3,1349,Add persist method to DataSet,306380,closed,0,,,10,2017-04-03T13:59:02Z,2017-04-04T16:19:20Z,2017-04-04T16:14:17Z,MEMBER,,0,pydata/xarray/pulls/1349,"Fixes https://github.com/pydata/xarray/issues/1344

 - [x] closes #xxxx
 - [x] tests added / passed
 - [x] passes ``git diff upstream/master | flake8 --diff``
 - [x] whatsnew entry  (not sure what to do here, is there a new section?  It looks like the last release was yesterday)
","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1349/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,,13221727,pull
218315793,MDU6SXNzdWUyMTgzMTU3OTM=,1344,Dask Persist,306380,closed,0,,,5,2017-03-30T20:19:17Z,2017-04-04T16:14:17Z,2017-04-04T16:14:17Z,MEMBER,,,,"It would be convenient to load constituent dask.arrays into memory as dask.arrays rather than as numpy arrays.  This would help with distributed computations where we want to load a large amount of data into distributed memory once and then iterate on the full xarray dataset repeatedly without reloading from disk every time.

We can probably solve this from either side:

1.  XArray could make a `.persist` method that replaced all of its dask.arrays with a persisted version of that array

```python
import dask

dset.x, dset.y, dset.z = dask.persist(dset.x, dset.y, dset.z)
```

2.  We could look into the Dask duck type solution again https://github.com/dask/dask/pull/1068

cc @shoyer @jcrist @rabernat @pwolfram ","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1344/reactions"", ""total_count"": 1, ""+1"": 1, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue
187594293,MDU6SXNzdWUxODc1OTQyOTM=,1085,Always use absolute paths,306380,closed,0,,,3,2016-11-06T22:25:08Z,2016-12-01T16:47:40Z,2016-12-01T16:47:40Z,MEMBER,,,,"This would avoid a mismatch between clients and workers when using dask.distributed

```python
In [2]: os.path.abspath('my-local-path')
Out[2]: '/home/mrocklin/my-local-path'
```","{""url"": ""https://api.github.com/repos/pydata/xarray/issues/1085/reactions"", ""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,completed,13221727,issue