issue_comments: 325723577

This data as json

html_url	issue_url	id	node_id	user	created_at	updated_at	author_association	body	reactions	performed_via_github_app	issue
https://github.com/pydata/xarray/pull/1528#issuecomment-325723577	https://api.github.com/repos/pydata/xarray/issues/1528	325723577	MDEyOklzc3VlQ29tbWVudDMyNTcyMzU3Nw==	1217238	2017-08-29T16:43:58Z	2017-08-29T16:44:25Z	MEMBER	Is the goal here to be able to round-trip the file, such that calling .to_netcdf() produces an identical file to the original source file? Yes, exactly. I don't understand how encoding interacts with attributes? When is something an attribute vs. an encoding (add_offset for example)? Typically, we store things in encoding that are attributes on the underlying NetCDF file, but no longer make sense to describe the decoded data. For example: - On the file, `add_offset` is an attribute. - If loaded with `open_dataset(..., mask_and_scale=True)`, `add_offset` can be found in `encoding`, not `attrs`, because the data has already been offset. - If loaded with `open_dataset(..., mask_and_scale=False)`, `add_offset` will still be on `attrs` (the data has not been offset). How does xarray know whether the store automatically encodes / decodes the encodings vs. when it has to be done by xarray, e.g. by calling mask_and_scale Currently, we assume that stores never do this, and always handle it ourselves. We might need a special exception for zarr and scale/offset encoding. Does this mean that my ZarrStore should inherit from WritableCFDataStore instead of AbstractWritableDataStore? Maybe, though again it will probably need slightly customized conventions for writing data (if we let zarr handling scale/offset encoding). I don't yet understand how to make these elements work together properly, for example, do avoid applying the scale / offset function twice, as I mentioned above. We have two options: 1. Handle it all in xarray via the machinery in `conventions.py`. Never pass the arguments to do scale/offset encoding to zarr (just save them as attributes). 2. Handle it all in zarr. We'll need special case logic to skip this part of encoding. I think (2) would be the preferred way to do this.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }		253136694