home / github / issues

Menu
  • Search all tables
  • GraphQL API

issues: 309227775

This data as json

id node_id number title user state locked assignee milestone comments created_at updated_at closed_at author_association active_lock_reason draft pull_request body reactions performed_via_github_app state_reason repo type
309227775 MDU6SXNzdWUzMDkyMjc3NzU= 2022 Enable Append/concat to existing zarr datastore 6101444 closed 0     7 2018-03-28T05:39:37Z 2019-06-29T23:43:15Z 2019-06-29T23:43:15Z NONE      

Following discussion from https://github.com/pangeo-data/pangeo/issues/19

How would we go about implementing a concat or append function for zarr data stores? I am imagining something like xr.concat here. Its not clear to me how this would work when using open_mfdataset.

```python zarray_1 = xr.open_zarr(store=gcsmap)

ds2 = xr.open_dataset(path_to_netcdf)

xr.concat([zarray_1, ds2]) ```

Problem description

If you are using cloud storage facility like gcs, ds.to_zarr can fail before the completion of the upload. This is a problem for multi-TB datasets as the entire process needs to be restarted without any way to resume where you left off.

Expected Output

new zarr dataset with additional dataset appended along appropriate dim

{
    "url": "https://api.github.com/repos/pydata/xarray/issues/2022/reactions",
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  completed 13221727 issue

Links from other tables

  • 2 rows from issues_id in issues_labels
  • 7 rows from issue in issue_comments
Powered by Datasette · Queries took 236.737ms · About: xarray-datasette