issues: 439875798
This data as json
id | node_id | number | title | user | state | locked | assignee | milestone | comments | created_at | updated_at | closed_at | author_association | active_lock_reason | draft | pull_request | body | reactions | performed_via_github_app | state_reason | repo | type |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
439875798 | MDU6SXNzdWU0Mzk4NzU3OTg= | 2937 | encoding of boolean dtype in zarr | 1197350 | open | 0 | 3 | 2019-05-03T03:53:27Z | 2022-04-09T01:22:42Z | MEMBER | I want to store an array with 1364688000 boolean values in zarr. I will have to read this array many times, so I am trying to do it as efficiently as possible. I have noticed that, if we try to write boolean data to zarr from xarray, zarr stores it as Example
So it seems like, during serialization of bool data, xarray is converting the data to int8 and then adding a Problem descriptionSince zarr is fully capable of storing bool data directly, we should not need to encode the data as i8. I think this happens in which calls So maybe we make the boolean encoding optional? Output of
|
{ "url": "https://api.github.com/repos/pydata/xarray/issues/2937/reactions", "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
13221727 | issue |