home / github / issues

Menu
  • Search all tables
  • GraphQL API

issues: 415802678

This data as json

id node_id number title user state locked assignee milestone comments created_at updated_at closed_at author_association active_lock_reason draft pull_request body reactions performed_via_github_app state_reason repo type
415802678 MDU6SXNzdWU0MTU4MDI2Nzg= 2796 Better explanation of 'minimal' in xarray.open_mfdataset(data_vars='minimal') in docs? 5704500 open 0     2 2019-02-28T20:11:42Z 2021-07-08T17:42:52Z   NONE      

Problem description

I'm currently troubleshooting some overly long (to me) load times using open_mfdataset on GFS data. In trying to speed things up, I'm trying to specify just the four variables I actually care about using data_vars=[strings], but to no avail. It still takes ~30 minutes to load 52 time slices from 7 files.

In the docs I do see that if data_vars =

list of str: "The listed data variables will be concatenated, in addition to the ‘minimal’ data variables."

However, I can't seem to understand what the 'minimal' variables are from this sentence in the docs:

‘minimal’: Only data variables in which the dimension already appears are included.

All the variables in the CF-compliant GFS data are associated with dimensions. So does that mean that all the variables in the files will be concatenated, regardless if I specify which ones I want? I feel like I'm misunderstanding what is included by default.

{
    "url": "https://api.github.com/repos/pydata/xarray/issues/2796/reactions",
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
    13221727 issue

Links from other tables

  • 2 rows from issues_id in issues_labels
  • 2 rows from issue in issue_comments
Powered by Datasette · Queries took 0.789ms · About: xarray-datasette