home / github

Menu
  • Search all tables
  • GraphQL API

issue_comments

Table actions
  • GraphQL API for issue_comments

2 rows where issue = 878481461 and user = 11815787 sorted by updated_at descending

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: created_at (date), updated_at (date)

user 1

  • minhhg · 2 ✖

issue 1

  • open_mfdataset: Not a valid ID · 2 ✖

author_association 1

  • NONE 2
id html_url issue_url node_id user created_at updated_at ▲ author_association body reactions performed_via_github_app issue
836869041 https://github.com/pydata/xarray/issues/5276#issuecomment-836869041 https://api.github.com/repos/pydata/xarray/issues/5276 MDEyOklzc3VlQ29tbWVudDgzNjg2OTA0MQ== minhhg 11815787 2021-05-10T15:50:37Z 2021-05-10T15:50:37Z NONE

Hi there - are you able to narrow down the problem a little bit for us? Can you get the same error by opening any one file individually with xarray.open_dataset()? What if you use the netCDF4 library to access a variable directly without xarray? Failing that can you give us a file/files for which this fails, so that we can reproduce the problem locally? Also, have you tried using a more recent version of netCDF4? The most recent version on PyPI seems to be 1.5.6, but you're using 1.4.2.

As mentioned above, the error only happens after successful call the func for a few times, i.e, first few calls it works fine as expected. "Can you get the same error by opening any one file individually with `xarray.open_dataset()" Yes, I can. But the whole point is to use open_mfdataset.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  open_mfdataset: Not a valid ID 878481461
836857104 https://github.com/pydata/xarray/issues/5276#issuecomment-836857104 https://api.github.com/repos/pydata/xarray/issues/5276 MDEyOklzc3VlQ29tbWVudDgzNjg1NzEwNA== minhhg 11815787 2021-05-10T15:41:56Z 2021-05-10T15:43:04Z NONE

I was able to generate a small code to re-produce the error. ```python import numpy as np import xarray as xr from copy import deepcopy import random import os import logging log = logging.getLogger(name)

def generate_data(xpath, m=10000,n=900,k=601): """

:param m:
:param n:
:param k:
:return:
"""
coord1=dict(dim1=range(m), dim2=range(n))
for mat in range(k):
    a = np.random.rand(m,n)
    file_name = "{:04d}.nc".format(mat)
    full_file_name = os.path.join(xpath, file_name)
    coord1['maturity'] = [mat]
    a = np.reshape(a,[m,n,1])
    ar = xr.DataArray(a, dims=['dim1', 'dim2', 'maturity'], coords=coord1)
    ar.to_netcdf(full_file_name)

def func(xpath, spec): log.debug("xpath=%s, spec=%s", xpath, spec)

doc = deepcopy(spec)

with xr.open_mfdataset(xpath + "/*.nc", concat_dim='maturity') as data:
        var_name=  list(data.data_vars)[0]
        ar = data[var_name]
        maturity = spec['maturity']
        ann = ar.cumsum(dim='maturity')
        ann = ann - 1
        ar1 = ann.sel(maturity=maturity)
        doc['data'] = ar1.load().values
return doc

def repeat_access_data_generate_error(xpath, k=601, l=1000): """ Dummy to generate error :return: """ spec={} dum=0.0 for _l1 in range(l): try: spec['maturity'] = np.random.randint(0,k,1)[0] doc = func(xpath, spec) #do dummy operation with doc dum = (dum + np.sum(doc['data']))/2.0 except Exception as err: log.error("spec=%s, err=%s", spec, err, exc_info=True)

return dum

if name == "main": FORMAT = '%(asctime)-15s, %(levelname)s %(process)d, %(filename)s:%(lineno)s - %(funcName)20s(), %(message)s' log_file_name = "errors.log" logging.basicConfig(filename=log_file_name, filemode='a+', level=logging.DEBUG, format=FORMAT) xpath="/home/ubuntu/runs/xrdata/" # folder to hold files # change values of m,n,k to re-produce the error generate_data(xpath,m=1000) repeat_access_data_generate_error(xpath)

```

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  open_mfdataset: Not a valid ID 878481461

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);
Powered by Datasette · Queries took 1361.609ms · About: xarray-datasette