home / github / issue_comments

Menu
  • Search all tables
  • GraphQL API

issue_comments: 210694521

This data as json

html_url issue_url id node_id user created_at updated_at author_association body reactions performed_via_github_app issue
https://github.com/pydata/xarray/issues/826#issuecomment-210694521 https://api.github.com/repos/pydata/xarray/issues/826 210694521 MDEyOklzc3VlQ29tbWVudDIxMDY5NDUyMQ== 1217238 2016-04-16T00:17:46Z 2016-04-16T00:17:46Z MEMBER

Yes, my main concern is code bloat. Storing things like the computation graph and command line flags used to invoke a script are certainly useful things to do, and I use versions of this stuff all the time. But they are orthogonal to the labeled data focus of xarray so they belong better in another library.

If you want to take this approach, you might start by using something like dask.imperative, and extracting the task dependencies from the resulting task graph. Or you could even try to work with the full dask graphs created by using dask.array with xarray, but these can get pretty big.

Getting parameters from the calling script is even easier -- just inspect sys.argv and set it as an attribute before saving files (or write your own function). The main complexity here is picking a convention, not implementing it.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  148771214
Powered by Datasette · Queries took 0.728ms · About: xarray-datasette