home / github

Menu
  • Search all tables
  • GraphQL API

issue_comments

Table actions
  • GraphQL API for issue_comments

7 rows where issue = 357808970 sorted by updated_at descending

✖
✖

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: created_at (date), updated_at (date)

user 3

  • rabernat 3
  • shoyer 3
  • pep8speaks 1

author_association 2

  • MEMBER 6
  • NONE 1

issue 1

  • WIP: don't create indexes on multidimensional dimensions · 7 ✖
id html_url issue_url node_id user created_at updated_at ▲ author_association body reactions performed_via_github_app issue
455825759 https://github.com/pydata/xarray/pull/2405#issuecomment-455825759 https://api.github.com/repos/pydata/xarray/issues/2405 MDEyOklzc3VlQ29tbWVudDQ1NTgyNTc1OQ== shoyer 1217238 2019-01-20T00:06:21Z 2019-01-20T00:06:21Z MEMBER

I think this function mostly makes sense, but we would want to drop the stuff for scalar variables.

Unfortunately I don't know a good way to fix this stuff short of auditing a lot of code manually -- this is a hazard of data model changes. On the plus side, I've also started to do some of this for the explicit index refactor.

One thing that might turn up a few bugs is to try adding such a variable to the create_test_data() helper function which gets used in lots of places.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: don't create indexes on multidimensional dimensions 357808970
455794328 https://github.com/pydata/xarray/pull/2405#issuecomment-455794328 https://api.github.com/repos/pydata/xarray/issues/2405 MDEyOklzc3VlQ29tbWVudDQ1NTc5NDMyOA== rabernat 1197350 2019-01-19T16:28:53Z 2019-01-19T16:28:53Z MEMBER

I had some spare time and started working on this again. I'm scared of how much internal refactoring it requires. For example, this function

https://github.com/pydata/xarray/blob/385b36cdd34431b4f6f14aad1f222f989e7e2de2/xarray/core/dataset.py#L92-L113

just doesn't make sense any more if dimensions are not guaranteed to be one-dimensional.

I've never touched this part of the code base before. I have no idea how many places there are that make such an assumption.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: don't create indexes on multidimensional dimensions 357808970
455790072 https://github.com/pydata/xarray/pull/2405#issuecomment-455790072 https://api.github.com/repos/pydata/xarray/issues/2405 MDEyOklzc3VlQ29tbWVudDQ1NTc5MDA3Mg== pep8speaks 24736507 2019-01-19T15:34:24Z 2019-01-19T15:34:24Z NONE

Hello @rabernat! Thanks for updating the PR.

  • In the file xarray/core/formatting.py, following are the PEP8 issues :

Line 275:45: E225 missing whitespace around operator

  • In the file xarray/core/indexes.py, following are the PEP8 issues :

Line 55:77: E225 missing whitespace around operator Line 55:80: E501 line too long (80 > 79 characters)

  • In the file xarray/tests/test_dataset.py, following are the PEP8 issues :

Line 68:1: E302 expected 2 blank lines, found 1 Line 251:9: E265 block comment should start with '# '

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: don't create indexes on multidimensional dimensions 357808970
420014715 https://github.com/pydata/xarray/pull/2405#issuecomment-420014715 https://api.github.com/repos/pydata/xarray/issues/2405 MDEyOklzc3VlQ29tbWVudDQyMDAxNDcxNQ== rabernat 1197350 2018-09-10T18:31:43Z 2018-09-10T18:31:43Z MEMBER

With 68f170c there are basically no failing tests in the original test suite. I will now work on adding more tests along the lines outlined by @shoyer.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: don't create indexes on multidimensional dimensions 357808970
419969570 https://github.com/pydata/xarray/pull/2405#issuecomment-419969570 https://api.github.com/repos/pydata/xarray/issues/2405 MDEyOklzc3VlQ29tbWVudDQxOTk2OTU3MA== shoyer 1217238 2018-09-10T16:10:27Z 2018-09-10T16:49:51Z MEMBER

It would be good to add a tests for these variables in: - The data_vars argument to the Dataset constructor (these objects should become data_vars, not coords) - The DataArray constructor. - Explicitly adding these variables: - With Dataset.__setitem__, Dataset.coords.__setitem__ and DataArray.coords.__setitem__ - When there are no existing variables matching the dimension name, and when a 1D index variable matching the dimension name already exists (the existing index should be deleted/cleared) - Converting these objects to pandas with to_dataframe, to_series and to_pandas - Concatenating along an existing dimension whose name matches existing variables (e.g., concatenate along x when a variable x with dimensions ('x', 'y') exists) - Concatenating along a new dimension whose name matches existing variables (e.g., concatenate along x when a variable x with dimensions ('y',) exists) - More generally, test variables where the name matches a dimension but the variable does not include that dimension at all, e.g., a variable 'x' with dimensions ('y',) in a Dataset where x is a dimension.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: don't create indexes on multidimensional dimensions 357808970
419960606 https://github.com/pydata/xarray/pull/2405#issuecomment-419960606 https://api.github.com/repos/pydata/xarray/issues/2405 MDEyOklzc3VlQ29tbWVudDQxOTk2MDYwNg== shoyer 1217238 2018-09-10T15:44:45Z 2018-09-10T15:44:45Z MEMBER

I turned up a few lines that probably need to be fixed, just by grepping for in\ \w+\.dims: https://github.com/pydata/xarray/blob/66a8f8dd7f5a2997ff614f3966d1951587915e7e/xarray/backends/api.py#L125-L126 https://github.com/pydata/xarray/blob/66a8f8dd7f5a2997ff614f3966d1951587915e7e/xarray/core/coordinates.py#L111-L116 https://github.com/pydata/xarray/blob/66a8f8dd7f5a2997ff614f3966d1951587915e7e/xarray/core/dataarray.py#L502

It looks like this change broke align somehow -- possibly we have a bug where some indexes were not being created as IndexVariable objects?

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: don't create indexes on multidimensional dimensions 357808970
419295930 https://github.com/pydata/xarray/pull/2405#issuecomment-419295930 https://api.github.com/repos/pydata/xarray/issues/2405 MDEyOklzc3VlQ29tbWVudDQxOTI5NTkzMA== rabernat 1197350 2018-09-07T01:56:44Z 2018-09-07T01:56:44Z MEMBER

Seeking suggestions about what else needs to be tested here.

{
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
  WIP: don't create indexes on multidimensional dimensions 357808970

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);
Powered by Datasette · Queries took 15.703ms · About: xarray-datasette
  • Sort ascending
  • Sort descending
  • Facet by this
  • Hide this column
  • Show all columns
  • Show not-blank rows