issue_comments
15 rows where author_association = "MEMBER", issue = 387924616 and user = 6628425 sorted by updated_at descending
This data as json, CSV (advanced)
Suggested facets: updated_at (date)
issue 1
- CFTimeIndex Resampling · 15 ✖
id | html_url | issue_url | node_id | user | created_at | updated_at ▲ | author_association | body | reactions | performed_via_github_app | issue |
---|---|---|---|---|---|---|---|---|---|---|---|
459998025 | https://github.com/pydata/xarray/pull/2593#issuecomment-459998025 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1OTk5ODAyNQ== | spencerkclark 6628425 | 2019-02-02T20:49:30Z | 2019-02-02T20:49:30Z | MEMBER | Sounds good @shoyer, thanks for bringing this to the finish line. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
456513616 | https://github.com/pydata/xarray/pull/2593#issuecomment-456513616 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NjUxMzYxNg== | spencerkclark 6628425 | 2019-01-22T18:39:37Z | 2019-01-22T18:39:37Z | MEMBER | One more optional thing -- support for the |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
455790039 | https://github.com/pydata/xarray/pull/2593#issuecomment-455790039 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NTc5MDAzOQ== | spencerkclark 6628425 | 2019-01-19T15:33:43Z | 2019-01-19T15:38:08Z | MEMBER |
That's great news!
Before spending too much time on that just yet, see if you can resolve the merge conflicts, and if you can think about a way to reduce the length of the existing tests. It would be helpful to see a coverage report generated by coveralls for the new logic you've added (if you resolve the merge conflicts our CI here will run and we'll be able to see that automatically). Maybe start by commenting out a bunch of the really long tests and see where things stand? Then we can think about how to add coverage back in as needed. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
455662721 | https://github.com/pydata/xarray/pull/2593#issuecomment-455662721 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NTY2MjcyMQ== | spencerkclark 6628425 | 2019-01-18T19:35:14Z | 2019-01-18T19:35:14Z | MEMBER | @jwenfai I created an issue in cftime regarding this question: Unidata/cftime#109. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
455619034 | https://github.com/pydata/xarray/pull/2593#issuecomment-455619034 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NTYxOTAzNA== | spencerkclark 6628425 | 2019-01-18T17:09:26Z | 2019-01-18T18:23:19Z | MEMBER |
Ah indeed; this makes sense now. Maybe we should bring this up in cftime to see what their recommendation might be? I could imagine writing a function like this that would correct for this imprecision when taking the difference between two dates: ```python from datetime import timedelta def exact_cftime_datetime_difference(a, b): """Exact computation of b - a""" seconds = b.replace(microsecond=0) - a.replace(microsecond=0) seconds = int(round(seconds.total_seconds())) microseconds = b.microsecond - a.microsecond return timedelta(seconds=seconds, microseconds=microseconds) ``` Here are a couple test cases: ```python import cftime from datetime import datetime Testing with cftime version 1.0.0 where datetime where I canreproduce precision issues.test_cases = [ [(2000, 1, 1, 0, 4, 0, 956321), (1892, 1, 3, 12, 0, 0, 112123)], [(2000, 1, 1, 0, 4, 0, 1), (1892, 1, 3, 12, 0, 0, 503432)], [(2000, 1, 1, 0, 4, 0, 999999), (1892, 1, 3, 12, 0, 0, 112123)], [(2000, 1, 1, 0, 4, 0, 11213), (1892, 1, 3, 12, 0, 0, 77777)], ] for a_args, b_args in test_cases: a_cftime = cftime.DatetimeGregorian(a_args) b_cftime = cftime.DatetimeGregorian(b_args) a = datetime(a_args) b = datetime(b_args)
``` But maybe I'm missing something important. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
455540080 | https://github.com/pydata/xarray/pull/2593#issuecomment-455540080 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NTU0MDA4MA== | spencerkclark 6628425 | 2019-01-18T13:07:59Z | 2019-01-18T13:07:59Z | MEMBER | @jwenfai to provide some more detail for my confusion here -- my impression is that adding or subtracting a Taking the difference between two dates to produce a timedelta object takes a different code path in cftime, which is not microsecond-precise. This, as we've seen, can induce some small errors in interpolation (because in the process we need to determine the amount of time between each date in the time coordinate and a reference date).
Could you provide more details about this example? What were the resample parameters used (e.g. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
455390725 | https://github.com/pydata/xarray/pull/2593#issuecomment-455390725 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NTM5MDcyNQ== | spencerkclark 6628425 | 2019-01-18T01:09:59Z | 2019-01-18T01:34:59Z | MEMBER |
Exactly 👍
~~I think the "values falls before first bin" errors are all from pandas, where datetime arithmetic is exact, so they could not be due to cftime, right?~~ I'll take a look at the 6AS-JUN tests. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
455393260 | https://github.com/pydata/xarray/pull/2593#issuecomment-455393260 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NTM5MzI2MA== | spencerkclark 6628425 | 2019-01-18T01:23:20Z | 2019-01-18T01:23:20Z | MEMBER |
Oops, my bad, I should have read more carefully! I'll think about this more. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
455391923 | https://github.com/pydata/xarray/pull/2593#issuecomment-455391923 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1NTM5MTkyMw== | spencerkclark 6628425 | 2019-01-18T01:16:21Z | 2019-01-18T01:17:10Z | MEMBER |
Yeah this has for sure been helpful for development -- I did not expect to encounter the "values falls before first bin" error, but clearly it seems we do need to worry about it. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
450740815 | https://github.com/pydata/xarray/pull/2593#issuecomment-450740815 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1MDc0MDgxNQ== | spencerkclark 6628425 | 2019-01-01T16:18:40Z | 2019-01-01T16:18:40Z | MEMBER |
No worries! I just wanted to get these thoughts down when I had some time. No rush to make any updates. |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
450533035 | https://github.com/pydata/xarray/pull/2593#issuecomment-450533035 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ1MDUzMzAzNQ== | spencerkclark 6628425 | 2018-12-30T01:31:12Z | 2018-12-30T01:31:12Z | MEMBER | So the crux of the problem now seems to be in generating For data indexed by a CFTimeIndex, we do not have the luxury of a formal I've put together a gist which compares the This inspired the alternative solution proposed in the second part of the gist (I've also added back in a call to a cftime version of the Let me know if this alternative solution makes sense. Digging in to the guts of the resample code in pandas/xarray is still fairly new for me too, so I could be missing something. In the gist I'm using this branch of xarray, the development version of pandas, and the latest version of cftime. Thanks again for your hard work on this! |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
449635856 | https://github.com/pydata/xarray/pull/2593#issuecomment-449635856 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ0OTYzNTg1Ng== | spencerkclark 6628425 | 2018-12-23T13:16:41Z | 2018-12-23T13:16:41Z | MEMBER |
I think it is OK to continue working here, with the aim of implementing resample for cftime that is closest to the most up-to-date pandas version. We can add code to skip tests that depend on this behavior if the version of pandas is not recent enough (see an example of doing this here). As we're working before version 0.24 of pandas is released we can keep an eye on our "py36-pandas-dev" CI build listed under the "Allowed Failures" section in Travis. |
{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
449534277 | https://github.com/pydata/xarray/pull/2593#issuecomment-449534277 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ0OTUzNDI3Nw== | spencerkclark 6628425 | 2018-12-22T01:19:42Z | 2018-12-22T01:19:42Z | MEMBER | So I think the way you have things written now in |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
449533135 | https://github.com/pydata/xarray/pull/2593#issuecomment-449533135 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ0OTUzMzEzNQ== | spencerkclark 6628425 | 2018-12-22T01:04:32Z | 2018-12-22T01:04:32Z | MEMBER | Yes, I noticed this too. I think it's related to changes made here: https://github.com/pandas-dev/pandas/pull/24347. At least in the test cases that I've run, I've only seen it make a difference in the NaN placement at the end of the time series. For example with pandas 0.23.4: ``` In [1]: import xarray as xr; import pandas as pd In [2]: nptimes = pd.date_range('2000', periods=2000) In [3]: nptime_da = xr.DataArray(range(2000), [('time', nptimes)]) In [4]: nptime_da.resample(time='4AS').mean('time')
Out[4]:
<xarray.DataArray (time: 3)>
array([ 730., 1730., nan])
Coordinates:
* time (time) datetime64[ns] 2000-01-01 2004-01-01 2008-01-01
|
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 | |
448443826 | https://github.com/pydata/xarray/pull/2593#issuecomment-448443826 | https://api.github.com/repos/pydata/xarray/issues/2593 | MDEyOklzc3VlQ29tbWVudDQ0ODQ0MzgyNg== | spencerkclark 6628425 | 2018-12-19T02:11:03Z | 2018-12-19T02:11:03Z | MEMBER | @jwenfai thanks for the updates. It looks like there are some merge conflicts that are preventing our CI from running. Could you please resolve those when you get chance, so we can see those results? |
{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 } |
CFTimeIndex Resampling 387924616 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE [issue_comments] ( [html_url] TEXT, [issue_url] TEXT, [id] INTEGER PRIMARY KEY, [node_id] TEXT, [user] INTEGER REFERENCES [users]([id]), [created_at] TEXT, [updated_at] TEXT, [author_association] TEXT, [body] TEXT, [reactions] TEXT, [performed_via_github_app] TEXT, [issue] INTEGER REFERENCES [issues]([id]) ); CREATE INDEX [idx_issue_comments_issue] ON [issue_comments] ([issue]); CREATE INDEX [idx_issue_comments_user] ON [issue_comments] ([user]);
user 1