github: issue_comments: 21 rows where issue = 524043729 sorted by updated

21 rows where issue = 524043729 sorted by updated_at descending

Search:

descending

id	html_url	issue_url	node_id	user	created_at	updated_at ▲	author_association	body	reactions	issue
1362936825	https://github.com/pydata/xarray/pull/3545#issuecomment-1362936825	https://api.github.com/repos/pydata/xarray/issues/3545	IC_kwDOAMm_X85RPMP5	kmuehlbauer 5821660	2022-12-22T14:56:44Z	2022-12-22T14:56:44Z	MEMBER	@scottcha @keewis I've tried hard, but finally decided to start from scratch, see #7400.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
1306187168	https://github.com/pydata/xarray/pull/3545#issuecomment-1306187168	https://api.github.com/repos/pydata/xarray/issues/3545	IC_kwDOAMm_X85N2tWg	keewis 14808389	2022-11-07T21:02:35Z	2022-11-09T14:13:24Z	MEMBER	I can try to rebase the PR onto latest main I did try that a few months ago, but a lot has changed since the PR was opened so it might actually be easier to reimplement the PR?	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
1306229983	https://github.com/pydata/xarray/pull/3545#issuecomment-1306229983	https://api.github.com/repos/pydata/xarray/issues/3545	IC_kwDOAMm_X85N23zf	kmuehlbauer 5821660	2022-11-07T21:37:11Z	2022-11-07T21:37:11Z	MEMBER	Thanks @keewis for the heads up. I'll have a look and if things get too complicated a reimplementation might be our best option.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
1306157550	https://github.com/pydata/xarray/pull/3545#issuecomment-1306157550	https://api.github.com/repos/pydata/xarray/issues/3545	IC_kwDOAMm_X85N2mHu	kmuehlbauer 5821660	2022-11-07T20:38:21Z	2022-11-07T20:38:21Z	MEMBER	Great @scottcha, I was coming back here too every once in an while to just refresh my mind with the ideas pursued here. I can try to rebase the PR onto latest main, if I can free some cycles in the following days for starters.	{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
1306152559	https://github.com/pydata/xarray/pull/3545#issuecomment-1306152559	https://api.github.com/repos/pydata/xarray/issues/3545	IC_kwDOAMm_X85N2k5v	scottcha 775186	2022-11-07T20:33:02Z	2022-11-07T20:33:02Z	CONTRIBUTOR	I'm still along and yes I do still need this functionality (I still sync back to this PR when I have data missing vars). The issue was that the technical requirements got beyond what I was able to account for with the time I had available. If you or someone else was interested in picking it up I'd be happy to evaluate against my use cases.	{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
1306147080	https://github.com/pydata/xarray/pull/3545#issuecomment-1306147080	https://api.github.com/repos/pydata/xarray/issues/3545	IC_kwDOAMm_X85N2jkI	kmuehlbauer 5821660	2022-11-07T20:26:57Z	2022-11-07T20:26:57Z	MEMBER	@scottcha Are you still around and interested to bring this along? If not I could try to dive again into this.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
1165200959	https://github.com/pydata/xarray/pull/3545#issuecomment-1165200959	https://api.github.com/repos/pydata/xarray/issues/3545	IC_kwDOAMm_X85Fc44_	lukelbd 19657652	2022-06-24T05:08:38Z	2022-06-24T05:08:38Z	NONE	Cool PR - looks like it's stale? Maybe someone should copy the work to a new one? Have been coming across this issue a lot in my work recently.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
871210504	https://github.com/pydata/xarray/pull/3545#issuecomment-871210504	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDg3MTIxMDUwNA==	mullenkamp 2656596	2021-06-30T08:41:29Z	2021-06-30T08:41:29Z	NONE	Has this been implemented? Or is it still failing the tests?	{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
586629198	https://github.com/pydata/xarray/pull/3545#issuecomment-586629198	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU4NjYyOTE5OA==	scottcha 775186	2020-02-15T18:38:25Z	2020-02-15T18:38:25Z	CONTRIBUTOR	I just pushed an incomplete set of changes as @kmuehlbauer tests have demonstrated there was some incomplete cases the PR still isn't handling. Here is a summary: 1. I've simplified the logic based on @dcherian comments but in order to keep the result deterministic needed to use list logic instead of set logic. I also kept the OrderedDict instead of going with the default dict as the built in ordering methods as of py 3.6 were still insufficient for keeping the ordering consistent (I needed to pop FIFO) which doesn't seem possible until py 3.8. 2. I did add a failing test to capture the cases @kmuehlbauer pointed out. I'm not sure I have my head wrapped around xarray enough to address @dcherian's latest comments though which is why i'm sharing the code at this point. All tests are passing except the new cases which were pointed out. I'll try to continue to get time to update this but wanted to at least provide this status update at this point as its been awhile.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
586346637	https://github.com/pydata/xarray/pull/3545#issuecomment-586346637	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU4NjM0NjYzNw==	dcherian 2448579	2020-02-14T15:52:21Z	2020-02-14T15:52:33Z	MEMBER	I am now wondering if we can use `align` or `reindex` to do the filling for us. Example: goal is concat along 'x' with result dataset having `x=[1,2,3,4]` 1. Loop through datasets and assign coordinate values as appropriate. 2. Break datasets up into mappings `collected = {"variable": [var1_at_x=1, var2_at_x=2, var4_at_x=4]}` -> there's some stuff in `merge.py` that could be reused for this 3. concatenate these lists to get a new mapping `concatenated = {"variable": [var_at_x=[1,2,4]]}` 4. apply `reindexed = {concatenated[var].reindex(x=[1,2,3,4], fill_value=...) for var in concatenated}` 5. create dataset `Dataset(reindexed)` Step 1 would be where we deal with all the edge cases mentioned in @shoyer's comment viz For example: Pre-existing vs non-pre-existing dimension Pre-existing dimensions of different sizes Missing data variables vs coordinates vs indexed coordinates	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
579203497	https://github.com/pydata/xarray/pull/3545#issuecomment-579203497	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU3OTIwMzQ5Nw==	kmuehlbauer 5821660	2020-01-28T11:33:41Z	2020-01-28T11:52:02Z	MEMBER	@scottcha @shoyer below is a minimal example where one variable is missing in each file. ```python import random random.seed(123) random.randint(0, 10) create var names list with one missing value orig = [f'd{i:02}' for i in range(10)] datasets = [] for i in range(1, 9): l1 = orig.copy() l1.remove(f'd{i:02}') datasets.append(l1) create files for i, dsl in enumerate(datasets): foo_data = np.arange(24).reshape(2, 3, 4) with nc.Dataset(f'test{i:02}.nc', 'w') as ds: ds.createDimension('x', size=2) ds.createDimension('y', size=3) ds.createDimension('z', size=4) for k in dsl: ds.createVariable(k, int, ('x', 'y', 'z')) ds.variables[k][:] = foo_data flist = glob.glob('test*.nc') dslist = [] for f in flist: dslist.append(xr.open_dataset(f)) ds2 = xr.concat(dslist, dim='time') ds2 ``` Output: <xarray.Dataset> Dimensions: (time: 8, x: 2, y: 3, z: 4) Dimensions without coordinates: time, x, y, z Data variables: d01 (x, y, z) int64 0 1 2 3 4 5 6 7 8 9 ... 15 16 17 18 19 20 21 22 23 d00 (time, x, y, z) int64 0 1 2 3 4 5 6 7 8 ... 16 17 18 19 20 21 22 23 d02 (time, x, y, z) float64 0.0 1.0 2.0 3.0 4.0 ... 20.0 21.0 22.0 23.0 d03 (time, x, y, z) float64 0.0 1.0 2.0 3.0 4.0 ... 20.0 21.0 22.0 23.0 d04 (time, x, y, z) float64 0.0 1.0 2.0 3.0 4.0 ... 20.0 21.0 22.0 23.0 d05 (time, x, y, z) float64 0.0 1.0 2.0 3.0 4.0 ... 20.0 21.0 22.0 23.0 d06 (time, x, y, z) float64 0.0 1.0 2.0 3.0 4.0 ... 20.0 21.0 22.0 23.0 d07 (time, x, y, z) float64 0.0 1.0 2.0 3.0 4.0 ... 20.0 21.0 22.0 23.0 d08 (time, x, y, z) float64 0.0 1.0 2.0 3.0 4.0 ... nan nan nan nan nan d09 (time, x, y, z) int64 0 1 2 3 4 5 6 7 8 ... 16 17 18 19 20 21 22 23 Three cases here: `d00` and `d09` are available in all datasets, and they are concatenated correctly (keeping dtype) `d02` to `d08` are missing in one dataset and are filled with the created dummy variable, but the dtype is converted to float64 `d01` is not handled properly, because it is missing in the first dataset, this is due to checking only variables of first dataset in `_calc_concat_over` `python elif opt == "all": concat_over.update( set(getattr(datasets[0], subset)) - set(datasets[0].dims) )` and from putting `d01` in `result_vars` before iterating to find missing variables.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
577778006	https://github.com/pydata/xarray/pull/3545#issuecomment-577778006	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU3Nzc3ODAwNg==	shoyer 1217238	2020-01-23T17:10:48Z	2020-01-23T17:10:48Z	MEMBER	Can you explain why you think you need the nested iteration over dataset variables? What ordering are you trying to achieve?	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
577773353	https://github.com/pydata/xarray/pull/3545#issuecomment-577773353	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU3Nzc3MzM1Mw==	scottcha 775186	2020-01-23T17:00:06Z	2020-01-23T17:01:07Z	CONTRIBUTOR	@kmuehlbauer @dcherian @shoyer If it would be easier it could abandon this PR and resubmit a new one as the code has drastically changed since the original comments were provided? Essentially I'm waiting for feedback or approval of this PR.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
577745924	https://github.com/pydata/xarray/pull/3545#issuecomment-577745924	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU3Nzc0NTkyNA==	kmuehlbauer 5821660	2020-01-23T15:59:47Z	2020-01-23T15:59:47Z	MEMBER	@dcherian Just to clarify, the concatenation is done along a new dimension (which has to be created by expand_dims). What do you mean by short-clrcuit in this context?	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
577737475	https://github.com/pydata/xarray/pull/3545#issuecomment-577737475	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU3NzczNzQ3NQ==	dcherian 2448579	2020-01-23T15:41:33Z	2020-01-23T15:41:33Z	MEMBER	the most time consuming part is the expand_dims for every dataset, which accounts for roughly 80% overall concat runtime. Hmmm... maybe we need a short-circuit version of `expand_dims`?	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
577696741	https://github.com/pydata/xarray/pull/3545#issuecomment-577696741	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU3NzY5Njc0MQ==	kmuehlbauer 5821660	2020-01-23T14:08:57Z	2020-01-23T14:08:57Z	MEMBER	@scottcha @shoyer For one of my use cases (240 datasets, 1 with missing variables) I do not see any performance penalties using this implementation compared to the current. But this might be due to the fact, that the most time consuming part is the `expand_dims` for every dataset, which accounts for roughly 80% overall concat runtime. If I can be of any help to push this over the line, please ping me.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
577530863	https://github.com/pydata/xarray/pull/3545#issuecomment-577530863	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU3NzUzMDg2Mw==	kmuehlbauer 5821660	2020-01-23T06:52:06Z	2020-01-23T07:29:37Z	MEMBER	@scottcha If found this while searching. Have the same requirements, means missing DataArrays in some Datasets of a timeseries to be concatenated. I've already some hacks and workarounds in place for my specific use cases, but it would be really great if this could be handled by xarray. I'll try to test your current implementation against my source data and will report my findings here. Update: I've rebased locally on latest master and this works smoothly with my data (which uses packed data). I'll now look into performance.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
569845354	https://github.com/pydata/xarray/pull/3545#issuecomment-569845354	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU2OTg0NTM1NA==	shoyer 1217238	2019-12-31T01:40:02Z	2019-12-31T01:40:02Z	MEMBER	I'll take a look at this more carefully soon. But I do think it is a hard requirement that concat runs in linear time (with respect to the total number of variables across all datasets). On Mon, Dec 30, 2019 at 5:18 PM Scott Chamberlin notifications@github.com wrote: Hi, I've provided a new update to this PR (sorry it took me awhile both to get more familiar with the code and find the time to update the PR). I improved the logic to be a bit more performant and handle more edge cases as well as updated the test suite. I have a few questions: The tests I wrote are a bit more verbose than the tests previously. I can tighten them down but I found it was easier for me to read the logic in this form. Please let me know what you prefer. I'm still not quite sure I've captured all the scenarios as I'm a pretty basic xarray user so please let me know if there is still something I'm missing. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/pydata/xarray/pull/3545?email_source=notifications&email_token=AAJJFVVSKN5ZWD4FQHPIJG3Q3KMWXA5CNFSM4JOLVICKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEH3RY5Q#issuecomment-569842806, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAJJFVUPKZ7Q3UFVSH7D2STQ3KMWXANCNFSM4JOLVICA .	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
569842806	https://github.com/pydata/xarray/pull/3545#issuecomment-569842806	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU2OTg0MjgwNg==	scottcha 775186	2019-12-31T01:18:35Z	2019-12-31T01:18:35Z	CONTRIBUTOR	Hi, I've provided a new update to this PR (sorry it took me awhile both to get more familiar with the code and find the time to update the PR). I improved the logic to be a bit more performant and handle more edge cases as well as updated the test suite. I have a few questions: The tests I wrote are a bit more verbose than the tests previously. I can tighten them down but I found it was easier for me to read the logic in this form. Please let me know what you prefer. I'm still not quite sure I've captured all the scenarios as I'm a pretty basic xarray user so please let me know if there is still something I'm missing.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
555791968	https://github.com/pydata/xarray/pull/3545#issuecomment-555791968	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU1NTc5MTk2OA==	scottcha 775186	2019-11-20T01:22:51Z	2019-11-20T01:22:51Z	CONTRIBUTOR	Ok, I'll work on extending the updates with the feedback and additional tests. Thanks!	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729
555299451	https://github.com/pydata/xarray/pull/3545#issuecomment-555299451	https://api.github.com/repos/pydata/xarray/issues/3545	MDEyOklzc3VlQ29tbWVudDU1NTI5OTQ1MQ==	shoyer 1217238	2019-11-19T02:12:45Z	2019-11-19T02:12:45Z	MEMBER	Thanks for working on this important issue! There are a lot of edge cases that can come up in `concat`, so I think it would be very helpful to try to enumerate a broader set of unit tests for thoroughly testing this. For example: - Pre-existing vs non-pre-existing dimension - Pre-existing dimensions of different sizes - Missing data variables vs coordinates vs indexed coordinates	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }	Add defaults during concat 508 524043729

Advanced export

JSON shape: default, array, newline-delimited, object

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);

issue_comments

21 rows where issue = 524043729 sorted by updated_at descending

create var names list with one missing value

create files

Advanced export