github: issue_comments: 2 rows where author_association = "MEMBER", issue = 1047608434 and user = 226037 sorted by updated

2 rows where author_association = "MEMBER", issue = 1047608434 and user = 226037 sorted by updated_at descending

Search:

descending

id	html_url	issue_url	node_id	user	created_at	updated_at ▲	author_association	body	reactions	performed_via_github_app	issue
964318162	https://github.com/pydata/xarray/issues/5954#issuecomment-964318162	https://api.github.com/repos/pydata/xarray/issues/5954	IC_kwDOAMm_X845elPS	alexamici 226037	2021-11-09T16:28:59Z	2021-11-09T16:28:59Z	MEMBER	but most backends serialise writes anyway, so the advantage is limited. I'm not sure I understand this comment, specifically what is meant by "serialise writes". I often use Xarray to do distributed writes to Zarr stores using 100+ distributed dask workers. It works great. We would need the same thing from a TileDB backend. I should have added "except Zarr" 😅 . All netCDF writers use `xr.backends.locks.get_write_lock` to get a scheduler appropriate writing lock. The code is intricate and I don't find where to point you, but as I recall the lock was used so only one worker/process/thread could write to disk at a time. Concurrent writes a la Zarr are awesome and xarray supports them now, so my point was: we can add non-concurrent write support to the plugin architecture quite easily and that will serve a lot of users. But supporting Zarr and other advanced backends via the plugin architecture is a lot more work.	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }		Writeable backends via entrypoints 1047608434
963895846	https://github.com/pydata/xarray/issues/5954#issuecomment-963895846	https://api.github.com/repos/pydata/xarray/issues/5954	IC_kwDOAMm_X845c-Im	alexamici 226037	2021-11-09T07:53:00Z	2021-11-09T08:02:20Z	MEMBER	@rabernat and all, at the time of the read-only backend refactor @aurghs and I spent quite some time analysing write support and thinking of a unifying strategy. This is my interpretation of our findings: one of the big advantages of the unified `xr.open_dataset` API is that you don't need to specify the `engine` of the input data and you can rely on xarray guessing it. This is in general non true when you write your data, as you care about what format you are storing it. another advantage of `xr.open_dataset` is that xarray manages all the functionaries related to dask and to in-memory cacheing, so backends only need to know how to lazily read from the storage. Current (rather complex) implementation has support for writing from dask and distributed workers but most backends serialise writes anyway, so the advantage is limited. This is not to say that it is not worth, but the cost / benefit ratio of supporting potentially distributed writes is much lower than read support. that said, I'd really welcome a unified write API like `ds.save(engine=...)` or even `xr.save_dataset(ds, engine=...)` with a `engine` keyword argument and possibly other common options. Adding support for a single `save_dataset` entry point to the backend API is trivial, but adding full support for possibly distributed writes looks like it is much more work. Also note that ATM @aurghs and I are overloaded at work and we would have very little time that we can spend on this :/	{ "total_count": 0, "+1": 0, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }		Writeable backends via entrypoints 1047608434

Advanced export

JSON shape: default, array, newline-delimited, object

CREATE TABLE [issue_comments] (
   [html_url] TEXT,
   [issue_url] TEXT,
   [id] INTEGER PRIMARY KEY,
   [node_id] TEXT,
   [user] INTEGER REFERENCES [users]([id]),
   [created_at] TEXT,
   [updated_at] TEXT,
   [author_association] TEXT,
   [body] TEXT,
   [reactions] TEXT,
   [performed_via_github_app] TEXT,
   [issue] INTEGER REFERENCES [issues]([id])
);
CREATE INDEX [idx_issue_comments_issue]
    ON [issue_comments] ([issue]);
CREATE INDEX [idx_issue_comments_user]
    ON [issue_comments] ([user]);