html_url,issue_url,id,node_id,user,created_at,updated_at,author_association,body,reactions,performed_via_github_app,issue
https://github.com/pydata/xarray/issues/3268#issuecomment-539463105,https://api.github.com/repos/pydata/xarray/issues/3268,539463105,MDEyOklzc3VlQ29tbWVudDUzOTQ2MzEwNQ==,6213168,2019-10-08T11:04:06Z,2019-10-08T11:04:06Z,MEMBER,"Thing is, the whole thing is undefined.
What does the accessor state contain? As a xarray developer, I don't know.
Is it variable names? Is it references to objects that make up the Dataset, e.g. Variables or the attrs dict? Is it objects whose contents rely on the current state of the Dataset, e.g. aggregated measures? Is it objects whose contents rely on _historical events_ (like in your case)?
Dataset.copy() will create a copy of everything up to and excluding the numpy arrays. In order to allow you to retain accessor state, we'd need to plant a hook in it and invoke some agreed duck-type API in your object that basically states, ""I called copy(), and this is the new object I created, please create a copy of yourself accordingly making extra sure you don't retain references to components of the previous object"".
And then there are _all the other methods_ that currently nuke the accessor state - including many in-place ones - because they could potentially invalidate it. What should they do? Invoke a special API on the accessor? If not, why should copy() trigger special accessor API and e.g. roll() shouldn't?
Planting accessor-refresher hooks in every single method that currently just wipes it away is out of question as it would need to be almost everywhere and - more importantly - it would be born broken.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,485708282
https://github.com/pydata/xarray/issues/3268#issuecomment-539240950,https://api.github.com/repos/pydata/xarray/issues/3268,539240950,MDEyOklzc3VlQ29tbWVudDUzOTI0MDk1MA==,6213168,2019-10-07T23:03:28Z,2019-10-07T23:04:25Z,MEMBER,"> Would that make any sense that the xr.DataSet.copy() method also return a copy of the accessors ?
It's been discussed above in this same thread. It's impossible without breaking the accessor API, as it would require you (the accessor developer) to define a copy method.
The more high level discussion is that the statefulness of the accessor is something that is OK to use for caching and performance improvements, and not OK for storing functional information like yours.
Have you considered storing a flag in ``Variable.attrs`` instead?
```python
def add(self, da):
da.attrs[""cleanable""] = True
self.obj[da.name] = da
return self.obj
def clean(self):
return self.obj.drop([
k for k, v in self.obj.variables.items()
if v.attrs.get(""cleanable"")
])
```
","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,485708282
https://github.com/pydata/xarray/issues/3268#issuecomment-538577126,https://api.github.com/repos/pydata/xarray/issues/3268,538577126,MDEyOklzc3VlQ29tbWVudDUzODU3NzEyNg==,6213168,2019-10-04T22:14:38Z,2019-10-04T22:16:03Z,MEMBER,"@gmaze ``Dataset.drop`` does not mutate the state of the original object, so it's conceptually wrong for your clean() method to mutate the accessor state too. It should be:
```python
def clean(self):
return self.obj.drop(self.added)
```
The new dataset returned will have no accessor cache, and will recreate an instance on the fly on first access.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,485708282
https://github.com/pydata/xarray/issues/3268#issuecomment-525260148,https://api.github.com/repos/pydata/xarray/issues/3268,525260148,MDEyOklzc3VlQ29tbWVudDUyNTI2MDE0OA==,6213168,2019-08-27T11:30:22Z,2019-08-27T11:32:13Z,MEMBER,"The circular reference issue could also be worked around in a user-friendly way by having the decorator automatically add methods to the decorated class, copying the design of ``@dataclass``:
```python
import weakref
class C:
def __init__(self, owner):
self._owner = weakref.ref(owner)
if hasattr(self, ""__post_init__""):
self.__post_init__()
@property
def owner(self):
out = self._owner()
if out is None:
raise AttributeError(""Orphaned accessor"")
return out
```
This would also allow for shallow copies to change the pointer.","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,485708282
https://github.com/pydata/xarray/issues/3268#issuecomment-525257435,https://api.github.com/repos/pydata/xarray/issues/3268,525257435,MDEyOklzc3VlQ29tbWVudDUyNTI1NzQzNQ==,6213168,2019-08-27T11:20:50Z,2019-08-27T11:21:14Z,MEMBER,"``store the names of the coordinate variabless we know are going to be useful to us later. ``
So you work on the assumption that no new potentially useful coords will be added after the first invocation of your accessor? Or do you have logic that invalidates your cache every time the state of the coords changes?","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,485708282
https://github.com/pydata/xarray/issues/3268#issuecomment-525256306,https://api.github.com/repos/pydata/xarray/issues/3268,525256306,MDEyOklzc3VlQ29tbWVudDUyNTI1NjMwNg==,6213168,2019-08-27T11:16:48Z,2019-08-27T11:16:48Z,MEMBER,@fmaussion could you change your accessor code to store its state in ``Dataset.attrs`` instead?,"{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,485708282
https://github.com/pydata/xarray/issues/3268#issuecomment-525240363,https://api.github.com/repos/pydata/xarray/issues/3268,525240363,MDEyOklzc3VlQ29tbWVudDUyNTI0MDM2Mw==,6213168,2019-08-27T10:24:38Z,2019-08-27T10:24:38Z,MEMBER,"Demonstation on the circular reference issue:
```python
import gc
import weakref
import xarray
class C:
pass
@xarray.register_dataset_accessor('foo')
class Foo:
def __init__(self, obj):
self.obj = obj
ds = xarray.Dataset()
w = weakref.ref(ds)
print(""No accessor, in scope:"", w() is not None)
del ds
print(""No accessor, descoped:"", w() is not None)
ds = xarray.Dataset()
ds.foo
w = weakref.ref(ds)
print(""with accessor, in scope:"", w() is not None)
del ds
print(""with accessor, descoped:"", w() is not None)
gc.collect()
print(""with accessor, after gc pass:"", w() is not None)
```
Output:
```
No accessor, in scope: True
No accessor, descoped: False
with accessor, in scope: True
with accessor, descoped: True
with accessor, after gc pass: False
```","{""total_count"": 0, ""+1"": 0, ""-1"": 0, ""laugh"": 0, ""hooray"": 0, ""confused"": 0, ""heart"": 0, ""rocket"": 0, ""eyes"": 0}",,485708282