home / github / issues

Menu
  • GraphQL API
  • Search all tables

issues: 62585113

This data as json

id node_id number title user state locked assignee milestone comments created_at updated_at closed_at author_association active_lock_reason draft pull_request body reactions performed_via_github_app state_reason repo type
62585113 MDExOlB1bGxSZXF1ZXN0MzEzOTk0OTI= 378 ENH: fillna method for Dataset, DataArray and GroupBy objects 1217238 closed 0   1028398 0 2015-03-18T04:16:29Z 2015-03-20T23:00:42Z 2015-03-20T23:00:41Z MEMBER   0 pydata/xarray/pulls/378

This is a new method for Dataset, DataArray and GroupBy objects. For the most part, it follows standard broadcasting and alignment rules for binary operations.

Example usage

Setup:

``` In [1]: import xray

In [2]: import pandas as pd

In [3]: import numpy as np

In [4]: array = xray.DataArray(np.arange(75.0), [('time', pd.date_range('2000-01-01', periods=75, freq='5D'))])

In [5]: array[::3] = np.nan

In [6]: array Out[6]: <xray.DataArray (time: 75)> array([ nan, 1., 2., nan, 4., 5., nan, 7., 8., nan, 10., 11., nan, 13., 14., nan, 16., 17., nan, 19., 20., nan, 22., 23., nan, 25., 26., nan, 28., 29., nan, 31., 32., nan, 34., 35., nan, 37., 38., nan, 40., 41., nan, 43., 44., nan, 46., 47., nan, 49., 50., nan, 52., 53., nan, 55., 56., nan, 58., 59., nan, 61., 62., nan, 64., 65., nan, 67., 68., nan, 70., 71., nan, 73., 74.]) Coordinates: * time (time) datetime64[ns] 2000-01-01 2000-01-06 2000-01-11 2000-01-16 ... ```

Simple example:

In [7]: array.fillna(0) Out[7]: <xray.DataArray (time: 75)> array([ 0., 1., 2., 0., 4., 5., 0., 7., 8., 0., 10., 11., 0., 13., 14., 0., 16., 17., 0., 19., 20., 0., 22., 23., 0., 25., 26., 0., 28., 29., 0., 31., 32., 0., 34., 35., 0., 37., 38., 0., 40., 41., 0., 43., 44., 0., 46., 47., 0., 49., 50., 0., 52., 53., 0., 55., 56., 0., 58., 59., 0., 61., 62., 0., 64., 65., 0., 67., 68., 0., 70., 71., 0., 73., 74.]) Coordinates: * time (time) datetime64[ns] 2000-01-01 2000-01-06 2000-01-11 2000-01-16 ...

Fill missing values with average for that month:

``` In [8]: g = array.groupby('time.month')

In [9]: g.fillna(g.mean('time')) Out[9]: <xray.DataArray (time: 75)> array([ 17.2, 1. , 2. , 17.2, 4. , 5. , 17.2, 7. , 8. , 9. , 10. , 11. , 15. , 13. , 14. , 15. , 16. , 17. , 15. , 19. , 20. , 21. , 22. , 23. , 21. , 25. , 26. , 27. , 28. , 29. , 27. , 31. , 32. , 33. , 34. , 35. , 33. , 37. , 38. , 39. , 40. , 41. , 39. , 43. , 44. , 45. , 46. , 47. , 45. , 49. , 50. , 51. , 52. , 53. , 51. , 55. , 56. , 57. , 58. , 59. , 57. , 61. , 62. , 63. , 64. , 65. , 63. , 67. , 68. , 69.8, 70. , 71. , 69.8, 73. , 74. ]) Coordinates: * time (time) datetime64[ns] 2000-01-01 2000-01-06 2000-01-11 2000-01-16 ... month (time) int32 1 1 1 1 1 1 1 2 2 2 2 2 3 3 3 3 3 3 3 4 4 4 4 4 4 5 5 5 5 5 5 6 6 6 ... ```

CC @nicolasfauchereau

{
    "url": "https://api.github.com/repos/pydata/xarray/issues/378/reactions",
    "total_count": 0,
    "+1": 0,
    "-1": 0,
    "laugh": 0,
    "hooray": 0,
    "confused": 0,
    "heart": 0,
    "rocket": 0,
    "eyes": 0
}
    13221727 pull

Links from other tables

  • 2 rows from issues_id in issues_labels
  • 0 rows from issue in issue_comments
Powered by Datasette · Queries took 77.649ms · About: xarray-datasette