issue_comments: 497629988

This data as json

html_url	issue_url	id	node_id	user	created_at	updated_at	author_association	body	reactions	performed_via_github_app	issue
https://github.com/pydata/xarray/issues/2281#issuecomment-497629988	https://api.github.com/repos/pydata/xarray/issues/2281	497629988	MDEyOklzc3VlQ29tbWVudDQ5NzYyOTk4OA==	1217238	2019-05-31T08:46:38Z	2019-05-31T08:46:38Z	MEMBER	Yes, if we cache the Delaunay triangulation we could probably do the entire thing in about the time it currently takes to do one time step. On Thu, May 30, 2019 at 10:50 AM Fernando Paolo notifications@github.com wrote: @shoyer https://github.com/shoyer and @crusaderky https://github.com/crusaderky That's right, that is how I was actually dealing with this problem prior trying xarray ... by flattening the grid coordinates and performing either gridding (with scipy's griddata) or interpolation (with scipy's map_coordinate) ... instead of performing proper regridding (from cube to cube without having to flatten anything). As a rule of thumb, any fancy algorithm should first exist for numpy-only data and then potentially it can be wrapped by the xarray library. This is important information. For the record, here is so far what I found to be best performant: import xarray as xr from scipy.interpolate import griddata Here x/y are dummy 1D coords that wont be used. da1 = xr.DataArray(cube1, [('t', t_cube1) , ('y', range(cube1.shape[1])), ('x', range(cube1.shape[2]))]) Regrid t_cube1 onto t_cube2 first since time will always map 1 to 1 between cubes. This operation is very fast. print('regridding in time ...') cube1 = da1.interp(t=t_cube2).values Regrid each 2D field (X_cube1/Y_cube1 onto X_cube2/Y_cube2) one at a time print('regridding in space ...') cube3 = np.full_like(cube2, np.nan) for k in range(t_cube1.shape[0]): print('regridding:', k) cube3[:,:,k] = griddata((X_cube1.ravel(), Y_cube1.ravel()), cube1[k,:,:].ravel(), (X_cube2, Y_cube2), fill_value=np.nan, method='linear') Performance is not that bad... for ~150 time steps and ~1500 nodes in x and y it takes less than 10-15 min. I think this can be sped up by computing and saving the interpolation weights between grids in the first iteration and cache them (I think xESMF does this). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/pydata/xarray/issues/2281?email_source=notifications&email_token=AAJJFVQNLZ3SUTY2WIMI3E3PYAHWVA5CNFSM4FJQZDP2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODWTATPA#issuecomment-497420732, or mute the thread https://github.com/notifications/unsubscribe-auth/AAJJFVV5OXUPV25D64WJEZTPYAHWVANCNFSM4FJQZDPQ .	{ "total_count": 1, "+1": 1, "-1": 0, "laugh": 0, "hooray": 0, "confused": 0, "heart": 0, "rocket": 0, "eyes": 0 }		340486433