home / github / commits

Menu
  • GraphQL API
  • Search all tables

commits: 03d409ec35034d78a3a625dcaf1744117587b93c

This data as json

sha message author_date committer_date raw_author raw_committer repo author committer
03d409ec35034d78a3a625dcaf1744117587b93c Improve the speed of from_dataframe with a MultiIndex (by 40x!) (#4184) * Add MultiIndexSeries.time_to_xarray() benchmark * Improve the speed of from_dataframe with a MultiIndex Fixes GH-2459 Before: pandas.MultiIndexSeries.time_to_xarray ======= ========= ========== -- subset ------- -------------------- dtype True False ======= ========= ========== int 505±0ms 37.1±0ms float 485±0ms 38.3±0ms ======= ========= ========== After: pandas.MultiIndexSeries.time_to_xarray ======= ========= ========== -- subset ------- -------------------- dtype True False ======= ========= ========== int 11.5±0ms 39.2±0ms float 12.5±0ms 26.6±0ms ======= ========= ========== There are still some cases where we have to fall back to the existing slow implementation, but hopefully they should now be relatively rare. * remove unused import * Simplify converting MultiIndex dataframes * remove comments * remove types with NA * more multiindex dataframe tests * add whats new note * Preserve order of MultiIndex levels in from_dataframe * Add todo note * Rewrite from_dataframe to avoid passing around a dataframe * Require that MultiIndexes are unique even with sparse=True * clarify comment 2020-07-02T20:39:00Z 2020-07-02T20:39:00Z f10b21bed2846b879806f87039b77245b18e7671 cd792325681cbad9f663f2879d8b69f1edbb678f 13221727 1217238 19864447
Powered by Datasette · Queries took 1.043ms · About: xarray-datasette