pull_requests: 1088467433
This data as json
id | node_id | number | state | locked | title | user | body | created_at | updated_at | closed_at | merged_at | merge_commit_sha | assignee | milestone | draft | head | base | author_association | auto_merge | repo | url | merged_by |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1088467433 | PR_kwDOAMm_X85A4LHp | 7172 | closed | 0 | Lazy import dask.distributed to reduce import time of xarray | 90008 | I was auditing the import time of my software and found that distributed added a non insignificant amount of time to the import of xarray: Using `tuna`, one can find that the following are sources of delay in import time for xarray: To audit, one can use the the command ``` python -X importtime -c "import numpy as np; import pandas as pd; import dask.array; import xarray as xr" 2>import.log && tuna import.lo ``` The command as is, breaks out the import time of numpy, pandas, and dask.array to allow you to focus on "other" costs within xarray. Main branch:  Proposed:  One would be tempted to think that this is due to xarray.testing and xarray.tutorial but those just move the imports one level down in tuna graphs.  - [x] ~~Closes~~ - [x] ~~Tests added~~ - [x] User visible changes (including notable bug fixes) are documented in `whats-new.rst` - [x] ~~New functions/methods are listed in `api.rst`~~ | 2022-10-16T18:25:31Z | 2022-10-18T17:41:50Z | 2022-10-18T17:06:34Z | 2022-10-18T17:06:34Z | 89f7de888468eb37979faa686e7d70dbe11fb83c | 0 | 6b4aa3c401720e324ffa407c4da7bad6ecaf6fa2 | 9df2dfca57e1c672f6faf0f7945d2f38921a4bb2 | CONTRIBUTOR | { "enabled_by": { "login": "dcherian", "id": 2448579, "node_id": "MDQ6VXNlcjI0NDg1Nzk=", "avatar_url": "https://avatars.githubusercontent.com/u/2448579?v=4", "gravatar_id": "", "url": "https://api.github.com/users/dcherian", "html_url": "https://github.com/dcherian", "followers_url": "https://api.github.com/users/dcherian/followers", "following_url": "https://api.github.com/users/dcherian/following{/other_user}", "gists_url": "https://api.github.com/users/dcherian/gists{/gist_id}", "starred_url": "https://api.github.com/users/dcherian/starred{/owner}{/repo}", "subscriptions_url": "https://api.github.com/users/dcherian/subscriptions", "organizations_url": "https://api.github.com/users/dcherian/orgs", "repos_url": "https://api.github.com/users/dcherian/repos", "events_url": "https://api.github.com/users/dcherian/events{/privacy}", "received_events_url": "https://api.github.com/users/dcherian/received_events", "type": "User", "site_admin": false }, "merge_method": "squash", "commit_title": "Lazy import dask.distributed to reduce import time of xarray (#7172)", "commit_message": "* Lazy import testing and tutorial\r\n\r\n* Lazy import distributed to avoid a costly import\r\n\r\n* Revert changes to __init__\r\n\r\n* Explain why we lazy import\r\n\r\n* Add release note\r\n\r\n* dask.distritubed.lock now supports blocking argument\r\n\r\nCo-authored-by: Deepak Cherian <dcherian@users.noreply.github.com>" } |
13221727 | https://github.com/pydata/xarray/pull/7172 |
Links from other tables
- 3 rows from pull_requests_id in labels_pull_requests