Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize H5Store init. #443

Merged
merged 5 commits into from
Jan 5, 2021
Merged

Optimize H5Store init. #443

merged 5 commits into from
Jan 5, 2021

Conversation

bdice
Copy link
Member

@bdice bdice commented Dec 22, 2020

Description

Optimizes the H5Store initializer to use os.path.abspath instead of os.path.realpath. As mentioned in #442, the cost of realpath is fairly high and it isn't necessary to use a "real" path in this case.

(Note that this optimization would not apply to job.document or anything that must interface with signac's buffering logic, in my understanding. Buffering relies on "real" paths to ensure that data is correctly loaded in the buffer.)

Motivation and Context

Speeds up H5Store initialization.

I noticed this optimization while using a signac-flow project with a condition that checks for a key in job.data. For my simple test case with 157 jobs, the time spent in the condition function went from 2.72 seconds to 2.15 seconds (21% faster).

Note: Another 20-30% of the time could be trimmed by adding lazy state point loading and avoiding the state point validation when accessing the job.data (which never needs to know the state point).

Types of Changes

  • Documentation update
  • Bug fix
  • New feature
  • Breaking change1

1The change breaks (or has the potential to break) existing functionality.

Checklist:

If necessary:

  • I have updated the API documentation as part of the package doc-strings.
  • I have created a separate pull request to update the framework documentation on signac-docs and linked it here.
  • I have updated the changelog and added all related issue and pull request numbers for future reference (if applicable). See example below.

@bdice bdice requested review from a team as code owners December 22, 2020 08:51
@bdice bdice requested review from kidrahahjo and lyrivera and removed request for a team December 22, 2020 08:51
@codecov
Copy link

codecov bot commented Dec 22, 2020

Codecov Report

Merging #443 (8ec81e1) into master (ffc9116) will not change coverage.
The diff coverage is 100.00%.

Impacted file tree graph

@@           Coverage Diff           @@
##           master     #443   +/-   ##
=======================================
  Coverage   77.13%   77.13%           
=======================================
  Files          42       42           
  Lines        5738     5738           
  Branches     1119     1119           
=======================================
  Hits         4426     4426           
  Misses       1027     1027           
  Partials      285      285           
Impacted Files Coverage Δ
signac/core/h5store.py 89.00% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ffc9116...8ec81e1. Read the comment docs.

@bdice bdice self-assigned this Dec 29, 2020
@bdice bdice added the enhancement New feature or request label Dec 29, 2020
@bdice bdice added this to the v1.6.0 milestone Dec 29, 2020
Copy link
Collaborator

@kidrahahjo kidrahahjo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't have expertise in the buffering logic currently but, in the interest of this PR, this is a straight forward change, so I am approving this PR.

@vyasr vyasr merged commit 8cfa82d into master Jan 5, 2021
@vyasr vyasr deleted the feature/optimize-h5store-init branch January 5, 2021 17:26
@bdice bdice mentioned this pull request Feb 19, 2021
12 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants