Benchmarking #2454

benjeffery · 2022-08-01T11:06:27Z

Needs some polish, but probably a good point to get early feedback on the direction. The results are appended to a JSON file that will be committed to the repo, and a very basic HTML report generated:

My thinking is that at under a minute we can add this to CI.

jeromekelleher · 2022-08-01T11:16:31Z

Looks great!

Some unexpected differences here in the e.g. tree.parent_array, which make me wonder if anything under 50ns should just be "green". Do you think these are significant?

Re running in CI, I guess we just run it to make sure it hasn't broken but don't pay any attention to the numbers?

Probably worth spinning into its own Workflow, so we're not running it n-times in the Tests?

codecov · 2022-08-01T11:18:40Z

Codecov Report

Merging #2454 (44e83aa) into main (4c4ca2c) will decrease coverage by 0.93%.
The diff coverage is n/a.

❗ Current head 44e83aa differs from pull request most recent head 7fa6187. Consider uploading reports for the commit 7fa6187 to get more accurate results

@@            Coverage Diff             @@
##             main    #2454      +/-   ##
==========================================
- Coverage   93.44%   92.51%   -0.94%     
==========================================
  Files          28       28              
  Lines       27380    27732     +352     
  Branches     1253     1350      +97     
==========================================
+ Hits        25584    25655      +71     
- Misses       1762     2040     +278     
- Partials       34       37       +3

Flag	Coverage Δ
c-tests	`92.26% <ø> (ø)`
lwt-tests	`89.05% <ø> (ø)`
python-c-tests	`71.24% <ø> (+0.04%)`	⬆️
python-tests	`98.95% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
python/tskit/vcf.py	`72.06% <0.00%> (-26.36%)`	⬇️
python/tskit/trees.py	`88.57% <0.00%> (-10.15%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4c4ca2c...7fa6187. Read the comment docs.

benjeffery · 2022-08-01T14:16:43Z

Looks great!

Some unexpected differences here in the e.g. tree.parent_array, which make me wonder if anything under 50ns should just be "green". Do you think these are significant?

Yes, the colouring was a quick hack. I think for those fast ones that is usual variance.

Re running in CI, I guess we just run it to make sure it hasn't broken but don't pay any attention to the numbers?

Exactly, mostly run it to check we didn't break it, occasionally look at the numbers if useful.

Probably worth spinning into its own Workflow, so we're not running it n-times in the Tests?

Yeah, very simple action that will finish before all the other tests, so not making our CI longer in total.

benjeffery · 2022-08-02T11:23:32Z

Ok, this needs some docs, but is ready for a code review.
Here is the full table - there are a couple of regressions that might be worth looking into.

jeromekelleher

Looks great!

The only major thing that's missing from the benchmarks is row-by-row access with/out metadata decoding, but we can add that in later.

petrelharp · 2022-08-02T13:57:52Z

Wow!!!! This is great!

benjeffery · 2022-08-02T21:47:01Z

I've added a test for decoding metadata. I've also added a script that runs the benchmarks across all released versions.
I've also added the script to CI (takes ~5m), the results are available as an artifact - they won't be directly comparable as it is a different CPU, but the relative changes should still be useful. I think this is ready to merge, it could be more polished, e.g. making the script a proper command line tool with args, but I don't think that is worth the dev time for the benefits currently.

I think we should keep an issue open for RAM benching.

benjeffery · 2022-08-02T22:15:58Z

BTW, this shows how great the backwards compatibility has been. Same 0.5.2 file for all these benchmarks!

jeromekelleher

Awesome!

Few minor suggestions, merge away whenever you're happy.

jeromekelleher · 2022-08-03T08:17:31Z

python/benchmark/run-for-all-releases.py

+if __name__ == "__main__":
+    versions = [v for v in versions("tskit") if "a" not in v and "b" not in v]
+    for v in tqdm.tqdm(versions):
+        os.system(f"pip install tskit=={v}")


I think it's probably worth automating in a venv here, so that the devs environment doesn't get messed up by running this.

jeromekelleher · 2022-08-03T08:17:54Z

python/benchmark/run-for-all-releases.py

+if __name__ == "__main__":
+    versions = [v for v in versions("tskit") if "a" not in v and "b" not in v]
+    for v in tqdm.tqdm(versions):
+        os.system(f"pip install tskit=={v}")


subprocess.run(check=True, shell=True) is a better option here

jeromekelleher · 2022-08-03T08:20:37Z

python/benchmark/run.py

+    return ret
+
+
+def make_file():


Slightly nicer:

benchfile = tskit_dir / "benchmark" / "bench.trees" if not benchfile.exists(): ... ts.dump(benchfile)

similar patterns below

benjeffery marked this pull request as draft August 1, 2022 11:06

benjeffery force-pushed the bench branch 2 times, most recently from 08fb3a8 to e9b106a Compare August 2, 2022 11:21

benjeffery marked this pull request as ready for review August 2, 2022 11:23

jeromekelleher approved these changes Aug 2, 2022

View reviewed changes

benjeffery force-pushed the bench branch 11 times, most recently from 0db6de8 to 5efe70b Compare August 2, 2022 21:38

jeromekelleher approved these changes Aug 3, 2022

View reviewed changes

benjeffery force-pushed the bench branch 2 times, most recently from 7c24452 to 1d46672 Compare August 4, 2022 08:25

benjeffery added the AUTOMERGE-REQUESTED label Aug 4, 2022

Add benchmarks

7fa6187

benjeffery force-pushed the bench branch from 1d46672 to 7fa6187 Compare August 4, 2022 08:56

mergify bot merged commit 46d3e4a into tskit-dev:main Aug 4, 2022

mergify bot removed the AUTOMERGE-REQUESTED label Aug 4, 2022

This was referenced Aug 4, 2022

Performance statistics report #2444

Closed

Memory profiling #2457

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmarking #2454

Benchmarking #2454

Uh oh!

benjeffery commented Aug 1, 2022

Uh oh!

jeromekelleher commented Aug 1, 2022

Uh oh!

codecov bot commented Aug 1, 2022 •

edited

Loading

Uh oh!

benjeffery commented Aug 1, 2022 •

edited

Loading

Uh oh!

benjeffery commented Aug 2, 2022

Uh oh!

jeromekelleher left a comment

Uh oh!

petrelharp commented Aug 2, 2022

Uh oh!

benjeffery commented Aug 2, 2022 •

edited

Loading

Uh oh!

benjeffery commented Aug 2, 2022

Uh oh!

jeromekelleher left a comment

Uh oh!

jeromekelleher Aug 3, 2022

Uh oh!

jeromekelleher Aug 3, 2022

Uh oh!

jeromekelleher Aug 3, 2022

Uh oh!

Uh oh!

Benchmarking #2454

Benchmarking #2454

Uh oh!

Conversation

benjeffery commented Aug 1, 2022

Uh oh!

jeromekelleher commented Aug 1, 2022

Uh oh!

codecov bot commented Aug 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

benjeffery commented Aug 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benjeffery commented Aug 2, 2022

Uh oh!

jeromekelleher left a comment

Choose a reason for hiding this comment

Uh oh!

petrelharp commented Aug 2, 2022

Uh oh!

benjeffery commented Aug 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benjeffery commented Aug 2, 2022

Uh oh!

jeromekelleher left a comment

Choose a reason for hiding this comment

Uh oh!

jeromekelleher Aug 3, 2022

Choose a reason for hiding this comment

Uh oh!

jeromekelleher Aug 3, 2022

Choose a reason for hiding this comment

Uh oh!

jeromekelleher Aug 3, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Aug 1, 2022 •

edited

Loading

benjeffery commented Aug 1, 2022 •

edited

Loading

benjeffery commented Aug 2, 2022 •

edited

Loading