-
Notifications
You must be signed in to change notification settings - Fork 805
New results #399
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New results #399
Conversation
Will add a couple more datasets to this shortly! |
Oh no, looks like fast_pq/tinyknn regressed substantially since #363 (comment) and is completely broken on mnist. |
If you fix it in the next few hours, then I can re-run it. But I want to tear down the r6.16xlarge instance soon (it's ~$100/day) |
Is the run actually using the latest ann-benchmarks code? It seems the string fast_pq should have been replaced in the repo |
I just realize I haven't re-run install.py in the last few days. Let me wipe the fast_pq data and re-run all benchmarks (will also wipe opensearchknn) |
I found the problem. Just testing my fix now. Will push it asap. |
It's pushed. Building the Docker image now should git clone the newest version. |
Pushed latest results. Going to run gist-960-euclidean as well just to have a high dimensional dataset that's a bit larger. It will probably take 10h. |
hi,@erikbern there is only one line -tinyknn in the new result of glove25, would you please check it? |
You're right, I accidentally ran glove-25 only for tinyknn. Let me run it for all algos. |
It's possible the image didn't rebuild because of Docker layer caching. I didn't check closely. I'll rerun soon though. |
If the docker image times out during querying, is the data gathered still saved? Or is saving the data part of the job being run inside the docker container, so it gets lost? In the later case I can try to reduce the number of query_args to prevent timeouts. |
the process running inside the container saves the data to data that's mounted into the container from the outside – so if the container is killed after 2h, any data saved up to that point is kept generally the 2h timeout happens during index building though, not afterwards during query processing. that step is usually pretty fast. |
|
It seems some of the last args were taking more than half an hour, so I trimmed the list down. Now none of the rounds should take more than a minute or two. |
Merging this for now, but I'm planning to run this again in just a week or two. Will also polish the graphs a bit. But I don't want perfection to get in the way of getting something updated out. Won't promote this for now. |
See new plots