Skip to content

docs: update vector benchmarks after insert optimization#273

Merged
kacy merged 1 commit intomainfrom
docs/update-vector-benchmarks
Feb 24, 2026
Merged

docs: update vector benchmarks after insert optimization#273
kacy merged 1 commit intomainfrom
docs/update-vector-benchmarks

Conversation

@kacy
Copy link
Owner

@kacy kacy commented Feb 24, 2026

summary

updates vector benchmark numbers in README.md and bench/README.md with results from the binary encoding + parallel HNSW construction optimizations (#272).

new numbers from GCP c2-standard-8 (100k vectors, 128-dim, cosine, M=16, ef=64):

config insert (vec/sec) query (q/sec) p99 memory
ember (1 key) 2,432 1,217 0.99ms 29 MB
ember (8 shards) 5,482 1,793 0.62ms ~31 MB
chromadb 4,879 381 2.91ms 139 MB
pgvector 1,702 782 1.67ms 178 MB
qdrant 7,699 560 2.00ms 168 MB

also adds a shard scaling table showing near-linear insert throughput scaling.

what was tested

benchmarks run on existing GCP c2-standard-8 VM with latest main (includes #272).

new numbers from GCP c2-standard-8 after binary VADD_BATCH encoding
and parallel HNSW construction optimizations:

- single key: 2,432 insert/sec (was 1,501), 1,217 query/sec
- 8 shards: 5,482 insert/sec, 1,793 query/sec, p99=0.62ms
- 4-6x less memory than all competitors (29 MB vs 139-178 MB)

adds shard scaling table showing near-linear insert throughput
scaling across shard counts.
@kacy kacy merged commit e3665f4 into main Feb 24, 2026
@kacy kacy deleted the docs/update-vector-benchmarks branch February 24, 2026 16:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant