Skip to content

Add multi-model embedding support#52

Merged
rendro merged 3 commits intomainfrom
multi-model-embeddings
Feb 8, 2026
Merged

Add multi-model embedding support#52
rendro merged 3 commits intomainfrom
multi-model-embeddings

Conversation

@rendro
Copy link
Owner

@rendro rendro commented Feb 8, 2026

Summary

  • Support 4 embedding models selectable via SEDIMENT_EMBEDDING_MODEL env var: all-MiniLM-L6-v2 (default), e5-small-v2, bge-small-en-v1.5, bge-base-en-v1.5
  • Add asymmetric query/document prefixes per model spec and dynamic embedding dimensions (384/768)
  • Pin SHA-256 hashes for all model artifacts (weights, tokenizer, config)
  • Update BENCHMARKS.md with results from 6 competitor systems and 4 model variants
  • Bump version to 0.4.4

Test plan

  • cargo test — all 80 tests pass
  • cargo test -- --ignored — model download tests pass for all 4 variants
  • Full benchmark run validates correct model loading and distinct embeddings per model

Support 4 embedding models (all-MiniLM-L6-v2, e5-small-v2,
bge-small-en-v1.5, bge-base-en-v1.5) selectable via the
SEDIMENT_EMBEDDING_MODEL env var. Add asymmetric query/document
prefixes per model spec, dynamic embedding dimensions (384/768),
and pinned SHA-256 hashes for all model artifacts.

Benchmark results across 6 systems confirm all-MiniLM-L6-v2 remains
the best default (tied with bge-base on quality, 2x faster).
@rendro rendro merged commit 69fed70 into main Feb 8, 2026
2 checks passed
@rendro rendro deleted the multi-model-embeddings branch February 8, 2026 03:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant