-
Notifications
You must be signed in to change notification settings - Fork 0
Comparing changes
Open a pull request
base repository: thewebscraping/crossvector
base: v0.1.3
head repository: thewebscraping/crossvector
compare: v1.0.0
- 14 commits
- 52 files changed
- 2 contributors
Commits on Dec 6, 2025
-
feat: Add comprehensive benchmarking tool and improvements
Benchmarking Features: - Created scripts/benchmark.py for performance testing - Support 4 backends (pgvector, astradb, milvus, chroma) - Support 2 embedding providers (OpenAI, Gemini) - Test 7 operations: bulk/individual create, vector/metadata search, Query DSL, update, delete - Added --skip-slow flag to skip cloud backends for faster testing - Smart Query DSL optimization: 4 operators for slow backends, 10 for fast - Performance summary shows tested vs skipped backends - Comprehensive markdown reports with detailed metrics Engine Enhancements: - Added VectorEngine.drop_collection() method for cleanup - Fixed DEFAULT_COLLECTION_NAME to use api_settings.VECTOR_COLLECTION_NAME Architecture Improvements: - Enhanced ABC base class with unified initialization - Improved adapter collection name handling - Better error reporting in benchmarks Documentation: - Added benchmarking section to README.md (102 lines) - Created docs/benchmarking.md (385 lines complete guide) - Updated docs/contributing.md with benchmarking workflow - Added usage examples and best practices Testing: - All 365 unit tests passing - Updated tests for collection name changes - 40% overall coverage (core components 70-100%) Breaking Changes: - Removed DEFAULT_COLLECTION_NAME class constant (use settings instead)
Configuration menu - View commit details
-
Copy full SHA for 303ee58 - Browse repository at this point
Copy the full SHA 303ee58View commit details -
Configuration menu - View commit details
-
Copy full SHA for 60ca97f - Browse repository at this point
Copy the full SHA 60ca97fView commit details -
chore: Release v1.0.0 - First Production Release! 🎉
Version: 1.0.0 (0.1.3 → 1.0.0) Status: Production Ready ✅ Production Readiness: - 365 tests passing (100% success rate) - 40% overall coverage, core components 70-100% - 19 PgVector integration tests verified - Comprehensive benchmarking and documentation Removed Beta Status: - No longer in beta - Production-ready and stable - Removed all beta warnings from README - Updated badges to v1.0.0 Recommended Backends: - ChromaDB Cloud - Best for cloud deployments - PgVector - Best for self-hosted/on-premise - Both verified with production-grade performance
Configuration menu - View commit details
-
Copy full SHA for 0adc123 - Browse repository at this point
Copy the full SHA 0adc123View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5acf0e0 - Browse repository at this point
Copy the full SHA 5acf0e0View commit details -
Merge pull request #3 from thewebscraping/standard-dbs
Release v1.0.0 - First Production Release!
Configuration menu - View commit details
-
Copy full SHA for 930ba09 - Browse repository at this point
Copy the full SHA 930ba09View commit details -
docs: Complete documentation overhaul for v1.0.0
- **Gemini Recommended**: Updated all guides to recommend Gemini (free tier, faster) over OpenAI. - **Reference Docs Updated**: Rewrote embeddings.md, api.md, databases.md, and architecture.md for 100% consistency. - **Environment Variables**: Aligned all documentation with settings.py (removed obsolete AstraDB keys). - **Benchmarks**: Added performance data to recommendation boxes.
Configuration menu - View commit details
-
Copy full SHA for b2d6a0d - Browse repository at this point
Copy the full SHA b2d6a0dView commit details -
Merge pull request #4 from thewebscraping/standard-dbs
docs: Complete documentation overhaul for v1.0.0
Configuration menu - View commit details
-
Copy full SHA for 00dd560 - Browse repository at this point
Copy the full SHA 00dd560View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d177ff - Browse repository at this point
Copy the full SHA 6d177ffView commit details -
Configuration menu - View commit details
-
Copy full SHA for ea8ef4d - Browse repository at this point
Copy the full SHA ea8ef4dView commit details -
feat: optimize benchmark with static vectors and timing constants
- Pre-compute documents and query vectors once in main() to eliminate embedding API calls - Add timing constants for collection lifecycle (drop/init/cleanup) operations - Implement graceful error handling to skip failed backends without crashing - Fix vector search API call to use query parameter - Update markdown report to show test results summary and failed tests section - Increased sleep timings (0.5s drop, 0.2s wait, 0.3s init) for cloud backend stability Changes: - SLEEP_* constants for timing (easily adjustable) - Vectors computed once globally, reused across all backends - Error handling with try-except in run_all() loop - Markdown output with error summary and skipped backends info
Configuration menu - View commit details
-
Copy full SHA for 7cb5dee - Browse repository at this point
Copy the full SHA 7cb5deeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 78c9697 - Browse repository at this point
Copy the full SHA 78c9697View commit details -
Configuration menu - View commit details
-
Copy full SHA for db1a79c - Browse repository at this point
Copy the full SHA db1a79cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1007fdb - Browse repository at this point
Copy the full SHA 1007fdbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 824e18c - Browse repository at this point
Copy the full SHA 824e18cView commit details
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff v0.1.3...v1.0.0