Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: thewebscraping/crossvector
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v0.1.3
Choose a base ref
...
head repository: thewebscraping/crossvector
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: v1.0.0
Choose a head ref
  • 14 commits
  • 52 files changed
  • 2 contributors

Commits on Dec 6, 2025

  1. feat: Add comprehensive benchmarking tool and improvements

    Benchmarking Features:
    - Created scripts/benchmark.py for performance testing
    - Support 4 backends (pgvector, astradb, milvus, chroma)
    - Support 2 embedding providers (OpenAI, Gemini)
    - Test 7 operations: bulk/individual create, vector/metadata search, Query DSL, update, delete
    - Added --skip-slow flag to skip cloud backends for faster testing
    - Smart Query DSL optimization: 4 operators for slow backends, 10 for fast
    - Performance summary shows tested vs skipped backends
    - Comprehensive markdown reports with detailed metrics
    
    Engine Enhancements:
    - Added VectorEngine.drop_collection() method for cleanup
    - Fixed DEFAULT_COLLECTION_NAME to use api_settings.VECTOR_COLLECTION_NAME
    
    Architecture Improvements:
    - Enhanced ABC base class with unified initialization
    - Improved adapter collection name handling
    - Better error reporting in benchmarks
    
    Documentation:
    - Added benchmarking section to README.md (102 lines)
    - Created docs/benchmarking.md (385 lines complete guide)
    - Updated docs/contributing.md with benchmarking workflow
    - Added usage examples and best practices
    
    Testing:
    - All 365 unit tests passing
    - Updated tests for collection name changes
    - 40% overall coverage (core components 70-100%)
    
    Breaking Changes:
    - Removed DEFAULT_COLLECTION_NAME class constant (use settings instead)
    thewebscraping committed Dec 6, 2025
    Configuration menu
    Copy the full SHA
    303ee58 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    60ca97f View commit details
    Browse the repository at this point in the history
  3. chore: Release v1.0.0 - First Production Release! 🎉

    Version: 1.0.0 (0.1.3 → 1.0.0)
    Status: Production Ready ✅
    
    Production Readiness:
    - 365 tests passing (100% success rate)
    - 40% overall coverage, core components 70-100%
    - 19 PgVector integration tests verified
    - Comprehensive benchmarking and documentation
    
    Removed Beta Status:
    - No longer in beta
    - Production-ready and stable
    - Removed all beta warnings from README
    - Updated badges to v1.0.0
    
    Recommended Backends:
    - ChromaDB Cloud - Best for cloud deployments
    - PgVector - Best for self-hosted/on-premise
    - Both verified with production-grade performance
    thewebscraping committed Dec 6, 2025
    Configuration menu
    Copy the full SHA
    0adc123 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5acf0e0 View commit details
    Browse the repository at this point in the history
  5. Merge pull request #3 from thewebscraping/standard-dbs

    Release v1.0.0 - First Production Release!
    thewebscraping authored Dec 6, 2025
    Configuration menu
    Copy the full SHA
    930ba09 View commit details
    Browse the repository at this point in the history
  6. docs: Complete documentation overhaul for v1.0.0

    - **Gemini Recommended**: Updated all guides to recommend Gemini (free tier, faster) over OpenAI.
    - **Reference Docs Updated**: Rewrote embeddings.md, api.md, databases.md, and architecture.md for 100% consistency.
    - **Environment Variables**: Aligned all documentation with settings.py (removed obsolete AstraDB keys).
    - **Benchmarks**: Added performance data to recommendation boxes.
    thewebscraping committed Dec 6, 2025
    Configuration menu
    Copy the full SHA
    b2d6a0d View commit details
    Browse the repository at this point in the history
  7. Merge pull request #4 from thewebscraping/standard-dbs

    docs: Complete documentation overhaul for v1.0.0
    thewebscraping authored Dec 6, 2025
    Configuration menu
    Copy the full SHA
    00dd560 View commit details
    Browse the repository at this point in the history
  8. chore: README.md

    thewebscraping committed Dec 6, 2025
    Configuration menu
    Copy the full SHA
    6d177ff View commit details
    Browse the repository at this point in the history
  9. chore: README.md

    thewebscraping committed Dec 6, 2025
    Configuration menu
    Copy the full SHA
    ea8ef4d View commit details
    Browse the repository at this point in the history
  10. feat: optimize benchmark with static vectors and timing constants

    - Pre-compute documents and query vectors once in main() to eliminate embedding API calls
    - Add timing constants for collection lifecycle (drop/init/cleanup) operations
    - Implement graceful error handling to skip failed backends without crashing
    - Fix vector search API call to use query parameter
    - Update markdown report to show test results summary and failed tests section
    - Increased sleep timings (0.5s drop, 0.2s wait, 0.3s init) for cloud backend stability
    
    Changes:
    - SLEEP_* constants for timing (easily adjustable)
    - Vectors computed once globally, reused across all backends
    - Error handling with try-except in run_all() loop
    - Markdown output with error summary and skipped backends info
    thewebscraping committed Dec 6, 2025
    Configuration menu
    Copy the full SHA
    7cb5dee View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    78c9697 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    db1a79c View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    1007fdb View commit details
    Browse the repository at this point in the history
  14. Bump version to 1.0.0

    thewebscraping committed Dec 6, 2025
    Configuration menu
    Copy the full SHA
    824e18c View commit details
    Browse the repository at this point in the history
Loading