Improve collection handling and hashing consistency by m1rl0k · Pull Request #203 · Context-Engine-AI/Context-Engine

m1rl0k · 2026-01-26T03:38:24Z

Added robust parsing for SCHEMA_CACHE_TTL in indexing_admin.py to handle invalid environment values. Updated upload_service.py to better support demo mode and fallback to Qdrant collections when authentication is disabled. Modified watch_index_core/processor.py to use xxhash for file hashing for consistency with the pipeline, with a fallback to hashlib if xxhash is unavailable.

augmentcode · 2026-01-26T03:41:04Z

🤖 Augment PR Summary

Summary: Improves robustness around schema-cache TTL parsing, admin collection handling in demo mode, and watcher file hashing consistency.

Changes:

Hardened SCHEMA_CACHE_TTL_SECS parsing by defaulting invalid/non-finite/non-positive values to 300s.
Updated admin collection status/stream endpoints to fall back to direct Qdrant collection listing when auth is disabled (demo mode) or reports disabled.
Aligned watcher file hashing with the ingest pipeline by using xxhash, with a hashlib fallback when unavailable/erroring.

_{🤖 Was this summary useful? React with 👍 or 👎}

augmentcode

Review completed. 2 suggestions posted.

Comment augment review to trigger a new review at any time.

scripts/indexing_admin.py

scripts/watch_index_core/processor.py

Added validation for SCHEMA_CACHE_TTL_SECS to ensure it is finite and positive, defaulting to 300 seconds if invalid. Enhanced file hash computation to fallback to hashlib.sha1 on any xxhash runtime error, improving robustness.

m1rl0k · 2026-01-26T04:07:02Z

augment review

augmentcode

Review completed. 1 suggestions posted.

Comment augment review to trigger a new review at any time.

scripts/watch_index_core/processor.py

Removed fallback to hashlib and now require xxhash for hashing file contents in _read_text_and_sha1. This ensures consistency with other scripts and simplifies the code by eliminating error handling for missing xxhash.

Renamed _read_text_and_sha1 to _read_text_and_hash and updated its implementation and usage to compute xxhash64 instead of SHA1 for consistency with the pipeline. Updated related test to check for the correct hash length.

Introduces optional LZ4 compression for JSON data stored in Redis, reducing memory usage with minimal CPU overhead. Updates requirements.txt to include lz4 as a dependency and modifies workspace_state.py to compress data before storing and decompress on retrieval, falling back gracefully if lz4 is unavailable.

augmentcode bot reviewed Jan 26, 2026

View reviewed changes

scripts/indexing_admin.py Show resolved Hide resolved

scripts/watch_index_core/processor.py Outdated Show resolved Hide resolved

Improve error handling for cache TTL and file hashing

d463bdf

Added validation for SCHEMA_CACHE_TTL_SECS to ensure it is finite and positive, defaulting to 300 seconds if invalid. Enhanced file hash computation to fallback to hashlib.sha1 on any xxhash runtime error, improving robustness.

augmentcode bot reviewed Jan 26, 2026

View reviewed changes

scripts/watch_index_core/processor.py Outdated Show resolved Hide resolved

m1rl0k added 3 commits January 25, 2026 23:10

Refactor to use xxhash exclusively for file hashing

01709b3

Removed fallback to hashlib and now require xxhash for hashing file contents in _read_text_and_sha1. This ensures consistency with other scripts and simplifies the code by eliminating error handling for missing xxhash.

Refactor file hash function to use xxhash64

4151cb4

Renamed _read_text_and_sha1 to _read_text_and_hash and updated its implementation and usage to compute xxhash64 instead of SHA1 for consistency with the pipeline. Updated related test to check for the correct hash length.

m1rl0k merged commit 5053491 into test Jan 26, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve collection handling and hashing consistency#203

Improve collection handling and hashing consistency#203
m1rl0k merged 5 commits intotestfrom
bubble-bullshit

m1rl0k commented Jan 26, 2026

Uh oh!

augmentcode bot commented Jan 26, 2026 •

edited

Loading

Uh oh!

augmentcode bot left a comment

Uh oh!

Uh oh!

Uh oh!

m1rl0k commented Jan 26, 2026

Uh oh!

augmentcode bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

m1rl0k commented Jan 26, 2026

Uh oh!

augmentcode bot commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

augmentcode bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

m1rl0k commented Jan 26, 2026

Uh oh!

augmentcode bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

augmentcode bot commented Jan 26, 2026 •

edited

Loading