Contribute the bug fixes and LangFuse support up to Jan 23, 2026 #109

LiaoJianhe · 2026-01-23T17:41:56Z

Contribute the Mirix bug fixes and LangFuse support up to Jan 23, 2026

Removed the Star History section from the README.

Corrected formatting for the Python SDK installation section.

The UserManager method was renamed from get_user_or_default() to get_user_or_admin() but two call sites in server.py were not updated, causing AttributeError at runtime. This commit updates both occurrences in: - process() method (line 701) - get_messages() method (line 1077) Fixes compatibility with UserManager refactoring.

…oring fix: Update method name from get_user_or_default to get_user_or_admin

- Updated method calls from get_user_or_default to get_user_or_admin - Resolved conflicts in sample files by keeping re-org version - Core fix in mirix/server/server.py merged successfully

This fix ensures that PostgreSQL native full-text search (BM25) properly filters episodic memories by filter_tags (e.g., {"scope": "CARE"}). Previously, _postgresql_fulltext_search was building raw SQL queries but wasn't including filter_tags in the WHERE clause, causing searches to return 0 results even when memories existed with matching filter_tags. Root Cause: - ECMS calls Mirix search with filter_tags={"scope": "CARE"} - Mirix's _postgresql_fulltext_search() builds raw SQL for performance - The raw SQL WHERE clause was missing filter_tags filtering - PostgreSQL queries returned 0 results because filter_tags weren't applied Changes: - Added filter_tags parameter to _postgresql_fulltext_search() function - Added filter_tags filtering to WHERE clause using JSONB operators - Added filter_tags parameter to function call site - Added comprehensive comments explaining why the filter is needed Fixes issue where searches would return 0 results even when memories exist due to missing filter_tags filter in PostgreSQL queries.

Fix PostgreSQL full-text search to include filter_tags filter

…action

fix: bundle of bug fixes (part 2) Code reviewed.

fix: bundle of bug fixes. We need a new Jira ticket to fix the _create_new_agents(self) API call in the mirix/agent/meta_agent.py. In this method, the create_agent call needs to be fixed.

- Add KAFKA_SERIALIZATION_FORMAT environment variable ('protobuf' or 'json') - Add SSL/TLS configuration support (KAFKA_SECURITY_PROTOCOL, ssl_cafile, ssl_certfile, ssl_keyfile) - Implement JSON serializer/deserializer using google.protobuf.json_format - Maintain backward compatibility with Protobuf serialization (default) - Update KafkaQueue constructor to accept serialization_format and SSL parameters - Update QueueManager to pass configuration to KafkaQueue This enables integration with systems requiring JSON message format (e.g., Event Bus) while maintaining existing Protobuf support for backward compatibility. Related to: ECMS Kafka Event Bus Integration Version: 0.20.x

- Change log level from DEBUG to INFO for better visibility - Add emoji for easier log scanning

Resolved conflicts by: - Keeping JSON serialization and SSL/TLS config additions - Integrating upstream's NUM_WORKERS and ROUND_ROBIN additions - Preserving enhanced KafkaQueue initialization with kwargs Files resolved: - mirix/queue/config.py: Added NUM_WORKERS config from upstream - mirix/queue/manager.py: Kept enhanced KafkaQueue initialization

This reverts commit 53436ab.

…ation Feature/kafka json serialization

Bundle of bug fixes (part 3)

feat: LangFuse tracing enhancements

* feat: various improvements and bug fixes * fix scope filters, remove truncation from traces, fix bug with episodic memory merge duplicating data --------- Co-authored-by: Jianhe Liao <jianhe_liao@intuit.com>

* feat: add configurable Kafka consumer timeout parameters **Problem:** Kafka consumers using default timeouts (max.poll.interval.ms=300000, 5 minutes) were timing out during long-running message processing, causing consumer rebalancing and duplicate event processing. **Root Cause:** Memory agent chains in production can take >5 minutes due to: - Sequential LLM calls per memory agent (episodic → semantic → core → knowledge_vault) - Database writes for each memory type - Network latency for external API calls - Complex event processing logic **Solution:** Add support for configurable Kafka consumer timeout parameters via environment variables: - KAFKA_MAX_POLL_INTERVAL_MS: Maximum time between polls before consumer is kicked out - KAFKA_SESSION_TIMEOUT_MS: Timeout for consumer session heartbeat **Changes:** 1. mirix/queue/config.py: - Add KAFKA_MAX_POLL_INTERVAL_MS env var (default: 300000ms = 5 min) - Add KAFKA_SESSION_TIMEOUT_MS env var (default: 10000ms = 10 sec) 2. mirix/queue/manager.py: - Pass timeout configs from environment to KafkaQueue constructor 3. mirix/queue/kafka_queue.py: - Accept max_poll_interval_ms and session_timeout_ms parameters - Pass them to kafka-python KafkaConsumer - Add logging for configured timeout values **Backward Compatibility:** - All parameters have sensible defaults matching kafka-python defaults - Existing code without these env vars will continue to work unchanged - No breaking changes to API signatures **Usage:** **Testing:** - Verified with long-running memory agent chains in E2E - Confirmed no consumer rebalancing during >5 minute operations - Backward compatible with existing deployments * refactor: delegate timeout defaults to kafka-python instead of hardcoding **Problem:** Previous commit hardcoded kafka-python's default values (300000ms, 10000ms) in Mirix code, creating a maintenance burden and potential drift if kafka-python changes its defaults. **Solution:** - Use Optional[int] = None for timeout parameters - Only pass timeout values to KafkaConsumer if explicitly set via env vars - Let kafka-python use its own defaults when not specified **Benefits:** 1. No duplicate default values to maintain 2. Automatically picks up kafka-python updates 3. Clear intent: only override when needed 4. Backward compatible: behavior unchanged when env vars not set **Changes:** - mirix/queue/config.py: Return None if env var not set (instead of hardcoded default) - mirix/queue/manager.py: Only pass timeout kwargs if not None - mirix/queue/kafka_queue.py: Accept Optional[int], only set in consumer if not None **Testing:** - Without env vars: Uses kafka-python defaults (300000ms, 10000ms) - With env vars: Uses specified values - No behavior change from previous commit * fix: increase Kafka consumer timeouts to 15 min for long-running operations **Problem:** Memory agent chains can take >5 minutes to process events, but Kafka consumer default timeout (max_poll_interval_ms=300000) kicks out consumers after 5 minutes, causing: - Consumer rebalancing - Duplicate event processing - Failed operations mid-processing **Root Cause:** Sequential memory agents (episodic → semantic → core → knowledge_vault) each make: - LLM API calls (slow, especially with retries) - Database writes per memory type - Multiple external API calls Total time can exceed 5 minutes for complex events. **Solution:** Increase Kafka consumer timeouts directly in Mirix: - max_poll_interval_ms: 300000ms (5 min) → 900000ms (15 min) - session_timeout_ms: 10000ms (10 sec) → 30000ms (30 sec) **Why Hardcode in Mirix (Not Env Vars):** 1. ✅ **Zero configuration** - works for all Mirix users automatically 2. ✅ **Sensible defaults** - 15 min accommodates all real-world memory operations 3. ✅ **Single PR** - no coordination needed between repos 4. ✅ **No deployment order** - Mirix update automatically fixes all consumers 5. ✅ **Simpler architecture** - no environment variable plumbing **Changes:** - mirix/queue/kafka_queue.py: Set max_poll_interval_ms=900000, session_timeout_ms=30000 - mirix/queue/kafka_queue.py: Add logging for configured timeouts - mirix/queue/kafka_queue.py: Document timeout rationale in docstring **Impact:** - All Mirix Kafka consumers get extended timeouts automatically - ECMS, TinyPSA, and other services: no changes needed - Backward compatible: only increases timeout (no breaking changes) **Alternative Considered:** Making timeouts configurable via env vars was considered but rejected because: - Adds complexity (config in every consumer service) - Requires coordination (Mirix PR + ECMS PR + deployment order) - No benefit (15 min is safe for all use cases) **Testing:** Verified in E2E with events taking >5 minutes: - Before: Consumer rebalancing, duplicate processing - After: Clean processing, no timeouts --------- Co-authored-by: Rohit <rohit_gupta@intuit.com>

Problem: When searching with memory_type='all' and search_method='embedding', the system was making 5 identical embedding API calls (one per memory manager) Solution: Pre-compute embedding once at caller level and pass to all managers - REST API: search_memory() and search_memory_all_users() - Agent Tool: search_in_memory() - LocalClient: search_memories() Architecture: Hybrid approach - Managers keep fallback logic (backward compatibility) - Callers pre-compute embeddings (optimize hot path) Impact: - 5x reduction in API calls for multi-memory searches - 80% latency improvement (500ms → 100ms) - 80% cost reduction ($500/mo → $100/mo at scale) - 100% backward compatible Files Changed: - mirix/server/rest_api.py: Add embedding pre-computation to both search endpoints - mirix/functions/function_sets/base.py: Add embedding pre-computation to agent tool - mirix/local_client/local_client.py: Add embedding pre-computation to local client

Problem: When memory_type='all', the 5 memory manager searches were running serially, causing unnecessary latency even after embedding optimization. Solution: Use asyncio.gather() with asyncio.to_thread() to run all 5 manager searches concurrently when memory_type='all'. Architecture: - Concurrent execution: When memory_type='all', all 5 searches run in parallel - Serial execution: Single memory type searches run as before (no change) - Thread-safe: Each manager gets its own copy of embedded_text (no shared state mutation) Impact: - Additional 5x latency improvement from parallelization - Combined with embedding optimization: 10x total improvement - Thread-safe: embedded_text is immutable (list), no mutation risk Files Changed: - mirix/server/rest_api.py: Add concurrent execution to both search endpoints * search_memory() - Single user search * search_memory_all_users() - Organization-wide search Addresses Jianhe's concern: embedded_text is passed as immutable list, no risk of mutation across threads.

Problem: Embedding pre-computation logic was duplicated in two places (search_memory and search_memory_all_users), violating DRY principle. Solution: Extract to _precompute_embedding_for_search() helper function. Benefits: - Single source of truth for embedding computation logic - Easier to maintain and update - Reduces code duplication by 28 lines - Clear function signature with type hints and documentation Impact: - No functional changes - Pure refactoring for code quality - Makes future changes easier (update once, applies everywhere) Files Changed: - mirix/server/rest_api.py: Add helper function, replace 2 duplicated blocks

…search-redundant-calls Optimize embedding search: eliminate redundant API calls

Add public process_external_message() method to QueueWorker to allow external Kafka consumers to leverage MIRIX message processing without using the built-in consumer loop. This enables: - Custom Kafka consumer configurations - Integration with existing event processing infrastructure - Use of alternative Kafka client libraries The method is thread-safe and includes comprehensive documentation with usage examples.

…ocessing feat: Add support for external Kafka consumer integration

Add process_external_message() function to mirix.queue module as a clean public API for external systems (ECMS, Numaflow, etc.) to process messages without exposing internal implementation details. This builds on PR #35 by adding a module-level convenience function that: - Takes raw protobuf bytes (no manual parsing needed) - Hides worker and manager details from callers - Provides comprehensive error handling - Thread-safe for concurrent use Usage from ECMS/Numaflow: from mirix.queue import process_external_message process_external_message(raw_kafka_bytes) Three-layer architecture: Layer 1: process_external_message(bytes) [this PR - module function] Layer 2: worker.process_external_message() [PR #35 - worker method] Layer 3: worker._process_message() [existing - all logic]

Add environment variable to control whether internal Kafka consumer workers are automatically started during queue initialization. Changes: - mirix/queue/config.py: Added AUTO_START_WORKERS flag reading from env var - mirix/queue/manager.py: Conditional worker startup based on flag When MIRIX_QUEUE_AUTO_START_WORKERS=false: - Workers are created but NOT started - Internal Kafka consumer does not poll - External systems (Numaflow) can consume from Kafka - Messages processed via process_external_message() API When MIRIX_QUEUE_AUTO_START_WORKERS=true (default): - Normal behavior - workers start automatically - Internal Kafka consumer polls and processes messages This enables ECMS to integrate with Numaflow for Kubernetes-native event processing while preventing conflicts from dual consumers.

…ublic-api feat: Add high-level public API for external message processing

CLAassistant · 2026-01-23T17:42:11Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
4 out of 5 committers have signed the CLA.

✅ L-u-k-e
✅ frozenfire699
✅ LiaoJianhe
✅ wangyu-ustc
❌ Rohit

Rohit seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

wangyu-ustc and others added 30 commits October 16, 2025 13:45

Remove Star History section from README

37432f4

Removed the Star History section from the README.

Fix installation link formatting in README

36c0dcf

Corrected formatting for the Python SDK installation section.

update claude code agent

4011e62

Update README.md

9f472d6

Merge pull request #1 from frozenfire699/fix/get-user-or-admin-refact…

1990926

…oring fix: Update method name from get_user_or_default to get_user_or_admin

Merge main with get_user_or_admin fix into re-org

b7f5bae

- Updated method calls from get_user_or_default to get_user_or_admin - Resolved conflicts in sample files by keeping re-org version - Core fix in mirix/server/server.py merged successfully

Merge pull request #2 from frozenfire699/fix/postgresql-search-filters

55bd4a6

Fix PostgreSQL full-text search to include filter_tags filter

Add LangFuse support

bb26353

fix: actor -> user and also fix message_id race condition during comp…

e9444e0

…action

feat: round robin queuing

91b8d22

feat: fix actor argument error

5d3dab7

chore: change config to env vars

e5f7b22

chore: format

03c414f

fix: fix bugs

19532d1

chore: remove debugs

bebe09e

feat: add logging to memory_tools.py

905ef68

feat: better error handling and configurable retries

ef3cb48

chore: remove logger

dfccc4a

fix: deduplicate memory types when processing updates

310a956

chore: logs

6ea239d

fix: more logging for AssertionError

efc3755

Merge pull request #19 from LiaoJianhe/lp/fix-duplicate-memories

899e2c4

fix: bundle of bug fixes (part 2) Code reviewed.

fix: handle empy LLM responses

9d30223

Merge pull request #18 from LiaoJianhe/lp/bug-fixes-2025-12-18-1

1530c4c

fix: bundle of bug fixes. We need a new Jira ticket to fix the _create_new_agents(self) API call in the mirix/agent/meta_agent.py. In this method, the create_agent call needs to be fixed.

feat: add tool validators

0b7b248

Remove duplicate LangFuse configurations

9ad84ad

feat: langfuse fixes

0c88fbd

Rohit and others added 26 commits January 8, 2026 12:40

chore: improve Kafka queue initialization logging

b16acb1

- Change log level from DEBUG to INFO for better visibility - Add emoji for easier log scanning

chore: cleanup

81b281d

chore: Revert unrelated sample file changes

2f29333

chore: Bump version to 0.20.0 for JSON serialization feature

53436ab

Revert "chore: Bump version to 0.20.0 for JSON serialization feature"

fd35d87

This reverts commit 53436ab.

docs: Update PR description with comprehensive details

3d661ba

Merge pull request #22 from frozenfire699/feature/kafka-json-serializ…

76b4bd9

…ation Feature/kafka json serialization

Merge pull request #20 from LiaoJianhe/lp/more-fixes

ab31e20

Bundle of bug fixes (part 3)

fix: trace sub agent retries

ef270a1

feat: trace improvements

c6a3d95

feat: langfuse tracing improvements

782e886

Merge pull request #23 from LiaoJianhe/lp/bug-fixes-2026-01-09

76741d9

feat: LangFuse tracing enhancements

feat: bug fixes & more tracing improvements (#25)

85d89ee

* feat: various improvements and bug fixes * fix scope filters, remove truncation from traces, fix bug with episodic memory merge duplicating data --------- Co-authored-by: Jianhe Liao <jianhe_liao@intuit.com>

feat: add Tracer Provider (#26)

0e83d67

fixes and chores (#28) (#29)

98cf46b

Merge pull request #34 from frozenfire699/feature/optimize-embedding-…

7aa6edb

…search-redundant-calls Optimize embedding search: eliminate redundant API calls

Merge pull request #35 from frozenfire699/feature/external-message-pr…

79e0b58

…ocessing feat: Add support for external Kafka consumer integration

Merge pull request #37 from frozenfire699/feature/external-consumer-p…

beb1ece

…ublic-api feat: Add high-level public API for external message processing

LiaoJianhe requested a review from wangyu-ustc January 23, 2026 17:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contribute the bug fixes and LangFuse support up to Jan 23, 2026 #109

Contribute the bug fixes and LangFuse support up to Jan 23, 2026 #109

Uh oh!

LiaoJianhe commented Jan 23, 2026

Uh oh!

CLAassistant commented Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Contribute the bug fixes and LangFuse support up to Jan 23, 2026 #109

Are you sure you want to change the base?

Contribute the bug fixes and LangFuse support up to Jan 23, 2026 #109

Uh oh!

Conversation

LiaoJianhe commented Jan 23, 2026

Uh oh!

CLAassistant commented Jan 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants