feat: implement robust LLM JSON parsing to handle markdown and filler… #40

starryendymion · 2026-01-28T15:01:34Z

Description

This PR implements robust JSON parsing for LLM-generated responses within the backend agents. Currently, the system relies on json.loads() directly on the raw response text. If the LLM returns Markdown code blocks (e.g., ```json ... ```) or conversational filler, the json.loads() call fails with a JSONDecodeError, potentially crashing the chat session.

The Issue: Fragile Parsing

The existing implementation in call_gemini_for_keywords and call_gemini_detect_intents assumes the LLM response is a perfectly formatted JSON string.

Examples of Failure Modes:

Markdown Formatting: The LLM wraps the JSON in ```json tags.
Conversational Filler: The LLM responds with: "Sure, here are the keywords: {"keywords": ["brain"]}".
Leading/Trailing Whitespace: Unexpected characters outside the JSON object structure.

The Fix: Robust Extraction

I introduced a private helper function _parse_llm_json that utilizes regular expressions and string slicing to extract the JSON object from a "noisy" response before parsing it.

Changes Made

backend/agents.py:
- Implemented _parse_llm_json(text: str) -> dict to handle Markdown blocks and conversational text.
- Updated call_gemini_for_keywords to use the new robust parser.
- Updated call_gemini_detect_intents to use the new robust parser.

Verification

Verified with a test script covering:

Clean JSON strings.
JSON wrapped in Markdown code blocks.
JSON embedded within conversational sentences.

… text

feat: implement robust LLM JSON parsing to handle markdown and filler…

83fccce

… text

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement robust LLM JSON parsing to handle markdown and filler… #40

feat: implement robust LLM JSON parsing to handle markdown and filler… #40

Uh oh!

starryendymion commented Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: implement robust LLM JSON parsing to handle markdown and filler… #40

Are you sure you want to change the base?

feat: implement robust LLM JSON parsing to handle markdown and filler… #40

Uh oh!

Conversation

starryendymion commented Jan 28, 2026

Description

The Issue: Fragile Parsing

The Fix: Robust Extraction

Changes Made

Verification

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant