refactor: Restructure Gadget prompt to reduce tech hallucination #63

KaiserRuben · 2025-09-30T11:57:07Z

Problem

The Gadget service was hallucinating technologies and missing actual dependencies, despite an
extensive 500+ line prompt. This led to inaccurate and unreliable outputs.

Solution

This pull request replaces the monolithic prompt with a structured 4-step verification workflow.
This new approach makes it structurally difficult for the model to claim a technology without
verifying its existence in the codebase.

The new workflow is as follows:

Map project structure: The model starts by listing the directory contents to understand the
project layout.
Verify technologies: It then reads specific files to verify the presence of each technology.
Extract configuration: All relevant configuration is extracted from the verified files.
Validate build requirements: Finally, it validates the build requirements based on the
extracted configuration.

Key Changes

Reduced the prompt from ~500 to ~100 lines.
Introduced verification tables to structure the process.
Made file reading mandatory before a technology can be claimed.
Implemented a sequential workflow where each step depends on the previous one.
Increased the default reasoning effort to ensure higher quality analysis.

This change significantly improves the reliability of the Gadget service by grounding its
analysis in the actual content of the repository.

PLEASE TEST BEFORE MERGING!

… verification workflow Problem: Gadget service was hallucinating technologies and missing actual dependencies despite 500+ line prompt with extensive instructions. Solution: Replace monolithic prompt with structured 4-step verification workflow: - Step 1: Map project structure via list_dir - Step 2: Verify each technology by reading specific files - Step 3: Extract all configuration from verified files - Step 4: Validate build requirements Key changes: - Reduce prompt from 500 to ~100 lines by removing redundant examples - Introduce verification tables - Make file reading mandatory before technology claims - Use sequential steps where each depends on previous results - Higher default reasoning effort (Medium minimum) Result: Technologies can only be included if verified by actual file content, making hallucination structurally difficult rather than behaviorally discouraged.

KaiserRuben and others added 2 commits September 30, 2025 13:53

Merge branch 'lttle-cloud:master' into master

ec9cfc2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: Restructure Gadget prompt to reduce tech hallucination #63

refactor: Restructure Gadget prompt to reduce tech hallucination #63

Uh oh!

KaiserRuben commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

refactor: Restructure Gadget prompt to reduce tech hallucination #63

Are you sure you want to change the base?

refactor: Restructure Gadget prompt to reduce tech hallucination #63

Uh oh!

Conversation

KaiserRuben commented Sep 30, 2025

Problem

Solution

Key Changes

PLEASE TEST BEFORE MERGING!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant