Code Debug Agent

An automated agent for debugging and fixing bugs in Python code using LLM (Qwen) and LangGraph.

Description

This project implements an automated code debugging system based on "Qwen3-0.6B" LLM. The agent analyzes buggy code from the HumanEvalPack dataset and generates fixed code. After generation, the code is tested in isolated Docker environment.

Project Structure

buggy_agent/
├── main.py              # Entry point
├── code_agent.py        # LangGraph agent for code fixing
├── code_model.py        # Qwen model wrapper
├── code_intepretor.py   # Docker runner for code execution
├── evaluation.py        # Evaluation system
├── prompts.py           # LLM prompts
└── requirements.txt     # Project dependencies

Install Dependencies

pip install -r requirements.txt

Run the agent

python main.py

Workflow

Load Data: Load HumanEvalPack dataset with buggy code
Prepare Prompt: Create prompt with task description, error type, and examples
Generate Fixed code: LLM analyzes code and generates fixed version
Testing: Fixed code runs with unit tests in Docker
Evaluation: Calculate Pass@1 metric and detailed statistics

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code Debug Agent

Description

Project Structure

Install Dependencies

Run the agent

Workflow

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
code_agent.py		code_agent.py
code_intepretor.py		code_intepretor.py
code_model.py		code_model.py
evaluation.py		evaluation.py
main.py		main.py
prompts.py		prompts.py
requirements.txt		requirements.txt

Max0072/buggy_agent

Folders and files

Latest commit

History

Repository files navigation

Code Debug Agent

Description

Project Structure

Install Dependencies

Run the agent

Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages