AI-Powered Prompt Chaining System

Overview

This solution implements a structured approach to prompt chaining using Azure OpenAI's GPT-4o models. The goal is to decompose complex user prompts into manageable stages, leveraging the capabilities of a smaller model (gpt-4o-mini) for planning and the full-scale model (gpt-4o) for generating comprehensive, high-quality responses. The process is designed to maximize reasoning, clarity, and efficiency by dividing the task into two distinct stages:

Planning Stage:
- Uses gpt-4o-mini to create a structured plan based on the user's initial prompt.
- Breaks down the task into actionable components, each described with its importance and key considerations.
Final Output Stage:
- Utilizes gpt-4o to synthesize a detailed and holistic response, grounded in the structured plan generated during the planning stage.

Process Flow

The following Mermaid diagram illustrates the detailed process flow of the AI-Powered Prompt Chaining System:

graph TD
    A[User Input] --> B[Validate User Prompt]
    B --> C{Valid?}
    C -->|Yes| D[Planning Stage]
    C -->|No| Z[Return Error]
    D --> E[gpt-4o-mini Model]
    E --> F[Generate Structured Plan]
    F --> G[Validate Plan]
    G --> H{Valid?}
    H -->|Yes| I[Final Output Stage]
    H -->|No| J[Retry Planning]
    J --> E
    I --> K[gpt-4o Model]
    K --> L[Generate Comprehensive Response]
    L --> M[Validate Response]
    M --> N{Valid?}
    N -->|Yes| O[Return Final Output]
    N -->|No| P[Retry Final Output]
    P --> K
    O --> Q[Performance Metrics]
    Q --> R[End]

    subgraph Planning Stage
    D
    E
    F
    G
    H
    J
    end

    subgraph Final Output Stage
    I
    K
    L
    M
    N
    P
    end

    subgraph Error Handling
    Z
    end

    subgraph Validation
    B
    C
    G
    H
    M
    N
    end

    subgraph Performance Tracking
    Q
    end

This diagram showcases the following key aspects of the system:

User input validation
Two-stage processing (Planning and Final Output)
Use of different models (gpt-4o-mini and gpt-4o)
Validation steps throughout the process
Retry mechanisms for both planning and final output stages
Error handling
Performance tracking

Goals

Enhance Reasoning Quality:
- By breaking down the problem into smaller, well-defined sub-tasks, the solution encourages the AI to reason through each component and produce a more thoughtful and coherent final output.
Optimize Resource Utilization:
- Offloading the planning to gpt-4o-mini, a less computationally expensive model, reduces costs and response times while reserving the full model's power for the detailed response.
Modular and Reusable Design:
- The solution is designed with clear modularity, enabling easy customization and reuse for a wide range of use cases, from strategic planning to report generation.
Improve Prompt Effectiveness:
- Through structured, multi-step prompting, the approach overcomes the limitations of single-shot prompts, enhancing the relevance, depth, and accuracy of responses.
Scalability and Extendibility:
- The architecture supports scalable deployment and integration with additional models or features, such as interactive feedback loops, dynamic token management, or response validation mechanisms.
Transparency and Traceability:
- Logging and displaying the prompts and responses at each stage ensures traceability and enables better understanding of the AI's decision-making process.

Key Features

Two-Stage Processing: Utilizes gpt-4o-mini for planning and gpt-4o for final output generation.
Robust Error Handling: Implements custom exceptions, retry mechanisms, and validation checks for improved reliability.
Configurable Settings: Centralized configuration management through YAML files for easy parameter adjustments.
Input and Output Validation: Ensures high-quality responses through thorough validation and quality assessment.
Performance Tracking: Detailed execution statistics for monitoring and optimization.
Modular Design: Well-structured code for easy maintenance and extensibility.
Async Processing: Support for processing multiple prompts concurrently with configurable batch sizes.

Requirements

Python 3.7+
Azure OpenAI API access
Required Python packages:
```
openai
python-dotenv
pyyaml
aiohttp
```

Setup

Clone the repository:

git clone https://github.com/terilios/reasoning-iteration.git

Install required packages:
```
pip install -r requirements.txt
```

Create a .env file in the root directory and add your Azure OpenAI credentials:

AZURE_OPENAI_API_KEY=your_api_key
AZURE_OPENAI_API_BASE=your_api_base
AZURE_OPENAI_API_VERSION=your_api_version
AZURE_OPENAI_DEPLOYMENT_NAME=your_deployment_name
AZURE_OPENAI_MINI_DEPLOYMENT_NAME=your_mini_deployment_name

Note: The .env file is included in .gitignore to prevent sensitive information from being uploaded to the repository.

(Optional) Create a config.yaml file to customize system settings:

# Token limits
planning_max_tokens: 2000
output_max_tokens: 4000

# Model parameters
temperature: 0.7

# Retry settings
max_retries: 3
retry_base_delay: 1.0

# Validation thresholds
min_plan_length: 50
min_response_length: 200
min_paragraphs: 2

# Quality thresholds
quality_thresholds:
  high: 0.8
  medium: 0.6
  low: 0.4

# Async processing settings
concurrent_requests: 3
async_batch_size: 5

Usage

The script supports several modes of operation:

Single Prompt Processing:

python main.py --prompt "Your prompt here"

Multiple Prompts from File:

python main.py --prompts-file your_prompts.txt

Multiple Prompts with Async Processing:

python main.py --prompts-file your_prompts.txt --use-async

Custom Configuration:

python main.py --prompt "Your prompt here" --config custom_config.yaml

The script will:

Process your prompt(s) through the planning stage using gpt-4o-mini
Generate detailed response(s) using gpt-4o
Write the results to results.md with:
- Original prompt
- Generated plan
- Enhanced response
- Execution statistics including:
  - Total tokens used
  - Stage durations
  - Token usage per stage
  - Retry counts
  - Validation results

Code Structure and Approach

Strengths:

Clear Separation of Concerns: The code is well-organized into distinct functions for planning and output generation.
Robust Error Handling: Try-except blocks are properly implemented in each stage.
Good Configuration Management: Environment variables and YAML configuration for flexible settings.
Helpful Debugging Features: Comprehensive logging of prompts and responses, and performance metrics tracking.
Async Support: Efficient processing of multiple prompts with configurable batch sizes.

Areas for Improvement:

Response Validation: Enhance quality checks for the planning stage output and final response.
Configuration Enhancement: Add more configurable parameters for flexibility.
Enhanced Error Recovery: Implement more sophisticated retry logic with exponential backoff.
Extensibility: Add support for custom prompt templates.
Documentation: Expand docstrings and add more type hints for better code clarity.

Contributing

Contributions to enhance the functionality or efficiency of the system are welcome. Please submit a pull request with your proposed changes.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Output Format

The script generates a markdown file (results.md) with the following structure:

# AI Response Analysis

## Original Prompt

[Your input prompt]

## Generated Plan

[Structured plan from gpt-4o-mini]

## Enhanced Response

[Detailed response from gpt-4o]

## Execution Statistics

[JSON object containing performance metrics]

The execution statistics include:

Total duration of execution
Total tokens used across all stages
Duration of each stage (planning and output)
Token usage per stage
Number of retries if any
Validation results for response quality

Summary

This solution leverages Azure OpenAI's models to tackle complex tasks with greater efficiency and thoughtfulness. By employing a structured, two-stage approach with comprehensive token tracking, performance monitoring, and async processing capabilities, it enhances both the quality of the AI's output and the scalability of its application in various domains, such as strategic planning, content generation, and process optimization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Powered Prompt Chaining System

Overview

Process Flow

Goals

Key Features

Requirements

Setup

Usage

Code Structure and Approach

Strengths:

Areas for Improvement:

Contributing

License

Output Format

Summary

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
main.py		main.py
requirements.txt		requirements.txt

License

terilios/reasoning-iteration

Folders and files

Latest commit

History

Repository files navigation

AI-Powered Prompt Chaining System

Overview

Process Flow

Goals

Key Features

Requirements

Setup

Usage

Code Structure and Approach

Strengths:

Areas for Improvement:

Contributing

License

Output Format

Summary

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages