Chain of Agents Implementation

Chain of Agents (CoA) implementation in Python and Swift - A framework for long-context tasks using Large Language Models (LLMs).

Overview

This repository implements the Chain-of-Agents framework as described in:

Chain of Agents: Large language models collaborating on long-context tasks (Google Research Blog)
Chain of Agents Paper (Research Paper)

The Chain of Agents framework enables efficient processing of long-context tasks by:

Breaking down large inputs into manageable chunks
Using worker agents to process individual chunks
Employing a manager agent to synthesize results

Features

Support for PDF document analysis
Configurable chunk sizes for processing
Real-time progress tracking
Streaming responses from both worker and manager agents
Clean macOS native interface
Dual processing modes:
- Cloud-based processing using Together AI's LLaMA models
- On-device processing using MLX framework
Support for offline inference with MLX

Installation

git clone https://github.com/rudrankriyam/chain-of-agents.git
cd chain-of-agents
pip install -r requirements.txt

Prerequisites

Python 3.x
Xcode 15+ (for macOS app)
Together API key (sign up at Together.ai)

Usage

Command Line Version

Create a .env file in the root directory and add your Together API key:

echo "TOGETHER_API_KEY=your_api_key_here" >> .env

Run the example script:

./run.sh

This will:

Set up a Python virtual environment
Install required dependencies
Process a sample PDF document using the Chain of Agents framework

macOS App

Start the API server:

./run_api.sh

This will:

Activate the Python virtual environment
Start the API server that the macOS app communicates with
The server will run on localhost:8000

Open the Xcode project:

open ChainOfAgents.xcodeproj

Build and run the app (⌘R)

Processing Modes

Cloud Processing: Uses Together AI's hosted models through the API server
On-Device Processing: Uses MLX framework for local inference
- Automatically downloads and caches the required model
- No internet connection required after initial model download
- Lower latency but may have different performance characteristics

The macOS app provides:

PDF document selection
Custom query input
Real-time processing visualization
Worker agent progress tracking
Final synthesis display
Toggle between cloud and on-device processing

Models

Cloud Processing

Uses Together AI's hosted models:

Worker model: meta-llama/Llama-3.3-70B-Instruct-Turbo-Free
Manager model: meta-llama/Llama-3.3-70B-Instruct-Turbo-Free

On-Device Processing

Uses MLX-optimized models:

Default model: llama-3.1-8B (Quantized 8-bit version)
Automatically handles model downloading and caching
Optimized for Apple Silicon processors

Technical Details

Built with SwiftUI for the macOS interface
Uses MLX framework for efficient on-device inference
Implements Server-Sent Events (SSE) for real-time progress updates
Supports concurrent processing of document chunks
Automatic memory management for large documents

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

@article{zhang2024chain,
    title={Chain of Agents: Large Language Models Collaborating on Long-Context Tasks},
    author={Zhang, Yusen and Sun, Ruoxi and Chen, Yanfei and Pfister, Tomas and Zhang, Rui and Arık, Sercan Ö.},
    journal={arXiv preprint arXiv:2404.08392},
    year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
ChainOfAgents.xcodeproj		ChainOfAgents.xcodeproj
ChainOfAgents		ChainOfAgents
chain_of_agents		chain_of_agents
.gitignore		.gitignore
DeepSeek_R1.pdf		DeepSeek_R1.pdf
LICENSE		LICENSE
README.md		README.md
Screenshot 2025-01-29 at 15.05.34.png		Screenshot 2025-01-29 at 15.05.34.png
api.py		api.py
requirements.txt		requirements.txt
run.sh		run.sh
run_api.sh		run_api.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chain of Agents Implementation

Overview

Features

Installation

Installation

Prerequisites

Usage

Command Line Version

macOS App

Processing Modes

Models

Cloud Processing

On-Device Processing

Technical Details

Contributing

License

Citation

About

Releases

Packages

Languages

License

rudrankriyam/Chain-of-Agents

Folders and files

Latest commit

History

Repository files navigation

Chain of Agents Implementation

Overview

Features

Installation

Installation

Prerequisites

Usage

Command Line Version

macOS App

Processing Modes

Models

Cloud Processing

On-Device Processing

Technical Details

Contributing

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages