Multi Agent System Benchmarking #6
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request introduces a comprehensive framework for managing and interacting with a multi-agent system in a benchmarking environment. Key changes include the implementation of core classes for defining agents and their interactions, tools for creating and configuring agent systems, and utilities for handling input/output operations with enhanced user interactivity.
Multi-Agent System Framework
benchmarking/agents/AgentSystem.py: Added core classesAgent,Command, andAgentSystemto model agents, their commands, and their interactions. Includes methods for loading configurations from JSON, retrieving agents, and generating prompts for large language models (LLMs).Agent System Configuration and Creation
benchmarking/agents/create_agent_system.py: Introduced an interactive script for defining agents, connecting them, and saving configurations. Includes user-friendly prompts and error handling for creating agent systems.benchmarking/agents/system_blueprint.json: Added a sample JSON blueprint for a multi-agent system, including amaster_agentand two specialist agents (coder_agentandresearch_agent).Input/Output Enhancements
benchmarking/core/io_helpers.py: Added utilities for rich-text terminal interactions, dataset selection, and resource collection. Includes a function to extract Python code from text and format execution responses with detailed output.Miscellaneous
benchmarking/.gitignore: Updated to ignoreagent_systems/directory, ensuring generated agent configurations are not tracked.