A distributed observability system that uses AI agents to analyze and respond to system events.
The system consists of several specialized agents:
- Log Agent: Analyzes log data to identify patterns and anomalies
- Metric Agent: Processes metrics to detect performance issues
- Tracing Agent: Analyzes distributed traces to identify bottlenecks
- Root Cause Agent: Determines the root cause of issues
- Postmortem Agent: Generates postmortem reports
- Notification Agent: Handles alerting and notifications
- Deployment Agent: Manages deployments and rollbacks
- Install dependencies:
pip install -r requirements.txt
- Set up environment variables:
cp .env.example .env
# Edit .env with your configuration
- Run the agents:
python main.py
pytest tests/
- Create a new tool class in
common/tools/
- Implement the tool using crewai's BaseTool
- Add the tool to the appropriate agent
- Update tests
- Create a new agent class in
agents/
- Implement the agent using crewai's Agent
- Add the agent to main.py
- Create tests
- Fork the repository
- Create a feature branch
- Make your changes
- Run tests
- Submit a pull request
MIT