-
Notifications
You must be signed in to change notification settings - Fork 1
Operations
Garot Conklin edited this page Apr 29, 2025
·
1 revision
CloudOpsAI operates as a serverless NOC agent using AWS Lambda and Bedrock for intelligent incident management. This guide covers key operational aspects of the system.
- Function Name:
cloudopsai-agent
- Memory: 1024MB
- Timeout: 15 minutes
- VPC: Private subnets with VPC endpoints
# View Lambda logs
aws logs tail /aws/lambda/cloudopsai-agent --follow
# Check alarm status
aws cloudwatch describe-alarms \
--alarm-name-prefix "CloudOpsAI"
- Lambda invocations (~$0.20/million)
- Bedrock API calls (~$0.01/1K tokens)
- CloudWatch logs ($0.50/GB)
- S3 storage (minimal)
- Check Lambda execution logs
- Review AI decisions
- Verify remediation success rates
- Review cost metrics
- Update YAML rules if needed
- Check for configuration drift
- Security review
- Performance optimization
- Rule effectiveness analysis
-
High Lambda Latency
- Check VPC endpoints
- Review memory usage
- Verify Bedrock availability
-
Failed Remediation
- Check IAM permissions
- Verify target resource state
- Review action logs
-
AI Decision Quality
- Review Bedrock prompts
- Check historical data
- Adjust thresholds
Access the operational dashboard at:
https://console.aws.amazon.com/cloudwatch/home?region=us-east-1#dashboards:name=CloudOpsAI
- Lambda success rate
- AI decision accuracy
- Remediation effectiveness
- Cost per incident
# Backup YAML rules
aws s3 sync s3://cloudopsai-config/ backup/
# Backup DynamoDB
aws dynamodb create-backup \
--table-name cloudopsai-incidents \
--backup-name "backup-$(date +%Y%m%d)"
- Rotate IAM access keys
- Review VPC security groups
- Check CloudTrail logs
- Update KMS key policy
- Maintain audit logs
- Review access patterns
- Update documentation
Report Bug | Request Feature | Documentation | Contributing Guidelines
© 2025 fleXRPL. All Rights Reserved.