Production-ready scripts and utilities for DevOps engineers and SysAdmins
A curated collection of battle-tested automation scripts, monitoring utilities, and system maintenance tools designed for modern infrastructure management. Built by the community, for the community.
# Clone the repository
git clone https://github.com/bubobot-team/devops-toolkit.git
cd devops-toolkit
# Make scripts executable
chmod +x scripts/system-health/system-doctor.sh
# Run a basic system health check
./scripts/system-health/system-doctor.sh --verbosesystem-doctor.sh- Comprehensive system health monitoring and auto-recoveryresource-monitor.sh- Real-time resource monitoring with alertingcleanup-agent.sh- Intelligent log and cache cleanup utility
audit-scanner.sh- Security audit and vulnerability scanningssl-checker.sh- SSL certificate monitoring and expiry alerts
zero-downtime-deploy.sh- Blue-green deployment automationhealth-validator.sh- Post-deployment health validation
service-watchdog.sh- Intelligent service monitoring and restartalert-processor.sh- Alert aggregation and intelligent routing
Our flagship script provides comprehensive system health monitoring and automated recovery:
# Basic health check with auto-fixes
./scripts/system-health/system-doctor.sh
# Check-only mode (no changes)
./scripts/system-health/system-doctor.sh --check-only
# Aggressive cleanup with service restarts
./scripts/system-health/system-doctor.sh --aggressive-clean --service-restart
# Generate JSON report for monitoring systems
./scripts/system-health/system-doctor.sh --report-json- β CPU, Memory, Disk monitoring with intelligent thresholds
- β Service health checks with automatic restart capabilities
- β Smart log cleanup with size-based triggers
- β Network connectivity validation
- β JSON output for integration with monitoring platforms
- β Zero-dependency - runs on any Linux system
name: System Health Check
on: [push, schedule]
jobs:
health-check:
runs-on: self-hosted
steps:
- uses: actions/checkout@v3
- name: Run System Doctor
run: ./scripts/system-health/system-doctor.sh --check-only --report-json- name: Run system health check
command: ./system-doctor.sh --aggressive-clean
register: health_result
changed_when: health_result.rc != 0Our scripts are designed to integrate seamlessly with popular monitoring platforms:
- Prometheus - Metrics export via JSON output
- Grafana - Dashboard templates included
- Bubobot - Native integration for comprehensive monitoring
- Datadog, New Relic - Compatible output formats
- PagerDuty, OpsGenie - Alert routing capabilities
- Monitoring Stack (Prometheus + Grafana + AlertManager)
- Logging Stack (ELK/EFK stack with automation)
- Security Stack (Fail2ban + ModSecurity + monitoring)
- Health Check Service - Automated background monitoring
- Cleanup Timer - Scheduled maintenance tasks
- Alert Service - Real-time alerting daemon
- Daily Health Reports - Automated system reporting
- Weekly Cleanup - Maintenance automation
- Monthly Security Audit - Automated security scans
- Set appropriate thresholds based on your workload patterns
- Monitor trends, not just current values
- Implement gradual alerting (warning β critical)
- Document your monitoring strategy
- Always test in staging before production deployment
- Implement rollback mechanisms
- Use check-only mode for validation
- Monitor automation effectiveness
- Automate common fixes but escalate complex issues
- Log all automated actions for audit trails
- Implement circuit breakers to prevent automation loops
- Regular review of automation effectiveness
- Discord: Join our DevOps community
- GitHub Discussions: Share experiences and get help
- Blog: DevOps automation insights
- Email: support@bubobot.com
We welcome contributions! Check out our Contributing Guide to get started.
- Documentation: Check our docs folder
- GitHub Issues: Report bugs or request features
- Community Discord: Ask questions and share knowledge
- Stack Overflow: Tag questions with
devops-toolkit
This toolkit is trusted by teams at:
- Startups building their first monitoring infrastructure
- Scale-ups automating operations for growth
- Enterprises standardizing DevOps practices
- MSPs managing multiple client environments
- β System health monitoring (system-doctor.sh)
- β³ Security audit tools
- β³ Container health monitoring
- β³ Database monitoring utilities
- β³ Kubernetes integration scripts
- β³ Cloud provider specific tools (AWS, GCP, Azure)
- β³ Advanced ML-based anomaly detection
- β³ Terraform integration modules
- β³ Mobile app for monitoring dashboards
- β³ Advanced alerting with ML
- β³ Compliance automation tools
- β³ Enterprise SSO integration
This project is licensed under the MIT License - see the LICENSE file for details.
If this toolkit helps you, please give it a star! It helps others discover these tools and motivates us to keep improving.
Built with β€οΈ by the DevOps community
Looking for comprehensive monitoring solutions? Check out Bubobot - the monitoring platform that integrates perfectly with these tools!