-
Notifications
You must be signed in to change notification settings - Fork 16
CP-8403 Adding Telegraf-based metric collection. #81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very happy to see this. Thanks for adding it.
ab-pre-push Success: Cloned a VM from the build and:
Output files are being logged under /var/log/telegraf:
I also pulled a Support Bundle and verified these are included. Enabling the performance playbook scripts:
and disabling them again:
I also tested streaming to InfluxDB directly by uncommenting the relevant config and this works. |
Initial addition of configuration and control files to enable performance metric collection using the Telegraf agent. See also IDEA-2835 : Improving Support Bundle Performance Metrics Includes: - Service definition and startup script for "delphix-telegraf" - Modified version of "estat" adding JSON output via a "-j" option - A "perf_playbook" wrapper script to enable/disable enhanced collection - Configuration file sections (combined on startup) - Simple wrappers to facilitate parsing of "nfs_threads", "zpool iostat -o", and "zcache stats -a" outputs The service starts with a "base" set of metrics, but will include Object Storage metrics when it is detected, and will include Performance Playbook commands if that has been enabled (manually). The config is reassembled each startup. File paths intended: /opt/delphix/server/bin/delphix-telegraf-service /lib/systemd/system/delphix-telegraf.service /etc/telegraf/nfs-threads.sh /opt/delphix/server/bin/perf_playbook /etc/telegraf/telegraf.base /etc/telegraf/telegraf.inputs.dose /etc/telegraf/telegraf.inputs.playbook /etc/telegraf/zcache-stats.sh /etc/telegraf/zpool-iostat-o.sh This configuration records 4 output files (rotated on size) for main metrics, aggregate statistics (min,max,mean,stddev) and Playbook outputs to enable independent retention periods.
Initial addition of configuration and control files to enable performance metric collection using the Telegraf agent.
See also IDEA-2835 : Improving Support Bundle Performance Metrics
Includes:
and "zcache stats -a" outputs
The service starts with a "base" set of metrics, but will include Object Storage
metrics when it is detected, and will include Performance Playbook commands
if that has been enabled (manually). The config is reassembled each startup.
File paths intended:
/opt/delphix/server/bin/delphix-telegraf-service
/lib/systemd/system/delphix-telegraf.service
/usr/bin/estat
/etc/telegraf/nfs-threads.sh
/opt/delphix/server/bin/perf_playbook
/etc/telegraf/telegraf.base
/etc/telegraf/telegraf.inputs.dose
/etc/telegraf/telegraf.inputs.playbook
/etc/telegraf/zcache-stats.sh
/etc/telegraf/zpool-iostat-o.sh
This configuration records 3 output files (rotated on size) for main metrics,
aggregate statistics (min,max,mean,stddev) and Playbook outputs to enable
independent retention periods.