Skip to content

Latest commit

 

History

History

elasticsearch

DATABASE ELASTICSEARCH DataDog monitors

How to use this module

module "datadog-monitors-database-elasticsearch" {
  source      = "claranet/monitors/datadog//database/elasticsearch"
  version     = "{revision}"

  environment = var.environment
  message     = module.datadog-message-alerting.alerting-message
}

Purpose

Creates DataDog monitors with the following checks:

  • Elasticsearch average index flushing to disk latency
  • Elasticsearch average indexing latency by document
  • Elasticsearch average Old-generation garbage collections latency
  • Elasticsearch average search fetch latency
  • Elasticsearch average search query latency
  • Elasticsearch average Young-generation garbage collections latency
  • Elasticsearch change alert on the average time spent by tasks in the queue on {{cluster_name}}
  • Elasticsearch change alert on the number of currently active queries on {{cluster_name}}
  • Elasticsearch change alert on the number of query cache evictions on {{node_name}}
  • Elasticsearch change alert on the number of request cache evictions on {{node_name}}
  • Elasticsearch change alert on the number of search fetches currently running on {{cluster_name}}
  • Elasticsearch change alert on the total number of evictions from the fielddata cache on {{node_name}}
  • ElasticSearch Cluster has maximum shards open on {{cluster_name}}
  • ElasticSearch Cluster has unassigned shards on {{cluster_name}}
  • ElasticSearch Cluster is initializing shards on {{cluster_name}}
  • ElasticSearch Cluster is relocating shards on {{cluster_name}}
  • ElasticSearch Cluster status not green on {{cluster_name}}
  • ElasticSearch does not respond on {{server}}:{{port}}
  • ElasticSearch free space < 10% on {{node_name}}
  • Elasticsearch JVM HEAP memory usage
  • Elasticsearch JVM memory Old usage
  • Elasticsearch JVM memory Young usage
  • Elasticsearch number of current open HTTP connections anomaly detected on {{node_name}}
  • Elasticsearch {{policy}} snapshot deletion failure on {{cluster_name}}
  • Elasticsearch {{policy}} snapshot failed on {{cluster_name}}

Requirements

Name Version
terraform >= 0.12.31
datadog >= 3.1.2

Providers

Name Version
datadog >= 3.1.2

Modules

Name Source Version
filter-tags ../../common/filter-tags n/a

Resources

Name Type
datadog_monitor.cluster_initializing_shards resource
datadog_monitor.cluster_maximum_shards resource
datadog_monitor.cluster_relocating_shards resource
datadog_monitor.cluster_status_not_green resource
datadog_monitor.cluster_unassigned_shards resource
datadog_monitor.fetch_change resource
datadog_monitor.fetch_latency resource
datadog_monitor.field_data_evictions_change resource
datadog_monitor.flush_latency resource
datadog_monitor.http_connections_anomaly resource
datadog_monitor.indexing_latency resource
datadog_monitor.jvm_gc_old_collection_latency resource
datadog_monitor.jvm_gc_young_collection_latency resource
datadog_monitor.jvm_heap_memory_usage resource
datadog_monitor.jvm_memory_old_usage resource
datadog_monitor.jvm_memory_young_usage resource
datadog_monitor.node_free_space resource
datadog_monitor.not_responding resource
datadog_monitor.query_cache_evictions_change resource
datadog_monitor.request_cache_evictions_change resource
datadog_monitor.search_query_change resource
datadog_monitor.search_query_latency resource
datadog_monitor.slm_snapshot_deletion_failures resource
datadog_monitor.slm_snapshots_failed resource
datadog_monitor.task_time_in_queue_change resource

Inputs

Name Description Type Default Required
cluster_initializing_shards_enabled Flag to enable Cluster Initializing Shards monitor string "true" no
cluster_initializing_shards_extra_tags Extra tags for Cluster Initializing Shards monitor list(string) [] no
cluster_initializing_shards_message Custom message for the Cluster Initializing Shards monitor string "" no
cluster_initializing_shards_threshold_critical Cluster Initializing Shards critical threshold string 2 no
cluster_initializing_shards_threshold_warning Cluster Initializing Shards warning threshold string 1 no
cluster_initializing_shards_time_aggregator Time aggregator for the Cluster Initializing Shards monitor string "avg" no
cluster_initializing_shards_timeframe Timeframe for the Cluster Initializing Shards monitor string "last_5m" no
cluster_maximum_shards_enabled Flag to enable Cluster Maximum Shards monitor string "true" no
cluster_maximum_shards_extra_tags Extra tags for Cluster Maximum Shards monitor list(string) [] no
cluster_maximum_shards_message Custom message for the Cluster Maximum Shards monitor string "" no
cluster_maximum_shards_per_node Maximum shards per node for the Cluster Maximum Shards monitor number 1000 no
cluster_maximum_shards_threshold_critical Cluster Maximum Shards critical threshold string 95 no
cluster_maximum_shards_threshold_warning Cluster Maximum Shards warning threshold string 90 no
cluster_maximum_shards_time_aggregator Time aggregator for the Cluster Maximum Shards monitor string "avg" no
cluster_maximum_shards_timeframe Timeframe for the Cluster Maximum Shards monitor string "last_5m" no
cluster_relocating_shards_enabled Flag to enable Cluster Relocating Shards monitor string "true" no
cluster_relocating_shards_extra_tags Extra tags for Cluster Relocating Shards monitor list(string) [] no
cluster_relocating_shards_message Custom message for the Cluster Relocating Shards monitor string "" no
cluster_relocating_shards_threshold_critical Cluster Relocating Shards critical threshold string 2 no
cluster_relocating_shards_threshold_warning Cluster Relocating Shards warning threshold string 1 no
cluster_relocating_shards_time_aggregator Time aggregator for the Cluster Relocating Shards monitor string "avg" no
cluster_relocating_shards_timeframe Timeframe for the Cluster Relocating Shards monitor string "last_5m" no
cluster_status_not_green_enabled Flag to enable Cluster Status monitor string "true" no
cluster_status_not_green_extra_tags Extra tags for Cluster Status monitor list(string) [] no
cluster_status_not_green_message Custom message for the Cluster Status monitor string "" no
cluster_status_not_green_threshold_critical Cluster Status critical threshold string 0 no
cluster_status_not_green_threshold_warning Cluster Status warning threshold string 1 no
cluster_status_not_green_time_aggregator Time aggregator for the Cluster Status monitor string "avg" no
cluster_status_not_green_timeframe Timeframe for the Cluster Status monitor string "last_5m" no
cluster_unassigned_shards_enabled Flag to enable Cluster Unassigned Shards monitor string "true" no
cluster_unassigned_shards_extra_tags Extra tags for Cluster Unassigned Shards monitor list(string) [] no
cluster_unassigned_shards_message Custom message for the Cluster Unassigned Shards monitor string "" no
cluster_unassigned_shards_threshold_critical Cluster Unassigned Shards critical threshold string 2 no
cluster_unassigned_shards_threshold_warning Cluster Unassigned Shards warning threshold string 1 no
cluster_unassigned_shards_time_aggregator Time aggregator for the Cluster Unassigned Shards monitor string "avg" no
cluster_unassigned_shards_timeframe Timeframe for the Cluster Unassigned Shards monitor string "last_5m" no
environment Architecture Environment string n/a yes
evaluation_delay Delay in seconds for the metric evaluation number 15 no
fetch_change_enabled Flag to enable Fetch Change monitor string "true" no
fetch_change_extra_tags Extra tags for Fetch Change monitor list(string) [] no
fetch_change_message Custom message for the Fetch Change monitor string "" no
fetch_change_threshold_critical Fetch Change critical threshold string 100 no
fetch_change_threshold_warning Fetch Change warning threshold string 75 no
fetch_change_time_aggregator Time aggregator for the Fetch Change monitor string "avg" no
fetch_change_timeframe Timeframe for the Fetch Change monitor string "last_10m" no
fetch_change_timeshift Timeshift for the Fetch Change monitor string "last_10m" no
fetch_latency_enabled Flag to enable Fetch Latency monitor string "true" no
fetch_latency_extra_tags Extra tags for Fetch Latency monitor list(string) [] no
fetch_latency_message Custom message for the Fetch Latency monitor string "" no
fetch_latency_threshold_critical Fetch Latency critical threshold string 20 no
fetch_latency_threshold_warning Fetch Latency warning threshold string 10 no
fetch_latency_time_aggregator Time aggregator for the Fetch Latency monitor string "min" no
fetch_latency_timeframe Timeframe for the Fetch Latency monitor string "last_15m" no
field_data_evictions_change_enabled Flag to enable Field Data Evictions monitor string "true" no
field_data_evictions_change_extra_tags Extra tags for Field Data Evictions monitor list(string) [] no
field_data_evictions_change_message Custom message for the Field Data Evictions monitor string "" no
field_data_evictions_change_threshold_critical Field Data Evictions critical threshold string 120 no
field_data_evictions_change_threshold_warning Field Data Evictions warning threshold string 60 no
field_data_evictions_change_time_aggregator Time aggregator for the Field Data Evictions monitor string "avg" no
field_data_evictions_change_timeframe Timeframe for the Field Data Evictions monitor string "last_15m" no
field_data_evictions_change_timeshift Timeframe for the Field Data Evictions monitor string "last_15m" no
filter_tags_custom Tags used for custom filtering when filter_tags_use_defaults is false string "*" no
filter_tags_custom_excluded Tags excluded for custom filtering when filter_tags_use_defaults is false string "" no
filter_tags_use_defaults Use default filter tags convention string "true" no
flush_latency_enabled Flag to enable Flush Latency monitor string "true" no
flush_latency_extra_tags Extra tags for Flush Latency monitor list(string) [] no
flush_latency_message Custom message for the Flush Latency monitor string "" no
flush_latency_threshold_critical Flush Latency critical threshold string 150 no
flush_latency_threshold_warning Flush Latency warning threshold string 100 no
flush_latency_time_aggregator Time aggregator for the Flush Latency monitor string "avg" no
flush_latency_timeframe Timeframe for the Flush Latency monitor string "last_15m" no
http_connections_anomaly_alert_window Alert window. string "last_15m" no
http_connections_anomaly_count_default_zero Count default zero. string "true" no
http_connections_anomaly_detection_algorithm Anomaly Detection Algorithm used string "agile" no
http_connections_anomaly_deviations Deviations to detect the anomaly string 2 no
http_connections_anomaly_direction Direction of the anomaly. It can be both, below or above. string "above" no
http_connections_anomaly_enabled Flag to enable Open HTTP Connections Anomaly monitor string "true" no
http_connections_anomaly_extra_tags Extra tags for Open HTTP Connections Anomaly monitor list(string) [] no
http_connections_anomaly_interval Interval. string 60 no
http_connections_anomaly_message Custom message for the Open HTTP Connections Anomaly monitor string "" no
http_connections_anomaly_seasonality Seasonality of the algorithm string "hourly" no
http_connections_anomaly_threshold_critical Open HTTP Connections Anomaly critical threshold string 1 no
http_connections_anomaly_threshold_warning Open HTTP Connections Anomaly warning threshold string 0.75 no
http_connections_anomaly_time_aggregator Time aggregator for the Open HTTP Connections Anomaly monitor string "avg" no
http_connections_anomaly_timeframe Timeframe for the Open HTTP Connections Anomaly monitor string "last_4h" no
indexing_latency_enabled Flag to enable Indexing Latency monitor string "true" no
indexing_latency_extra_tags Extra tags for Indexing Latency monitor list(string) [] no
indexing_latency_message Custom message for the Indexing Latency monitor string "" no
indexing_latency_threshold_critical Indexing Latency critical threshold string 30 no
indexing_latency_threshold_warning Indexing Latency warning threshold string 15 no
indexing_latency_time_aggregator Time aggregator for the Indexing Latency monitor string "avg" no
indexing_latency_timeframe Timeframe for the Indexing Latency monitor string "last_10m" no
jvm_gc_old_collection_latency_enabled Flag to enable JVM Garbace Collector Old Collection Latency monitor string "true" no
jvm_gc_old_collection_latency_extra_tags Extra tags for JVM Garbace Collector Old Collection Latency monitor list(string) [] no
jvm_gc_old_collection_latency_message Custom message for the JVM Garbace Collector Old Collection Latency monitor string "" no
jvm_gc_old_collection_latency_threshold_critical JVM Garbace Collector Old Collection Latency critical threshold string 300 no
jvm_gc_old_collection_latency_threshold_warning JVM Garbace Collector Old Collection Latency warning threshold string 200 no
jvm_gc_old_collection_latency_time_aggregator Time aggregator for the JVM Garbace Collector Old Collection Latency monitor string "avg" no
jvm_gc_old_collection_latency_timeframe Timeframe for the JVM Garbace Collector Old Collection Latency monitor string "last_15m" no
jvm_gc_young_collection_latency_enabled Flag to enable JVM Garbace Collector Young Collection Latency monitor string "true" no
jvm_gc_young_collection_latency_extra_tags Extra tags for JVM Garbace Collector Young Collection Latency monitor list(string) [] no
jvm_gc_young_collection_latency_message Custom message for the JVM Garbace Collector Young Collection Latency monitor string "" no
jvm_gc_young_collection_latency_threshold_critical JVM Garbace Collector Young Collection Latency critical threshold string 40 no
jvm_gc_young_collection_latency_threshold_warning JVM Garbace Collector Young Collection Latency warning threshold string 20 no
jvm_gc_young_collection_latency_time_aggregator Time aggregator for the JVM Garbace Collector Young Collection Latency monitor string "avg" no
jvm_gc_young_collection_latency_timeframe Timeframe for the JVM Garbace Collector Young Collection Latency monitor string "last_15m" no
jvm_heap_memory_usage_enabled Flag to enable JVM Heap Memory Usage monitor string "true" no
jvm_heap_memory_usage_extra_tags Extra tags for JVM Heap Memory Usage monitor list(string) [] no
jvm_heap_memory_usage_message Custom message for the JVM Heap Memory Usage monitor string "" no
jvm_heap_memory_usage_threshold_critical JVM Heap Memory Usage critical threshold string 90 no
jvm_heap_memory_usage_threshold_warning JVM Heap Memory Usage warning threshold string 80 no
jvm_heap_memory_usage_time_aggregator Time aggregator for the JVM Heap Memory Usage monitor string "avg" no
jvm_heap_memory_usage_timeframe Timeframe for the JVM Heap Memory Usage monitor string "last_5m" no
jvm_memory_old_usage_enabled Flag to enable JVM Memory Old Usage monitor string "true" no
jvm_memory_old_usage_extra_tags Extra tags for JVM Memory Old Usage monitor list(string) [] no
jvm_memory_old_usage_message Custom message for the JVM Memory Old Usage monitor string "" no
jvm_memory_old_usage_threshold_critical JVM Memory Old Usage critical threshold string 90 no
jvm_memory_old_usage_threshold_warning JVM Memory Old Usage warning threshold string 80 no
jvm_memory_old_usage_time_aggregator Time aggregator for the JVM Memory Old Usage monitor string "avg" no
jvm_memory_old_usage_timeframe Timeframe for the JVM Memory Old Usage monitor string "last_10m" no
jvm_memory_young_usage_enabled Flag to enable JVM Memory Young Usage monitor string "true" no
jvm_memory_young_usage_extra_tags Extra tags for JVM Memory Young Usage monitor list(string) [] no
jvm_memory_young_usage_message Custom message for the JVM Memory Young Usage monitor string "" no
jvm_memory_young_usage_threshold_critical JVM Memory Young Usage critical threshold string 90 no
jvm_memory_young_usage_threshold_warning JVM Memory Young Usage warning threshold string 80 no
jvm_memory_young_usage_time_aggregator Time aggregator for the JVM Memory Young Usage monitor string "avg" no
jvm_memory_young_usage_timeframe Timeframe for the JVM Memory Young Usage monitor string "last_10m" no
message Message sent when a monitor is triggered any n/a yes
new_group_delay Delay in seconds before monitor new resource number 300 no
new_host_delay Delay in seconds before begin to monitor new host number 300 no
node_free_space_enabled Flag to enable Free Space in nodes monitor string "true" no
node_free_space_extra_tags Extra tags for Free Space in nodes monitor list(string) [] no
node_free_space_message Custom message for the Free Space in nodes monitor string "" no
node_free_space_threshold_critical Free Space in nodes critical threshold string 10 no
node_free_space_threshold_warning Free Space in nodes warning threshold string 20 no
node_free_space_time_aggregator Time aggregator for the Free Space in nodes monitor string "sum" no
node_free_space_timeframe Timeframe for the Free Space in nodes monitor string "last_5m" no
not_responding_enabled Flag to enable Elasticsearch does not respond monitor string "true" no
not_responding_extra_tags Extra tags for Elasticsearch does not respond monitor list(string) [] no
not_responding_message Custom message for Elasticsearch does not respond monitor string "" no
not_responding_no_data_timeframe Elasticsearch not responding monitor no data timeframe string 10 no
not_responding_threshold_warning Elasticsearch not responding limit (warning threshold) number 3 no
notify_no_data Will raise no data alert if set to true bool true no
prefix_slug Prefix string to prepend between brackets on every monitors names string "" no
query_cache_evictions_change_enabled Flag to enable Query Cache Evictions monitor string "true" no
query_cache_evictions_change_extra_tags Extra tags for Query Cache Evictions monitor list(string) [] no
query_cache_evictions_change_message Custom message for the Query Cache Evictions monitor string "" no
query_cache_evictions_change_threshold_critical Query Cache Evictions critical threshold string 120 no
query_cache_evictions_change_threshold_warning Query Cache Evictions warning threshold string 60 no
query_cache_evictions_change_time_aggregator Time aggregator for the Query Cache Evictions monitor string "avg" no
query_cache_evictions_change_timeframe Timeframe for the Query Cache Evictions monitor string "last_15m" no
query_cache_evictions_change_timeshift Timeframe for the Query Cache Evictions monitor string "last_15m" no
request_cache_evictions_change_enabled Flag to enable Request Cache Evictions monitor string "true" no
request_cache_evictions_change_extra_tags Extra tags for Request Cache Evictions monitor list(string) [] no
request_cache_evictions_change_message Custom message for the Request Cache Evictions monitor string "" no
request_cache_evictions_change_threshold_critical Request Cache Evictions critical threshold string 120 no
request_cache_evictions_change_threshold_warning Request Cache Evictions warning threshold string 60 no
request_cache_evictions_change_time_aggregator Time aggregator for the Request Cache Evictions monitor string "avg" no
request_cache_evictions_change_timeframe Timeframe for the Request Cache Evictions monitor string "last_15m" no
request_cache_evictions_change_timeshift Timeshift for the Request Cache Evictions monitor string "last_15m" no
search_query_change_enabled Flag to enable Search Query Change monitor string "true" no
search_query_change_extra_tags Extra tags for Search Query Change monitor list(string) [] no
search_query_change_message Custom message for the Search Query Change monitor string "" no
search_query_change_threshold_critical Search Query Change critical threshold string 100 no
search_query_change_threshold_warning Search Query Change warning threshold string 75 no
search_query_change_time_aggregator Time aggregator for the Search Query Change monitor string "avg" no
search_query_change_timeframe Timeframe for the Search Query Change monitor string "last_10m" no
search_query_change_timeshift Timeshift for the Search Query Change monitor string "last_10m" no
search_query_latency_enabled Flag to enable Query Latency monitor string "true" no
search_query_latency_extra_tags Extra tags for Query Latency monitor list(string) [] no
search_query_latency_message Custom message for the Query Latency monitor string "" no
search_query_latency_threshold_critical Query Latency critical threshold string 20 no
search_query_latency_threshold_warning Query Latency warning threshold string 10 no
search_query_latency_time_aggregator Time aggregator for the Query Latency monitor string "avg" no
search_query_latency_timeframe Timeframe for the Query Latency monitor string "last_15m" no
slm_snapshot_deletion_failures_enabled Flag to enable SLM Snapshot deletion failures monitor string "true" no
slm_snapshot_deletion_failures_extra_tags Extra tags for SLM Snapshot deletion failures monitor list(string) [] no
slm_snapshot_deletion_failures_message Custom message for SLM Snapshot deletion failures monitor string "" no
slm_snapshot_deletion_failures_threshold_critical SLM Snapshot deletion failures critical threshold string 0 no
slm_snapshot_deletion_failures_time_aggregator Time aggregator for SLM Snapshot deletion failures monitor string "avg" no
slm_snapshot_deletion_failures_timeframe SLM Snapshot deletion failures timeframe string "last_15m" no
slm_snapshot_deletion_failures_timeshift SLM Snapshot deletion failures timeshift string "last_2h" no
slm_snapshots_failed_enabled Flag to enable SLM Snapshots Failed monitor string "true" no
slm_snapshots_failed_extra_tags Extra tags for SLM Snapshots Failed monitor list(string) [] no
slm_snapshots_failed_message Custom message for SLM Snapshots Failed monitor string "" no
slm_snapshots_failed_threshold_critical SLM Snapshots Failed critical threshold string 0 no
slm_snapshots_failed_time_aggregator Time aggregator for SLM Snapshots Failed monitor string "avg" no
slm_snapshots_failed_timeframe SLM Snapshots Failed timeframe string "last_15m" no
slm_snapshots_failed_timeshift SLM Snapshots Failed timeshift string "last_2h" no
tags Global variables list(string)
[
"type:database",
"provider:elasticsearch",
"resource:elasticsearch"
]
no
task_time_in_queue_change_enabled Flag to enable Task Time in Queue monitor string "true" no
task_time_in_queue_change_extra_tags Extra tags for Task Time in Queue monitor list(string) [] no
task_time_in_queue_change_message Custom message for the Task Time in Queue monitor string "" no
task_time_in_queue_change_threshold_critical Task Time in Queue critical threshold string 200 no
task_time_in_queue_change_threshold_warning Task Time in Queue warning threshold string 100 no
task_time_in_queue_change_time_aggregator Time aggregator for the Task Time in Queue monitor string "avg" no
task_time_in_queue_change_timeframe Timeframe for the Task Time in Queue monitor string "last_10m" no
task_time_in_queue_change_timeshift Timeshift for the Task Time in Queue monitor string "last_10m" no
team n/a string "claranet" no
timeout_h Default auto-resolving state (in hours) number 0 no

Outputs

Name Description
cluster_initializing_shards_id id for monitor cluster_initializing_shards
cluster_maximum_shards_id id for monitor cluster_maximum_shards
cluster_relocating_shards_id id for monitor cluster_relocating_shards
cluster_status_not_green_id id for monitor cluster_status_not_green
cluster_unassigned_shards_id id for monitor cluster_unassigned_shards
fetch_change_id id for monitor fetch_change
fetch_latency_id id for monitor fetch_latency
field_data_evictions_change_id id for monitor field_data_evictions_change
flush_latency_id id for monitor flush_latency
http_connections_anomaly_id id for monitor http_connections_anomaly
indexing_latency_id id for monitor indexing_latency
jvm_gc_old_collection_latency_id id for monitor jvm_gc_old_collection_latency
jvm_gc_young_collection_latency_id id for monitor jvm_gc_young_collection_latency
jvm_heap_memory_usage_id id for monitor jvm_heap_memory_usage
jvm_memory_old_usage_id id for monitor jvm_memory_old_usage
jvm_memory_young_usage_id id for monitor jvm_memory_young_usage
node_free_space_id id for monitor node_free_space
not_responding_id id for monitor not_responding
query_cache_evictions_change_id id for monitor query_cache_evictions_change
request_cache_evictions_change_id id for monitor request_cache_evictions_change
search_query_change_id id for monitor search_query_change
search_query_latency_id id for monitor search_query_latency
slm_snapshot_deletion_failures_id id for monitor slm_snapshot_deletion_failures
slm_snapshots_failed_id id for monitor slm_snapshots_failed
task_time_in_queue_change_id id for monitor task_time_in_queue_change

Related documentation