Get metrics from mapreduce service in real time to:
- Visualize and monitor mapreduce states
- Be notified about mapreduce failovers and events.
The Mapreduce check is included in the Datadog Agent package, so you don't need to install anything else on your servers.
To configure this check for an Agent running on a host:
-
Edit the
mapreduce.d/conf.yaml
file, in theconf.d/
folder at the root of your Agent's configuration directory to point to your server and port, set the masters to monitor. See the sample mapreduce.d/conf.yaml for all available configuration options.
For containerized environments, see the Autodiscovery Integration Templates for guidance on applying the parameters below.
Parameter | Value |
---|---|
<INTEGRATION_NAME> |
mapreduce |
<INIT_CONFIG> |
blank or {} |
<INSTANCE_CONFIG> |
{"resourcemanager_uri": "https://%%host%%:8088", "cluster_name":"<MAPREDUCE_CLUSTER_NAME>"} |
-
Collecting logs is disabled by default in the Datadog Agent, enable it in your
datadog.yaml
file:logs_enabled: true
-
Uncomment and edit the logs configuration block in your
mapreduce.d/conf.yaml
file. Change thetype
,path
, andservice
parameter values based on your environment. See the sample mapreduce.d/conf.yaml for all available configuration options.logs: - type: file path: <LOG_FILE_PATH> source: mapreduce service: <SERVICE_NAME> # To handle multi line that starts with yyyy-mm-dd use the following pattern # log_processing_rules: # - type: multi_line # pattern: \d{4}\-\d{2}\-\d{2} \d{2}:\d{2}:\d{2},\d{3} # name: new_log_start_with_date
See Datadog's documentation for additional information on how to configure the Agent for log collection in Docker environments.
Run the Agent's status subcommand and look for mapreduce
under the Checks section.
See metadata.csv for a list of metrics provided by this integration.
The Mapreduce check does not include any events.
mapreduce.resource_manager.can_connect
Returns CRITICAL
if the Agent is unable to connect to the Resource Manager.
Returns OK
otherwise.
mapreduce.application_master.can_connect
Returns CRITICAL
if the Agent is unable to connect to the Application Master.
Returns OK
otherwise.
Need help? Contact Datadog support.