Skip to content

davidcrossland/StorageQueueSink

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

StorageQueueSink

Description

A Apache Spark metric sink that sends a notification message to an Azure Storage Queue when a job has been detected to be complete

Usage

Add the following lines to conf/metrics.properties

*.sink.queue.class=org.apache.spark.metrics.sink.StorageQueueSink
*.sink.queue.period=1
*.sink.queue.unit=seconds
*.sink.queue.connection=DefaultEndpointsProtocol=http;AccountName=%storageName%;AccountKey=$storageKey%
*.sink.queue.queue=%queueName%

Note:
Substitute %storageName% and %storageKey% as apropriate for the storage queue, and replace %queueName%

Bugs

If a job completes very quickly and the metrics system has not been invoked between job sumbission and completion we do not detect a job has finished. The metric system reports allJobs > 0 before it reports runningJobs > 0 and unless we detect a job has been in a running state we cannot report on success/failure

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •