Spark with probabilistic algortighmts - Bloom filter, HLL, QTree and Count-min sketch
-
Updated
Aug 24, 2017 - Scala
Spark with probabilistic algortighmts - Bloom filter, HLL, QTree and Count-min sketch
Big data analytics application built with Apache Spark and Scala. It compares the performance of queries on 1) Entire Dataset 2) Sampled Dataset 3) Use of CountMinSketch.
🗂️ Collection of immutable data structures implemented in idiomatic, functional Scala. Some well-known, some homemade
Add a description, image, and links to the count-min-sketch topic page so that developers can more easily learn about it.
To associate your repository with the count-min-sketch topic, visit your repo's landing page and select "manage topics."