Name	Name	Last commit message	Last commit date
Latest commit History 11 Commits
data/allRadioPartitionByRadioAndDate.parquet	data/allRadioPartitionByRadioAndDate.parquet
src	src
stackScripts	stackScripts
.gitignore	.gitignore
README.md	README.md
build.sbt	build.sbt

Name

Last commit message

Last commit date

11 Commits

data/allRadioPartitionByRadioAndDate.parquet

Kafka / Cassandra / Spark Structured Streaming Example

Stream the number of time Drake is broadcasted on each radio. And also, see how easy is Spark Structured Streaming to use using SparkSQL

Input data

Coming from radio stations stored inside a parquet file, the stream is emulated with .option("maxFilesPerTrigger", 1) option.

The stream is after read to be sink into Kafka. Then, Kafka to Cassandra

Output data

Stored inside Kafka and Cassandra for example only. Cassandra's Sinks uses the ForeachWriter and also the StreamSinkProvider to compare both sinks.

Kafka topic

topic:test

Cassandra Table

A table for the ForeachWriter

CREATE TABLE test.radio (
  radio varchar,
  title varchar,
  artist varchar,
  count bigint,
  PRIMARY KEY (radio, title, artist)
);

A second sink to test the other writer.

CREATE TABLE test.radioOtherSink (
  radio varchar,
  title varchar,
  artist varchar,
  count bigint,
  PRIMARY KEY (radio, title, artist)
);

cqlsh> SELECT * FROM test.radio;

 radio   | title                    | artist | count
---------+--------------------------+--------+-------
 skyrock |                Controlla |  Drake |     1
 skyrock |                Fake Love |  Drake |     9
 skyrock | Hold On We’Re Going Home |  Drake |    35
 skyrock |            Hotline Bling |  Drake |  1052
 skyrock |  Started From The Bottom |  Drake |    39
    nova |         4pm In Calabasas |  Drake |     1
    nova |             Feel No Ways |  Drake |     2
    nova |                From Time |  Drake |    34
    nova |                     Hype |  Drake |     2

Useful links

Requirements

Cassandra 3.10
Kafka 0.10+ (with Zookeeper)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kafka / Cassandra / Spark Structured Streaming Example

Input data

Output data

Kafka topic

Cassandra Table

Useful links

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

polomarcus/Spark-Structured-Streaming-Examples

Folders and files

Latest commit

History

Repository files navigation

Kafka / Cassandra / Spark Structured Streaming Example

Input data

Output data

Kafka topic

Cassandra Table

Useful links

Requirements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages