GitHub - polyzos/stream-processing-with-apache-flink

Stream Processing with Apache Flink

This repository contains the code for the book Stream Processing: Hands-on with Apache Flink.

Environment Setup

In order to run the code samples we will need a Kafka and Flink cluster up and running. You can also run the Flink examples from within your favorite IDE in which case you don't need a Flink Cluster.

If you want to run the examples inside a Flink Cluster run the following command to start the services.

docker-compose up

When the cluster is up and running successfully run the following command for redpanda:

./redpanda-setup.sh

or this command for kafka setup

./kafka-setup.sh

Register UDF

CREATE FUNCTION maskfn  AS 'io.streamingledger.udfs.MaskingFn'      LANGUAGE JAVA USING JAR '/opt/flink/jars/spf-0.1.0.jar';
CREATE FUNCTION splitfn AS 'io.streamingledger.udfs.SplitFn'        LANGUAGE JAVA USING JAR '/opt/flink/jars/spf-0.1.0.jar';
CREATE FUNCTION lookup  AS 'io.streamingledger.udfs.AsyncLookupFn'  LANGUAGE JAVA USING JAR '/opt/flink/jars/spf-0.1.0.jar';

CREATE TEMPORARY VIEW sample AS
SELECT * 
FROM transactions 
LIMIT 10;

SELECT transactionId, maskfn(UUID()) AS maskedCN FROM sample;
SELECT * FROM transactions, LATERAL TABLE(splitfn(operation));

SELECT 
  transactionId,
  serviceResponse, 
  responseTime 
FROM sample, LATERAL TABLE(lookup(transactionId));

Deploy a JAR file

Package the application and create an executable jar file

mvn clan package

Copy it under the jar files to be included in the custom Flink images
Start the cluster to build the new images by running

docker-compose up

Deploy the flink job

docker exec -it jobmanager ./bin/flink run \
  --class io.streamingledger.datastream.BufferingStream \
  jars/spf-0.1.0.jar

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
assets		assets
data		data
grafana		grafana
jars		jars
prometheus		prometheus
src/main		src/main
.gitignore		.gitignore
Dockerfile		Dockerfile
Readme.md		Readme.md
docker-compose-kafka.yaml		docker-compose-kafka.yaml
docker-compose.yaml		docker-compose.yaml
kafka-setup.sh		kafka-setup.sh
pom.xml		pom.xml
redpanda-setup.sh		redpanda-setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stream Processing with Apache Flink

Table of Contents

Environment Setup

Register UDF

Deploy a JAR file

About

Releases

Packages

Languages

polyzos/stream-processing-with-apache-flink

Folders and files

Latest commit

History

Repository files navigation

Stream Processing with Apache Flink

Table of Contents

Environment Setup

Register UDF

Deploy a JAR file

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages