Kafka JSON Processor

A processor that reads JSONs from Kafka topics, processes them and puts them in other selected Kafka topic.

Kafka topic #1 -> JSON message -> kafka-json-processor -> new JSON message -> Kafka topic #2

Processors used by kafka-json-processor are configured by a template.yaml file. You can customize the process of processing messages or extend processors with custom ones.

In fact this utility generates Rust code based on a template in YAML. Compiled code in customized cases is generally faster, as it does not need to interpret the config and runs instructions directly.

This project is split into the following subprojects:

The generator - generates a project based on template.yaml and available processor generators.
The processor generators - a set of scripts with code generators with some predefined functions for your custom processor.
The plugin framework - base for your custom code generator (if you want to write it in Rust, not as a script).
The core dependency - used in generated projects, prevents boilerplate.

How to use?

In short, the steps to run your custom processor are the following:

Prepare your template.yaml with your desired processor configuration (e.g. copy field, extract date from message etc. - see example).
Generate JSON processor with the generator (and processor generators, you can also use your own) and compile the generated project.
Prepare processor.properties with rdkafka (Kafka client) configuration - see example (put this file in the same directory as your executable).
Run your executable (to see logs set the following environment variable: RUST_LOG=info, e.g. in bash you can just run RUST_LOG=info ./your_executable).

Test your processor

You may want to test if the generated processor is correct before deploying it. To test it, in the generated project, create a simulations directory. In this directory, create another directory (or directories, depending on the template.yaml) with the name ${input_topic}_${output_topic}, where ${input_topic} is the name of the input_topic from template.yaml and ${output_topic} is the name of the output_topic from template.yaml.

For example, if you have the following stream in your tempate.yaml:

streams:
  - input_topic: sometopic
    output_topic: target

Then create the following directory structure:

<project_directory>
 > simulations
 | > sometopic_target

In the ${input_topic}_${output_topic} directory (in this case - sometopic_target), create text files with the input message and expected output. For example, given the template all_processors.yaml (see template-examples), I have prepared some test data in simulations/in_out. The test files always have two headers - [Input] (for input JSON) and [Expected] (for expected processed message).

To run the simulation, run cargo test in the generated project. See kjp-generator/tests/integration_test.rs for a complete example.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github/workflows		.github/workflows
kafka-json-processor-core		kafka-json-processor-core
kjp-generator-generators		kjp-generator-generators
kjp-generator-plugin		kjp-generator-plugin
kjp-generator		kjp-generator
simulations		simulations
template-examples		template-examples
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
processor.properties		processor.properties

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kafka JSON Processor

How to use?

Test your processor

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

multicatch/kafka-json-processor

Folders and files

Latest commit

History

Repository files navigation

Kafka JSON Processor

How to use?

Test your processor

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages