Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.
-
Updated
Aug 9, 2024 - Python
Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.
A scalable, mature and versatile web crawler based on Apache Storm
[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
[PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
News crawling with StormCrawler - stores content as WARC
Fast Advanced Spam Analysis Tool
A curated list of Pulsar tools, integrations and resources.
Battle-tested Apache Storm Multi-Lang implementation for Python
Docker image packaging for Apache Storm
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
A framework for building spouts for Apache Storm and a Kafka based spout for dynamically skipping messages to be processed later.
a suite of benchmark applications for distributed data stream processing systems
Apache Pulsar Adapters
Apache Storm cluster on Docker
Storm Debian Packaging with dpkg-buildpackage
Resources for running StormCrawler with Docker services
Process web archives (WARC format) with StormCrawler and index content into Elasticsearch or Solr
A dockerized image of Apache Storm (Zookeeper, Nimbus, Supervisor, Ui, Logviewer.)
My Talk at IoT Fusion 2018 Philadelphia, PA
Real time computation system with Apache Storm, Apache Kafka and Google Guice
Add a description, image, and links to the apache-storm topic page so that developers can more easily learn about it.
To associate your repository with the apache-storm topic, visit your repo's landing page and select "manage topics."