Skip to content
#

dataframes-api

Here are 13 public repositories matching this topic...

A sandbox environment designed to simulate a pseudo-distributed Hadoop cluster with integrated Apache Spark and Kafka components. It allows developers to prototype and experiment with big data workflows, test distributed computing patterns, and explore cluster behavior in a contained virtual setup.

  • Updated Jun 10, 2025
  • Java

Improve this page

Add a description, image, and links to the dataframes-api topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataframes-api topic, visit your repo's landing page and select "manage topics."

Learn more