Open source stack lakehouse
-
Updated
Mar 2, 2024 - Python
Open source stack lakehouse
Command line interface to the services provided by Oslo Origo's Dataplatform
Data platform for building batch and real-time ETL flows using only open-source technologies
A complete Open-Source Data Platform with ETL, Datawarehouse and Viz
How to build a complete Data Platform -> Here
Dataplatform hosted events
This project goal is to design a Data Platform for retail Data Analytics.
Big Data Platform on MongoDB Atlas and Heroku PostgreSQL
OpenSource dataplatform Integrating frameworks
Shopkeeper is a python package that implements Data Platform Resources -- data producers, data consumers and datasets -- as custom infrastructure components
This ETL project was designed to demonstrate the development of a scalable data pipeline for customer sales analysis. It covers all essential steps, from data extraction to transformation and loading into a database, with Apache Airflow used.
divith-raju-big-data-tools
REST API for creating and managing event streams and sending data to event streams. Obsolete as of 2022-05-20.
The Spark Memory Configuration Calculator is designed to help data engineers and Spark developers quickly determine the optimal memory and core configurations for their Spark clusters. With this tool, you can avoid common pitfalls and ensure your cluster resources are used efficiently, leading to better performance and lower costs.
REST API for managing clients and keys in Maskinporten and synchronization with AWS SSM
Add a description, image, and links to the dataplatform topic page so that developers can more easily learn about it.
To associate your repository with the dataplatform topic, visit your repo's landing page and select "manage topics."