A simplified, lightweight ETL Framework based on Apache Spark
-
Updated
Jan 24, 2024 - Scala
A simplified, lightweight ETL Framework based on Apache Spark
A Spark plugin for reading and writing Excel files
A simple Spark-powered ETL framework that just works 🍺
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
为DataX(https://github.com/alibaba/DataX) 提供远程多语言调用(ThriftServer,HttpServer) 分布式运行(DataX on YARN) 功能
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
A schema-aware Scala library for data transformation
Powerful, whiteboard-style ETL
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more.
智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!
Write ETL using your favorite SQL dialects
NebulaGraph Exchange is an Apache Spark application to parse data from different sources to NebulaGraph in a distributed environment. It supports both batch and streaming data in various formats and sources including other Graph Databases, RDBMS, Data warehouses, NoSQL, Message Bus, File systems, etc.
Add a description, image, and links to the etl topic page so that developers can more easily learn about it.
To associate your repository with the etl topic, visit your repo's landing page and select "manage topics."