Automated Lakeflow Declarative Pipleines/Workflows --- Bank Project

In this project, I have demonstrated the use Databrick platform to create Automated Lakeflow Declarative Pipeline(previous DLT), this project demonstrates how Databricks Workflows makes data engineering pipelines declarative, and scalable, while powering dashboards for real-time insights.

Project Details

Landing_Layer.py: Defines both landing_customers_incremental and landing_accounts_incremental as streaming tables using Autoloader, with correct schemas. These are the inputs for your bronze layer.
bronze_layer.py: Reads from landing sources, cleans/transforms, then writes bronze_customers_clean and bronze_accounts_clean. Expectation columns now match transformations.
silver_layer.py: Reads bronze_customers_clean and bronze_accounts_clean directly using streaming patterns: spark.readStream.table("bronze_customers_clean") spark.readStream.table("bronze_accounts_clean") Then performs additional transformation and CDC flows. No incorrect references.
gold_layer.py: Batch reads from silver layers using dp.read (deprecated, but works) for joins/aggregations

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Bank_Project_Pipeline		Bank_Project_Pipeline
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated Lakeflow Declarative Pipleines/Workflows --- Bank Project

Project Details

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Automated Lakeflow Declarative Pipleines/Workflows --- Bank Project

Project Details

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages