Welcome to my GitHub! I'm a Databricks Certified Data Engineer Professional with over 6 years of experience in designing and implementing scalable data solutions. I specialize in transforming raw data into actionable insights using modern data engineering tools and advanced analytics to drive business growth.
- Experience: 6+ years in data engineering, focusing on scalable data pipelines and advanced analytics.
- Mission: Build robust, efficient, and scalable data solutions to empower data-driven decision-making.
- Certifications: Databricks Certified Data Engineer Professional
- Languages: Python, SQL, JavaScript (Basics)
- Data Engineering & Processing: Apache Spark (PySpark), Databricks, Delta Lake, Apache Airflow, Pandas, Polars
- Databases & Query Engines: PostgreSQL, MySQL, MongoDB, ClickHouse, CrateDB
- Cloud & Infrastructure: AWS (S3, EC2, Glue, Athena, Redshift), Docker, FastAPI
- Visualization & Tools: Tableau, Streamlit, Databricks SQL Dashboard, Great Expectations, Unity Catalog
- Platforms & Environments: Jupyter Notebook, Visual Studio Code, Linux/Unix Shell
- Data Modeling & Architecture: Medallion Architecture, Star Schema, Fact/Dimension Tables, ETL/ELT Design
- Soft Skills: Agile collaboration, Problem-solving, Cross-functional teamwork
Explore my repositories to see examples of my work in:
- Building scalable ETL/ELT pipelines with Apache Spark and Databricks.
- Implementing data models using Medallion Architecture and Delta Lake.
- Creating interactive dashboards with Streamlit and Tableau.
- Automating workflows with Apache Airflow and AWS services.
- Ensuring data quality with Great Expectations and Unity Catalog.
- LinkedIn: https://www.linkedin.com/in/akshayboddhul/
- Email: akshayboddhul45@gmail.com
Feel free to explore my repositories and reach out for collaboration or inquiries!


