Skip to content

PacktPublishing/Practical-Machine-Learning-on-Databricks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Machine Learning Summit 2025

Machine Learning Summit 2025

Bridging Theory and Practice: ML Solutions for Today’s Challenges

3 days, 20+ experts, and 25+ tech sessions and talks covering critical aspects of:

  • Agentic and Generative AI
  • Applied Machine Learning in the Real World
  • ML Engineering and Optimization

👉 Book your ticket now >>


Join Our Newsletters 📬

DataPro

The future of AI is unfolding. Don’t fall behind.

DataPro QR

Stay ahead with DataPro, the free weekly newsletter for data scientists, AI/ML researchers, and data engineers.
From trending tools like PyTorch, scikit-learn, XGBoost, and BentoML to hands-on insights on database optimization and real-world ML workflows, you’ll get what matters, fast.

Stay sharp with DataPro. Join 115K+ data professionals who never miss a beat.


BIPro

Business runs on data. Make sure yours tells the right story.

BIPro QR

BIPro is your free weekly newsletter for BI professionals, analysts, and data leaders.
Get practical tips on dashboarding, data visualization, and analytics strategy with tools like Power BI, Tableau, Looker, SQL, and dbt.

Get smarter with BIPro. Trusted by 35K+ BI professionals, see what you’re missing.

Practical Machine Learning on Databricks

This is the code repository for Practical Machine Learning on Databricks, published by Packt.

Seamlessly transition ML models and MLOps on Databricks

What is this book about?

Unleash the potential of databricks for end-to-end machine learning with this comprehensive guide, tailored for experienced data scientists and developers transitioning from DIY or other cloud platforms. Building on a strong foundation in Python, Practical Machine Learning on Databricks serves as your roadmap from development to production, covering all intermediary steps using the databricks platform.

This book covers the following exciting features:

  • Transition smoothly from DIY setups to databricks
  • Master AutoML for quick ML experiment setup
  • Automate model retraining and deployment
  • Leverage databricks feature store for data prep
  • Use MLflow for effective experiment tracking
  • Gain practical insights for scalable ML solutions
  • Find out how to handle model drifts in production environments

If you feel this book is for you, get your copy today!

https://www.packtpub.com/

Instructions and Navigations

All of the code is organized into folders. For example, Chapter02.

The code will look like the following:

iris = load_iris()
 
X = iris.data # Features
 
y = iris.target # Labels

Following is what you need for this book: This book is for experienced data scientists, engineers, and developers proficient in Python, statistics, and ML lifecycle looking to transition to databricks from DIY clouds. Introductory Spark knowledge is a must to make the most out of this book, however, end-to-end ML workflows will be covered. If you aim to accelerate your machine learning workflows and deploy scalable, robust solutions, this book is an indispensable resource.

With the following software and hardware list you can run all code files present in the book (Chapter 1-10).

Software and Hardware List

Chapter Software required OS required
1-10 Databricks Runtime Windows and Mac OS
1-10 Python proficiency (3.x) Windows and Mac OS
1-10 Statistics and ML basics Windows and Mac OS
1-10 Spark knowledge (3.0 or above) Windows and Mac OS
1-10 Delta Lake features (optional) Windows and Mac OS

Related products

Get to Know the Author

Debu Sinha is an experienced data science and engineering leader with deep expertise in software engineering and solutions architecture. With over 10 years in the industry, Debu has a proven track record in designing scalable software applications and big data, and machine learning systems. As lead ML specialist on the Specialist Solutions Architect team at Databricks, Debu focuses on AI/ML use cases in the cloud and serves as an expert on LLMs, ML, and MLOps. With prior experience as a start-up co-founder, Debu has demonstrated skills in team-building, scaling, and delivering impactful software solutions. An established thought leader, Debu has received multiple awards and regularly speaks at industry events.

About

Practical Machine Learning on Databricks, published by packt

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 7

Languages