Name	Name	Last commit message	Last commit date
parent directory ..
data	data
images	images
mglearn	mglearn
.gitignore	.gitignore
01-introduction.ipynb	01-introduction.ipynb
02-supervised-learning.ipynb	02-supervised-learning.ipynb
03-unsupervised-learning.ipynb	03-unsupervised-learning.ipynb
04-representing-data-feature-engineering.ipynb	04-representing-data-feature-engineering.ipynb
05-model-evaluation-and-improvement.ipynb	05-model-evaluation-and-improvement.ipynb
06-algorithm-chains-and-pipelines.ipynb	06-algorithm-chains-and-pipelines.ipynb
07-working-with-text-data.ipynb	07-working-with-text-data.ipynb
08-conclusion.ipynb	08-conclusion.ipynb
README.md	README.md
cover.jpg	cover.jpg
environment.yml	environment.yml
preamble 2.py	preamble 2.py
preamble.py	preamble.py

Introduction to Machine Learning with Python

This repository holds the code for the book "Introduction to Machine Learning with Python" by Andreas Mueller and Sarah Guido. You can find details about the book on the O'Reilly website.

The books requires the current stable version of scikit-learn. Most of the book can also be used with previous versions of scikit-learn, though you need to adjust the import for everything from the model_selection module, mostly cross_val_score, train_test_split and GridSearchCV.

This repository provides the notebooks from which the book is created, together with the mglearn library of helper functions to create figures and datasets.

All datasets are included in the repository, with the exception of the aclImdb dataset, which you can download from the page of Andrew Maas. See the book for details.

If you get ImportError: No module named mglearn you can try to install mglearn into your python environment using the command pip install mglearn in your terminal or !pip install mglearn in Jupyter Notebook.

Here are the chapters:

Introduction
Supervised Learning
Unsupervised Learning
Representing Data and Engineering Features
Model Evaluation and Improvement
Algorithm Chains and Pipelines
Wrapping Up

Introduction

The code in this chapter can be accessed in this notebook.

scikit-learn
Jupyter Notebook
NumPy
SciPy
Matplotlib
Pandas

ScikitLearn

scikit-learn is an open source project, meaning that it is free to use and distribute, and anyone can easily obtain the source code to see what is going on behind the scenes. The scikit-learn project is constantly being developed and improved, and it has a very active user community. It contains a number of state-of-the-art machine learning algorithms, as well as comprehensive documentation about each algorithm.
scikit-learn is a very popular tool, and the most prominent Python library for machine learning. It is widely used in industry and academia, and a wealth of tutorials and code snippets are available online. scikit-learn works well with a number of other scientific Python tools.

Files

Intro-To-ML-with-Python

Directory actions

More options

Directory actions

More options

Latest commit

History

Intro-To-ML-with-Python

Folders and files

parent directory

README.md

Introduction to Machine Learning with Python

Introduction

ScikitLearn

Jupyter Notebook

NumPy

SciPy

Matplotlib

Pandas

Supervised Learning

Classification and Regression

Generalization, Overfitting, and Underfitting

k-Nearest-Neighbors

Linear Models

Naive Bayes

Decision Trees

Random Forests

Gradient Boosting Machines

Kernelized Support Vector Machines

Neural Networks

Unsupervised Learning

Types of Unsupervised Learning

Challenges in Unsupervised Learning

Principal Component Analysis

Non Negative Matrix Factorization

Manifold Learning with tSNE

k Means Clustering

Agglomerative Clustering

DBSCAN

Representing Data and Engineering Features

Categorical Variables

Univariate Nonlinear Transformations

Automatic Feature Selection

Model Evaluation and Improvement

Cross Validation

Metrics in Model Selection

Key Takeaways

Algorithm Chains and Pipelines

Wrapping Up

Approaching a Machine Learning Problem

From Prototype to Production

Testing Production Systems

Building Your Own Estimator

Setup

Installing packages with conda:

Installing packages with pip

Downloading English language model