Skip to content

izmendi/CursoML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Tutorial: Machine Learning with scikit-learn

Presented by Izaskun Mendia at TECNALIA on Nov 29-30, Dec 1-13, 2016.

Hours Allocation Code: 058585_20200

Description

Although numeric data is easy to work with in Python, most knowledge created by humans is actually raw, unstructured text. By learning how to transform text into data that is usable by machine learning models, you drastically increase the amount of data that your models can learn from. In this tutorial, we'll build and evaluate predictive models from real-world text using scikit-learn.

Objectives

By the end of this tutorial, attendees will be able to confidently build a predictive model, including feature extraction, model building and model evaluation.

Required Software

Attendees will need scikit-learn and pandas (and their dependencies) already installed. Installing the Anaconda distribution of Python is an easy way to accomplish this. Both Python 2 and 3 are welcome.

I will be leading the tutorial using the IPython/Jupyter notebook, and have added a pre-written notebook to this repository. I have also created a Python script that is identical to the notebook, which you can use in the Python environment of your choice.

Tutorial Files

Prerequisite Knowledge

Attendees to this tutorial should be comfortable working in Python, should understand the basic principles of machine learning, and should have at least basic experience with both pandas and scikit-learn. However, no knowledge of advanced mathematics is required.

  • If you need a refresher on pandas, I recommend reviewing the notebook of this 3-part tutorial, and also 00_pandas.ipynb.

Recommended Resources

Resources Machine Learning Intro

Resources for Learning Python

IPython and Markdown resources:

Resources Getting started

Resources Training a learning model

Resources Comparing models

Resources Linear Regression

Pandas:

Seaborn:

Confusion Matrix Resources

ROC and AUC Resources

Other Resources

About

Curso sobre Machine Learning y scikit-learn

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published