-
Couldn't load subscription status.
- Fork 26
7. Python Data Analysis
Learn Python modules that make data analysis and visualization easy.
NumPy is the fundamental package for scientific computing in Python. It is a Python library that provides a multidimensional array object, various derived objects (such as masked arrays and matrices), and an assortment of routines for fast operations on arrays, including mathematical, logical, shape manipulation, sorting, selecting, I/O, discrete Fourier transforms, basic linear algebra, basic statistical operations, random simulation and much more.
Read through the first 2 sections of the NumPy User Guide
matplotlib is a python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive environments across platforms. matplotlib can be used in python scripts, the python and ipython shell, web application servers, and six graphical user interface toolkits.
matplotlib tries to make easy things easy and hard things possible. You can generate plots, histograms, power spectra, bar charts, errorcharts, scatterplots, etc, with just a few lines of code.
Get Started - Nicolas P. Rougier matplotlib tutorial
If you enjoy charting, explore Seaborn and their official tutorial. It is an advanced charting library built on top of matplotlib.
pandas is an open source library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
We will use pandas extensively throughout the rest of the Foundations and main curriculum. It's use in data science is comparable to Microsoft Excel in the business world.
Get Started - 10 Minutes to pandas
Optional - pandas Cookbook
- Complete the tutorials for NumPy, matplotlib and pandas.
- For more background, go through this set of Jupyter Notebooks
Complete the notebooks in order of difficulty.
- College Majors - Notebook and Data
- Police Killings - Notebook and Data (encoded in
ISO-8859-1)
- Bangalore Weather - Notebook and Data (tab separated file)
- Thanksgiving Dinner - Notebook and Data (encoded in
Latin-1)