Skip to content

Latest commit

 

History

History
30 lines (18 loc) · 1.41 KB

README.md

File metadata and controls

30 lines (18 loc) · 1.41 KB

Python Data Exploration

This is an example of how to use Python and Jupyter Notebook to explore a given dataset with Pandas and Seaborn.

Requirements

To get started, create a virtual environment and install the requirements in it.

$ python3 -m venv venv
$ pip install pandas seaborn jupyterlab

After that, you can launch your notebook server by running

$ jupyter-lab

Of course you can use your own environment as well. Obviously you need Python. I'd recommend installing Anaconda, which provides you with all the fancy Python data-science libraries.

If you have python installed, make sure to install Jupyter Notebook, as well as the necessary python packages for Pandas and Seaborn via pip.

You can see the steps in Exploring Datasets.ipynb. Github will render the Jupyter Notebooks so you don't have to download it. If you want to try it yourself, feel free to do it inside Exploring Datasets Yourself.ipynb.

Source

Fair, Ray. 1978. “A Theory of Extramarital Affairs,” Journal of Political Economy, February, 45-61.

The data is available at http://fairmodel.econ.yale.edu/rayfair/pdf/2011b.htm