Data analysis
The actual dataset (they also have documentation and a separate api, also contains csv datasets) https://collegescorecard.ed.gov/data/
You can find the full report here
http://nbviewer.jupyter.org/github/jayeshkv/Scalable-Data-Analysis/blob/master/Report.ipynb