As data analysts or data scientists, we find that we don't always get the data we need, but the data that we deserve. Many times it is up to us to extract meaningful information from our data. It could be done by transforming the data using derived columns, grouping the data and using aggregated information, or cleaning and reformatting the data. We will explore these techniques in this lab.
Before working on today's main lab, we'd like you to review some Pandas functions. Open the Pandas-concat-merge-join.ipynb file in the your-code directory, review the information there, and warm up yourself by doing some coding.
Then open the main.ipynb file in the your-code directory. Follow the instructions and add your code and explanations as necessary. By the end of this lab, you will have learned how to prepare a dataset for most scikit-learn algorithms.
Pandas-concat-merge-join.ipynbwith your responses.main.ipynbwith your responses.
Upon completion, add your deliverables to git. Then commit git and push your branch to the remote.
