Python notebook for salary prediction with Kaggle dataset + machine learning models.
This project explores data science salaries using Kaggle’s dataset with Python and machine learning models.
👉 Full annotated notebook available here: Buy on Gumroad
- Data Science Salaries Python Notebook.ipynb: The main Python notebook
- ds_salaries.csv: The underlying dataset used for analysis and model training
To run the .ipynb file, you’ll need:
- Python 3.7+
- Jupyter Notebook or JupyterLab
- Go to Anaconda Download
- Jupyter Notebook is part of Anaconda.
- Click Get Started
- Sign in using your Google account, if required.
- Download the installer for your operating system (Windows, Mac, or Linux).
- After installation, open Anaconda Navigator from your Start Menu or Applications folder.
- Additional installations required: xgboost, scipy, and sklearn.tree are not included with Anaconda. You can install them using the following command in the command prompt: pip install xgboost scipy scikit-learn
- In Anaconda Navigator, click “Launch” under Jupyter Notebook
- Your browser will open with the Jupyter interface. Navigate to the file Data Science Salaries Python Notebook.ipynb and start exploring. No additional installations needed - required libraries like pandas, numpy, and scikit-learn are already included with Anaconda.