The objective of this project is to analyse the New York city job posting dataset.
- This dataset contains current job postings available on the City of New York’s official jobs site (http://www.nyc.gov/html/careers/html/search/search.shtml).
- The data can be found here: https://www.kaggle.com/new-york-city/new-york-city-current-job-postings
- The jobs are either internal postings available to city employees or external postings available to the general public are included.
I am running a data science project on the dataset to answer below questions:
Highest paid Skills in the US market? Job categories for niche skills (from previous step? Applying clustering concepts to show different salary ranges based on job category and years of experience.
- Data exploratry analysis
- Data wrangling
- feature engineering
- Clustering algorithm
- Machine learning algorithms for salary prediction using linear regression, decision tree & random forest
- Analysing the models performance