The main purpose of this project is to create Polars script to analyze the job application data, including generating summary statistics and visualization chart.
The data used in this analysis comes from a dataset published by data.lacity.org on Data.Gov website, and the data version is as of September 15, 2023. You can find more information about data source via link here: https://catalog.data.gov/dataset/job-applicants-by-gender-and-ethnicity
- Read job applicant csv file into dataframe.
- Create statistic functions to summarize key indicators, such as mean, median, standard deviation, etc.
- Visualize data using matplotlib.
- Ethnicity distribution shows Hispanic and Black applicants as the largest groups, with significant representation from Caucasian applicants as well. Other ethnic groups have much smaller representation.
- These findings suggest potential areas for our future study to increase diversity in the applicant pool.