Created And Manipulated Pandas DataFrames To Analyze School, Student's Budgets And Standardized Test Data.
As You are the new Chief Data Scientist for your city's school district. In this capacity, you'll be helping the school board and mayor make strategic decisions regarding future school budgets and priorities. As a first task, you've been asked to analyze the district-wide standardized test results. You'll be given access to every student's math and reading scores, as well as various information on the schools they attend. Your task is to aggregate the data to showcase obvious trends in school performance.
- Created a new repository for this project called
pandas-challenge,Clonedthe new repository to local byterminal. - Inside my local Git repository, created a folder for
"PyCitySchools" - Added
Jupyter notebook"(PyCitySchools_starter_Roshni.ipynb)" to this folder. This is the main script to run this analysis. - A
Resourcesfolder that contains the csv files(Raw Data) i have used. - Also this folder that contains "pdf" file that has the results from the conducted analysis.
- Pushed these changes to
GitHubprofile bybash terminal.
Note
Useful information that users should know, even when skimming content.
Using Pandas and Jupyter Notebook, created a report that includes the following data. Report is included a written description of at least two observable trends based on the data.
Check out the sample solution called "PyCitySchools_starter.ipynb" located in the "PyCitySchools" folder
to review the desired edges for this analysis.
Important
Key information users need to know to achieve their goal.
Performed the necessary calculations and then created a high-level snapshot of the district's key metrics in a DataFrame.
Include the following:
- Total number of unique schools
- Total students, Total budget
Averagemath scoreAveragereading score% passingmath (the percentage of students who passed math)% passingreading (the percentage of students who passed reading)% overall passing(the percentage of students who passed math AND reading)
Performed the necessary calculations and then created a DataFrame that summarizes key metrics about each school.
Include the following:
- School name
- School type
- Total students
- Total school
budget - Per student
budget Averagemath scoreAveragereading score% passing math(the percentage of students who passed math)% passing reading(the percentage of students who passed reading)% overall passing(the percentage of students who passed math AND reading)
Sorted the schools by % Overall Passing in descending order and display the top 5 rows.
Saved the results in a DataFrame called "top_schools".
Sorted the schools by % Overall Passing in ascending order and displaied the top 5 rows.
Saved the results in a DataFrame called "bottom_schools".
Performed the necessary calculations to create a DataFrame that lists the average math score for students of each grade level (9th, 10th, 11th, 12th) at each school.
Created a DataFrame that lists the average reading score for students of each grade level (9th, 10th, 11th, 12th) at each school.
- Created a
tablethat breaks down school performance based onaveragespending ranges (per student). - Used required code to create four
binswith reasonable cutoff values togroupschool spending.
- Used
pd.cutto categorize spending based on thebins. - Created the following
codeto calculatemeanscores per spending range
spending_math_scores = school_spending_df.groupby(["Spending Ranges (Per Student)"])["Average Math Score"].mean()
spending_reading_scores = school_spending_df.groupby(["Spending Ranges (Per Student)"])["Average Reading Score"].mean()
spending_passing_math = school_spending_df.groupby(["Spending Ranges (Per Student)"])["% Passing Math"].mean()
spending_passing_reading = school_spending_df.groupby(["Spending Ranges (Per Student)"])["% Passing Reading"].mean()
overall_passing_spending = school_spending_df.groupby(["Spending Ranges (Per Student)"])["% Overall Passing"].mean()
- Used the scores above to create a
DataFramecalled "spending_summary".
Averagemath scoreAveragereading score% passing math(the percentage of students who passed math)% passing reading(the percentage of students who passed reading)% overall passing(the percentage of students who passed math AND reading)
- Created the required code to bin the
per_school_summary.
- Used "
pd.cut" on the "Total Students" column of the "per_school_summary" DataFrame. - Created a DataFrame called "
size_summary" that breaks down school performance based onschoolsize (small, medium, or large).
Used the "per_school_summary" DataFrame from the previous step to create a new DataFrame called "type_summary".
Important
Key information users need to know to achieve their goal.
The analysis focused on evaluating the academic performance of schools based on various factors such as school type, size, and per student budget. It examined average math and reading scores, as well as the percentage of students passing math, reading, and overall. The data revealed significant differences between Charter and District schools, with Charter schools consistently performing better across all metrics. Additionally, the analysis explored the relationship between per student budget and academic performance, noting that higher budget allocation did not always correlate with better outcomes.
- This suggests that how factors beyond financial resources, such as teaching quality and school culture, may influence student achievements.
Overall, the analysis highlights the importance of considering multiple factors when assessing school performance and emphasizes the need for further investigation into effective educational practices.
Note
Useful information that users should know, even when skimming content.
I. In every criterion, charter schools did better than district schools. Overall passing rates, percentages of children passing math and reading, and average math and reading scores were all higher in charter schools.
II. The average math and reading scores for charter schools were 83.47 and 83.90, respectively, considerably higher than the average scores of 76.96 and 80.97 for district schools.
III.Additionally, the passing percentages in arithmetic (93.62% vs. 66.55%) and reading (96.59% vs. 80.80%) were much better in Charter schools, leading to an overall passing rate of 90.43% compared to 53.67% for District schools.
I.The investigation also looked into how each student's budget affected their academic achievement. Unexpectedly, schools with smaller funds per student frequently fared better than those with larger budgets.
II.91.33% was the highest overall passing percentage for schools in the lowest budget category (<$585), while 54.64% was the highest for schools in the highest budget group ($645–680).
III. This shows that a number of variables, including school administration, resource distribution, and the caliber of instruction, are more important in predicting academic performance than financial constraints.
The research emphasizes how crucial school style and efficient resource use are in affecting student success. Charter schools routinely outperform District schools, while sometimes having smaller funding per student. This shows how important the instructional ideas and approaches used by these schools are.
The results also cast doubt on the idea that larger budgets equate to improved academic performance, emphasizing the need for a more complex understanding of the variables influencing school achievement outside of financial resources. This suggests that spending more on student support services, curriculum development, and teacher training might improve student accomplishment more than just raising funding levels.
Important
Key information users need to know to achieve their goal.
In conclusion, while financial resources are important for educational institutions, the analysis underscores the significance of other factors such as school type and teaching methodologies in driving academic excellence. Moreover, it highlights the need for further investigation into the specific practices and approaches employed by Charter schools that contribute to their superior performance compared to District schools.






