Using data manipulation to explore and analyse the statistics of test scores in public schools in New York. This quick project is mainly focused on training to sort and subset data, rounding values and group summary statistics.
The goal was to analyse data to answer on these questions:
- The best math results are at least 80% of the maximum possible score of 800 for math.
- Save the results by creating a new column named "total_SAT", with results are the sum of math, reading, and writing scores.
- The DataFrame should contain one row, with the name of the NYC borough with the largest standard deviation of "total_SAT", the number of schools in the borough, the mean of "total_SAT", the standard deviation of "total_SAT". Also, round all numeric values to two decimal places.
notebook.ipynb– main notebookschools.xlsx– source dataschoolbus.jpg– visual cover photo
- Clone the repo
- Open
notebook.ipynbin Jupyter Notebook