Path: blob/master/lessons/lesson_12/python-notebooks-data-wrangling/Visualization--School-Scores-and-Poverty.ipynb
1904 views
Analyzing California high school SAT performance vs. poverty level
In this example, we use pandas and its merge function to analyze the relationship betweem SAT performance and percentage of students eligible for free and reduced-price lunch (a common proxy for poverty).
You can read more about the data collection at this notebook: Data-Extraction--CDE-XLS.ipynb
Warmup: single dataset analysis
The SAT dataset includes a breakdown of scores by reading, writing, and math.
Are high SAT reading scores correlated with high SAT writing scores?
Looks like there's a correlation between reading and writing scores. What about writing and math?
Relationship between poverty and SAT scores
Bigger questions require more datasets. Let's test the relationship between poverty and SAT scores.
The relationship is not as highly correlated as reading vs. writing scores, but with few exceptions, schools that have a high percentage of students eligible for free/reduced-price lunch have a lower percentage of students who score above 1500.