Path: blob/master/lessons/lesson_12/python-notebooks-data-wrangling/Data-Extraction--CDE-XLS.ipynb
1904 views
Kernel: Python 3
Data extraction -- California Dept. of Education Spreadsheets
A set of examples on how to extract machine-readable data from the raw, official sources. No pandas needed, just requests and regex and xlrd (for Excel spreadsheets)
(in progress)
File system setup
In [1]:
In [2]:
California SAT score results, 2014
TK
http://www.cde.ca.gov/ds/sp/ai/
Parsing and wrangling SAT data file
In [3]:
Out[3]:
Wrote 2478 rows to data/schools/extracted/sat-2014.csv
California student poverty/free-or-reduced-price-lunch data, 2014
TKTK
http://www.cde.ca.gov/ds/sd/sd/filessp.asp
Parsing and wrangling free-and-reduced lunch data file
In [4]:
Out[4]:
Wrote 10361 rows to data/schools/extracted/frpm-2014.csv
In [ ]: