Jupyter notebook 1-Filtering for only legal cardiologists.ipynb
Notebook #1: Filtering for only legal cardiologists
This notebook...
deduplicates the given 100 NPIs (down to 97)
join in demographic information from data source #1 (data.medicare.gov physician compare)
filter out non-cardiologists
check to make sure none of the cardiologists are on the HHS OIG Exclusions list
Checkpoint (⚑) 1: Dataset of 97 NPIs with basic demographic info
Basic demographic information from source #1 merged in with the list of unique NPIs provided; a sample of the data is below
If the ACO is looking for only cardiologists, we should probably filter non-cardiologists out.
First let's see what specialties are represented in our given list of providers -- perhaps multiple specialities fall within cardiology and/or cardiology shows up in secondary specialties
CARD* shows up in primary and secondary specialty #1 -- let's see if the instance of secondary specialty is assigned to a provider with the primary specialty.
Note: there were 5 providers with no specialty which we will rule out per a documented assumption
==> there are 9 providers with a specialty related to the heart
8x 'CARDIOVASCULAR DISEASE (CARDIOLOGY)'
1x 'CARDIAC SURGERY'
The provider with a secondary specialty 1 of cardiac surgery has a primary specialty of cardiology.
Per the previously documented assumption, we are going to discard the cardiac surgeon bringing us to 8 potential providers to potentially invite to the ACO
⚑ 2: Narrowed given NPIs to 8 Cardiologists:
⚑ 8 cardiologists in the running (none are in the HHS OIG exclusion list)
So.. now we have 8 cardiologists but how do we prioritize them? Even if we could invite all eight, we should focus our energy on top performers.