Kernel: Python 3
Analyze given data set
Check data shape
Describe data
Check for null values and drop null values or replace it
Drop colums that you find unnecessary
What is revenue distribution for Bekins in 2005
What is the growth% for Data/Technology in 2006
What is the Revenue distribution for Education in 2005
What is number of companies in Each industry
Draw a correlation plot to check on feature realtionship.
In [1]:
In [2]:
In [ ]:
In [3]:
Out[3]:
In [4]:
Out[4]:
In [5]:
Out[5]:
(529, 18)
In [6]:
Out[6]:
In [7]:
In [8]:
Out[8]:
Index(['ID', 'Name', 'Industry', 'Year Founded', 'Employees', 'State', 'City',
'Zip Code', '2005 Revenue', '2005 Expenses', '2005 Growth%',
'2004 Revenue', '2004 Expenses', '2004 Growth%', '2003 Revenue',
'2003 Expenses'],
dtype='object')
In [9]:
Out[9]:
In [10]:
Out[10]:
ID 0
Name 0
Industry 0
Year Founded 1
Employees 0
State 0
City 33
Zip Code 37
2005 Revenue 0
2005 Expenses 0
2005 Growth% 0
2004 Revenue 0
2004 Expenses 0
2004 Growth% 41
2003 Revenue 41
2003 Expenses 394
dtype: int64
In [11]:
In [12]:
In [13]:
In [14]:
In [15]:
In [16]:
What is the growth% for Data/Technology in 2005
In [17]:
Out[17]:
2.952745423823052
What is the Revenue distribution for Education in 2005
In [36]:
Out[36]:
What is number of companies in Each industry
In [19]:
Out[19]:
Industry
Aerospace and Defense 1
Business & Legal Services 45
Data/Technology 98
Education 19
Energy 28
Environment & Weather 12
Finance & Investment 75
Food & Agriculture 6
Geospatial/Mapping 30
Governance 43
Healthcare 40
Housing/Real Estate 21
Insurance 11
Lifestyle & Consumer 25
Media 1
Research & Consulting 28
Scientific Research 17
Software 1
Transportation 28
Name: ID, dtype: int64
Draw a correlation plot to check on feature realtionship
In [20]:
Out[20]:
In [21]:
Out[21]:
Index(['ID', 'Employees', '2005 Revenue', '2005 Expenses', '2005 Growth%',
'2004 Revenue', '2004 Expenses'],
dtype='object')
In [22]:
In [23]:
In [24]:
Out[24]:
In [29]:
Out[29]:
<matplotlib.axes._subplots.AxesSubplot at 0x21dd9cec288>
In [ ]:
In [ ]:
In [ ]: