Ex - GroupBy

Check out Alcohol Consumption Exercises Video Tutorial to watch a data scientist go through the exercises

GroupBy can be summarized as Split-Apply-Combine.

Special thanks to: https://github.com/justmarkham for sharing the dataset and materials.

Check out this Diagram

In [2]:

import pandas as pd

In [4]:

drinks = pd.read_csv('https://raw.githubusercontent.com/justmarkham/DAT8/master/data/drinks.csv',keep_default_na=False)
drinks.head()

Out[4]:

In [5]:

drinks.groupby('continent').beer_servings.mean()

Out[5]:

continent
AF     61.471698
AS     37.045455
EU    193.777778
NA    145.434783
OC     89.687500
SA    175.083333
Name: beer_servings, dtype: float64

In [6]:

drinks.groupby('continent').wine_servings.describe()

Out[6]:

In [7]:

drinks.groupby('continent').mean(numeric_only=True)

Out[7]:

In [8]:

drinks.groupby('continent').median(numeric_only=True)

Out[8]:

In [9]:

drinks.groupby('continent').spirit_servings.agg(['mean', 'min', 'max'])

Out[9]:

In [ ]: