Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
YStrano
GitHub Repository: YStrano/DataScience_GA
Path: blob/master/lessons/lesson_08/assets/dataset/titanic.txt
1904 views
1
VARIABLE DESCRIPTIONS:
2
survival Survival
3
(0 = No; 1 = Yes)
4
pclass Passenger Class
5
(1 = 1st; 2 = 2nd; 3 = 3rd)
6
name Name
7
sex Sex
8
age Age
9
sibsp Number of Siblings/Spouses Aboard
10
parch Number of Parents/Children Aboard
11
ticket Ticket Number
12
fare Passenger Fare
13
cabin Cabin
14
embarked Port of Embarkation
15
(C = Cherbourg; Q = Queenstown; S = Southampton)
16
17
SPECIAL NOTES:
18
Pclass is a proxy for socio-economic status (SES)
19
1st ~ Upper; 2nd ~ Middle; 3rd ~ Lower
20
21
Age is in Years; Fractional if Age less than One (1)
22
If the Age is Estimated, it is in the form xx.5
23
24
With respect to the family relation variables (i.e. sibsp and parch)
25
some relations were ignored. The following are the definitions used
26
for sibsp and parch.
27
28
Sibling: Brother, Sister, Stepbrother, or Stepsister of Passenger Aboard Titanic
29
Spouse: Husband or Wife of Passenger Aboard Titanic (Mistresses and Fiances Ignored)
30
Parent: Mother or Father of Passenger Aboard Titanic
31
Child: Son, Daughter, Stepson, or Stepdaughter of Passenger Aboard Titanic
32
33
Other family relatives excluded from this study include cousins,
34
nephews/nieces, aunts/uncles, and in-laws. Some children travelled
35
only with a nanny, therefore parch=0 for them. As well, some
36
travelled with very close friends or neighbors in a village, however,
37
the definitions do not support such relations.
38