Book a Demo!
CoCalc Logo Icon
StoreFeaturesDocsShareSupportNewsAboutPoliciesSign UpSign In
rasbt
GitHub Repository: rasbt/machine-learning-book
Path: blob/main/ch07/wine.names.txt
1245 views
1
1. Title of Database: Wine recognition data
2
Updated Sept 21, 1998 by C.Blake : Added attribute information
3
4
2. Sources:
5
(a) Forina, M. et al, PARVUS - An Extendible Package for Data
6
Exploration, Classification and Correlation. Institute of Pharmaceutical
7
and Food Analysis and Technologies, Via Brigata Salerno,
8
16147 Genoa, Italy.
9
10
(b) Stefan Aeberhard, email: [email protected]
11
(c) July 1991
12
3. Past Usage:
13
14
(1)
15
S. Aeberhard, D. Coomans and O. de Vel,
16
Comparison of Classifiers in High Dimensional Settings,
17
Tech. Rep. no. 92-02, (1992), Dept. of Computer Science and Dept. of
18
Mathematics and Statistics, James Cook University of North Queensland.
19
(Also submitted to Technometrics).
20
21
The data was used with many others for comparing various
22
classifiers. The classes are separable, though only RDA
23
has achieved 100% correct classification.
24
(RDA : 100%, QDA 99.4%, LDA 98.9%, 1NN 96.1% (z-transformed data))
25
(All results using the leave-one-out technique)
26
27
In a classification context, this is a well posed problem
28
with "well behaved" class structures. A good data set
29
for first testing of a new classifier, but not very
30
challenging.
31
32
(2)
33
S. Aeberhard, D. Coomans and O. de Vel,
34
"THE CLASSIFICATION PERFORMANCE OF RDA"
35
Tech. Rep. no. 92-01, (1992), Dept. of Computer Science and Dept. of
36
Mathematics and Statistics, James Cook University of North Queensland.
37
(Also submitted to Journal of Chemometrics).
38
39
Here, the data was used to illustrate the superior performance of
40
the use of a new appreciation function with RDA.
41
42
4. Relevant Information:
43
44
-- These data are the results of a chemical analysis of
45
wines grown in the same region in Italy but derived from three
46
different cultivars.
47
The analysis determined the quantities of 13 constituents
48
found in each of the three types of wines.
49
50
-- I think that the initial data set had around 30 variables, but
51
for some reason I only have the 13 dimensional version.
52
I had a list of what the 30 or so variables were, but a.)
53
I lost it, and b.), I would not know which 13 variables
54
are included in the set.
55
56
-- The attributes are (dontated by Riccardo Leardi,
57
[email protected] )
58
1) Alcohol
59
2) Malic acid
60
3) Ash
61
4) Alcalinity of ash
62
5) Magnesium
63
6) Total phenols
64
7) Flavanoids
65
8) Nonflavanoid phenols
66
9) Proanthocyanins
67
10)Color intensity
68
11)Hue
69
12)OD280/OD315 of diluted wines
70
13)Proline
71
72
5. Number of Instances
73
74
class 1 59
75
class 2 71
76
class 3 48
77
78
6. Number of Attributes
79
80
13
81
82
7. For Each Attribute:
83
84
All attributes are continuous
85
86
No statistics available, but suggest to standardise
87
variables for certain uses (e.g. for us with classifiers
88
which are NOT scale invariant)
89
90
NOTE: 1st attribute is class identifier (1-3)
91
92
8. Missing Attribute Values:
93
94
None
95
96
9. Class Distribution: number of instances per class
97
98
class 1 59
99
class 2 71
100
class 3 48
101
102