A Preliminary Exploration of The Data To Better Understand Its Characteristics
A Preliminary Exploration of The Data To Better Understand Its Characteristics
A Preliminary Exploration of The Data To Better Understand Its Characteristics
10th percentile
75th percentile
50th percentile
25th percentile
10th percentile
Celsius
standard
deviation
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 ‹#›
Visualization of the Iris Correlation Matrix
• Parallel Coordinates
– Used to plot the attribute values of high-
dimensional data
– Instead of using perpendicular axes, use a set of
parallel axes
– The attribute values of each object are plotted as a
point on each corresponding coordinate axis and
the points are connected by a line
– Thus, each object is represented as a line
– Often, the lines representing a distinct class of
objects group together, at least for some attributes
– Ordering of attributes is important in seeing such
groupings
© Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 ‹#›
Parallel Coordinates Plots for Iris Data