Graphical Presentation - 2017
Graphical Presentation - 2017
Graphical Presentation - 2017
Graphical presentation
presentation
OBJECTIVES
OBJECTIVES
1. Distributions
2. Graphical Presentation of data
2.1 Histogram
2.2 Dotplot
2.3 Stem-and-Leaf Display (Stemplot)
3. Quartiles and Interquartile Range
4. Five-number Summary (Boxplot)
6. Describing main features of a distribution
Looking
Looking at
at Data
Data
Raw Graph
RawData
Data Graph
Question
Question Collect Present
Collect Organise
Organise Present Draw
Draw
to
tobe
be data data
data data
data data conclusion
conclusion
addressed
addressed
3
1.
1. Distributions
Distributions
4
2.
2. Frequency
Frequency Distribution
Distribution
5
2.1
2.1 Pie-chart
Pie-chart
A graph for the relative frequencies for a categorical
variable.
•It is a circle, whose sectors constitute the different
categories of the variable.
•The relative frequency determines the size of the sector
17%
23%
single
married
others
60%
6
2.2
2.2 Grouped
Grouped Frequency
Frequency
Distribution
Distribution
Useful for measurement data (continuous mainly).
Example: Systolic blood pressure of nonsmokers (mm/Hg)
for 20 persons from the Honolulu Heart Study
102 190 122 116 116 136 118 134 178 162
120 138 126 176 104 140 102 142 146 112
8
44 Graphical
Graphical Presentation
Presentation of
of Data
Data --
Dotplot
Dotplot
Non-smokers
138 128 112 128 134 104 152 134 132 130
118 108 108 128 134 162 98 144 118 118
10
Comparing two samples for differences
• List the stems vertically (consists of all but the final digit)
• Attach the leaves to the stems (leaves are the final digit)
• Rewrite the stems and put leaves in ascending order from left
to right
Stem-and-leaf of Smokers N = 20
Leaf Unit = 1.0
STEM LEAF
10 224
11 2668
12 026
13 468
14 026
15
16 2
17 68
18
19 0 13
5.2 Back-to-back Stem & Leaf plots
sorted smokers
102 102 104 112 116 116 118 120 122 126
134 136 138 140 142 146 162 176 178 190
sorted non-smokers
98 104 108 108 112 118 118 118 128 128
128 130 132 134 134 134 138 144 152 162
8 9
10 224
884 11 2668
8882 12 026
888 13 468
84420 14 026
4 15
2 16 2
2 17 68
18
19 0
No-smokers Smokers
14
66 Boxplot
Boxplot
Based on the Five Number Summary (What are these numbers)
Q1 Q3
Minimum Maximum
Value Value
Median
15
6.1.
6.1. Boxplot
Boxplot
• Strengths of Box-plots
• The distribution of the data is easily seen
• Outliers can easily be picked
• Powerful in comparing distribution of a
variable across different groups (e.g. the
gene expressions)
16
6.2. Constructing
6.2. Constructing Box-plots
Box-plots
.
• A box-plot is a visual description of the
distribution based on the five number
summary, which are the
– Minimum
– Q1
– Median
– Q3
– Maximum
6.3. .
6.3. Example 11
Example
Q168.5, Q3=77.5
whisker
Pulse rate
outliers
7.
7. Two
Two continuous
continuous
variables-scatter
variables-scatter plot
plot
• Displays the relationship between two continuous
variables
22
Age versus Systolic Blood Pressure
in a Clinical Trial
23
9.
9. Main
Main features
features of
of aa
distribution
distribution
24
Distribution shapes
25
EXERCISE
EXERCISE 11
What can you say about the distribution?
27