Sbe10 02
Sbe10 02
Sbe10 02
Descriptive Statistics:
Tabular and Graphical Presentations Frequency Distribution
Relative Frequency Distribution
Bar Graph
Summarizing Qualitative Data
Pie Chart
Summarizing Quantitative Data
Exploratory Data Analysis
Crosstabulations and Scatter Diagrams
Slide 1 Slide 2
Slide 3 Slide 4
Slide 5 Slide 6
1
Relative Frequency Distributions Bar Graph
Slide 7 Slide 8
Slide 9 Slide 10
Marada Inn Quality Ratings Insights Gained from the Preceding Pie Chart
Excellent
One-half of the customers surveyed gave Marada
5% a quality rating of “above average” or “excellent”
Poor (looking at the left side of the pie). This might
10% please the manager.
Below
Average For each customer who gave an “excellent” rating,
Above 15% there were two customers who gave a “poor”
Average
45%
rating (looking at the top of the pie). This should
Average displease the manager.
25%
Slide 11 Slide 12
2
Summarizing Quantitative Data Example: Hudson Auto Repair
Slide 13 Slide 14
Sample of Parts Cost for 50 Tune-ups Guidelines for Selecting Number of Classes
• Use between 5 and 15 classes.
91 78 93 57 75 52 99 80 97 62
71 69 72 89 66 75 79 75 72 76 • Data sets with a larger number of elements
usually require a larger number of classes.
104 74 62 68 97 105 77 65 80 109
• Smaller data sets usually require fewer classes
85 97 88 68 83 68 71 69 67 74
62 82 98 101 79 105 79 69 62 73
Slide 15 Slide 16
Slide 17 Slide 18
3
Relative Frequency Distributions Relative Frequency Distributions
Slide 19 Slide 20
Histogram Histogram
Slide 21 Slide 22
Slide 23 Slide 24
4
Ogive Ogive with
Cumulative Percent Frequencies
An ogive is a graph of a cumulative distribution.
Tune-up Parts Cost
The data values are shown on the horizontal axis.
100
20
Parts
Cost ($)
50 60 70 80 90 100 110
Slide 25 Slide 26
The techniques of exploratory data analysis consist of ◼ A stem-and-leaf display shows both the rank order
simple arithmetic and easy-to-draw pictures that can and shape of the distribution of the data.
be used to summarize data quickly. ◼ It is similar to a histogram on its side, but it has the
One such technique is the stem-and-leaf display. advantage of showing the actual data values.
◼ The first digits of each data item are arranged to the
left of a vertical line.
◼ To the right of the vertical line we record the last
digit for each item in rank order.
◼ Each line in the display is referred to as a stem.
◼ Each digit on a stem is a leaf.
Slide 27 Slide 28
Slide 29 Slide 30
5
Stem-and-Leaf Display Crosstabulations and Scatter Diagrams
Slide 31 Slide 32
Crosstabulation Crosstabulation
Total 30 20 35 15 100
Slide 33 Slide 34
Total 30 20 35 15 100
Frequency distribution
for the home style variable
Slide 35 Slide 36
6
Crosstabulation: Row Percentages Crosstabulation: Column Percentages
(Colonial and > $99K)/(All >$99K) x 100 = (12/45) x 100 (Colonial and > $99K)/(All Colonial) x 100 = (12/30) x 100
Slide 37 Slide 38
Data in two or more crosstabulations are often Example: Kidney stone treatment
aggregated to produce a summary crosstabulation. The table below shows the success rates and numbers
We must be careful in drawing conclusions about the of treatments for treatments involving both small and
relationship between the two variables in the large kidney stones.
aggregated crosstabulation.
Treatment A Treatment B
Simpson’ Paradox: In some cases the conclusions Small Stones Group 1 Group 2
based upon an aggregated crosstabulation can be 93% (81/87) 87% (234/270)
completely reversed if we look at the unaggregated Large Stones Group 3 Group 4
data. 73 (192/263) 69% (55/80)
Both 78% (273/350) 83% (289/350)
Slide 39 Slide 40
Slide 41 Slide 42
7
Scatter Diagram Example: Panthers Football Team
Slide 43 Slide 44
Slide 45