iQRM Warm Up Week 5 February 17 Corrected
iQRM Warm Up Week 5 February 17 Corrected
iQRM Warm Up Week 5 February 17 Corrected
1
iQRM Warm-Up: Last Week Review
• Probability
o Conditional probability of A given B is the probability of event A given that
event B occurred: P(A)=P(A&B)/P(B)
o Law of Total Probability: 𝑃 𝐴 = σ 𝑃 𝐴 𝐵 𝑃(𝐵)
o Independent events A and B: P(A)=P(A|B)
o Tree diagrams are useful to visualize conditional probability
• Random Variables and Probability Distributions
o Discreet RV: PMF
o Continuous RV: PDF
o Cumulative Distribution Function CMF: 𝐹𝑋 𝑥 = 𝑃(𝑋 ≤ 𝑥)
o Distributions: Normal, Poisson, Lognormal, Binomial correspond to RV of
different nature
2
iQRM Warm-Up: Today’s Topics
3
Measures of Central Tendency – Arithmetic Mean
4
What if the values are not equally likely?
3 + 1 + 1 +0 + 0 + ⋯+ 0
𝜇= ≈ 0.178
28
5
Measures of Central Tendency – Expected Value
3 + 1 + 1 +0 + 0 + ⋯+ 0 1 2 25
𝜇= = ∙3+ ∙1+ ∙ 0 ≈ 0.178
28 28 28 28
• Weighted average!
6
Measures of Central Tendency – Expected Value
𝐸 𝑋 = න 𝑥𝑓𝑋 𝑥 𝑑𝑥
7
Measures of Central Tendency – Median and Mode
• Median - the value separating the upper half from the lower half
of a data sample, a population, or a probability distribution.
• Examples:
• 5, 6, 8, 10, 12, 15, 20 – in case of odd number of data points, the
median is the middle number
• 5, 6, 8, 10, 12, 15, 20, 24 – in case of even number of data
points the median is the average between the two middle
numbers: 11
8
Measures of Central Tendency – Median and Mode
• Examples:
• 5, 6, 6, 10, 12, 15, 20 – mode is 6
• Data sets can be multi-modal: 5, 6, 6, 10, 10, 15, 20 – modes
are 6 and 10
9
Measures of Variation
10
Measures of Variation – Variance
11
Measures of Variation - Variance
• Question 1:
• Why does the distribution
with smaller variance
have higher peak than the
one with larger variance?
• Answer:
• Because the entire area
(integral) should be equal
to 1
12
Measures of Variation - Variance
• Question 2:
• Why can’t we simply find
the average deviation?
• Answer:
• Because the average
deviation is 0!
13
Properties of Expected Value and Variance
• 𝐸 𝑎𝑋 + 𝑏𝑌 = 𝑎𝐸 𝑋 + 𝑏𝐸(𝑌)
• Example: expected value of the sum of two dice
• Var 𝑎𝑋 = 𝑎2 𝑉𝑎𝑟(𝑋)
(if all values of the distribution are multiplied by 𝑎, squared devitions
will be multiplied by 𝑎2 )
15
Population vs Sample
16
Population vs Sample
17
Measures of Variation – Standard Deviation
18
Parameters vs Statistics
20
Other Measures – Skewness and Kurtosis
21
Other Measures – Skewness and Kurtosis
• Positive skew:
mean > median
• Negative skew:
mean < median
22
Other Measures – Skewness and Kurtosis
23
Excel Functions
24
Probability Distributions – Normal Distribution
26
Normal Distribution - Definition
27
Normal Distribution – Probability Density Function
1 1 𝑥−𝜇 2
−
𝑓 𝑥 = 𝑒 2 𝜎
𝜎 2𝜋
28
Normal Distribution
30
What percent of the data lies within 1, 2 or 3
standard deviations of the mean?
31
What percent of the data lies within 1, 2 or 3
standard deviations of the mean?
32
Normally Distributed Data
33
Applying Normal Distribution
• Example:
• Although there is no law for IQ cutoff to become a police officer, in
some states the police departments are allowed not to hire people
who are too intelligent. Suppose in the Happytown Police
Department the cutoff IQ score is 2 standard deviations above the
population mean. Among 1000 people chosen at random,
approximately how many will NOT get hired because they are too
smart?
36
Next Time
• Hypothesis testing
37
Thank you!
38
Thank you! Any questions?