Descriptive Statsistics
Descriptive Statsistics
Descriptive Statsistics
[email protected]
4YS9XLK8G2
What do the numbers tell?
Data
[email protected]
Categorical Numerical
4YS9XLK8G2
(Qualitative) (Quantitative)
E.g. Gender, Location of
store, Preference
Discrete Continuous
E.g. Family size, Number of E.g. Waiting time, Length of
rooms in a hotel, number of a part produced
credit cards issued
https://economictimes.indiatimes.com/articleshow/52450273.cms?utm_source=contentofinterest
&utm_medium=text&utm_campaign=cppst
This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
US median household income climbs to new high
If you're a member of the middle class, chances are things are looking up. Median
household income reached a record $61,372 in 2017, up 1.8 percent from $60,309
in 2016.This marks the third year in a row that median household income has
gone up, according to the U.S. Census Bureau, which compiled the data.
[email protected]
4YS9XLK8G2
https://www.cnbc.com/2018/09/12/median-household-income-climbs-to-new-high-
of-61372.html
• 340 340 350 350 340 340 320 340 330 330
Affected by extreme values. Not affected by extreme values. Not affected by extreme values.
• IQR =Q3-Q1
• Data set: 12, 14, 11, 18, 11.5, 12, 14, 11, 9
• Arranging in ascending order, the data set becomes
9, 11, 11, 11.5, 12, 12, 14, 14, 18
IQR = Q3 – Q1 = 14 – 11 = 3
This file is meant for personal use by [email protected] only.
Proprietary content. Sharing
©GreatorLearning. All contents
publishing the Rights Reserved.
in part or fullUnauthorized
is liable for legaluse or distribution
action. prohibited.
Standard deviation
• Standard deviation forms the cornerstone for the inferential
statistics.
Histogram( also known as frequency histogram) is a snap shot of the frequency distribution.
Histogram is a graphical representation of the frequency distribution in which the X-axis represents the
classes and the Y-axis represents the frequencies in bars.
Histogram depicts the pattern of the distribution emerging from the characteristic being measured.
• In symbolic form
4YS9XLK8G2
CV = S/𝑋ത for the sample data and = σ/µ for the population
data.
Sales Team 2
• Standard Deviation 12 units
This file is meant for personal use by [email protected] only.
Proprietary content. Sharing
©GreatorLearning. All contents
publishing the Rights Reserved.
in part or fullUnauthorized
is liable for legaluse or distribution
action. prohibited.
Coefficient of Variation
Example
• Additional information
Sales Team 1
• Mean: 70 units
[email protected]
4YS9XLK8G2
Sales Team 2
• Mean: 120 units
[email protected]
4YS9XLK8G2
• Most companies are now recognizing the power of data in making crucial
business decisions. For an Insurance company, it becomes more important to
study various attributes about their customers. Leveraging this customer
information to make business decisions can provide a competitive edge to the
company over other players in the market
[email protected]
4YS9XLK8G2
• We are provided with some customer data of an Insurance company like age,
gender, BMI and medical charges billed by insurance company. We need to
explore this data to see if we can derive some meaningful insights from this data.
[email protected]
4YS9XLK8G2
[email protected]
4YS9XLK8G2
[email protected]
4YS9XLK8G2
[email protected]
4YS9XLK8G2