Quartiles
Quartiles
Quartiles
"quartiles" are values that divide a data set into four equal parts. There are three quartiles denoted
as Q1, Q2, and Q3:
Q1 (First Quartile): This is the value below which the lowest 25% of the data falls. It's also
known as the 25th percentile.
Q2 (Second Quartile): This is the median of the data set, which is the middle value when the data
set is ordered. It's also the 50th percentile.
Q3 (Third Quartile): This is the value below which the lowest 75% of the data falls. It's also
known as the 75th percentile.
These quartiles are useful for understanding the distribution of data, particularly in box plots
where they are often represented as points on a number line. They help identify the spread and
central tendency of a dataset.
Example
18,20,22,25,28,30,32,35,38,40,42,45,48,50
Q1 (First Quartile): This divides the lowest 25% of the data. To find Q1, we take the median of
the lower half of the data.
The lower half of the data is: 18,20,22,25,28,30
The median of this lower half is 22+252=23.5222+25=23.5.
So, Q1=23.5.
Q2 (Second Quartile): This is the median of the entire dataset, which is 28.
Q3 (Third Quartile): This divides the lowest 75% of the data. To find Q3, we take the median of
the upper half of the data.
The upper half of the data is: 32,35,38,40,42,45,48,50
The median of this upper half is 40+422=41240+42=41.
So, Q3=41.
In summary, for this dataset:
Q1 = 23.5
Q2 = 28
Q3 = 41
These quartiles give us insights into the spread and central tendency of the ages in this group. For
example, we know that the middle 50% of ages fall between 23.5 and 41, and the median age is
28.
Deciles
a "decile" is a value that divides a dataset into ten equal parts. There are nine deciles denoted as
D1, D2, ..., D9.
D1 (First Decile): This is the value below which the lowest 10% of the data falls. It's also known
as the 10th percentile.
D2 (Second Decile): This is the value below which the lowest 20% of the data falls. It's also
known as the 20th percentile.
D3 (Third Decile): This is the value below which the lowest 30% of the data falls. It's also
known as the 30th percentile.
And so on...
D9 (Ninth Decile): This is the value below which the lowest 90% of the data falls. It's also
known as the 90th percentile.
Deciles are similar to quartiles, but they provide more granularity in dividing the data. They are
used to understand the distribution of data in finer detail than quartiles. Deciles can be helpful in
situations where more specific information about the spread and distribution of the data is
required.
Example
consider a dataset representing the scores of a group of students on a test:
45,50,55,60,65,70,75,80,85,90,9545,50,55,60,65,70,75,80,85,90,95
To find the deciles, we first need to sort the data in ascending order:
45,50,55,60,65,70,75,80,85,90,9545,50,55,60,65,70,75,80,85,90,95
Now, we have 11 data points, which means there will be 10 deciles.
D1 (First Decile): This divides the lowest 10% of the data. To find D1, we take the value at the
10th percentile position.
D1=10%×(n+1) =10%×12=1.2
Since 1.2 is not a whole number, we take an average of the first and second value in the sorted
data.
D1= (245+50)/2=47.5
D2 (Second Decile): This divides the lowest 20% of the data. To find D2, we take the value at
the 20th percentile position.
D2=20%×(n+1 )=20%×12=2.4
Again, since 2.4 is not a whole number, we take an average of the second and third value in the
sorted data.
D2= (250+55)/2=52.5
And so on, until we reach the ninth decile.
D9 (Ninth Decile): This divides the lowest 90% of the data. To find D9, we take the value at the
90th percentile position.
D9=90%×(n+1) =90%×12=10.8
Since 10.8 is not a whole number, we take an average of the tenth and eleventh value in the
sorted data.
D9= (290+95)/2=92.5
In summary, for this dataset:
D1 = 47.5
D2 = 52.5
D3 = 57.5
D4 = 62.5
D5 = 67.5
D6 = 72.5
D7 = 77.5
D8 = 82.5
D9 = 92.5
These deciles provide more detailed information about how the scores are distributed throughout
the dataset.
Percentiles
In statistics, a "percentile" is a measure that indicates the relative standing of a particular value
within a dataset. It represents the percentage of scores that fall below or equal to a given value.
For example, the 50th percentile (also known as the median) is the value below which 50% of
the data falls. Similarly, the 25th percentile (Q1) is the value below which 25% of the data falls,
and the 75th percentile (Q3) is the value below which 75% of the data falls.
step-by-step process to calculate percentiles:
Sort the Data: Arrange the data points in ascending order.
Identify the Position: Determine the position of the percentile value using the following formula:
Position=p/100×(n+1)
where p is the desired percentile (e.g., 25 for the 25th percentile), and n is the total number of
data points.
Interpolation (if necessary): If the position is not a whole number, you may need to use
interpolation to estimate the percentile. This is more common when working with discrete data.
Percentiles are very useful for understanding the spread and distribution of a dataset. They can
help identify outliers, evaluate the central tendency, and compare individual data points to the
overall dataset.
For example, if a student scores in the 80th percentile on a standardized test, it means they
performed better than approximately 80% of the students who took the test.
Example
Let's use a dataset of exam scores to illustrate percentiles. Suppose we have the following scores:
60,65,70,75,80,85,90,95,100,105,11060,65,70,75,80,85,90,95,100,105,110
We'll calculate the 25th percentile (Q1), 50th percentile (median, Q2), and 75th percentile (Q3).
25th Percentile (Q1):
Position = 25100×(11+1)=310025×(11+1)=3
Since the position is a whole number, we take the value at the 3rd position, which is 70.
So, the 25th percentile is 70.
50th Percentile (Median, Q2):
Position = 50100×(11+1)=610050×(11+1)=6
The value at the 6th position is 85.
So, the median is 85.
75th Percentile (Q3):
Position = 75100×(11+1)=910075×(11+1)=9
The value at the 9th position is 100.
So, the 75th percentile is 100.
In summary, for this dataset:
25th Percentile (Q1) = 70
50th Percentile (Median, Q2) = 85
75th Percentile (Q3) = 100
These percentiles give us valuable information about how the scores are distributed and what
percentage of students scored below certain thresholds. For instance, we know that 25% of
students scored 70 or lower, 50% scored 85 or lower (which is also the median), and 75% scored
100 or lower.