8 Quantitative Concepts
8 Quantitative Concepts
8 Quantitative Concepts
OF THE INDUSTRY
IS CFA INSTITUTE INVESTMENT FOUNDATIONS THE BIG PICTURE
RIGHT FOR YOU?
Investment Foundations is a comprehensive global education
Investment Foundations is a certificate program designed to give certificate program that provides a clear understanding of investment
you a clear understanding of the investment management industry. industry essentials. The certificate program is designed for all
Whether youre just starting a career in financial services or want to professional disciplines outside of investment roles, including IT,
learn essential industry concepts, Investment Foundations offers an operations, accounting, administration, and marketing. There is no
accessible solution for breaking through the complexities of the global education or experience requirement and the exam can be taken at
investment industry and raising your professional profile. your convenience at available test centers around the world.
www.cfainstitute.org
d Describe how time and discount rate affect present and future values;
g Explain uses of mean, median, and mode, which are measures of fre-
quency or central tendency;
INTRODUCTION 1
Knowledge of quantitative (mathematically based) concepts is extremely important to
understanding the world of finance and investing. Quantitative concepts play a role
in financial decisions, such as saving and borrowing, and also form the foundation
for valuing investment opportunities and assessing their risks. The time value of
money and descriptive statistics are two important quantitative concepts. They are
not directly related to each other, but we combine them in this chapter because they
are key quantitative concepts used in finance and investment.
The time value of money is useful in many walks of life: it helps savers to know how
long it will take them to afford a certain item and how much they will have to put
aside each week or month, it helps investors to assess whether an investment should
provide a satisfactory return, and it helps companies to determine whether the profit
from investing will exceed the cost.
Statistics are also used in a wide range of business and personal contexts. As you
attempt to assess the large amount of personal and work-related data that are part of
our everyday lives, you will probably realise that an efficient summary and description
of data is helpful to make sense of it. Most people, for instance, look at summaries of
weather information to make decisions about how to dress and whether to carry an
umbrella or bring rain gear. Summary statistics help you understand and use informa-
tion in making decisions, including financial decisions. For example, summary infor-
mation about a companys or markets performance can help in investment decisions.
In short, quantitative concepts are fundamental to the investment industry. For any-
one working in the industry, familiarity with the concepts described in this chapter is
critical. As always, you are not responsible for calculations, but the presentation
of formulae and illustrative calculations may enhance your understanding.
2.1 Interest
Borrowing and lending are transactions with cash flow consequences. Someone who
needs money borrows it from someone who does not need it in the present (a saver)
and is willing to lend it. In the present, the borrower has money and the lender has
given up money. In the future, the borrower will give up money to pay back the lender;
the lender will receive money as repayment from the borrower in the form of interest,
as shown below. The lender will also receive back the money lent to the borrower. The
money originally borrowed, which interest is calculated on, is called the principal.
Interest can be defined as payment for the use of borrowed money.
Lends Money
Pays Interest
Lender Borrower
Interest is all about timing: someone needs money now while someone else is willing
and able to give up money now, but at a price. The borrower pays a price for not being
able to wait to have money and to compensate the lender for giving up potential current
consumption or other investment opportunities; that price is interest. Interest is paid
by a borrower and earned by the lender to compensate the lender for opportunity cost
and risk. Opportunity cost, in general, is the value of alternative opportunities that
have been given up by the lender, including lending to others, investing elsewhere,
or simply spending the money. Opportunity cost can also be seen as compensation
for deferring consumption. Lending delays consumption by the term of the loan (the
time over which the loan is repaid). The longer the consumption is deferred, the more
compensation (higher interest) the lender will demand.
The lender also bears risks, such as the risk of not getting the money back if the
borrower defaults (fails to make a promised payment). The riskier the borrower or
the less certain the borrowers ability to repay the loan, the higher the level of inter-
est demanded by the lender. Another risk is that as a result of inflation (an increase
in prices of goods and services), the money received may not be worth as much as
expected. In other words, a lenders purchasing power may decline even if the money is
repaid as promised. The greater the expected inflation, the higher the level of interest
demanded by the lender.
From the borrowers perspective, interest is the cost of having access to money that
they would not otherwise have. An interest rate is determined by two factors: oppor-
tunity cost and risk. Even if a loan is viewed as riskless (zero likelihood of default),
there still has to be compensation for the lenders opportunity cost and for expected
inflation. Exhibit1 shows examples of borrowers and lenders.
Time Value of Money 179
If people invest in a
Invest Money company and earn interest
by buying bonds, they are
the lenders and the
Receive Interest
company is the borrower.
The actual amount of interest earned or paid depends on the simple interest rate, the
amount of principal lent or borrowed, and the number of periods over which it is lent
or borrowed. We can show this mathematically as follows:
Simple interest = Simple interest rate Principal Number of periods
If you put money in a bank account and the bank offers a simple interest rate of 10%
per annum (or annually), then for every 100 you put in, you (as a lender to the bank)
will receive 10 in the course of the year (assume at year end to simplify calculations):
Interest = 0.10 100 1= 10
If your money is left in the bank for two years, the interest paid will be 20:
Interest = 0.10 100 2= 20
180 Chapter 8 Quantitative Concepts
Simple interest is not reinvested and is applied only to the original principal, as shown
in Exhibit2.
160
Interest per Year
150
10
140
10
130
Pounds ()
10
120
10 End of Previous
110 Year Balance
10 (Principal + Interest)
100
...
Original Principal
0
Original Year 1 Year 2 Year 3 Year 4 Year 5
Principal
If the interest earned is added to the original principal, the relationship between the
original principal and its future value with simple interest can be described as follows:
To extend our deposit example: 100 [1+ (0.10 2)] = 100 (1.20) = 120. The
value at the end of two years is 120.
If a deposit of 100 is made and earns 10% and the money is reinvested (remains
on deposit), then additional interest is earned in the course of the second year on
the 10 of interest earned in the first year. The interest is being compounded. Total
interest after two years will now be 21; 10 (= 100 0.10) for the first year, plus
11 (= 110 0.10) for the second year. The second years interest is calculated on the
original 100 principal plus the first years interest of 10. As shown in Exhibit3, the
total interest after two years is 21 rather than 20 as in the case of simple interest
shown in Exhibit2.
Time Value of Money 181
160
Interest per Year 14.64
150
140 13.31
130
Pounds ()
12.10
120
11.00 End of Previous
110 Year Balance
10.00 (Principal + Interest)
100
The relationship between the original principal and its future value when interest is
compounded can be described as follows:
Future value = Original principal (1+ Simple interest rate)Number of periods
In the deposit example, 100 (1+ 0.10)2 = 100 (1.10)2 = 121. With compounding,
the value at the end of two years is 121.
700
600
500
Balance ()
400
300
200
100
0
0 5 10 15 20
Years
Simple Interest Compound Interest
365
0.1524
Credit card 15.24% 16.46% = 1 + 1
365
12
0.024
Bank deposit 2.4% (= 0.2% 12) 2.43% = 1 + 1
12
4
0.06
Loan 6.0% 6.14% = 1 + 1
4
Present Future
Time Time
Time affects the value of money because delay creates opportunity costs and risk. If
you earn a return of r% for waiting one year, 1 (1+ r%) is the future value after
one year of 1 invested today. Put another way, 1 is the present value of 1 (1+
r%) received in a years time.
184 Chapter 8 Quantitative Concepts
A saver may want to know how much money is needed today to produce a certain
sum in the future given the rate of interest, r. In the example in Exhibit3, todays value
is 100 and the interest rate is 10%, so the future value after two years is 100 (1+
0.10)2 = 121. The present valuethe equivalent value todayof 121 in two years,
given that the annual interest rate is 10%, is 100.
100 121
Present Interest Rate (10%) Future
Value Value
Today In 2 Years
100 121
Present Discount Rate (10%) Future
Value Value
Before you can calculate present or future values, you must know the appropriate
interest or discount rates to use. The rate will usually depend on the overall level of
interest rates in the economy, the opportunity cost, and the riskiness of the invest-
ments under consideration. The following equations generalise the calculation of
future and present values:
Example2 compares two investments with the same initial outflow (investment) but
with different future cash inflows at different points in time.
1 You are choosing between two investments of equal risk. You believe
that given the risk, the appropriate discount rate to use is 9%. Your initial
investment (outflow) for each is 500. One investment is expected to pay
out 1,000 three years from now; the other investment is expected to pay
out 1,350 five years from now. To choose between the two investments,
you must compare the value of each investment at the same point in time.
2 You are choosing between the same two investments but you have reas-
sessed their risks. You now consider the five-year investment to be more
risky than the first and estimate that a 15% return is required to justify
making this investment.
Example2 shows three elements that must be considered when comparing investments:
the risk associated with each investment, which is reflected in the discount rate.
Present value considers the joint effect of these three elements and provides an effec-
tive way of comparing investments with different risks that have different future cash
flows at different points in time.
The NPV of the investment in Example 2 that is paying 1,350 in five years
(discounted at 15%) if it initially cost 500 is:
186 Chapter 8 Quantitative Concepts
If costs were to occur at times different from time zero, then they would also be dis-
counted back to time zero for the purposes of comparison and calculation of the NPV.
If the NPV is zero or greater, the investment is earning at least the discount rate. An
NPV of less than zero indicates that the investment should not be made.
Calculating the NPV allows an investor to compare different investments using their
projected cash flows and costs. The concepts of present value and net present value
have widespread applications in the valuation of financial assets and products. For
example, equities may pay dividends and/or be sold in the future, bonds may pay
interest and principal in the future, and insurance may lead to future payouts.
1 You place 1,000 on deposit at an annual interest rate of 10% and make
regular contributions of 250 at the end of each of the next two years.
How much do you have in your account at the end of two years?
Time Value of Money 187
2 You place 1,000 on deposit and withdraw 250 at the end of the first
year. The balance on deposit at the beginning of the year earns an annual
interest rate of 10%. How much do you have in your account at the end of
two years?
At the end of the first year, you have 1,000 (1+ 0.10) = 1,100
You withdraw 250 and begin the second year with an amount = 850
At the end of the second year, you have 850 (1+ 0.10) = 935
Time value of money can also help determine the value of a financial instrument. It
can help you work out the value of an annuity or how long it will take to pay off the
mortgage on your home.
2.2.3.1 Present Value and the Valuation of Financial Instruments People invest
in financial products and instruments because they expect to get future benefits in
the form of future cash flows. These cash flows can be in the form of income, such
as dividends and interest, from the repayment of an amount lent, or from selling the
financial product or instrument to someone else. An investor is exchanging a sum of
money today for future cash flows, and some of these cash flows are more uncertain
than others. The value (amount exchanged) today of a financial product should equal
the value of its expected future cash flows. This concept is shown in Example5.
Consider the example of a simple loan that was made three years ago. Two years
from today, the loan will mature and the borrower should repay the principal
value of the loan, which is 100. The investor who buys (or owns) this loan should
also receive from the borrower two annual interest payments at the originally
promised interest rate of 8%. The interest payments will be 8 (= 8% 100),
with the first interest payment received a year from now and the second two
years from now.
How much would an investor pay today to secure these two years of cash flow
if the appropriate discount rate is 10% (i.e. r = 0.10)? Note that the rate used for
discounting the future cash flows should reflect the risk of the investment and
interest rates in the market. In practice, it is unlikely that the discount rate will
be equal to the loans originally promised interest rate because the risk of the
investment and interest rates in the market may change over time.
8
The first years interest payment is worth = 7.27.
1.101
188 Chapter 8 Quantitative Concepts
8
The second years interest payment is worth = 6.61.
1.102
The repayment of the loans principal value in two years is
100
worth = 82.64.
1.102
So today, the cash flows returned by the loan are worth 7.27+ 6.61+ 82.64=
96.52. So this loan is worth 96.52 to the investor. In other words, if the original
lender wanted to sell this loan, an investor would pay 96.52.
Through the understanding of present value and knowing how to calculate it, investors
can assess whether the price of a financial instrument trading in the marketplace is
priced cheaply, priced fairly, or overpriced.
2.2.3.2 Time Value of Money and Regular Payments Many kinds of financial arrange-
ments involve regular payments over time. For example, most consumer loans, including
mortgages, involve regular periodic payments to pay off the loan. Each period, some of
the payment covers the interest on the loan and the rest of the payment pays off some
of the principal (the loaned amount). A pension savings scheme or pension plan may
also involve regular contributions.
Most consumer loans result in a final balance of money equal to zero. That is, the
loan is paid off. Two time value of money applications that require the final balance
of money to be zero are annuities and mortgages.
Example6 illustrates the reduction of an annuity to zero over time and the reduction
of a mortgage to zero over time. To simplify the examples, the assumption is that the
annuity and the mortgage each mature in five years and entail a single withdrawal or
payment each of the five years.
Time Value of Money 189
Withdrawal
Annuity Balance (Payment by
at Beginning Balance at End of Year Insurance
Year of Year before Withdrawal Company)
2 You borrow 60,000 to buy a small cottage in the country. The interest
rate on the mortgage is 4.60%. Your payment at the end of each year will
be 13,706.
Mortgage
Outstanding Total
at Beginning Mortgage Principal
Year of Year Payment Interest Paid Reduced
As you can see in Example6, both the annuity and mortgage balances decline to zero
over time.
190 Chapter 8 Quantitative Concepts
3 DESCRIPTIVE STATISTICS
As the name suggests, descriptive statistics are used to describe data. Often, you are
confronted by data that you need to organise in order to understand it. For exam-
ple, you get the feeling that the drive home from work is getting slower and you are
thinking of changing your route. How could you assess whether the journey really is
getting slower? Suppose you calculated and compared the average daily commute time
each month over a year. The first question you need to address is, what is meant by
average? There are a number of different ways to calculate averages that are described
in Section 3.1, each of which has advantages and disadvantages.
In general, descriptive statistics are numbers that summarise essential features of a data
set. A data set relates to a particular variablethe time it takes to drive home from
work in our example. The data set includes several observationsthat is, observed
values for the variable. For example, if you keep track of your daily commute time
for a year, you will end up with approximately 250 observations. The distribution of
a variable is the values a variable can take and the number of observations associated
with each of these values.
We will discuss two types of descriptive statistics: those that describe the central ten-
dency of a data set (e.g., the average or mean) and those that describe the dispersion
or spread of the data (e.g., the standard deviation). In addition to knowing whether
the drive to work is getting slower (by comparing monthly averages), you might also
want to find a way to measure how much variation there is between journey times
from one day to another (by using standard deviation).
Similar needs to summarise data arise in business. For example, when comparing the
time taken to process two types of trades, a sample of the times required to process
each trade would need to be collected. The average time it takes to process each type of
trade could be calculated and the average times could then be compared. Descriptive
statistics efficiently summarise the information from large quantities of data for the
purpose of making comparisons. Descriptive statistics may also help in predicting
future values and understanding risk. For example, if there was little variation in the
times taken to process a trade, then presumably you would be confident that you had
a good idea of the average time it takes to process a trade and comfortable with that
as an estimate of how long it will take to process future trades. But if the time taken
to process trades was highly variable, you would have less confidence in how long it
would take on average to process future trades.
Measures of central tendency are useful for making comparisons between groups
of individuals or between sets of figures. Such measures reduce a large number of
measurements to a single figure. For instance, the mean or average temperature in
Descriptive Statistics 191
country X in July from 1961 to 2011 is calculated to be 16.1C. Over the same period
in September, the average temperature is 13.6C. Because it is a long time series, you
can reasonably conclude that it is usually warmer in July than September in country X.
arithmetic mean,
geometric mean,
median, and
mode.
The appropriate measure for a given data set depends on the features of the data and
the purpose of your calculation. These measures are examined in the following sections.
Exhibit5 shows the annual returns earned on an investment over a 10-year period. The
information contained in Exhibit5 will be used in examples throughout this section.
25
26.4%
20
Annual Returns (%)
15
10
8.0% 7.2%
4.2% 5.2%
5 3.7% 3.7%
2.4%
1.3% 0.8%
1 2 3 4 5 6 7 8 9 10
Year
25
26.4%
20
10
8.0% 7.2%
Mean
5
2.4% 4.2% 5.2%
1.3% 0.8% 3.7% 3.7%
1 2 3 4 5 6 7 8 9 10
Year
(1.3 + 2.4 + 0.8 + 3.7 + 8.0 + 3.7 + 7.2 + 26.4 + 4.2 + 5.2)
= 6.3% Mean
10
The arithmetic mean return or average annual return over the 10-year period
is 6.3%. The weighted mean return (shown in the following equation) is the same
as the arithmetic return because the probability assigned to each return is the
same: 10% or 0.1.
Weighted mean annual return
= (0.1 1.3) + (0.1 2.4) + (0.1 0.8) + (0.1 3.7) + (0.18.0)
+(0.13.7) +(0.17.2) +(0.126.4) + (0.1 4.2) + (0.1 5.2)
= 6.3%
The mean has one main disadvantage: it is particularly susceptible to the influence of
outliers. These are values that are unusual compared with the rest of the data set by
being especially small or large in numerical value. The arithmetic mean is not very
representative of the whole set of observations when there are outliers. Example8
shows the effect of excluding an outlier from the calculation of the arithmetic mean.
25
26.4%
Annual Returns (%)
20
Outlier
15
10
8.0% 7.2%
5.2%
5 Mean without Outlier 4.2%
1.3% 0.8% 3.7% 3.7%
2.4%
1 2 3 4 5 6 7 8 9 10
Year
Including the outlier, the mean is dragged in the direction of the outlier. When there
are one or more outliers in a set of data in one direction, the data are said to be
skewed in that direction. In Example7, ordering data so larger numbers are to the
right of smaller numbers, 26.4% lies to the right of the other data. Thus, the data are
said to be right skewed (or positively skewed). Other measures of central tendency
may better accommodate outliers.
8% 3% 7%
three years. So, the second step requires moving from three years to one by raising
the accumulation to the power of one over the number of periods held, three in this
particular case; this calculation can also be described as taking the number of peri-
ods held root of the value (1.19031/3 1.060). This value of 1.060 includes both the
original investment and the average yearly return on the investment each year (1 plus
the geometric mean return). The last step is, therefore, to subtract 1 from this value
to arrive at the return that would have to be earned on average each year to get to the
total accumulation over the three years (1.060 1 0.060 or 6.0%). The geometric
mean return is 6.0%, which in this case is the same as the arithmetic mean return.
Geometric mean is frequently the preferred measure for the investment industry.
where
Example9 shows the calculation of the geometric mean return for the investment
of Exhibit5.
If 1 currency unit was invested, you would have 1.8 currency units at the end
of the 10 years.
Total accumulation after 10 years
= [(1+ 1.3%) (1+ 2.4%) (1+ 0.8%) (1+ 3.7%) (1+ 8.0%) (1+
3.7%) (1+ 7.2%) (1+ 26.4%) (1+ 4.2%) (1+ 5.2%)]
= [(1.013) (1.024) (1.008) (1.037) (1.08) (1.037) (1.072)
(1.264) (1.042) (1.052)]
= 1.8
Average accumulation per year = 10th root of 1.8= (1.8)1/10 = 1.061
Geometric mean annual return = 1.061 1= 0.061= 6.1%
This can also be done as one calculation:
Geometric mean annual return
= {[(1+ 1.3%) (1+ 2.4%) (1+ 0.8%) (1+ 3.7%) (1+ 8.0%) (1+
3.7%) (1+ 7.2%) (1+ 26.4%) (1+ 4.2%) (1+ 5.2%)](1/10)} 1
= 6.1%
The geometric mean annual return is 6.1%. One currency unit invested for 10
years and earning 6.1% per year would accumulate to approximately 1.8 units.
Descriptive Statistics 195
An important aspect to notice is that the geometric mean is lower than the arithmetic
mean even though the annual returns over the 10-year holding period are identical.
This result is because the returns are compounded when calculating the geometric
mean return. Recall that compounding will result in a higher value over time, so a
lower rate of return is required to reach the same amount. In fact, if the same set of
numbers is used to calculate both means, the geometric mean return is never greater
than the arithmetic mean return and is normally lower.
3.1.3 Median
If you put data in ascending order of size from the smallest to the largest, the median
is the middle value. If there is an even number of items in a data set, then you average
the two middle observations. Hence, in many cases (i.e., when the sample size is odd
or when the two middle-ranked items of an even-numbered data set are the same)
the median will be a number that actually occurs in the data set. Example10 shows
the calculation of the median for the investment of Exhibit5.
EXAMPLE10. MEDIAN
When the returns are ordered from low to high, the median value is the arith-
metic mean of the fifth and sixth ordered observations.
0.8% 1.3% 2.4% 3.7% 3.7% 4.2% 5.2% 7.2% 8.0% 26.4%
(3.7 + 4.2)
4.0% Median
2
25
26.4%
20
Annual Returns (%)
15
10
8.0% 7.2%
4.2% 5.2%
5 Median
1.3% 0.8% 3.7% 3.7%
2.4%
1 2 3 4 5 6 7 8 9 10
Year
An advantage of the median over the mean is that it is not sensitive to outliers. In the
case of the annual returns shown in Exhibit5, the median of close to 4.0% is more
representative of the datas central tendency. This 4.0% median return is close to the
4.1% arithmetic mean return when the outlier is excluded. The median is usually a
better measure of central tendency than the mean when the data are skewed.
3.1.4 Mode
The mode is the most frequently occurring value in a data set. Example11 shows how
the mode is determined for the investment of Exhibit5.
EXAMPLE11. MODE
Looking at Exhibit5, we see that one value occurs twice, 3.7%. This value is the
mode of the data.
3.7% Mode
The mode can be used as a measure of central tendency for data that have been sorted
into categories or groups. For example, if all the employees in a company were asked
what form of transportation they used to get to work each day, it would be possible
to group the answers into categories, such as car, bus, train, bicycle, and walking. The
category with the highest number would be the mode.
A problem with the mode is that it is often not unique, in which case there is no
mode. If there are two or more values that share the same frequency of occurrence,
there is no agreed method to choose the representative value. The mode may also
be difficult to compute if the data are continuous. Continuous data are data that can
take on an infinite number of values between whole numbersfor example, weights
of people. One person may weigh 62.435 kilos and another 62.346 kilos. By contrast,
discrete data show observations only as distinct valuesfor example, the number
of people employed at different companies. The number of people employed will be
a whole number. For continuous data, it is less likely that any observation will occur
more frequently than once, so the mode is generally not used for identifying central
tendency for continuous data.
Another problem with the mode is that the most frequently occurring observation may
be far away from the rest of the observations and does not meaningfully represent them.
Descriptive Statistics 197
140
120
100
Salary ($ thousands)
80
60 Average
Annual Salary
40
20
0
Company A Company B
Another reason why measures of dispersion are important in finance is that invest-
ment risk is often measured using some measure of variability. When investors are
considering investing in a security, they are interested in the likely (expected) return
on that investment as well as in the risk that the return could differ from the expected
return (its variability). A risk-averse investor considering two investments that have
similar expected returns but very different measures of variability (risk) around those
expected returns, typically prefers the security with the lower variability.
Two common measures of dispersion of a data set are the range and the standard
deviation.
198 Chapter 8 Quantitative Concepts
3.2.1 Range
The range is the difference between the highest and lowest values in a data set. It is
the easiest measure of dispersion to calculate and understand, but it is very sensitive
to outliers. Example12 explains the calculation of the range of returns for the invest-
ment of Exhibit5.
EXAMPLE12. RANGE
In Exhibit5 we see that the highest annual return is 26.4% and the lowest annual
return is 0.8%.
If the extreme value at the upper end of the range is excluded, the next highest
value, 8.0%, is used to estimate the range, and the range is reduced significantly.
Clearly, the range is affected by extreme values and, if there are outliers, it says little
about the distribution of the data between those extremes.
If there are a large number of observations ranked in order of size, the range can be
divided into 100 equal-sized intervals. The dividing points are termed percentiles. The
50th percentile is the median and divides the observations so that 50% are higher and
50% are lower than the median. The 20th percentile is the value below which 20% of
observations in the series fall. So, the dispersion of the observations can be described
in terms of percentiles. Observations can be divided into other equal-sized intervals.
Commonly used intervals are quartiles (the observations are divided into four equal-
sized intervals) and deciles (the observations are divided into 10 equal-sized intervals)
2
Standard deviation =
[X1 E ( X )]2 + [X 2 E ( X )]2 + ... + [X n E ( X )]
n
where
The differences between the observed values of X and the mean value of X capture
the variability of X. These differences are squared and summed. Note that because
the differences are squared, what matters is the size of the difference not the sign of
the difference. The sum is then divided by the number of observations. Finally, the
square root of this value is taken to get the standard deviation.
The value before the square root is taken is known as the variance, which is another
measure of dispersion. The standard deviation is the square root of the variance. The
standard deviation and the variance capture the same thinghow far away from
the mean the observations are. The advantage of the standard deviation is that it
is expressed in the same unit as the mean. For example, if the mean is expressed as
minutes of journey time, the standard deviation will also be expressed as minutes,
whereas the variance will be expressed as minutes squared, making the standard
deviation an easier measure to use and compare with the mean.
To illustrate the calculation of the standard deviation, let us return to the example of
a three-year investment that returns 8% or 0.08 the first year, 3% or 0.03 the second
year, and 7% or 0.07 the third year. The arithmetic mean return is 6% or 0.06. The
standard deviation is approximately 2.16%.
8% 3% 7%
(0.0014)
= = 0.0216= 2.16%
3
200 Chapter 8 Quantitative Concepts
Example 13 shows the calculation of the standard deviation for the investment in
Exhibit5.
Larger values of standard deviation relative to the mean indicate greater variation in
a data set. Also, by using standard deviation, you can determine how likely it is that
any given observation will occur based on its distance from the mean. Example14
compares the returns of the investment shown in Exhibit5 and the returns on another
investment over the same period using mean and standard deviation.
Number of Employees
Salary ($) Company X Company Y
15,00020,000 5 1
20,00125,000 8 1
25,00130,000 20 3
30,00135,000 30 8
35,00140,000 22 10
40,00145,000 12 15
45,00150,000 6 20
50,00155,000 2 9
55,00160,000 1 7
35
Number of Employees
30
25
20
15
10
5
0
1520 2025 2530 3035 3540 4045 4550 5055 5560
Salary Range ($ thousands)
202 Chapter 8 Quantitative Concepts
25
Number of Employees
20
15
10
0
1520 2025 2530 3035 3540 4045 4550 5055 5560
Salary Range ($ thousands)
Note that the two distributions are not symmetrical. A symmetrical distribution
would have observations falling off fairly evenly on either side of the centre of the
range of salaries ($35,001$40,000). Instead, in each of these distributions, the bulk
of the observations are stacked towards one end of the range and tail off gradually
towards the other end. The two distributions are different in that each is stacked
towards a different end. Such distributions are considered skewed; the distribution
for Company X is positively skewed (i.e., the majority of the observations are on the
left and the skew or tail is on the right), whereas the distribution for Company Y is
negatively skewed (left skewed).
Although the range of the observations is the same in each case, the mean for each
is very different. Company Xs mean is approximately $35,000, whereas Company Ys
mean is approximately $44,000.
A normal distribution has special importance in statistics because many variables have
the approximate shape of a normal distributionfor example, height, blood pressure,
and lengths of objects produced by machines. This distribution is often useful as a
description of data when there are a large number of observations.
0.3413 0.3413
0.0228 0.0228
0.1359 0.1359
SD
3 2 1 0 1 2 3
68.26%
95.44%
The total area under the curve or bell is 100% of the distribution. The area under
the curve that is within one standard deviation of the mean is about 68% of all the
observations. In other words, given a mean of 0 and a standard deviation of 1, about
68% of the observations fall between 1 and +1, and 32% of the observations are more
than one standard deviation from the mean. The area under the curve that is within 2
standard deviations of the mean is about 95% of the observations. Given a mean of 0
and a standard deviation of 1, about 95% of the observations fall between 2 and +2,
and 5% of the observations are more than two standard deviations from the mean. The
area under the curve that is within three standard deviations of the mean represents
about 99% of the observations. Given a mean of 0 and a standard deviation of 1, about
99% of the observations fall between 3 and +3, and less than 1% of the observations
occur more than three standard deviations away from the mean.
The observations that are more than a specified number of standard deviations from
the mean can be described as lying in the tails of the distribution. Assuming that
returns on a portfolio of stocks are normally distributed, the chance of extreme losses
(a return more than three standard deviations lower than the mean return) is relatively
204 Chapter 8 Quantitative Concepts
small. The chance of the return being in the left tail more than two standard deviations
from the mean (which would be an extreme loss under typical circumstances) is just
2.5%. In other words, out of 200 days, 5 days are expected to have observations that
are more than two standard deviations from the mean. But during the financial crisis
of 2008, the losses that were incurred by some banks over several days in a row were
25 standard deviations below the mean.
To put this in perspective, if returns are normally distributed, a return that is 7.26
standard deviations below the mean would be expected to occur once every 13.7billion
years. That is approximately the age of the universe. The frequency of extreme events
during the financial crisis of 2008 was, therefore, much higher than predicted by the
normal distribution. This inconsistency is often referred to as the distribution having
fat tails, meaning that the probability of observing extreme outcomes is higher than
that predicted by a normal distribution.
In Exhibit9, the curve with the solid line represents the normal distribution. The curve
with the dotted line is an example of distribution with thinner tails than the normal
distribution, indicating a reduced probability of extreme outcomes. By contrast, the
curve with the dashed line is an example of a distribution with fatter tails than the
normal distribution, indicating increased likelihood of extreme outcomes.
Descriptive Statistics 205
3.3 Correlation
Another way of using and understanding data is identifying connections between
data sets. The strength of a relationship between two variables, such as growth in
gross domestic product (GDP) and stock market returns, can be measured by using
correlation. Essentially, two variables are correlated when a change in one variable
helps predict change in another variable.
When both variables change in the same direction, the variables are positively cor-
related. If we take the example of traders at an investment bank, salary and age are
positively correlated if salaries increase as age increases. If the variables move in the
opposite direction, then they are negatively correlated. For example, the size of a
transaction and the fees expressed as a percentage of the transaction are negatively
correlated if the larger the transaction, the smaller the associated fees. When there is no
clear tendency for one variable to move in a particular direction (up or down) relative
to changes in the other variable, then the variables are close to being uncorrelated.
In practice, it is difficult to find two variables that have absolutely no relationship,
even if just by chance.
Correlation measures both the direction of the relationship between two variables
(negative or positive) and the strength of that relationship (the closer to +1 or 1, the
stronger the relationship). In practice, it is unusual to find variables that are perfectly
positively or perfectly negatively correlated. The stronger the relationship between two
variablesthe higher the degree of correlationthe more confidently one variable can
be predicted given the other variable. For example, there may be a high correlation
between stock market index returns and expected economic growth. In that case, if
economic growth in the future is expected to be high then returns on the stock market
index are likely to be high too.
It is important, however, to realise that correlation does not imply causation. For
example, historically in the United States, stock market returns and snowfall are both
higher in January, and from that you may assume a correlation. But obviously snowfall
does not cause an increase in stock market returns, and an increase in stock market
returns clearly does not cause snowfall. There may be situations in which a correla-
tion implies some causal relationship. For example, a high correlation has been found
between power production and job growth. It may follow that the more workers there
are, the more power is consumed, but it does not necessarily follow that an increase
in power generation will create jobs.
Correlation is important in investing because the rise or fall in value of a variable may
help predict the rise or fall in value of another variable. It is also important because
when two or more securities that are not perfectly correlated are combined together in
a portfolio, there is normally a reduction in risk (measured by the portfolios standard
deviation of returns). As long as the returns on the securities do not have a correlation
206 Chapter 8 Quantitative Concepts
of +1 (that is, they are less than perfectly correlated), then the risk of the portfolio will
be less than the weighted average of the risks of the securities in the portfolio because
it is not likely that all the securities will perform poorly at the same time.
SUMMARY
The better your understanding of quantitative concepts, the easier it will be for you
to make sense of the financial world. Knowledge of quantitative concepts, such as
time value of money and descriptive statistics, is important to the understanding of
many of the key products in the financial industry. Understanding the time value of
money allows you to interpret cash flows and thus value them. Meanwhile, knowledge
of statistical concepts will help in identifying the important information in a large
amount of data, as well as in understanding what statistical measures reported by
others mean. It is easy to misinterpret or be misled by statistics, such as mean and
correlation, so an understanding of their uses and limitations is crucial.
Interest is return earned by a lender that compensates for opportunity cost and
risk. For the borrower, it is the cost of borrowing.
The simple interest rate is the cost to the borrower or the rate of return to the
lender, per period, on the original principal borrowed. A commonly quoted
simple interest rate is the annual percentage rate (APR).
Compound interest is the return to the lender or the cost to the borrower when
interest is reinvested and added to the original principal.
The present value of a future sum of money is found by discounting the future
sum by an appropriate discount rate. (The present value of multiple cash flows
is the sum of the present value of each cash flow.)
All else being equal (in other words, only one of the three elements differs):
the higher the cash flows, the higher the present and future values.
the earlier the cash flows, the higher the present and future values.
the lower the discount rate, the higher the present value.
the higher the interest rate, the higher the future value.
The net present value is the present value of future cash flows net of the invest-
ment required to obtain them. It is useful when comparing alternatives that
require different initial investments.
The arithmetic mean is the most commonly used measure. It represents the
sum of all the observations divided by the number of observations. It is an easy
measure to understand but may not be a good representative measure when
there are outliers.
The geometric mean return is the average compounded return for each
periodthat is, the average return for each period assuming that returns are
compounding. It is frequently the preferred measure of central tendency for
returns in the investment industry.
When observations are ranked in order of size, the median is the middle value.
It is not sensitive to outliers and may be a more representative measure than the
mean when data are skewed.
The mode is the most frequently occurring value in a data set. A data set may
have no identifiable unique mode. It may not be a meaningful representative
measure of central tendency.
Measures of dispersion are important for describing the spread of the data, or
its variation around a central value. Two common measures of dispersion are
range and standard deviation.
208 Chapter 8 Quantitative Concepts
Range is the difference between the highest and lowest values in a data set. It is
easy to measure, but it is sensitive to outliers.
Standard deviation measures the variability of a data set around the mean of the
data set. It is in the same unit of measurement as the mean.
A distribution is simply the values that a variable can take, showing its observed
or theoretical frequency of occurrence.
For a perfectly symmetrical distribution, the mean, median, and mode will be
identical.