Adams S. Foundations of Physics
Adams S. Foundations of Physics
Adams S. Foundations of Physics
of
Physics
Second Edition
Original title and copyright: Principles of Physics. Copyright ©2022 by The Pantaneto Press. All rights
reserved. Published by The Pantaneto Press.
This publication, portions of it, or any accompanying software may not be reproduced in any way, stored in
a retrieval system of any type, or transmitted by any means, media, electronic display or mechanical display,
including, but not limited to, photocopy, recording, Internet postings, or scanning, without prior permission
in writing from the publisher.
The publisher recognizes and respects all marks used by companies, manufacturers, and developers as a
means to distinguish their products. All brand names and product names mentioned in this book are trade-
marks or service marks of their respective companies. Any omission or misuse (of any kind) of service marks
or trademarks, etc. is not an attempt to infringe on the property of others.
232425321 This book is printed on acid-free paper in the United States of America.
Our titles are available for adoption, license, or bulk purchase by institutions, corporations, etc. For additional
information, please contact the Customer Service Dept. at 800-232-0223(toll free).
All of our titles are available in digital format at authorcloudware.com and other digital vendors. The sole
obligation of Mercury Learning and Information to the purchaser is to replace the book, based on
defective materials or faulty workmanship, but not based on the operation or functionality of the product.
Prefacexxix
CHAPTER 1: THE LANGUAGE OF PHYSICS 1
1.0 Introduction 1
1.1 The SI System of Units 1
1.1.1 Derived Units 2
1.1.2 Energy 2
1.1.3 Viscosity 3
1.2 Dimensions 3
1.2.1 Method of Dimensions 4
1.3 Scientific Notation, Prefixes,
and Significant Figures 5
1.4 Uncertainties 6
1.4.1 Types of Uncertainty 7
1.4.2 Combining Uncertainties 7
1.5 Dealing with Random and Systematic Experimental Errors 8
1.5.1 Random Errors 9
1.5.2 Systematic Errors 9
1.6 Differential Calculus 9
1.6.1 Derivatives and Rates of Change 9
1.6.1.1 Second Derivatives 11
1.6.2 Maximum and Minimum Values 12
The aim of this book is to draw on the essential physical principles that typical
physics courses use to provide a strong conceptual base for the further study
of more advanced topics. As such this book provides support for both intro-
ductory courses (calculus-based) and for readers interested in a basic review
of key topics in physics. It will also be a useful reference work for instructors.
The focus is on physical principles. Applications are used to exemplify the
physics but do not divert attention from the underlying concepts. Mathematics
is the language of physics and a mathematical approach is taken throughout,
drawing mathematical techniques including basic calculus. The approach
here acknowledges this and helps to secure a foundation of relevant math-
ematical skills in the context of real physical problems.
Practical techniques, including the collection, presentation, analysis and eval-
uation of data, are discussed in the context of key experiments linked to the
theoretical spine of the work. There are also sections on testing mathematical
relationships, the analysis of uncertainties, and how to approach, carry out,
and write-up experimental investigations.
Every chapter concludes with a set of exercises and an appendix on Fermi
problems provides an open-ended challenge that allows the reader to practice
their skills in unfamiliar contexts.
topic is studied or revised. The early sections on the language of physics and
representing and analyzing data can be referred back to from any of the other
sections. The appendices contain summary lists of units and useful data, equa-
tions, solutions to problem and a guide to planning, carrying out and writing
up experiments. There is also an extensive glossary of terms.
S. Adams
February 2023
1
The Language of Physics
1.0 INTRODUCTION
NASA’s Mars climate orbiter was launched in 1998 and should have gone
into orbit around Mars 286 days later. Instead, it fell too close to the
planet and broke up in the atmosphere. The mission had cost upwards of
$100 000 000…… Why did this happen? Because Lockheed Martin, who was
calculating the thrust to maneuver the spacecraft, used English units (pound-
seconds) and NASA, who controlled the thrusters, was expecting metric units
(newton-seconds).
The definition of each base unit is related to the experimental method that
is used to establish the unit in the laboratory. All mechanical quantities can
be expressed in terms of just three base units: length (the meter), mass (the
kilogram), and time (the second).
Base unit Definition
Meter The distance traveled by light in a vacuum in a time of 1/299 792 458 seconds
kilogram In 2019 the kilogram was redefined in terms of the fixed value of the Planck constant and
is measured electromagnetically using a Kibble balance.
second The duration is equal to 9 192 631 770 periods of radiation corresponding to the
transition between the two hyperfine levels of the ground state of a cesium 133 atom.
You will notice that the meter is actually defined in terms of the speed of
light. This has been the case since 1983 when the speed of light, which had
been measured with ever-increasing precision, was defined to have the value
299 792 458 m s− 1.
When you encounter a new physical quantity that is unfamiliar you can work
out its SI units by making sure that an equation containing the new quantity
is balanced. All equations in physics must balance in terms of units. Here are
two examples.
1.1.2 Energy
Any equation involving energy will suffice, for example, E = ½ mv2. The units
of the right-hand side are kg m2 s− 2 so these must be the units of energy. The
name “joule” is a convenient alternative for such a common physical unit so 1
J = 1 kg m2 s− 2.
1.1.3 Viscosity
There is an equation for the viscous drag on a sphere moving through a fluid
called “Stokes’ law.” This has the form: F = 6πη rv, where r and v are the radius
and velocity of the sphere, respectively. η is the viscosity of the fluid. What are
the SI units of viscosity? First, rearrange the equation to give η = F/6πrv, and
then balance the units (6π has no units, it is simply a number). The right-hand
side has units of N m− 2 s or Pa s (this is because of 1 Nm− 2 = 1 Pa). This can
be reduced to base units by substituting for N (=kg m s− 2). The base units for
viscosity are therefore kg m− 1 s− 1.
1.2 DIMENSIONS
All mechanical quantities depend on mass (M), length (L), and time (T).
These are the fundamental dimensions of mechanics and are measured in
terms of the base units of kg, m, and s. All equations in physics must balance
in terms of numerical value, units, and dimensions. When they do they are
said to be “homogeneous.” This can be very useful for checking your work
during a derivation (if you made a mistake in the algebra then the dimensions
might not balance) and for testing proposed equations to see if they are viable.
It can also be used to construct possible equations if you have an idea of the
relevant parameters.
To indicate that we are dealing with dimensions we use square brackets
so that [E] means “the dimensions of” energy and [v] means “the dimensions
of” velocity. The fundamental dimensions are related in a similar way to base
units so it is possible to work them out by balancing simple equations. For
example, if we want to find the dimensions for energy, we can use the equa-
tion E = mgh. The dimensions of E will be the same as the dimensions of mgh:
If the dimensions are now replaced by base units, we see that the SI unit for
energy is kg m2 s− 2 as before.
Five basic dimensions can be used to express most quantities in physics.
Dimension Mass Length Time Current Temperature
Symbol M L T I θ
so that the result is a pure number. These often have (or seem to have) great
significance. The fine structure constant α in quantum electrodynamics is a
good example:
e2
=
α = 0.007 297 352 5664
2 0 hc
where e is the charge on an electron, ε0 is the permittivity of free space, h is
the Planck constant, and c is the speed of light. This dimensionless constant
determines the strength of the interactions between electrons and photons. It
is also, approximately, the speed of an electron in units of the velocity of light.
Since α << 1, the motion of electrons in atoms does not have to be treated
relativistically.
Another interesting dimensionless number is the ratio of the mass of a proton
to the mass of an electron:
mp
=
µ = 1 836.152 673 89
me
Physicists think that we should be able to derive numbers such as µ from a
fundamental physical theory. The fact that, so far, we have been unable to do
this suggests that there is more new physics to be discovered.
T = constant × mx × ky
These increase (or decrease) in multiples of 103 but there are some prefixes
that are in common use that do not fit this pattern: “centi” multiples by 10− 2
and “deci” multiples by 10− 1. In chemistry, it is quite common to state volumes
in decimeters cubed. One decimeter is 10 cm or 0.10 m so this is a volume of
1000 cm3 or 1 liter…. In SI base units, it is 0.001 m3.
When presenting data, it is important to use an appropriate number of signifi-
cant figures, even if some of these are zeroes. The number of significant figures
used represents the precision of the data, so a length of 1.20 m is more pre-
cise than 1.2 m. In principle, 1.2 m could have been rounded from anything
between 1.150 m and 1.249 m so has an uncertainty of ±0.05, whereas a length of
1.20 m must really lie between 1.195 m and 1.205 m, and uncertainty of ± 0.005.
Quoting the third significant figure has increased the precision by a factor of 10.
However, it is also important not to quote data to too many significant figures
if this is not justified by the measurements that were used to obtain the data.
For example, if you were trying to calculate the density of a block of wood and
had measured a mass of 40.5 g and a volume of 24.2 cm3 the value for density
is 40.5/24.2 = 1.673553719 g cm− 3. The result must be rounded off so that it
is consistent with the data used to calculate it. In this case, data were given to
three significant figures so an appropriate result is 1.67 g cm− 3.
A useful rule of thumb: quote calculated values to the same number of
significant figures as the least precise piece of data used in the calculation.
1.4 UNCERTAINTIES
In 2012, the discovery of the Higgs boson was announced at the Large Hadron
Collider at CERN. Physicists had tuned the collider to the energy range in
which the particle was expected to be found and sure enough, it appeared.
This allowed physicists to measure the mass of the Higgs: 125.09 ± 0.21 GeV/c2
(the GeV/c2 is a convenient mass unit used in particle physics). The quoted
uncertainty is less than 0.2% of the mass. This pins the mass of the Higgs into
a small enough range so that its properties can be compared with theoretical
predictions.
If the uncertainty in a measured value is too large it is not very useful. For
example, if you were asked to measure the acceleration due to gravity in a
laboratory and you got a value of 9.8 m s− 2 you might be quite happy and feel
that you had done a good job. However, if the uncertainty associated with that
value was ±1.0 m s− 2 then the acceleration due to gravity as measured in your
experiment could lie anywhere between 8.8 m s− 2 and 10.8 m s− 2 so the fact
that the actual value turns out to be close to the true value was probably just
by chance. If you repeated the experiment in the same way you would prob-
ably get a very different value.
Whenever you calculate a value from experimental data you should
include the estimated uncertainty. This is a measure of the reliability of
the measured value.
While a detailed analysis of uncertainties and their effect on calculated values
involves a lot of statistics there are some simple methods that will give a rea-
sonable estimate and that can be used quite easily.
While this method can be used there are also some simple ways in which
uncertainties can be combined mathematically. Here are some simple rules.
Combination Rule
Uncertainty in a sum: Add absolute uncertainties δy = δa + δb
y=a+b
Uncertainty in a product: δy δa δb
Add fractional uncertainties = +
y = ab y a b
δy δa δb
Uncertainty in a quotient: y = a Add fractional uncertainties = +
b y a b
Using the data from the example above (acceleration of free fall) and applying
the rules from the table we have:
g = 2s/t2 = (2 × 2.500)/(0.710)2 = 9.92 m s− 2
δg δs δt 0.005 0.020
= +2 = +2 =0.058
g s t 2.500 0.710
δg = 0.58 ms− 2
This gives a result of g = 9.92 ± 0.58 ms−2 (almost identical to the first
method).
charge
Gradient represents
current at this instant:
I = dQ/dt = gradient
time
Some common derivatives (rates of change) that often occur in physics are
listed in the table below.
dy
Function: y Derivative (rate of change):
dx
Constant value, for example, y = 8 0
dy
Power law, for example, y = Axn = nAx n−1
dx
dy
Exponential function: y = ex = ex
dx
dy
Exponential relationship, for example, y = Aebx = bAe bx
dx
dy
Sine function: y = sin x = cos x
dx
dy
Cosine function: y = cos x = − sin x
dx
dy
Sinusoidal variation, for example, y = A sin (bx) = bAcos ( bx )
dx
dy
Cosinusoidal variation: y = A cos (bx) = − bAsin ( bx )
dx
dv d ds
=
a =
dt dt dt
This is called the “second derivative” of displacement and is written like this:
d2 s
a=
dt 2
Whereas first derivatives are equal to gradients, second derivatives are related
to the sharpness of curvature of a graph (the rate of change of the gradient).
Acceleration is related to the sharpness of curvature of a graph of displace-
ment against time.
The maximum power Pmax occurs at some resistance Rpeak. This is where the
curve has a gradient of zero. However, gradients are equal to derivatives so we
can find the maximum value if we can find where the gradient is zero. To do
this, we need an equation for power in terms of the resistance, P(R). It turns
out that this is easy to find (see Section 18.5.1). Once we have the equation
we differentiate it, set it equal to zero, and solve the resulting equation to find
the value of R:
dP ( R )
=0
dR
To find a maximum or minimum of some function y(x):
dy
Differentiate to find the derivative ;
dx
dy
Set the derivative equal to zero: = 0;
dx
Solve for x.
d2 y
Find ;
dx2
d2 y
If is positive the gradient is increasing: minimum position;
dx2
d2 y
If is negative the gradient is decreasing: maximum position;
dx2
d2 y
If is zero the position is a saddle point, neither a maximum nor a
dx2
minimum.
y
maximum : gradient
decreasing through zero
d2y
negative
dx 2
minimum: gradient
increasing through zero
d2y
positive
dx 2 saddle point: gradient
d2y
stationary zero
dx 2
s = vt
=
s vt + k
Is also a solution.
ds d
= ( vt + k=) v
dt dt
=
s vt + s0
dN
Radioactive decay = −l t N = N0 e − l t N = N0 at t = 0
dt
dQ Q −
t
Capacitor discharge = − Q = Q0 e RC
Q = Q0 at t = 0
dt RC
velocity
t
v(t
time
t1 t t2
The area of the small shaded strip, v(t) δt, approximates to the extra displace-
ment during a short time δt at time t. The area between any two times t1 and
t2 is equal to the displacement during that time interval and is given, approxi-
mately, by the sum of the areas of all such strips between those two times:
t2
=s ∑v ( t ) δt
t1
This becomes a better approximation to the actual area if we take thinner and
thinner strips by making δt smaller and smaller. It would be a precise value
in the limit that δt approached zero: δt→0. In this limit, the sum becomes a
continuous process called an integral:
t2 t2
=s lim ∑
=v ( t ) δt ∫v ( t ) dt
δt → 0
t1 t1
The table below gives some derivatives and related integrals that will be used
in this book.
Context Differential form Integral form
ds
Dynamics v= s = ∫vdt
dt
dv
Dynamics a= v = ∫adt
dt
d ( mv )
Newton’s laws F=
dt ∫Fdt
mv − mu =
dQ
Electric circuits I= Q = ∫Idt
dt
dN dN
Radioactivity
dt
= −l N ∫ N
= − ∫ l dt
dQ Q dQ dt
Capacitors
dt
= −
RC
∫ Q
= −∫
RC
10 m
10 m
14 m
10 m + 10 m
= 14 m
A sin A
x
A cos
F
Scalar product =F.s = Fs cos
s
B B
v
v
The shape of the triangle is “invariant” under these rotations and reflections.
In Newtonian mechanics the laws of physics are the same in all uniformly
moving (inertial) reference frames, so Newton’s laws are invariant under
a change of velocity. Einstein’s theory of special relativity goes further and
includes all of the laws of physics (see Chapter 24). Hermann Minkowski real-
ized that Einstein’s equations for relativity were similar to those for a geo-
metrical rotation and identified physical quantities that are the same for all
inertial observers—these are four-dimensional quantities called invariants
and are constructed from space and time components.
Emmy Noether showed that symmetry principles are linked to conservation
laws. This is not really surprising because a conservation law identifies some
quantity that stays the same (is invariant) when other things change. For
example, the total linear momentum of a collection of colliding bodies is the
same before and after the collisions, and the total energy of the Universe is
the same before and after an explosion. Noether showed that the conservation
of momentum is linked to the laws of physics remaining the same under trans-
lation, the conservation of angular momentum is linked to the laws staying the
same under rotation, and the law of conservation of energy is linked to the
laws staying the same at all times. This link between mathematical symmetries
and conservation laws is a powerful idea in theoretical physics.
=
E2 p2 c2 + m0 2 c 4
When p = 0 we can write:
E2 = m 0 2 c 4
E = ± m0 c 2
1.11 EXERCISES
1. Express:
(a) 267 g in kg, (b) 25 km in mm, (c) 5.0 m3 in cm3,
(d) 80 km/h in m s−1, (e) 45 cm2 in m2
2. Light travels at 3.0 × 108 m s−1 and it takes about 8 minutes for light to
travel from the Sun to the Earth.
(a) How far away is the Sun in km?
(b) How far is a light year? Give your answer in meters.
The distance from the Earth to the Moon is 380 000 km.
(c)
How far is this in light seconds?
3. Round off the following to three significant figures:
(a) 2.000009 (b) 0.0020900 (c) 0.009502 (d) π
4. Newton’s equation for gravitational forces is F = Gm1m2/r2. Use the
method of dimensions to find the correct SI units for the universal gravi-
tational constant G.
5. The lengths of the sides of a rectangular box are measured using a ruler
marked with an mm scale. The measurements give side lengths of 85,
62, and 20 mm, respectively. The uncertainty in each measurement is
± 1 mm.
(a) Calculate the fractional uncertainty in the length of each side.
(b) Calculate the percentage uncertainty in the length of each side.
(c) What is the area of the largest face of the box? Include its absolute
uncertainty.
(d) What is the volume of the box? Include its absolute uncertainty.
6. The time period of a mass-spring oscillator is given by;
m
T = 2p
k
where m is the mass oscillating and k is the spring constant (N m-1)
In an experiment, the time period is measured to be 0.68 s± 0.04 s and the
spring constant is 20 N m-1 ± 2 N m-1
Calculate the mass of the oscillator including its absolute
(a)
uncertainty.
(b) Which value (k or T) contributed most to the uncertainty in m?
Explain the difference between a systematic error and a
7. (a)
random error.
(b) How would you reduce the size of random errors when carrying out
an experiment?
(c) A micrometer screw gauge reads 0.02 mm when it should read zero.
The diameter of a wire is measured to be 0.34 mm using the same
micrometer screw gauge. What value should be recorded?
8. Convert these numbers to scientific notation:
(a) 5500 (b) 0.0000000007
(c) 1 200 000 000 000 000 000 000 000
9. Evaluate the expressions below (do not use a calculator):
(a) 2.0 × 106 × 4.5 × 109 (b) 2.0 × 106 × 4.5 × 10−9
(c) 6.0 × 1012/3.0 × 104 (d) 6.0 × 1012/3.0 × 10−4
2
Representing and Analyzing Data
2.0 INTRODUCTION
Experiments generate data and it is important to know how to select, record
and process this data in order to find out what the experiment has revealed.
This is a huge problem for large experiments. According to the CERN
website, the four experimental stations on the Large Hadron Collider each
generate between 750 Mbs−1 and 4 Gbs−1 of data when the accelerator is oper-
ating! The task of selecting and analyzing relevant data is carried out by the
Worldwide LHC Computing Grid which has to reject the majority of results
in order to focus on those that might reveal interesting new physics. Writing
the algorithms for this system is extremely challenging.
When you carry out experiments in a laboratory you will not face such a
daunting task. However, presenting your data clearly and to an appropriate
precision, rejecting anomalous results, deciding how to process it, and then
extracting information from the processed data, is an essential part of physics.
dependent
variable
independent
variable
Note that, while the time period has been given to four significant figures
(consistent with the raw data) the uncertainty is likely to make one or more of
the final figures meaningless.
Spreadsheets (such as Excel) are often used to record and analyze data. They
have some strong advantages over simple written tables. The main one is that
the data, once it is recorded, can be processed within the spreadsheet. For
example, if you have calculated displacement and time values it is simple to
use the spreadsheet to calculate velocities or accelerations. Mean values and
standard deviations can also be calculated. Once the data has been processed
it can be selected and displayed graphically in a wide variety of ways (scatter
graphs, bar charts, pie charts, etc.). It is also possible to insert error bars, to
extrapolate a line backward or forward, and to fit a line or curve to plotted data
and display an equation for the fit.
Much of the analysis described above is done “behind the scenes” but when
you need to present your data in a report it is important to make sure that you
include the correct table headings (and units) and that you round data to an
appropriate number of significant figures.
y y y
x x x
Straight line through the Straight line with Straight line with
origin: y is directly positive intercept y negative intercept y
proportional to x increases linearly with x decreases linearly with x
Beware—a common mistake is to think that a linear graph implies direct pro-
portion. This is only the case if the graph passes through the origin.
It is also important to realize that a linear graph with negative gradient does
not imply inverse proportion.
Intercept gradient = m
at y = c
0 x
0
The two graphs below display the same data. However, the range
of y-values is from 92 to 98. The use of a false origin makes the
relationship clearer and would make calculation of a gradient
from the graph more accurate.
100 100
0 90
0
0 0
If the origin (0, 0) is included on the graph then mark it with a zero on the
scale on each axis.
Mark values onto each axis in equal simple intervals, for example, 0.0,
10.0, 20.0, 30.0 ….. or 0.02, 0.04, 0.06 …. etc… Avoid unusual or awk-
ward intervals such as 3, 5, 7 ….
Label both axes with the physical quantity and unit in the same way as if
you were putting in a table heading, for example, velocity/ms.
Mark each data point carefully with a cross or a point surrounded by a
circle using a sharp pencil. If you are plotting on millimeter graph paper
the points need to be within half a millimeter of the correct value.
Look carefully at the pattern of the data and decide whether any of the
points are anomalies. This means that they seem “out of place” compared
to the others. If possible, these data points should be checked by repeat-
ing the experiment. If not, they should be labeled and ignored when
drawing a best-fit line.
Look at the pattern of data and decide whether it is best represented by
a straight line or a smooth curve. Remember, this is experimental data, so
each point has a degree of uncertainty—this means that the line does not
have to pass through the points, it is there to represent the relationship
revealed by the data.
If the data is best represented by a straight line then use a ruler to draw it,
trying to balance approximately equal numbers of points on either side of
the line. It is also possible to find the gradient and y-intercept of a linear
graph directly from the data by using an algebraic method called linear
regression (omitting data from anomalous points).
If the data is best represented by a curve then draw a smooth curve, again
trying to balance the distribution of points on either side of the curve.
gradient
y2 y1
x2 x1
Don’t forget units! The units for the gradient are the units of y divided by
the units of x. For example, a graph of charge/C against time/s would have
units of Cs−1 or amps.
Here is an example of a gradient calculation using data for the extension of a
steel spring.
25
(6.0, 20.2)
20
extension /cm
15
10
(1.6, 5.0)
5
0
0 1 2 3 4 5 6 7
load / N
20.2 5.0
gradient of line 3.5 cmN1
6.0 1.6
2.4.3 Using a Spreadsheet Program (e.g., Excel)
Whether you are plotting a graph by hand or using computer software your
aims are the same—to represent the relationship between two variables accu-
rately and clearly and to extract any useful physical information or values from
the graph as accurately and precisely as possible. The great advantage of a
spreadsheet program is that once data has been stored it can be used to gener-
ate a graph at the click of a button. However, this can also lead to problems.
The default settings of the program will determine how the graph appears and
this may not be the best way to display this particular data set. The ease and
rapidity of creating a graph can also mean you forget to do basic things such as
ensuring the axes are labeled or labeled correctly, having sensible scales with
appropriate numbers of significant figures, etc.
Here are some questions to ask yourself when using a spreadsheet to produce
a graph and analyze your data:
Could I do a better job by plotting the graph by hand?
Do I want the program to fit a line or curve to this data or shall I print out
the graph once the points have been plotted and then draw the line in by
hand?
Does the graph have an appropriate heading?
Do I want to include gridlines and if so with what divisions?
Are the axes correctly labeled with quantity and unit?
Are the scales marked correctly and are they easy to read?
Are values shown with the correct number of significant figures?
Do I need to use a false origin?
Should the line be extrapolated back or forward?
Should the line go through the origin?
What kind of fit do I want to use (e.g., linear, exponential, polynomial, etc.)?
Do I want to include error bars?
Do I want the program to display a formula and if so how should it be
formatted?
How large should the graph appear in my report?
Remember: you should be in control, not the computer.
Once all of the error bars have been drawn onto the graph you can add a
trend line. If you are plotting a graph that is linear there will be a range of
possible lines that can be drawn that pass through all of the error bars. The
extreme lines (steepest and shallowest) are called the “worst acceptable” lines.
These can be used to find the range of possible gradients and the range of
possible intercepts.
The graph below uses the same data as in the previous example (for stretch-
ing a steel spring) but error bars (± 1 cm) have been added to the data for
extension. In addition to the original best fit, a “worst acceptable line” (WAL)
has also been drawn. This is slightly steeper than the best fit line so it gives a
greater value for the gradient:
24.5 1.2
gradient of worst acceptable line 3.6 cmN1
7.0 0.6
(7.0, 24.5)
25
20
15
extension /cm
10
5
(0.6, 1.2)
0
-1 0 1 2 3 4 5 6 7 8
-5
load / N
Using this value and the gradient of the best fit line gives:
2.5 LOGARITHMS
2.5.1 Logarithmic Scales and Logarithms
Many quantities in physics have values that spread over an extremely wide
range, so it is often convenient to represent them using a scale that increases
in multiples rather than equal amounts. Such a scale is called logarithmic, for
example,
1, 10, 100, 1000, etc.
1, 2, 4, 8, 16, etc.
Each step raises the power of some base quantity by 1:
100, 101, 102, 103, etc.
20, 21, 23, 24, etc.
The logarithm of any number to a particular base is the power that the base
must be raised to in order to get the number. For example:
Base 10: logarithm to base 10 of 1000 = 3 or log10 (1000) = 3.
Base 2: logarithm to base 2 of 8 = 3 or log2 (8) = 3.
Another common base is the number e. Logarithms to base e are called
“natural logarithms” and are written using the prefix “ln.”
Base e: logarithm to base e of 10 = 2.3026 or ln (10) = 2.3026.
The values of logarithms to base 10 and of natural logarithms can be found
directly from your calculator.
If you are working with logarithms you will also need to be able to find anti-
logarithms or inverse-logarithms. For example, if you know that the logarithm
to base 10 of some physical quantity is 5, you can find the value of the physical
quantity by raising the base to the power 5:
That is, if log10 (x) = 5 then x 10 log10 x 10 5 100 000
if log10 (x) = 2.3 then x 10 log10 x 10 2.3 199.5
Antilogarithms (or inverse logarithms) can be found directly from your
calculator.
1
doubles. To test for this, you can plot a graph of y against . The variables are
x
inversely proportional if the graph is a straight line AND it passes through the
origin (0, 0).
k
The relationship can be represented by y = where k is the gradient of the
1 x
graph of y against .
x
y = Axn
log(y) = log(A) + n log(x)
A graph of log (y) against log (x) should be linear with a gradient equal to n
(the power) and an intercept equal to log (A).
y = Aebx
ln (y) = ln (A) + bx
2.7 EXERCISES
3. When a thin convex lens is used to form the image of a bright object on
a screen the relationship between object distance from the lens u, image
distance from the lens, v, and the focal length of the lens, f is:
1 1 1
u v f
4. The data below shows how the pressure and volume of a gas change when
it is compressed isothermally (at constant temperature).
Pressure (kPa) 100 128 155 180 215 250 280 305 335 345
Volume (cm3) 35.5 28.0 22.7 19.6 16.0 13.5 12.0 11.0 10.0 9.5
Assume that the uncertainty in pressure measurements is ± 10 kPa and the uncertainty in volume
measurements is ± 1.0 cm3.
(a) Use a graphical method to show that Boyle’s law applies. Include
error bars.
(b) Find an equation to relate the pressure to volume and calculate the
value of any constants in this equation (stating their units and the
associated uncertainty).
5. The table below gives some data for the planets in the solar system.
Planet Av. Distance from the Sun Orbit time days (d) years (y)
(million km)
Mercury 58 88 d
Venus 108 ?
Earth 150 365 days
Mars 228 687 days
Jupiter 778 11.9 years
Saturn 1430 29.5 years
Uranus 2870 84 years
Neptune 4500 165 years
Kepler proposed that the orbital period of the planets T is linked to their
mean radius of orbit r by an equation of the form:
T = rn
(a) Use a graphical method to verify that such a power law is valid and
determine the value of n.
(b) Use your graph (or the equation) to predict the orbital period of
Venus (check online to see if your value is acceptable).
where I0 is the initial value of the current and τ is the “time constant” for
the decay (in seconds).
The table below shows how current falls with time when a particular
capacitor is discharged through a resistor.
Current (mA) 48 38 30 24 19 15 12 9 8 6 5
Time (s) 0 10 20 30 40 50 60 70 80 90 100
3
Capturing, Displaying, and
Analyzing Motion
3.0 INTRODUCTION
Kinematics is the study of motion. Dynamics is the study of how forces affect
motion. In this chapter, we focus on how motion is described in space and time
and how we can capture data from moving objects in order to display their
motion graphically. In Chapter 5, we will see how Newton’s laws describe how
motion is affected by unbalanced or resultant forces.
where ∆s is the change in displacement during a time interval ∆t. If the time
interval is small (but finite) this might be written:
s
average velocity
t
To obtain the instantaneous velocity we would need to take the ratio of δs to
δt in the limit that δt→0. This is the same as taking the derivative of displace-
ment with respect to time:
s ds
instantaneous velocity lim =
t 0 t dt
ds
or v =
dt
The table below lists symbols used to describe changes and rates of change.
∆x A change in x
δx A small change in x
d x
The rate of change of x
dt
Differentiate: displacement
v ds/dt
velocity is gradient of
displacement time
graph
time
velocity
Differentiate: Integrate:
a dv/dt
d
acceleration is
gradient of velocity displacement is area
time graph under a velocity time
graph
time
acceleration
Integrate:
d
time
velocity
(v u)
B
u
0 time
0 t
v u
Acceleration is the gradient of a velocity–time graph so: a
t 0
v u
a
t
v u at (suvat equation 1)
Displacement is the area under a velocity–time graph and this can be config-
ured in various ways:
u v t
(i) s =Area = A B ut ½ v u t
2
u v t
s (suvat equation 2)
2
(ii) If we use equation 1 we can eliminate v from equation 2:
1 1
s = Area ut u at u t ut at 2
2 2
1 2
s ut at (suvat equation 3)
2
(iii) Alternatively, we can use equation 1 to eliminate u from equation 2
leading to:
1
s vt at 2 (suvat equation 4)
2
(iv) Finally, we can use equation 1 to eliminate t from equation 2 leading to:
v2 u2 2 as (suvat equation 5)
1
Which gives: s ut at 2 (suvat equation 3).
2
The other equations can be derived from these by a series of substitutions.
These equations are valid for motion at constant acceleration.
25
20
vertical position/m
15
10
0
0 5 10 15 20
horizontal position/m
Each data point represents the position of the projectile at 0.10 s intervals.
Notice that the horizontal displacement in each 0.10 s interval is always the
same (0.80 m).
a projectile launched at any speed or direction the example below is for the
simple case of a projectile launched horizontally from a point at x = 0, y = 0
at t = 0.
Horizontal motion: x = ut where u is the initial horizontal velocity.
Vertical motion: y = vt − ½ gt2 where v is the initial vertical velocity.
Substituting for t in terms of x in the equation for y gives:
g 2 v
y 2 x ux
2u
Which has the form y ax2 bx c (a parabolic curve with c = 0).
The equation above is only valid if g is constant and there are no frictional
forces. This is approximately true for projectile motion over small distances
close to the surface of the Earth. However, when considering the motion of
rockets varying g and frictional forces must be considered.
a kx
d2 x
kx
dt 2
This can be solved to find equations for the displacement x(t) and velocity v(t)
for the oscillator (see Section 11.2.1).
To computer
reflector
Ultrasound
pulses Moving trolley
Datalogger
Ultrasound
transducer
object (e.g., a dynamics trolley) usually has a vertical card attached and the
card travels between the “jaws” of the light gate. The length of the card must
be measured and used as a parameter in the setup to enable to software to
calculate velocities.
The diagram below shows an enlarged view of a light gate and a typical experi-
mental setup.
light gate
to datalogger light gate
Light gate A Light gate B
Card
Light beam
If two light gates (A and B) are used, the following measurements can be
made and recorded:
Time at A and time at B (from starting the experiment)
Time to move between A and B
Velocity at A and velocity at B
Acceleration from A to B.
Bear in mind that if the object is accelerating the velocity measurements at
each light gate will be average values over the time taken for all of the card to
pass through the beam.
for slow-motion playback but more sophisticated cameras can record video at
several thousand frames per second.
Slow-motion cameras can also be used to make accurate measurements. For
example, if you are trying to measure the bounce height of a squash ball it is
difficult to judge this by eye. A slow-motion video of the bounce can be ana-
lyzed to locate the maximum height much more accurately.
A great deal of software is available for the analysis of videos of motion. One
of the best free packages is called Tracker. This is ideal for the analysis of
projectile motion or rotational motion. Once your video file has been loaded
into Tracker you can use the pointer to mark the position of the object you
wish to track and then advance the video a few frames at a time to mark sub-
sequent positions. This positional data is used by the software to generate
displacement, velocity, and acceleration data (and graphs) in two dimensions.
For rotational motion, you are able to track angular displacement and angular
velocity.
Here is a screen grab from an experiment to capture and analyze the motion
of a “magnus glider” a spinning object projected from top left of the image.
You can see the individual tracking points behind the object and the data table
and graph of the motion are on the right.
While these applications enable us to track and display complex motions they
do have limitations, and their accuracy will be limited by the quality of the
information put into them. Displacements in a video file are measured by the
number of pixels across the image, they are not absolute measurements. In
order to measure displacements in meters the user must calibrate the soft-
ware by identifying a known distance in the image. However, parallax effects
across the field of view can distort results so it is important for the experi-
menter to think hard about the measurements that are being taken.
3.7 EXERCISES
7. Two dragsters line up for a 500 m race. Car A accelerates at 4.0 ms−2 and
then maintains a constant maximum speed of 50 ms−1. Car B accelerates
at 5.0 ms−2 and then maintains its maximum speed of 45 ms−1. Which car
wins the race and by how much?
8. A cricket ball is bowled horizontally at a speed of 20 ms−1 from a height of
2.5 m above the ground.
(a) How far from the bowler does it first hit the ground?
(b) What is the maximum distance from the cricketer that the ball hits
the ground if he throws the ball upwards at the same initial speed at
an angle of 45° to the horizontal? Ignore air resistance.
9. Describe an experiment to measure the acceleration of free fall on Earth.
You should include: a labeled diagram of the apparatus; a list of measure-
ments to be taken including the instruments used to take these measure-
ments; an explanation of how you will maximize precision and accuracy;
an explanation of how you will process the data to get an accurate value
of the acceleration.
4
Forces and Equilibrium
N: normal
contact force
Block G: friction with
surface
Slope wi
w
with
th
surface
rough surfa
f ce
W: weight of
block
6.0 N
20
9.0 N
120
8.0 N
8.0 N
6.0 N
9.0 N
y 6.0 N
20º 9.0 N x
120º
8.0 N
Resolving parallel to the x-axis: Fx = 9.0 + 6.0 cos (20°) − 8.0 sin (30°) = 10.64 N
10.64 N x
4.88 N
Resolving parallel to the y-axis: Fy = 6.0 sin (20°) − 8.0 cos (30°) = −4.88 N
Resultant magnitude is found using Pythagoras’s theorem:
The angle between the resultant and the x-axis is found by:
An analytic approach like the one used here is preferable to scale drawing
because it will give a more accurate answer.
F mgravitational g mgravitational
a g g
minertial minertial minertiial
Einstein assumed that inertial and gravitational mass are equivalent. This
helped him to construct the general theory of relativity (see Section 23.6).
4.2.2 Weight
Weight is the gravitational force acting on a body and is a vector quantity. The
weight of a body depends on the strength of the gravitational field in which it
is placed and is given by the formula:
W = mg
where m is the mass in kilograms and g is the gravitational field strength meas-
ured in Nkg−1.
The gravitational field strength near the surface of the Earth is on average
about 9.81 Nkg−1 (standard gravity is 9.80665 Nkg−1) but varies by about 0.7 %
at different locations (from about 9.76 Nkg−1 on a mountain in Peru to about
9.83 Nkg−1 in Oslo).
Here are some values for gravitational field strength elsewhere in the solar
system.
Moon Sun Venus Mars Jupiter Saturn
Surface gravity (Nkg−1) 1.62 275 8.87 3.69 24.8 10.5
Surface gravity (gEarth) 0.165 28.0 0.904 0.376 2.53 1.07
CG CG CG
If an object is suspended from a point, it will align itself with the center of
gravity vertically below the point of suspension. This means that the loca-
tion of the center of gravity can be found by suspending the object sepa-
rately from two different points (A and B) and noting where the two lines
of suspension intersect inside the object. This is shown below for a two-
dimensional object.
Center of
A gravity where
B
both lines cross
A
B
The reason that the center of gravity lies beneath the point of suspension is
that its line of action then passes through the point of suspension and so has
no moment or turning effect at that point. If the object is rotated slightly so
that the center of gravity lies on either side of this line then there would be
a resultant moment causing the body to rotate back toward the equilibrium
position. Equilibrium of moments (see Section 4.4.4) can be used to calculate
the position of a center of gravity for an extended body. The idea is simple.
The resultant moment of the body about any point must be equal to the
moment produced by its weight acting through the center of gravity. From
this equality, we can find the distance of the center of gravity from the point
about which we are taking moments. Here is an example of a uniform rod of
length l and mass m. We take moments about one end of the rod.
A xCG
x x
mg
x
Mass of strip of length δx = m
l
mgxδx
Moment about A of strip of length δx =
l
l
mgxdx mgl
Moment of entire rod about A =
0
l 2
This must equal the moment of the weight, mg, acting through the center of
gravity, about A.
mgl
mgxCG =
2
l
so xCG = , the center of gravity is at the midpoint of the rod, as expected.
2
The terms “center of gravity” and “center of mass” are often used interchange-
ably, but they are only in the same position when the object concerned is placed
in a uniform gravitational field. If the field varies significantly across the object
then they will be in different positions. For example, for the rod above, if the
value of g increases from left to right along the bar then the contributions to
the resultant moment would be greater from strips near the right-hand end.
This would move the position of the center of gravity to the right of the center
of the bar, that is, xCG > l/2. The center of mass on the other hand would not
be affected and would remain at the center of the uniform bar. In practice, the
difference between the two positions is rarely significant.
T1 T2
T1
W
T2
W
If there are more than three forces in equilibrium then they will form a closed
quadrilateral of four or more sides.
Here is an example where the triangle of forces is used to calculate an unknown
force keeping the system in equilibrium. The diagram shows a pendulum held in
equilibrium by a horizontal force F. The problem is to find the magnitude of F.
25º
Triangle of forces
25º
F
8.0 N
8.0 N
From the triangle of forces: tan (25°) = F/8.0 so F = 8.0 tan (25°) = 3.7 N.
P .
Moment about P = Fd
The SI unit for moment is the Nm and the direction of the moment is usually
described as clockwise or counterclockwise about the pivot. In this example,
it is clockwise.
You might be tempted to think that this unit is equivalent to the joule, since 1
J = 1 Nm. However, when calculating a moment, the force and displacement
are perpendicular, whereas for work done they must be parallel.
Notice that d is the perpendicular distance from the pivot, not the distance
between the point where the force acts on the body and the pivot. This means
that we often have to resolve the force to calculate the moment. The example
below shows how to calculate the moment of an inclined force acting on a
uniform rod at a distance l from the pivot P.
This can be regarded as the magnitude of the force multiplied by the compo-
nent of l perpendicular to the line of action of the force (F × l sin θ) OR the
component of force perpendicular to l multiplied by l (F sin θ × l).
The term torque is often used in engineering or when studying rotational
dynamics. This is simply another name for a moment and is calculated in the
same way and measured in the same units.
If the line of action of a force acts through P its moment about P is zero.
0.7 m
0.3 m 0.5 m
5.0 N
8.0 N 3.0 N
12.0 N
4.4.3 Couples
When the line of action of a resultant force on a body acts through its center
of mass (CM) it causes a change in translational motion (e.g., an acceleration).
if it acts through another point in the body, it will change its translational and
rotational motions.
. . . .
F
CM CM CM CM
F
F
y
x
.
CM
d
Moment of a couple = Fd
The perpendicular distance of each line of action from the center of mass is x
and y, respectively. The
Moment of couple = Fx + Fy = F ( x + y ) = Fd
Notice that the moment of the couple is independent of the individual values
of x and y and depends only on their sum, in other words on the separation of
the two lines of action. This means that the moment of a couple has the same
value about any point in the body (or outside it).
Moment of a couple = magnitude of one force × perpendicular separation of
lines of action
300 N
28
hinge strut
The hood of a car is held open in equilibrium by a strut as shown above. The
problem is to find the force from the strut. However, there is also an unknown
force from the hinge. The easiest way to solve this problem is to take moments
about the hinge position because this immediately eliminates the force that
acts through the hinge (because it will have no moment about this position).
Taking moments about the hinge position:
Clockwise moment from weight of hood = 300 cos (28°) × 0.80 = 212 Nm
counterclockwise moment from strut force = S × 0.45 Nm
Applying the principle of moments: 0.45 S = 212 therefore S = 471 N
471 N
H
28
28 300 N
We could continue to solve for the unknown force H from the hinge by con-
sidering the equilibrium of forces. This gives us two more equations—one for
the horizontal components and one for the vertical components:
Resolving horizontally: H sin θ = 471 sin (28°) or H sin θ = 221 (1)
Resolving vertically: H cos θ + 471 cos (28°) = 300 or H cos θ = 116 (2)
Dividing (1) by (2) gives tan θ = 221/116 = 1.91 and θ = 62.3°
Substituting back into (1) gives H = 250 N
When coplanar forces act on an extended structure (as in the example above)
the principle of moments gives one equation and the equilibrium of forces
gives two more, so with three independent simultaneous equations, it is pos-
sible to solve for up to three unknowns.
4.5 STABILITY
4.5.1 Types of Mechanical Equilibrium
When a system is in equilibrium the forces and moments acting on it are bal-
anced. However, if the system is disturbed it might return to equilibrium or
depart from it. The behavior of the system when disturbed is determined by
its stability. Consider the three objects below, all of which are in equilibrium
and all of which are given a small clockwise displacement from equilibrium.
A B C
A B C
A B C
Small Larger
displacement displacement
W W W
The limit of stability is reached when the line of action of the weight passes
beyond the corner of the base which acts as a pivot. The resultant moment
then changes from counterclockwise to clockwise and when the object is
released it continues to rotate and falls over. The potential energy curve looks
something like this:
Potenal
energy
angular
displacement
The dotted lines represent the limits of stability. These will move farther apart
if the object has a wide base and low center of gravity.
(e.g., when a car is parked on a hill frictional forces act up the hill). We will
discuss fluid friction when we consider viscosity (see Section 6.3) but here we
will concentrate on the frictional force between two surfaces in contact with
one another.
Applied
Friconal Force
Force
temporary bonds
Frictional force
limit of static
gradually increasing friction
frictional applied force
dynamic
force friction
Rough surface
Applied force
The “limit of static friction” is the maximum frictional force acting between
the two surfaces when they are at rest. To move the block an applied force
greater than this limit must be used. Once the block moves and the surfaces
are sliding over one another the friction drops to a lower dynamic value.
Frictional fo
fforce
rce F M Pulley
Mg
m
Masses are added to the hanger until the block just begins to slip. The limit of
static friction is then mg. The normal reaction is N = Mg.
Therefore mg = µSMg
so µS = m/M.
M
gradually lifted
until block
starts to slip.
angle at which
w ich block
wh
just begins to slip
Here is a free-body diagram for the block when the limiting angle after which
the block just begins to slip is reached.
Flimit = SN
W = Mg
At the limiting angle, the block is in equilibrium, so we can resolve forces
parallel and perpendicular to the surface:
Resolving parallel to the surface: µSN = Mg sin θ(1)
Resolving perpendicular to the surface: N = Mg cos θ(2)
Dividing (1) by (2) gives: µS = tan θ
The coefficient of static friction is equal to the tangent of the limiting
angle.
4.7 EXERCISES
1. In which of the following situations (if any) are the forces acting on a man
in equilibrium?
Lying still in his bed.
Sitting in a car seat when the car is traveling at constant velocity along
a motorway.
Standing in a lift that is moving upwards at constant velocity.
Floating is apparently weightless inside an orbiting spacecraft.
Floating in a swimming pool.
2. Calculate the magnitude and direction of the resultant force acting on
each block in the free-body diagrams below:
5.0 N
3.0 N 6.0 N
60
30
6.0 N 4.0 N
1.0 N
(a) (b) 3.0 N
40 cm
15 cm
30 cm
0.30 m
0.15 m
1.0 N
5. The diagram below shows a paving slab of mass 300 kg resting on two
wooden supports, A and B. Assume that the center of gravity of the slab is
at its geometric center and that the forces between the supports act at the
centers of their areas of contact with the slab.
2.00 m
0.30 m 0.50 m
A B
(a) Calculate the coefficient of static friction between the block and the
plane.
(b) Calculate the coefficient of kinetic friction between the block and
the plane.
(c) Sketch a graph to show how the frictional force between the block
and the plank varies from the time the plank is first lifted until
the block reaches the bottom of the plank. Include values on the
force axis.
5
Newtonian Mechanics
5.0 INTRODUCTION
Newton’s masterwork, Philosophiæ Naturalis Principia Mathematica
(“Mathematical Principles of Natural Philosophy”) is probably the most
famous and important book (actually three books) in the history of physics. It
sets out the laws of motion and the law of gravitation and then applies these
laws to the motion of planets in the solar system. It also marks the beginning
of mathematical physics. This is not because mathematical arguments had not
been used in physics before Newton but because Newton provided a math-
ematical framework in which to tackle an enormous range of physical prob-
lems. Newton’s work had, and still has, a remarkably wide impact on science
and philosophy and is even used as a model in other apparently unrelated
disciplines (e.g., economics). If you want to be good at physics you need to be
good at mechanics.
the U (A below). Now let the ramp on the far side slope up more gradually.
We would expect the ball to travel further along the ramp until, once again,
it reached its starting height (B below). Now let the ramp on the far side
continue horizontally, what will happen to the ball? Logically it seems it must
continue to move at a constant velocity until such time as the ramp rises back
up again (C below). Galileo argued that this showed that there is no need for
an unbalanced force to keep things moving at constant velocity.
Newton’s First Law:
An object continues to remain at rest or move at constant velocity (in a straight
line) unless acted upon by a resultant force.
Comments
Most moving objects are acted upon by many forces (e.g., thrust, grav-
ity, contact forces, drag, etc.). If they are moving at constant velocity the
resultant of all these forces must be zero.
If we know that the forces acting on a particular object are in equilibrium,
we cannot assume it is at rest, it might be moving at constant velocity.
If an object is accelerating, decelerating, or changing direction the forces
acting on it must be unbalanced, there is a resultant force. An example is
an object moving at constant speed along a curved path: since the speed is
unchanging there is no force component parallel to the motion but there
must be a component of force perpendicular to it in order to cause the
change of direction.
animals. Have a large bowl of water with some fish in it; hang up a bottle that
empties drop by drop into a wide vessel beneath it. With the ship standing
still, observe carefully how the little animals fly with equal speed to all sides
of the cabin. . . . When you have observed all these things carefully (though
doubtless when the ship is standing still everything must happen in this way),
have the ship proceed with any speed you like, so long as the motion is uniform
and not fluctuating this way and that. You will discover not the least change
in all the effects named, nor could you tell from any of them whether the ship
was moving or standing still.
Dialogue Concerning the Two Chief World Systems, translated by Stillman
Drake, University of California Press, 1953, pp. 186–187 (Second Day).
This was a profound observation. Einstein realized that if the laws of phys-
ics are the same in stationary and uniformly moving reference frames
then there is no fundamental difference between rest and motion, it just
depends on what you choose as your reference frame. It led Einstein to
the special theory of relativity (see Chapter 24) but in Galileo’s time, it
served a different purpose. Galileo was convinced that the Earth orbited
the Sun rather than the other way around as was believed by most people
at the time. They thought that it was obvious that the Earth was not moving
because we cannot feel the motion. Galileo’s thought experiment showed
that you would not expect to feel the motion – everything would happen
on Earth as if it was at rest so the argument against the Earth’s motion was
flawed.
Galilean relativity is the idea that:
The laws of mechanics are the same in all uniformly moving reference
frames.
experiment that showed that they should fall at the same rate. It is simple but
compelling and it goes something like this.
?
According to Aristotle the combined object should fall faster than either of
the original objects because it is more massive. However, shouldn’t attach-
ing the slower smaller mass to the larger faster mass slow the larger mass
down and shouldn’t attaching the larger mass to the smaller mass speed the
smaller mass up? This argument suggests that the composite body should
have a speed intermediate between the speeds of the small mass and large
mass alone. Aristotle’s idea leads to a contradiction. On the other hand, if all
objects fall at the same rate then there is no problem.
Galileo is said to have tested this idea by dropping two cannon balls of differ-
ent sizes from the top of the tower of Pisa and showing that they landed at the
same time. Many historians doubt that he actually did this but, in 1971, during
the Apollo 15 mission to the Moon, a hammer and a feather were dropped to
the Moon’s surface. They landed at the same time.
Newton’s second law can explain this. The resultant force on an object of
mass m in a gravitational field of strength g is its weight, mg. Using the equa-
tion above:
F mg
=
a = =g
m m
gravitational field strength is equal to the acceleration of free fall in that field.
Comment
It seems obvious that we can cancel the two m’s in the equation above.
However, if we think about this a little more deeply it is not so obvious that
they are the same thing. The m on the bottom of the equation is the “iner-
tial mass,” the m on the top is related to how strongly the mass responds
to a gravitational field, sometimes called the “gravitational mass.” They
are two different properties of mass, so the fact that all objects fall with
the same acceleration in the same gravitational field shows that they are
at least proportional to one another and possibly identical. This subtle
point was one of the clues that helped Einstein with his general theory of
relativity, a new theory of gravity.
1. The gravitational forces on the Earth and Moon are an “action–reaction” pair.
2. When a ball rests on the ground there are two “action–reaction” pairs.
Notice that while it is true that the weight of the ball and the upward contact
force from the ground have equal magnitudes and act in opposite directions,
they do not form an action-reaction pair. They fail in three respects: they are
not the same type of force, they do not act on different bodies and they are
not part of the same interaction.
3. When a ball rests on an accelerating surface the contact force and the
weight are not equal. While each action-reaction pair remains balanced
there is now a resultant upward force on the ball, as shown by the free-
body diagram on the right in the diagrams below.
Ground Ball
accelerang accelerang
upwards upwards
Gravitaonal
force from
Earth on ball.
Gravitaonal
Gravitaonal force from Earth
force from on ball.
Contact force from ball
ball on Earth.
acng on the ground
v u mv mu change in momentum
F ma m
t t time
The statement and equation (F = ma) in Section 5.1.3 is a special case, for
constant mass, of the general equation given here.
mv mu
F so Ft (mv mu)
t
will exert an equal but opposite force on the other and these forces will act in
opposite directions along the same line and will act for the same time.
Taking positive values in the x-direction the impulse on each body during a
short time dt is given by:
Impulse on A = FB on Adt
Impulse on B = FA on Bdt = - FB on Adt
During the interaction, the impulse given to each body, calculated from the
integral ∫Fdt , is also of equal magnitude but opposite in direction. Since
impulse is equal to change in momentum, the change in momentum of A is
equal and opposite to the change in momentum of B. The momentum change
of the complete system is zero:
t t
Change of momentum of system FA on Bdt FB on A dt 0
0 0
Linear momentum is conserved in an interaction between two bodies:
Newton’s third law tells us that all forces arise as a result of interactions so the
argument above will apply many times over for a complex system of interacting
bodies, as long as we include all the pairs of forces. This can be stated as a law:
The Law of Conservation of Linear Momentum
The linear momentum of a closed system is constant.
Comments
A “closed system” means that we include all pairs of forces within the
system. Another way of saying this is that the linear momentum of a
system is constant if no external resultant force acts on the system.
Linear momentum is a vector quantity, so the conservation of linear
momentum implies a separate conservation of each component of linear
momentum.
An object moving along a curved path has a continuously changing
momentum (it is changing direction even if its magnitude is constant).
The sum of the vector momenta after the collision must equal the total vector
momentum before the collision. Solving the triangle using trigonometry is
equivalent to taking components and deriving the equations (1) and (2) above.
the water could be brought to a boil. This suggested that the motion involved
in boring the cannons had produced thermal energy.
Joule realized that there is a mechanical equivalent of heat, in other words,
the book does work as it comes to rest and this transfers its kinetic energy to
thermal energy in the surroundings. Joule’s work set the stage for the idea that
energy is conserved. However, energy comes in a variety of different forms,
and while, it cannot be created or destroyed it can be transferred. When the
book stops moving, its kinetic energy has been transferred to thermal energy
which spreads into the surroundings.
5.3.1 Work
Work is the transfer of energy when the point of application of a force moves
in the direction of the force:
Work done (J) = Force applied (N) × displacement parallel to force (m)
The SI unit of energy is the joule (J).
1 joule of energy is transferred when a force of 1 N moves through 1 m.
(1 J = 1 Nm).
If the force is at an angle to the displacement then the component of force
parallel to the displacement must be used.
force / N
displacement / m
And if we let the small increments in x tend to zero (dx ® 0) this sum becomes
a continuous integral:
x b
Wab F x dx
x a
force needed to lift the mass is always equal to mg since the applied force
must balance the weight of the mass. In the absence of frictional forces, there
is no horizontal component of force to consider.
Work done lifting mass vertically: W = mgh
Work done along the inclined path: W = mg (h/cos q) cos q = mgh
The work done is independent of the path taken and depends only on the
height through which the mass is lifted. If the mass is allowed to fall back to
the ground it will gain kinetic energy which could be used to do some useful
work (e.g., generating electricity). Lifting it in a gravitational field has given it
the potential to do work.
Gravitational potential energy (GPE) is the potential energy an object has
because of its position in a gravitational field.
In a uniform gravitational field change in GPE is given by: DGPE = mgh
If a mass is moved around a closed loop inside a gravitational field (any gravi-
tational field, not just a uniform field) the work done by an external agent in
lifting it is equal to the work done by the gravitational field as it comes back
down. If it is returned to its original position the work done in the loop is zero.
This is an example of what is called a “conservative field.” Gravitational fields
are conservative fields.
In a short time dt the work done on the mass is given by: dW = Fds = Fvdt
v v
Using: F ma m : W m v t mv v
t t
Taking the limit of dt®0 and integrating between initial and final velocities
gives:
v
1 2 1
W mvdv mv mu2
u
2 2
The work done is equal to the change in kinetic energy so: KE = ½ mv2
It is also possible to derive this relationship by considering a constant result-
ant force acting on a constant mass to produce a constant acceleration. The
displacement during acceleration is given by:
s
v 2
u2
2a
m v2 u 2
mas
2
has been transferred to the kinetic energy of the new nucleus formed in the
decay and the emitted alpha particle.
The efficiency of a transducer is the ratio of the useful output energy to the
total input energy, usually expressed as a percentage. In the case of the com-
pact fluorescent light above this would be an efficiency of 0.40 or 40%:
useful output energy
efficiency= ×100%
total input energy
5.3.7 Power
Power is the rate of transfer of energy.
energy transferred
Power=
time
dE
For continuous energy transfers: P =
dt
The SI unit for work is the watt (W) and 1 W = 1 Js-1.
Be careful not to confuse W for watt (a unit) with W for work (a physical
quantity).
If the energy is transferred as work this is the rate of doing work. If work W is
dW d Fs
done in time t the power will be P
dt dt
ds
=
If the force is constant this becomes: P F= Fv
dt
This is the scalar product of force and velocity: P = F.v
Efficiency can also be expressed in terms of power:
useful output power
efficiency= ×100%
total input power
Fossil and nuclear fuels are “non-renewable energy sources.” Once we use
them, they are not regenerated, and eventually, they will run out. However,
some non-renewable fuels, for example, those used for nuclear fusion, would
be capable of providing energy for millennia. Unfortunately, we have not yet
solved the technical problems associated with building an effective working
nuclear fusion reactor. However, a huge international research reactor, ITER,
is being constructed in France as a major step toward a commercial fusion
reactor and it is hoped that this will produce its first plasmas in the mid-2020s.
Renewable energy resources are those that are naturally regenerated in a short
period of time.
The force exerted on the jet engine will be equal in magnitude to the rate of
change of momentum of the gas passing through the engine. While it is true
that the combustion of fuel inside the engine adds mass to the air passing
through it this is in practice a very small contribution so we can simplify the
analysis by assuming that the mass flow in and out of the system is solely due
to the air. This means that the mass flow rate in and out of the jet is the same.
If u is the air speed of the aircraft and v is the speed of the exhaust gases then
the thrust will be given by a simple formula:
dm
F v u
dt
Clearly, v must be greater than u for the jet to produce a forward thrust.
We can also see that there are two ways to increase the thrust:
Increase the mass flow rate (dm/dt).
Increase the exit velocity of the gases (v - u).
Different types of jet use different methods.
5.5.2 Rockets
A jet could not work in a vacuum because there would be no external material
against which to push. Rockets get around this problem by ejecting a large
amount of matter (in the form of burnt fuel) at very high velocity. The change
in momentum of the ejected matter is equal and opposite to the change in
momentum of the rocket. Once again, we can derive an equation for the
thrust of a rocket using Newton’s second law, but his time we are dealing with
an object of changing mass:
d mv
F
dt
If the rocket is in space with no external forces acting upon it then we can use
conservation of momentum to show that the change of velocity of the rocket
depends on the proportion of its mass expelled as burnt fuel. Consider a short
time dt during which the rocket expels a mass dm at a velocity u relative to the
rocket. At this time the rocket has a forward velocity v and a mass m.
leading to:
m
v f v0 u ln 0
mf
The second condition requires that the final mass of the rocket is small com-
pared to its initial mass, so the majority of the rocket’s mass at launch will be
in its fuel.
where I is the intensity (number of photons per second ´ photon energy). This
will be equal and opposite to the force exerted on the surface (by Newton’s
third law).
When radiation is reflected from a surface (e.g., light from a mirror) the force
is doubled because the momentum change is doubled for each photon (from
a positive value to an equal negative value). While the radiation pressure from
ordinary light sources on human-sized mirrors is tiny (less than 10-6 Pa for 100
W of radiation falling onto a mirror of area 1 m2) it has been suggested that
high-intensity laser beams directed from Earth might be able to accelerate
reflective micro spacecraft up to very high speeds so that they can make trips
to the nearest stars within human lifetimes.
would be very different if they were measured with respect to the Sun or the
Moon. Galileo used a thought experiment to show that the laws of mechanics
are the same in all uniformly moving reference frames:
Shut yourself up with some friend in the main cabin below decks on some
large ship, and have with you there some flies, butterflies, and other small
flying animals. Have a large bowl of water with some fish in it; hang up a
bottle that empties drop by drop into a wide vessel beneath it. With the ship
standing still, observe carefully how the little animals fly with equal speed to
all sides of the cabin. The fish swim indifferently in all directions; the drops
fall into the vessel beneath; and, in throwing something to your friend, you
need throw it no more strongly in one direction than another, the distances
being equal; jumping with your feet together, you pass equal spaces in every
direction. When you have observed all these things carefully (though doubt-
less when the ship is standing still everything must happen in this way), have
the ship proceed with any speed you like, so long as the motion is uniform and
not fluctuating this way and that. You will discover not the least change in all
the effects named, nor could you tell from any of them whether the ship was
moving or standing still. In jumping, you will pass on the floor the same spaces
as before, nor will you make larger jumps toward the stern than toward the
prow even though the ship is moving quite rapidly, despite the fact that during
the time that you are in the air the floor under you will be going in a direction
opposite to your jump. In throwing something to your companion, you will
need no more force to get it to him whether he is in the direction of the bow
or the stern, with yourself situated opposite. The droplets will fall as before
into the vessel beneath without dropping toward the stern, although while the
drops are in the air the ship runs many spans. The fish in their water will swim
toward the front of their bowl with no more effort than toward the back, and
will go with equal ease to bait placed anywhere around the edges of the bowl.
Finally, the butterflies and flies will continue their flights indifferently toward
every side, nor will it ever happen that they are concentrated toward the stern,
as if tired out from keeping up with the course of the ship, from which they
will have been separated during long intervals by keeping themselves in the
air. And if smoke is made by burning some incense, it will be seen going up in
the form of a little cloud, remaining still and moving no more toward one side
than the other. The cause of all these correspondences of effects is the fact that
the ship’s motion is common to all the things contained in it, and to the air
also. That is why I said you should be below decks; for if this took place above
in the open air, which would not follow the course of the ship, more or less
noticeable differences would be seen in some of the effects noted.
If this could occur, the photon would take away the energy and momentum of
the electron-positron pair. However, this is in fact impossible, as can be seen,
if we transform to the center of the mass frame by subtracting u/2 from the
electron and photon:
The momentum before annihilation is zero since the electron and positron
have equal mass but opposite velocities. This means that a single photon would
have to carry away energy but have no momentum. We have seen (Section
5.5.3) that photons have momentum p = E/c so this is impossible. In reality,
This allows energy to be carried away and the total momentum to remain
zero. The creation of a pair of identical photons when an electron and posi-
tron annihilate is utilized in PET scanners (see Section 29.5).
The laws of Newtonian mechanics are invariant (do not change) under
Galilean transformations. These transformations assume that physics takes
place against a background of absolute space and absolute time which are the
same for all observers. However, early in the 20th century, Albert Einstein
realized that the laws of electromagnetism are not invariant under Galilean
transformations. This realization ultimately led him to the special theory of
relativity which postulates that the laws of physics should be the same in all
inertial reference frames. The only way that this could be true was if measure-
ments in space and time were all relative and not absolute. This is explored
further in Chapter 24.
The dot above the symbol represents differentiation with respect to time
dq
(e.g., q = )
dt
Once the Lagrangian for the system is known we can derive the equation of
motion by using the Euler-Lagrange equation:
L L
t q q
1 2 1
1. T = mx and V = mgx so L = mx 2 = - mgx
2 2
L
2. LHS of Euler-Lagrange equation: mx
t x t
L
RHS of Euler-Lagrange equation: mg
x
These are equal: mx mg
t
Leading to: x g
This is hardly surprising. It shows that the object accelerates in the - x direc-
tion with an acceleration of magnitude g and that this acceleration is inde-
pendent of the mass. This is exactly the same result as if we had started with
the forces and used F = ma. In fact, as you can see, the Euler-Lagrange equa-
tion generates this equation in the second line of (2).
5.8 EXERCISES
6
Fluids
6.0 INTRODUCTION
The particles inside a solid vibrate at fixed positions unless the material is
placed under extreme stress. Particles inside a fluid however can move past
each other and change position, this allows them to flow when stresses are
applied to the fluid. Liquids and gases are fluids and their behavior can be
modeled using Newton’s laws.
Here are some key ideas used to describe the behavior of fluids:
Density (kgm-3):
r = m / V where m is the mass of the fluid (kg) and
V is the volume occupied by the fluid (m3).
Pressure (Pa): p = F/A where F is the force (N) exerted by the
fluid perpendicular to an area A (m2). 1 Pa = 1Nm-2.
Pressure in a fluid acts in all directions and the pres-
sure at the same level in a static fluid is constant.
Shear stress (Pa): When two parallel layers of area A are pulled in
opposite directions by a force F the shear stress act-
ing on the layers is s = F/A.
Incompressible fluids: Liquids such as water do not compress easily so a
useful model assumes that they have constant vol-
ume and so their density is constant.
Viscosity: When one layer of a fluid moves over another nearby layer
a frictional force between the layers opposes the flow. The
greater the resistance to flow, the greater the viscosity of
the fluid.
Inviscid fluid: For situations in which the viscous forces are negligi-
ble, or for fluids with very low viscosity, a useful model
assumes that the viscosity is zero. Such a fluid is said to be
“inviscid.”
Ideal fluid: The simplest model of a fluid is one which is incompress-
ible and has zero viscosity, this is called an ideal fluid.
Water can often be treated as an ideal fluid.
Ideal gas: An ideal gas is one that obeys the equation of state, pV =
nRT where p is pressure, V is volume, n is number of moles,
T is temperature in kelvin and R is the molar gas constant.
F = - Adp = rAgdz
The negative sign is because the pressure is greater lower down (p decreases
as z increases).
For thin layers this gives a pressure gradient:
dp
g
dz
If the fluid is considered incompressible its density is constant and the expres-
sion above can be integrated to give the excess pressure at the base of a col-
umn of height h caused by the weight of the column.
p0 0
Excess pressure = p0 ph p gdz gh
ph h
This result can also be derived by simply calculating the total weight of the
column and dividing it by the area of the base: p0 = rAgh/A = rgh. However,
the approach above can be used when the density of the fluid changes with
depth, for example, to derive an expression for the atmospheric pressure at
altitude h.
Using molar mass: M = nNAm (NA is the Avogadro number (6.02´1023) we can
express density in terms of pressure:
r = NAmp/RT = mp/kT k is the Boltzmann constant (k = R/NA).
We can now substitute this expression for density into the equation for hydro-
static pressure gradient in a fluid:
dp mgp
g
dz kT
p mg
ln x x
p0 kT
mg
x
px p0 e kT
this manometer must be equal because these points are at the same level in the
same fluid. However, the pressure at B is also equal to the pressure p0 plus the
pressure caused by the column of fluid BC of height Dh.
pA = p0 + Dp = pB = p0 + rgDh
Dp = rgDh
The lower the density of the liquid used, the greater the sensitivity of the
manometer (i.e., the greater the change in height per unit change in pressure).
6.1.4 Barometers
Imagine a manometer with one end open to the atmosphere and the other
end attached to an effective vacuum pump. The pressure difference is equal
to atmospheric pressure pAt so the height of the column can be used to meas-
ure atmospheric pressure. This is the principle of the barometer. However,
instead of connecting one end of a manometer to a pump, one end is sealed
and the fluid is allowed to fall away from the sealed end so that a vacuum
forms above it.
Pressure at A is equal to the pressure at B because both points are at the same
level in the same liquid. Since A is on the surface of the liquid exposed to the
atmosphere the pressure at both points must be atmospheric pressure.
6.1.5 Dams
The design of a dam must take into account all the forces that act on the struc-
ture. The most important of these is caused by hydrostatic pressure from the
trapped water. In a simple case, we can assume that this is the only force on
the dam and that the containing wall of the dam is vertical.
Pressure increases linearly with depth so the total horizontal force on the con-
taining wall will be equal to the average excess pressure (rgh/2) multiplied by
the area in contact with the water (A = hl, where l is the horizontal length of
the dam wall):
F = ½ rglh2
The line of action of this force is at a height h/3 from the base of the dam.
In reality, the situation is more complicated than this.
There could be a depth of water on the downstream side.
The containing wall might not be vertical.
There will be a hydrostatic pressure gradient under the dam because
water will penetrate the soil and rocks.
In addition to hydrostatic forces engineers must also consider the forces from
wind, seismic activity, and ice (if the water freezes). The dam must remain in
equilibrium under all possible conditions.
This result was derived using a rectangular object but is valid for an object of
any shape. This can be understood by considering the buoyancy force on each
small vertical column of material inside the object. For each column, we can
apply exactly the same reasoning as used for the rectangular block so the total
buoyancy force is always equal to the weight of the fluid displaced regardless
of the shape of the block.
Using the same reasoning as in Section 6.2.1, the contribution to the buoy-
ancy force from one narrow column will be dB = rghdA. This is equal to the
weight of fluid displaced by the volume of the column. The total buoyancy
force on the object will be the sum of forces on all vertical columns:
B gh A
all columns
This is the weight of fluid displaced by the total volume of the block as stated
in Archimedes’ principle.
6.2.3 Flotation
An object will float if the buoyancy force can support its weight. The maxi-
mum buoyancy force is when the object is completely submerged so, for an
object of volume V and average density robj to float in a fluid of density rfluid:
An object will float if its density is less than the density of the fluid in
which it is placed.
If an object’s density is equal to the density of the fluid in which it is sub-
merged it is said to have “neutral buoyancy.” Divers and submarines use neu-
tral buoyancy to remain at the same depth under water.
6.3 VISCOSITY
6.3.1 The Coefficient of Viscosity
A fluid with high viscosity is very resistant to shear. This means that a relatively
large shear stress is required to move one layer over another. This can be
understood by thinking about the flow of a fluid close to a boundary. Particles
in contact with the boundary are assumed to be at rest because of interactions
with the surface whereas those far from the boundary will be flowing with the
same speed as the body of the fluid. There is a velocity gradient close to and
perpendicular to the boundary, as shown in the diagram.
z
Fluid flow
direcon v(z)
moving layers
staonary layer
The greater the viscosity the smaller the velocity gradient dv/dz for the same
shear stress.
The coefficient of viscosity h is defined as the ratio of shear stress to velocity
gradient:
F
A
d
v
dz
Two regimes exist because there are two competing effects in the moving
fluid: inertial forces related to the density and speed of motion of the fluid,
and viscous forces related to the viscosity of the fluid and inversely to the
physical size of the channel in which the fluid is flowing (e.g., the diameter
of the pipe). If viscous forces dominate, eddies cannot form and the flow is
laminar. If inertial forces dominate the flow will be turbulent.
The Reynolds number Re is a dimensionless constant that represents the ratio
of inertial forces (µrv) to viscous forces (µh/L) in a particular flow situation.
It is defined as:
vL
Re
Where v is the flow velocity, r the fluid density, h the coefficient of viscosity
and L is a characteristic length. For flow in a pipe L would be the diameter
of the pipe; for flow between two parallel plates it would be the separation of
the plates.
As a very approximate rule the flow will be turbulent if Re > 1000.
The flow velocity in the wider part of the pipe is v1 and in the narrower part it
is v2. In a short time dt the mass flow through area A1 is r1A1v1dt and the mass
flow through area A2 is r2A2v2dt. These must be equal, so in general:
r1A1v1 = r2A2v2
A1v1 = A2v2
In this case the flow velocity is inversely proportional to the area of the pipe:
v1 A2
=
v2 A1
Viscous forces dominate (Re << 1000): drag force is directly proportional
to velocity.
The drag force arises as a reaction to shearing the layers of the fluid as
they flow around the object. These forces depend on the velocity gradient
and therefore on the velocity of the moving object.
Inertial forces dominate (Re >> 1000): drag force is directly proportional
to velocity-squared.
The inertial force arises as a reaction to the force needed to accelerate the
fluid in front of the moving object up to the velocity of the object. This
is directly proportional to the rate of change of momentum of the fluid
in front of the object which is proportional to the mass of fluid encoun-
tered per second multiplied by the velocity of the object. Since the mass
encountered per second is also proportional to the velocity, the drag force
will be proportional to the velocity-squared.
fluid
streamlines
moon drag
F v r
x y z
MLT 2 ML1T 1 LT 1 L
x y z
2 gr 2
vt
bb f
where rbb is the density of the ball bearing and rf is the density of the fluid.
F v A
x y z
MLT 2 ML3 LT 1 L2
x y z
1 2 1
P1 v1 gh1 P2 v22 gh2
2 2
1 2 1
Using the Bernoulli equation: P1 v1 P2 v22
2 2
The equation of continuity shows that v2 > v1 so P2 < P1 and the static p ressure
falls as the flow velocity increases. The kinetic energy of the fluid has increased
so its potential energy has decreased.
Another way to think about this is by considering Newton’s second law. The
fluid must accelerate as it enters the constriction so there must be a result-
ant force from the wider part of the pipe. This comes from the greater static
pressure.
The derivation has two parts – (i) we use the equation for viscosity to work out
an expression for the velocity of flow at radius r from the center. Then (ii) we
use this to work out the total rate of volume flow inside the pipe by integrating
over cylindrical shells.
(i) The applied force created by the pressure difference must balance the
viscous forces along its surface:
F r2p
F π r 2 p rp
shear stress along surface = = =
(area of surface) 2π rl 2 l
rp dv
This must balance viscous forces so:
2l dr
0 a
pr
dv 2 l dr
v 0
p 2
Which gives: v
4 l
a r2
This shows that the velocity profile in the pipe is parabolic:
Fluid Velocity
flow profile
(ii) Since different layers flow at different velocities the total flow can be
found by integrating the volume flow rates for all thin cylindrical shells
inside the pipe.
0 0
pa4
Q
8 l
This is Poiseuille’s equation. The volume flow rate depends on the fourth
power of the pipe radius a and is directly proportional to the pressure gradi-
ent p/l.
a4 p
8Q l
A
v2 v1
a
2 p
v1
A
2
1
a
2 p
QA
A 2
1
a
Venturi meters are used in many industrial applications, including water flow.
fluid
flow
dynamic stac
pressure pressure
total stac
pressure pressure
pressure transducer
The tube points into the direction of air flow and pressure sensors are used
to measure the difference between the total and static pressure. Since total
pressure is equal to the sum of the static and dynamic pressures the difference
between these is just the dynamic pressure ½ rv2. This value can then be used
to calculate the speed of the fluid relative to the Pitot tube.
Pitot tubes attached to the wings of planes are used to measure the aircraft’s
air speed and under boats to measure their speed in the water.
2 ptot pstat
v
6.6 EXERCISES
1. Scuba divers estimate that the excess pressure they experience when they
dive increases by an amount equal to the atmospheric pressure for every
additional 10 m of depth.
(a) Write down an expression for the excess pressure at depth x below
the surface of the water.
(b) Show that the total horizontal force acting on the wall of the dam
from the water is given by the expression F = ½ rglh2
(c) By considering equilibrium of moments show that the line of action
of this force is 1/3 h above the base of the dam.
3. The laminar flow of a viscous fluid through a narrow capillary tube
depends only on the radius a, the pressure gradient along the tube (p/l),
and the viscosity of the fluid, h.
(a) Use the method of dimensions to show that the volume flow rate Q
is given by an expression of the form:
(b) Explain why the method of dimensions cannot be used to determine
the value of the constant in the expression above.
(c) Explain why this formula is likely to break down for high flow rates
or larger diameter pipes.
4. (a) Show that the flow of air around a car is likely to be turbulent. You
will need to estimate the relevant quantities.
F = ½ CD rAv2
5. (a) Estimate the volume of your own body (HINT: your density is simi-
lar to that of water, about 1000 kgm-3).
(b) Estimate your weight.
(c) Estimate the buoyancy force on your body from the atmosphere.
(d) Discuss whether or not bathroom scales display your actual mass.
6. A steel ball bearing has a diameter of 1.2 mm. It is released from just
below the surface of glycerol inside a wide measuring cylinder. The den-
sity of steel is 7700 kgm-3 and the density of glycerol is 1260 kgm-3.
9. (a) Explain how it is possible to drink water from a glass through a straw.
(b) Discuss whether there is a limit to the length of straw that can be
used to drink water. The density of water is 1000 kgm-3.
10. Show that the SI base units for viscosity are kgms-1.
11. The pressure difference measured by a Pitot tube on an aircraft’s wing is
20 kPa. What is the aircraft’s air speed?
12. When an inflated balloon is connected to one side of a water manom-
eter the height difference between the manometer arms is 14.0 cm. The
atmospheric pressure is 102 kPa.
(a) What is the excess pressure inside the balloon?
(b) What is the total pressure inside the balloon?
Suggest one advantage and one disadvantage of using mercury
(c)
instead of water in a manometer.
13. In a famous experiment the French physicist Pascal placed one mercury
barometer at the base of a mountain and carried a second one to the top
of the mountain. Both barometers had mercury columns of equal height
when they were together at the base of the mountain. However, Pascal
noticed that the mercury column on the barometer he carried with him
fell gradually as he climbed the mountain. Explain this effect as carefully
as you can.
14. The circulation of blood in the body can be considered as a continuous
circuit. Blood is pumped from the heart into the aorta, splits into the
arteries, splits again into the capillaries and then returns to the heart via
the veins and finally the vena cava. This can be represented in the same
way as an electric circuit consisting of series and parallel resistors:
capillariess
cap
The table below gives the total area of each type of blood vessel along with
the average flow speed and volume flow rate.
(b) Blood has a viscosity of between 0.003 and 0.004 Pas. Discuss
whether blood flow in the human circulation is likely to be laminar
or turbulent.
7
Mechanical Properties
7.1 DENSITY
Density is a property of each material and is independent of the amount of
that material. This is in contrast to mass, which depends on the amount of
material present. Density is defined by the equation:
mass
density =
volume
the SI unit for density is kgm-3 but gcm-3 is also in common use. The relation
between these is:
1000 kgm-3 = 1 gcm-3
1 gcm-3 = 0.001 kgm-3
The densities of some common materials are listed below.
Density (kgm−3) Density (gcm−3)
force
repulsion
r
r0 separaon
aracon
The work that must be done to separate two particles from their equilibrium
separation is equal to the area between the negative part of the graph and infin-
ity. This is also equal to the energy released when the bond between the parti-
cles is formed. Since work must be done to push the particles closer together
or to separate them the potential energy is a minimum when their separation
is r0. They are in a bound state and each particle is in a potential well.
The graph below shows how the potential energy varies with particle
separation.
potenal energy
r0 separaon
separaon.
force / N
extension / m
O
permanent plasc
deformaon
Section OP is (almost) a straight line through the origin so in this region the
extension (x) is directly proportional to the force applied to the spring (or the
tension in the spring) (F).
F µ x or F = kx
Parallel Combinations
When n springs are connected in parallel and the system is stretched by a
force F, the tension in each spring must be the same and equal to F/n. The
total extension of each spring (and the system) will be e/n.
Using Hooke’s law for the system of n springs in parallel gives: kparallel = F/(e/n) = nk
The spring constant of a system consisting of several springs in parallel is the
sum of the spring constants of the individual springs. Connecting springs in
parallel increases the stiffness of the system.
Force
work done
extension
x
If the spring obeys Hooke’s law then the work done to stretch it to an exten-
sion x is equal to the shaded area in the graph above:
If we assume that no energy is lost as the spring is stretched, the elastic poten-
tial energy in a stretched spring is given by:
EPE = ½ kx2
In general, the work done to stretch something is calculated from the integral:
W F x dx
1 2
If the spring obeys Hooke’s law this is simply W kxdx kx .
2
giving the same result as before.
A
F F
l
force(N)
stress Nm2
cross-sectional area (m2 )
F
A
e
=
l0
Bone 14
Concrete 30
Copper 120
Steel 280
Diamond 1150
d2E
m e
4 gl0
d2E
A graph of m on the y-axis against e on the x-axis has a gradient
4 gl0
The Young’s modulus is given by:
4 gl
E gradient 20
d
A suitable experimental arrangement to determine the Young’s modulus of a
test wire is shown on the next page.
clamp l
test wire
bench
The diameter of the wire must be measured using a micrometer screw gauge.
This should be done at least three times in different positions and then an
average should be calculated. The images below show how a reading to 0.01
mm is obtained.
wire diameter is
measured by placing it
into this gap
collar: mm/100 scale (0.35 mm)
The reading is taken from the point where the lines on the shaft and collar
are aligned. The last cleared reading in mm from the shaft is added to the
reading from the collar. In the example below the gap is 2.50 mm + 0.35 mm
= 2.85 mm. Any zero error for the micrometer must be subtracted from this.
(You can check for a zero error by closing the micrometer gap completely and
checking the reading, it should be 0.00 mm).
The wire must then be clamped securely at one end.
Two light markers are attached to the wire a distance l0 apart. This distance
should be measured with an unkinked, unloaded but taut wire. Using a larger
value of l0 will increase the extensions and give better results.
The wire is then loaded, adding one mass at a time and recording the mass
added and the length of the wire in a suitable results table. This is repeated for
at least 7 different values of mass, and preferably many more. However, care
must be taken not to exceed the elastic limit for the wire. If this does occur
the graph will begin to curve and the Young’s modulus must only be calculated
using the gradient of the straight part of the graph.
Extensions should be calculated for each measured length and added to the
results table:
extension e l l0
Care must be taken throughout the experiment because there is always a dan-
ger that the wire could snap. Safety glasses should be worn!
Finally, a graph of mass against extension is plotted and the Young’s modulus can
be determined from the gradient of this graph as shown above (Section 7.4.1).
The dip in the curve is a result of the change in diameter of the sample. As it
begins to yield its diameter becomes smaller but in this graph the stress has
been calculated using the original diameter of the sample so while the calcu-
lated stress decreases the actual stress does not.
The area under a stress-strain graph represents energy per unit volume.
F dx 1
Area under curve d Fdx
A l Al
(Here we have assumed that the cross-sectional area of the sample has
remained constant during strain. This is approximately true for small strains.)
The larger the area up to fracture the more energy the material absorbs before
fracture and the “tougher” the material is said to be.
stress
O strain
Ceramics are solid non-metallic materials such as brick, tile, pottery, and china
that are formed by high temperature firing. They usually consist of tiny ionic
crystals bound together by amorphous glassy regions that formed during fir-
ing. Ceramics are usually stiff, hard, and strong and have high melting points
and good chemical resistance, but they can be brittle.
Glasses are closely related to ceramics but are characterized by being com-
pletely amorphous and result from a rapidly cooled melt. Glasses are distin-
guished from ceramics by their microscopic structure. For example, silica glass
and quartz have identical composition consisting of SiO4 units but in the glass
they are arranged randomly while in the ceramic they are arranged regularly.
Composite materials are combinations of two or more different materials
designed to take advantage of the desirable properties of each individual com-
ponent. Examples of composites are fiber glass, concrete, steel, reinforced
concrete, etc. Many composites consist of a matrix and a reinforcement.
Concrete is strong in compression, but weak in tension and brittle, whereas
steel is strong in tension and can prevent crack formation in the concrete,
providing a versatile and economical building material.
7.7 EXERCISES
1. A rectangular wooden block has sides of length 5.0 cm, 8.0 cm and 12 cm.
Its mass is 960 g.
extension/cm Force/N
0 0
2.8 1.0
6.2 2.0
9.6 3.0
13.2 4.0
16.5 5.0
20.1 6.0
23.2 7.0
26.6 8.0
30.2 9.0
extension/cm Force/N
33.0 10
36.3 11
40.2 12
44.4 13
49.4 14
57.1 15
68.5 16
96.5 17
113.2 18
(g) If two springs like the one above were connected in series how much
would they extend when stretched by a force of 14 N?
(h) When the force of 18N was removed the spring did not return to its
original length. Explain this.
(i) D
oes the spring become stiffer or less stiff beyond the limit of propor-
tionality? Explain your answer.
4. The steel cable used to moor a ship has a length of 12.0 m and a cross-
sectional area of 8.2´10-4 m2. The force in the cable is 15 000 N.
stress
A
D
strain
Use the correct terminology to describe the mechanical properties of each
material and to compare them.
8
Thermal Physics
8.0 INTRODUCTION
Thermal energy (often simply referred to as heat energy or heat) is the energy
something has as a result of the random thermal motions of its particles. This
is not the same thing as temperature. For example, the Atlantic Ocean has a
lower temperature (perhaps 15°C) than a freshly made mug of coffee (per-
haps 75°C) but has much more thermal energy because it contains many
more particles. While the formal definition of temperature is quite complex
and involves an understanding of the concept of entropy, a simple way to think
about temperature is to relate it to the mean energy per particle. The mean
energy of the water particles in a mug of hot coffee is greater than the mean
energy of water molecules in the Atlantic Ocean.
When there is no net heat transfer between systems in thermal contact they
are in thermal equilibrium and are at the same temperature.
The “Zeroth law of thermodynamics” states that: If two systems, A and B are
both in thermal equilibrium with a third system, C, then A and B are also in
thermal equilibrium with each other.
This law allows us to use thermometers and to set up formal temperature
scales.
T = q + 273.15 q = T - 273.15
changes. The ideal gas equation, pV = nRT shows that pressure p is propor-
tional to temperature T if volume V and amount of gas n are both constant
(which they are). Pressure values can be calibrated to give temperatures.
8.4.1 Conduction
Conduction involves the transfer of energy between particles as a result of
collisions. It is the most important process for heat transfer inside a solid
because particles in a solid remain in fixed positions and so cannot form con-
vection currents. Conduction also occurs in liquids and gases but here it is
often less important than convection since particles in liquids and gases are
able to move in convection currents transferring heat energy as they do so.
The process of conduction is a dynamic one but for macroscopic objects, the
transfer of energy from more energetic particles to less energetic particles has
a higher probability than transfer in the opposite direction so heat flows from
higher to lower temperatures. Conduction cannot occur in a vacuum because
there are no particles to conduct the heat.
dQ
The rate of heat flow (in watts) through an insulated block of material
dt
depends on the type of material, its cross-sectional area A, and the tempera-
ture gradient between opposite sides of the block.
d 2 1 where q > q .
1 2
dx l
dQ d
A
dt dx
The negative sign occurs because of the direction of heat transfer, from higher
to lower temperature. A constant of proportionality k can be introduced. This
depends on the material of the block. It is called the coefficient of thermal
conductivity. The equation for thermal conductivity (Fourier’s equation) is:
dQ d
kA
dt dx
The units for k are Wm- 1K- 1.
The table below lists typical thermal conductivities for common materials at
around room temperature.
Material Coefficient of thermal conductivity (Wm− 1K− 1)
Air 0.024
Brickwork 0.6–1.0
Window glass 0.96
Ground or soil 0.33 (dry) to 1.4 (very moist)
Insulation materials 0.035–0.16
Expanded polystyrene 0.03
Dry sand 0.15–0.25
Water 0.58
Rock 2–7
Aluminum 205
Diamond 1000
Gold 310
Iron 80
Stainless steel 16
Timber 0.14
The high values for metals are because they contain large numbers of free elec-
trons which rapidly transfer heat. Diamond has an especially high coefficient
of thermal conductivity because its atoms are bonded very tightly together
and this couples their motions, so that when one atom is disturbed it has an
almost immediate effect on its neighbors and so transfers energy quickly.
8.4.2 Convection
Convection can only occur in fluids, that is, liquids or gases. This is because
convection involves the bulk movement of hot matter from one part of the
medium to another. Natural convection currents arise as a result of changes
in density inside the fluid. In most fluids, the material expands as its tempera-
ture increases. This increases its volume and decreases its density so that it is
then displaced by cooler denser material from above it. The overall effect is to
set up a convection current with warmer material rising and cooler material
falling. This is often the dominant mechanism for heat transfer within a fluid.
Heat source
8.4.3 Radiation
All bodies emit and absorb a spectrum of electromagnetic radiation that
depends on the nature of their surface and their temperature. For many of
the hot objects we encounter in everyday life the peak of this spectrum is in
the infra-red region so thermal radiation is often referred to as infra-red radia-
tion although the actual spectrum of thermal radiation is continuous. Thermal
radiation is part of the electromagnetic spectrum so it can be transferred in a
vacuum. While the absorption and emission of radiation is a complex subject
it is usually true that matte black surfaces are good absorbers and emitters
whereas light shiny surfaces are poor absorbers and emitters. Thermal radia-
tion, like light, can be reflected from a silvered surface, for example, on the
inside of a thermos flask.
A thermal imaging camera can be used to measure infra-red radiation emitted
from different objects. The images below show the camera itself and thermal
images of a man’s face and hand (thanks to Al). The image is color-coded
(greyscale here) to show different temperatures, and these can be read off
from the scale at the bottom of the screen.
wavelength
black-body radiaon
spectrum
While 19th century physicists were unable to derive the shape of the spec-
trum from first principles, they did identify two important empirical laws that
are very useful when considering thermal radiation. These laws were later
derived from Planck’s theory.
Wien’s displacement law.
This law states that the product of the wavelength corresponding to the peak
of the spectrum l p and the absolute temperature (in kelvin) is a constant:
T constant 2.9 10 3 mK
Stefan–Boltzmann law.
This law states that the total power P radiated from a black body is propor-
tional to the fourth power of the absolute temperature (in kelvin) T:
P e AT 4
E
c
m
where DE is the energy supplied, m is the mass of the sample, and Dq is the
temperature change.
The SI unit for specific heat capacity is Jkg- 1K- 1.
The table below gives values for the specific heat capacity of a range of common
substances.
Substance Specific heat capacity (Jkg− 1K− 1)
Dry air at sea level 1460
Water (pure at 20°C) 4180
Ice (0°C) 2093
Copper 385
Aluminum 897
Iron 449
Mercury 140
Concrete 880
Water has a particularly high specific heat capacity. This makes it an ideal
coolant because it can absorb a large amount of heat for a relatively small
increase in temperature.
Some Terminology
Heat capacity is the energy needed to increase the temperature of a par-
ticular sample of a substance by 1°C (or 1K).
Specific heat capacity is the energy needed to increase the temperature
of 1 kg of a substance by 1°C (or 1K). That is, this is the heat capacity per
unit mass.
Molar heat capacity is the energy needed to increase the temperature of
1 mole of a substance by 1°C (or 1K). That is, the heat capacity per unit
amount (the SI unit of the amount is the mole).
will be greater than if the gas volume is kept constant. For this reason, there
are two significant heat capacities for a gas:
Heat capacity at constant volume: cV.
Heat capacity at constant pressure: cP.
For the reasons explained above, cP is greater than cV.
V
thermometer
A
insulaon
electrical
heater
substance
under test
temperature / °C
At higher temperatures
heat loss through the
insulaon increases so the
Inially some of the graph curves downwards
energy supplied heats
the heater so the graph In the linear
curves upwards secon almost all
of the heat
supplied is heang
the substance
under test
me / s
From the equation for specific heat capacity we have E mc so we can
write:
E
P mc
t t
P
c
m
t
is equal to the gradient of the linear part of the graph above.
t
m can be measured using a top pan balance.
P is calculated from the ammeter and voltmeter readings (P = IV).
8.8 EXERCISES
1. A brick hut with a flat wooden roof is heated so that the inside tempera-
ture is 22°C when the outside temperature is 12°C. The dimensions of the
hut’s base are 4.5 m by 3.5 m and the walls are 2.6 m high and 10 cm thick.
There is a glass window of area 1.5 m2 in one of the walls. The thickness of
the glass is 6.0 mm and the thickness of the wooden roof is 3.0 cm.
(a) Calculate the minimum power of the heater needed to maintain the
temperature at 22°C when the outside temperature is 12°C.
(b) Explain why, in practice, more powerful heating will be needed.
Thermal conductivities:
Brick: 0.80 Wm- 1K- 1, Wood: 0.16 Wm- 1K- 1, Glass 0.96 Wm- 1K- 1
Calculate the molar heat capacity of each metal. Comment on your results.
4. Calculate the energy needed to change 2.0 liters of water at 20°C into
steam at 120°C:
specific heat capacity of water = 4200 Jkg-1°C-1
specific heat capacity of steam = 2300 Jkg-1°C-1
specific latent heat of water = 2.26 MJkg-1
temperature
pouring
temperature
ambient
temperature
me
5. When molten metal is poured into a cast it cools down and solidifies. The
graph below shows a typical cooling curve for the metal.
Explain the shape of the graph and the significance of the three regions.
9
Gases
colored water. The experiment must be done slowly so that the gas remains
in thermal equilibrium with its surroundings. Changes that take place at con-
stant temperature, like this one, are called “isothermal” changes.
hand pump to
increase pressure
length of air
column: directly
proporonal to
the volume of
the gas Bourdon
gauge to
measure gas
pressure
colored
water, acts Fluid
as a piston to reservoir
compress the
gas
The length of the air column is measured for a range of different pressures.
Since the tube containing the air has a constant cross-sectional area, the vol-
ume of air is directly proportional to the length of the column. The results
of such an experiment show that the pressure p and volume V are inversely
proportional to one another:
This can be expressed algebraically by p ∝ 1 / V or pV = constant. This is called
Boyle’s law:
Boyle’s law: pV = constant for a fixed amount of an ideal gas at a constant
temperature.
pressure pressure
volume 1 / volume
The length of the air column is measured for a range of different tempera-
tures. The tube is open at one end so the pressure remains constant. The
results of such an experiment show that the volume V increases linearly with
temperature q when the temperature is measured in Celsius:
volume
temperature / ° C
Similar experiments using different amounts of gas also produce linear results
but with different gradients. However, if the graphs are extrapolated back, the
intercept on the temperature axis is always the same (-273.15°C) for all (ideal)
gases and all amounts of gas. This is clearly a physically significant temperature
and is used as the zero for the thermodynamic or kelvin scale. Using this
temperature scale, volume, and temperature are directly proportional.
more mass
volume
less mass
temperature / ° C
– 273.15 0
more mass
volume
less mass
temperature / K
0 + 273.15
to pressure
thermometer
sensor
Water bath
Air
Heat
pressure
more mass
less mass
21
0 + 273.15 temperature / K
This is called the ideal gas equation or equation of state for an ideal gas.
Holding any one of the three parameters constant reduces it to one of the
three gas laws, but it also applies if all three parameters change together.
The constant is directly proportional to the amount of gas. The value of the
constant for one mole of an ideal gas is R = 8.314 JK- 1. The ideal gas equation
is often written in the form:
Equation of state for an ideal gas: pV = nRT
Where n is the number of moles.
m v
x-direcon
The molecules are moving randomly so the mean-squared values of vx, vy, and
vz will all be equal:
=
v2x =
vy2 vz2
Using equations (1) and (2) above we can find an expression for the average
force on the wall of the container from a gas consisting of N particles in rapid
random motion:
Nm 2 Nm 2
=F = vx v
a 3a
F Nm 2 Nm v2
=
p = v =
A 3 a3 3V
This is the “kinetic theory equation” for gas pressure and it is usually written
like this:
1
pV = Nm v2
3
Since density r = nm/V we can also write this equation in the form:
1
p v2
3
Note that the rms speed is not equal to the mean speed and that the mean-
squared speed is not the same as the mean speed squared.
1
nNA m v2 = nRT
3
1 3
=
Total KE =N A m v2 RT
2 2
Dividing by NA gives:
1 3 R 3
=
Mean molecular KE = = =
m v2 == T kT
2 2 NA 2
R
The constant k = is called “Boltzmann’s constant”. This is effectively the
NA
gas constant per particle and it is a very important constant in thermodynam-
ics. k = 1.38064852 ´ 10- 23 JK- 1. The mean thermal energy per particle in any
thermodynamic system is of the order of kT; for an ideal gas, it is 3/2 kT.
Notice that, for an ideal gas, the mean kinetic energy depends only on tem-
perature (in kelvin). This implies that molecules of different gases, at the
same temperature, have the same mean kinetic energy but different mean
speeds. For example, air consists of about 20% oxygen and 80% nitrogen.
Oxygen molecules are more massive than nitrogen molecules, so they have
lower mean squared speeds. Hydrogen molecules have much lower mass and
therefore move, on average, at much greater speeds than oxygen or nitrogen
molecules. This results in them escaping from the atmosphere much more
rapidly, so there is very little hydrogen in the Earth’s atmosphere.
The rms speed of a molecule in an ideal gas at temperature T is given by the
equation:
3 kT
v=
rms =
v2
m
The table below shows the rms speeds of oxygen, nitrogen, and hydrogen
molecules at 20°C (293 K):
Gas Mass of molecule (kg) rms speed at 20°C (293 K) (ms−1)
- 26
Oxygen 5.32 × 10 478
- 26
Nitrogen 4.66 × 10 510
Hydrogen 3.34 × 10- 27 1910
1 3
U ==
Total KE =N A m v2 RT
2 2
The molar heat capacity at constant volume cV is the energy required to raise
the temperature of 1 mole of the gas by 1 K. Since all of the heat Q supplied
increases the internal energy:
Q U 3
cV R
T T 2
Note that this is the molar heat capacity at constant volume. If the gas is
instead heated at constant pressure, it will expand and do external work,
increasing the heat needed to raise its temperature and therefore increasing
its heat capacity.
The first law of thermodynamics (see Section 9.6.2) states that the change
in internal energy of a system is equal to the sum of the heat supplied to the
system and the work done on the system. The heat supplied to the system at
constant pressure is Q = cPDT = DU + pDV where cP is the molar heat capacity
at constant pressure and pDV is the work done by the gas as it expands (see
Section 9.6.3). Using the ideal gas equation we can replace DV with RDT/p
(because p is constant) to give:
3
Q RT RT cP T
2
5
Therefore: cP R cV R
2
The ratio of heat capacities is called the adiabatic gas constant g:
cP
cV
energy and potential energy so these two degrees of freedom add 2×(½ kT)
to the overall mean energy per particle. These additional degrees of free-
dom increase the heat capacity of the gas and in this case, if the additional 5
degrees of freedom were all excited the heat capacity should be 4kT.
Classically all of these degrees of freedom should be excited by thermal
motion and physicists expected the molar heat capacities of gases to be in
accord with this model. However, this is not the case, especially at low tem-
peratures. Some degrees of freedom (e.g., vibrational modes) do not contrib-
ute to the heat capacity and only “switch on” at higher temperatures, resulting
in lower heat capacities than those calculated using the equipartition theory.
This apparent anomaly was only explained by quantum theory, which resulted
in the “quenching” of some degrees of freedom at lower temperatures (see
Section 27.1.2).
y
x
This atom is free to vibrate along three independent directions. Since each
vibrational mode has two degrees of freedom, the total mean internal energy
is 3×(2× ½ kT) = 3 kT. For one mole of particles this is c = 3NAkT = 3RT.
1
Rate of diffusion ∝
r
This follows from the kinetic theory equation in the form:
1
p v2
3
Rate ∝ vrms = 3p ∝ 1
r r
p
c
where g is the adiabatic constant (=1.4 for a monatomic gas and 1.67 for a
diatomic gas). The adiabatic constant is the ratio of the specific heat of a gas
at constant pressure to the specific heat at constant volume.
vp vm vrms
fracon of
molecules
between v
and v+v
molecular speed
vp is the most probable speed. This means that there are more molecules
with speeds between vp and vp + Dv than in any other similar-sized range.
vm is the mean speed (vm = 1.13 vp).
As the temperature of the gas increases the peak moves to the right and falls.
The number of molecules with higher speeds increases:
The area under each graph is the same because the total number of molecules
has not changed.
fracon of
molecules
between v
and v+v
higher temperature
molecular speed
This is equal to the ratio of number of particles in the higher energy (E+DE)
state to the number of particles in the lower energy state (E):
∆E
Rate of evaporation ∝ −
e kT
Other activation processes include chemical reactions, conductivity of a semi-
conductor, viscous flow, and creep. In all of these cases, we would expect the
rate R to be directly proportional to the Boltzmann factor:
E
Rate of activation process R ∝ e kT
If this is the case then a graph of ln(R) against 1/T should be a straight line
with a negative gradient and positive y-axis intercept:
ln (R)
1/T
fracon of
molecules
between v
and v+v
Molecules in the
shaded region can
escape from the
liquid surface
molecular speed
Minimum speed
needed to escape
Note that we have defined W as positive when the system does external work.
Sometimes W is defined as work done on the system, in which case the sign of
W in the equation above changes.
Piston moves
outwards
Expanding gas F
Work is done
by gas
x
Piston pushed
slowly into gas
GAS
Adiabatic Changes
In an adiabatic change no heat flows into or out of the system so Q = 0. This
is often because the change takes place very rapidly and there is not enough
time for a significant heat transfer. Using the first law of thermodynamics we
can see that, when Q = 0, DU = - W, so the increase of internal energy is equal
to the work done on the system. An example of this would be a rapid compres-
sion of gas inside an insulated cylinder. Since the internal energy increases the
temperature of the system also increases.
Boyle’s law does not apply when an ideal gas is compressed adiabatically
(because the temperature is not constant). However, adiabatic compressions
obey another rule:
pV g = constant
Isochoric Changes
An isochoric change is one that takes place with no change in the volume of
the system. If the volume is constant there is no work done on or by the sys-
tem so W = 0 and DU = Q. The change in internal energy is equal to the heat
supplied to the system. An example of an (approximately) isochoric change is
the explosive combustion of a petrol and air mixture inside the cylinder of a
petrol engine.
Isobaric Changes
An isobaric change is one that takes place at constant pressure. For a fixed
amount of an ideal gas Charles’s law is obeyed: V/T = constant. An example
of an (approximately) isobaric change is the expansion of a diesel air mixture
as fuel is injected and ignited inside the cylinder of a diesel engine. Another
example is the change of state of a liquid to a vapor at constant temperature
and pressure. The vapor occupies a much larger volume than the liquid and
as it expands against external pressure it does work (e.g., pushing back the
atmosphere):
When an ideal gas expands isobarically work done is given by: W = pDV
Source at higher
temperature T1
Heat
extracted
from
source: Q1
Work done
by heat
engine: W
Heat
dumped
into sink:
Q2
Sink at lower
temperature T2
pressure
P2
C D
P1
B A
volume
V2 V1
DA: The fluid loses pressure at constant volume. This could be the result of a
sudden transfer of heat to the surroundings (e.g., as a result of exhaust). The
fluid loses heat Q2.
If the working fluid is an ideal gas, then the gas laws and the first law of ther-
modynamics can be used to analyze each stage in the cycle in order to cal-
culate the efficiency of the heat engine. While real heat engines do not have
simple indicator diagrams there are good approximations for the way in which
certain heat engines work. For example, the Otto cycle is used to model the
behavior of spark ignition petrol engines and the Diesel cycle is used to model
the behavior of a compression ignition engine such as a diesel engine.
cylinder working
head fluid
piston
cylinder
block
connecng
rod
cranksha
Intake stroke: the inlet valve opens and an explosive mixture of petrol and
air is drawn into the cylinder.
Compression stroke: the mixture is compressed.
Power stroke: following ignition the hot burnt gases push the piston back.
Exhaust stroke: the exhaust valve opens and burnt gases leave the cylinder.
After this the exhaust valve closes and the cycle begins again. In reality, the
fluid inside the cylinder is not constant, it is drawn in at the start of the cycle
and ejected as exhaust at the end of the cycle.
The diagram shows an idealized indicator diagram for the Otto cycle.
pressure
C
pC
pD
D
pB
B
pA A
O
volume
V2 V1
AB: Adiabatic compression of gas (work done on gas but no heat transfer).
Compression stroke.
BC: Isochoric heating of gas (heat is supplied as fuel–air mixture explodes
but no work is done).
CD: Adiabatic expansion of gas (work is done by the gas but no heat is
transferred). Power stroke.
DA: Isochoric cooling of gas (heat is lost from the gas but no work is done
by it).
AB corresponds to the compression stroke and CD to the power stroke. The
closed loop OAO represents the intake and exhaust strokes. The area con-
tained by the loop ABCD represents the net work done in one cycle.
T2
efficiency 1
T1
where T2 is the sink temperature and T1 is the source temperature. In the
idealized diagram above T1 = TC and T2 = TD so the theoretical efficiency of
the Otto cycle is:
TD
efficiency 1
TC
V1
This can be linked to the “compression ratio”, , by using the equation of
V2
state for an ideal gas and the relation pV g = constant for adiabatic changes.
We need to find an expression for TD/TC.
Consider the power stroke CD. This is an adiabatic change so:
pC V2 pD V1
The ideal gas equation also applies: = (2)
TC TD
g g −1
TD pD V1 V2 V1 V2
=
Combining these equations: = =
TC pC V2 V1 V2 V1
The theoretical efficiency of the Otto cycle is therefore:
g −1
V
efficiency ≤ 1 − 2
V1
Using a typical compression ratio V1/V2 = 10 and taking g = 1.4 gives a maxi-
mum theoretical efficiency of (1 - 0.100.4) = 0.60 (or 60%).
A real internal combustion petrol engine is only about 25–35% efficient at
transferring thermal energy from the fuel into useful mechanical energy.
Here are some of the ways in which energy is wasted:
The thermodynamic cycle of a real engine differs from the ideal Otto
cycle – valves take time to open and close and changes in pressure and
temperature cannot take place instantaneously.
The intake and exhaust strokes do not form a closed loop and work must
be done by the engine during these phases.
There is heat transfer from the engine to the surroundings (in addition to
the heat lost during the exhaust stroke).
There is friction between moving parts which transfers energy to heat.
D
PD
pA
O A
V2 V3 V1 volume
AB: Adiabatic compression of air (work done on gas but no heat transfer).
Compression stroke. The fuel has not yet been injected so there is no dan-
ger of pre-ignition (which limits the compression ratio for a petrol engine)
and the compression ratio V1/V2 for a diesel engine can be significantly
higher than for a petrol engine.
BC: Fuel is injected and burnt at constant pressure. Work is done by the
engine. This is the first part of the power stroke.
CD: Adiabatic expansion of gas (work is done by the gas but no heat is
transferred). This completes the power stroke.
DA: Isochoric cooling of gas (heat is lost from the gas but no work is done
by it).
A real diesel engine only approximates the idealized cycle shown above and
additional energy losses occur in much the same way as for a petrol engine.
9.8 EXERCISES
gas
(a) Sketch a graph to show how the pressure inside the syringe var-
ies with volume as the gas is slowly* compressed (e.g., as if some-
one put their finger over the end of the syringe and slowly pushed
the plunger in. *Assume this is slow enough for the temperature to
remain constant.
(b) On the same graph draw a second line to indicate how you would
expect pressure to change with volume if the plunger is pushed in
quickly.
(c) Explain the difference between the lines in (a) and (b) by referring
to work and internal energy.
4. Air at atmospheric pressure and temperature has a pressure of about 105
Pa and a density of about 1.2 kgm− 3.
gp
v=
ρ
where g is a dimensionless constant, p is the air pressure, and r is the air
density. By treating air as an ideal gas show that the speed of sound in the
air is independent of pressure but directly proportional to the square root
of absolute temperature. (Hint: consider one-mole of air and use the ideal
gas equation to relate density to pressure and temperature by introducing
the molar mass M.)
7. Eight molecules in a gas are moving parallel to the x-axis and have
x-components of velocity equal to: 510.0 m/s, - 550.0 /s, - 495.0 m/s,
548.0 m/s, - 498.0 m/s, 502.0 /s, 518.0 m/s, - 535.0 m/s.
Each molecule has a mass of 5.4 × 10− 26 kg. Calculate:
(a) define the terms that appear in the equation and explain the mean-
ing of the Boltzmann factor.
(b) Sketch graphs to show how the Boltzmann factor varies with (i) T,
(ii) DE, and (iii) DE/kT.
(c) Explain why chemical reactions proceed more rapidly at higher tem-
peratures.
11. By what factor, roughly, would you expect the rate of a chemical reaction
to change if the temperature increases by 10 K (assume the reaction is
taking place at around 350 K and that the activation energy is about 1 eV.
12. In human blood oxygen molecules bind to hemoglobin molecules. The
bond energy is about 0.30 eV. Calculate the ratio of free oxygen molecules
to bound oxygen molecules in the blood at body temperature (310 K).
13. (a) Explain why the conductance (G = 1/R) of a semiconductor NTC
increases with temperature. Your answer should refer to the Boltz-
mann factor.
(b) A semiconductor NTC thermistor is connected to a source of con-
stant low voltage V and a currentIpasses through it. I will vary with
temperature. Explain why a graph of ln I versus 1/T will be a straight
line. What is the gradient of this line?
14. Here is the Otto cycle for an ideal four-stroke petrol engine:
pressure
D
pB
B
pA A
O
volume
V2 V1
10
Statistical Thermodynamics and
the Second Law
10.0 INTRODUCTION
The gas laws and the ideal gas equation are all derived empirically, that is from
experimental data. They give a good mathematical description of the behavior
of an ideal gas but they offer no explanation of why the gas behaves in this
way. The reason for this is that they deal with macroscopic properties such as
volume, temperature, and pressure and do not engage with the microscopic
behavior of the molecules that constitute the gas. Kinetic theory (see Section
9.3) provides a more fundamental explanation of the behavior of an ideal gas,
despite being based on the random motion of particles. Statistical thermody-
namics takes this further and provides one of the most important and enig-
matic of all physical laws, the second law of thermodynamics.
direcon of me
Newton’s laws are reversible. So are the laws of electromagnetism and gravity.
They do not distinguish between the past and the future. However, macro-
scopic processes are almost always irreversible. The pale blue water does not
resolve itself back into separate clear water and a dark drop, a broken glass
remains broken, a scrambled egg remains scrambled and, more personally, we
remember the past but not the future and we all eventually grow old and die.
This raises two very important questions.
Seawater at
17 ° C (290K)
Water returned
at 16 ° C (289 K)
While pilot heat engines have shown that this process is physically viable there
is a fundamental limit to the possible efficiency of any heat engine. This is stated
qualitatively in a macroscopic version of the second law of thermodynamics:
It is impossible to construct a heat engine that, operating in a cycle, can
extract heat from a source and transfer it completely into work.
This implies that any heat engine MUST dump some waste heat into the envi-
ronment. It cannot be 100% efficient.
The work generated by a heat engine comes as a by-product of the transfer of
heat from the hot source to the cooler sink. If heat could flow back from the
sink to the source, we could continue to operate the heat engine and increase
its efficiency. However, heat only flows spontaneously from systems at higher
temperatures to systems at lower temperatures, so this is not possible. This
leads to a second, equivalent, statement of the second law of thermodynamics.
It is impossible for heat to flow from a cooler to a hotter body with no
other change taking place.
This is a matter of common experience. If you leave a cup of hot coffee on
the table it will soon transfer heat to its surroundings until it reaches the same
temperature as the room. However long you wait it does not reheat itself. This
seems similar to the example of the ink droplet in water, it is another exam-
ple of an irreversible macroscopic process. Once again the macroscopic one-
way process of approaching thermal equilibrium is actually driven by random
microscopic collisions that continue to take place after a uniform equilibrium
temperature has been reached.
In order to explain these one-way irreversible changes physicists introduced
a new quantity, “entropy.” The second law can then be stated in terms of
entropy:
The entropy of an isolated system tends to a maximum.
A B
Q
T1 T2
dQ
d SA = −
T1
dQ
d SB = +
T2
dQ dQ
d Ssystem =
− + ≥ 0 (second law of thermodynamics)
T1 T2
dQ dQ
therefore: ≥ so T1 ≥ T2
T2 T1
If entropy cannot decrease then either T1 > T2 and heat flow from hot to cold
or T1 = T2 and the system are in thermal equilibrium with maximum entropy.
This shows that the second law expressed in terms of entropy is in agreement
with the statement about the direction of heat transfer. Heat cannot, in isola-
tion, flow spontaneously from a colder to a hotter body because this would
decrease the entropy of the system.
Entropy of source
Source at T1 decreases as heat is
Q1 extracted.
Work done: W
Q2
Entropy of sink
Sink at T2 increases as heat
is dumped
Q1
d Ssource = −
T1
Q2
d Ssink = +
T2
Q Q
d Ssystem =
− 1 + 2 ≥ 0 (second law of thermodynamics)
T1 T2
Rearranging this gives: Q2 T2
≥
Q1 T1
W Q1 − Q2 Q
The efficiency of the engine is: efficiency = = = 1− 2
Q1 Q1 Q1
T2
Therefore: efficiency ≤ 1 −
T1
This result was first derived by a French engineer called Sadi Carnot based
on the operation of an ideal reversible heat engine. It sets a limit to the theo-
retical efficiency of any heat engine and shows that the theoretical efficiency
increases as the ratio of T1 to T2 increases. This is in agreement with our
previous macroscopic statement of the second law and also explains why our
seawater heat engine was doomed to failure. While a heat engine working
on those principles could be constructed, if it worked between a source tem-
perature of 290 K and a sink temperature of 289 K its maximum theoretical
efficiency would be (1 - 289/290) = 0.0034 or 0.34%, and this is before taking
into account the unavoidable additional losses caused by friction, etc. For a
petrol engine, the source temperature (of the hot gases following ignition) is
about 1000°C (about 1300 K), and the sink temperature at which the gases
are exhausted is about 500°C (about 800 K) giving a maximum theoretical
efficiency of (1 - 800/1300) = 0.38 or 38%.
barrier removed
since the particles move randomly. What is the probability that all N particles
are found on the left-hand side of the barrier? It is ½N, which is vanishingly
small for large N. This is because there is just one way in which this can occur –
just one microstate of the system which has all particles on the left (ignoring
rearrangements within the N particles themselves!), in the same way, that if
you toss a coin N times there is just one way that all N coins can land heads.
Now consider the probability that (N - 1) particles are on the left of the barrier
and just 1 is on the right. There are N possible ways in which this can happen
(each of the particles could be the one that is on the right) so the probability is
N times larger. For a distribution with (N - 2):2 the probability is much larger
again (by a factor of ½( N - 1) times. This is because there are (
N ( N − 1) )
2!
ways in which this can occur. In fact, the number of ways continues to increase
up to a distribution with N/2 particles on each side and then falls back again
until there is just one way in which all the particles can be found on the right-
hand side. The “number of ways” we have referred to here is the number of
microstates that represents each macro-state. The final macro-state (a 50:50
distribution) is also the macro-state that can be realized in the largest number
of ways, that is, the macro-state that can be realized in the largest number of
distinct micro-states. The graph below shows how the “number of ways,” W,
changes with the distribution of particles:
number of
micro-states W
macro-state
distribuon L:R
N:0 N/2 : N/2 0:N
are indistinguishable from this 50:50 distribution. Since the particles move
randomly, we can assume that, left alone, the system will explore all of the
microstates available to it. The system we considered started off in a macro-
state which corresponds to only 1 micro-state. As time goes on it explores
other adjacent micro-states, most of which lie closer to the peak because the
number of micro-states increases that way. This means the system tends to
evolve toward the peak. Given the fact that the number of micro-states close
to the peak vastly outweighs all of those even a short distance away from it
then, if we leave the system for any length of time, the overwhelming prob-
ability will be that we will find it in a macro-state close to the peak of the dis-
tribution – that is,, close to equilibrium. And once in equilibrium, it is highly
unlikely to fluctuate far from it because the number of micro-states drops
rapidly away from equilibrium. For the systems we usually interact with, N is
enormous (e.g., >1020), so the probability of a large fluctuation from equilib-
rium is so small that it can be assumed to be zero.
number of
micro-states W
macro-state
distribuon L:R
S = k ln W
one that can be realized in the largest number of micro-states This is the
maximum value of W and therefore maximum entropy. The arrow of time is
the direction from low entropy to high entropy and from small W to large W.
for a while it becomes so wet that it no longer dries you – as you rub yourself
with the towel as much water transfers from you to the towel as transfers from
the towel to you – both the towel and your skin have reached the same degree
of “wetness.” Continuing to use the towel results in no further change in wet-
ness. This is analogous to the approach to thermal equilibrium.
Temperature is therefore related to the rate at which entropy changes when
heat is transferred to or from a system, and we can define the thermodynamic
temperature using this equation:
1 dS
=
T dQ
How does this relate to the statistical description of entropy in terms of micro-
scopic configurations? This can be understood by considering the effect of
adding 1 quantum of energy to a system that already contains N quanta. While
this will always increase the number of configurations of the system it has a
greater effect on the entropy of a system with an initially smaller number of
configurations than on one with an initially larger number. Mathematically
this is a consequence of the natural logarithm in Boltzmann’s formula.
3 3 RT δV
Q= RδT + pδV = RδT + (using the ideal gas equation)
2 2 V
T2 V
3 RdT 2 RδV 3 T V
=
∆S ∫T 2T + V∫ = V 2
R ln 2 + R ln 2
T1 V1
1 1
RT δV
Q = pδV = (using the ideal gas equation)
V
V2
RδV V
=
∆S ∫
V1
= R ln 2
V V1
Qcold Qcold
For a heat engine, the heat reservoirs are burnt fuel (hot) and the environment
(cold). For a refrigerator, the heat reservoirs are the foodstuff that must be
cooled down (cold) and the environment (hot). Typically, a domestic refrig-
erator removes heat from food at about 4°C and dumps heat into a room at
about 20°C, although different types of refrigerator can work in different tem-
perature regimes. The reason you need to plug an electric refrigerator into a
mains supply is so that electricity can supply the work that will ultimately be
dumped as additional heat in the environment.
For a refrigerator to operate it must obey the second law of thermodynamics
so the entropy increase caused by the heat dumped in the environment must
be equal to or greater than the entropy decrease caused by extracting heat
from the stuff inside the refrigerator:
Qhot
∆Senvironment =
Thot
Q
∆Sfood =
− cold
Tcold
Qhot Qcold
∆SUniverse = − ≥0
Thot Tcold
Qhot Qcold
≥
Thot Tcold
Qhot Thot
Which implies that ≥ so Qhot (the heat dumped in the environment)
Qcold Tcold
must be greater than Qcold (the heat extracted from the food). The refrigerator
can only operate if we supply additional energy. This is the work W = Qhot - Qcold.
As far as the Universe is concerned a refrigerator is a heater.
The coefficient of performance (CoP) is a measure of how effective the refrig-
erator is and is the ratio Qcold/W. The higher this is, the more joules of heat are
removed from objects inside the refrigerator per joule of electrical work done,
so it is like the “efficiency” of the refrigerator. The second law sets a limit to
the theoretical thermodynamic efficiency of a heat engine and it also sets a
limit to the theoretical coefficient of performance for a refrigerator:
Q Qcold 1 1
= cold
CoPrefrigerator = = ≤
W Qhot − Qcold Qhot Thot
−1 −1
Qcold Tcold
Q Qhot 1 1
= hot
CoPheat pump= = ≤
W Qhot − Qcold 1 − Qcold 1 − Tcold
Qhot Thot
10.7 IMPLICATIONS OF THE SECOND LAW
10.7.1 The Second Law, the Arrow of Time, and the Universe
When the second law of thermodynamics is applied to the Universe it states:
The entropy of the Universe tends to a maximum value.
This implies that the arrow of time points in the direction of increasing
entropy. The past is a low entropy state and the future is a high entropy state.
Low High
entropy Arrow of Time. entropy
past future
So, have we explained the arrow of time? Not entirely. The microscopic
description of the second law shows that, if a macroscopic system starts off in a
low entropy state it is overwhelmingly likely to move to the high entropy states
that can be realized in a larger number of ways. However, the arrow of time
ceases to exist once the system has reached equilibrium, so one large ques-
tion remains: why did the Universe start in a low entropy state? According to
our analysis, the initial configuration of the Universe must have been one that
can be realized in only a relatively small number of ways (low W). This was a
state of low probability in the sense that if we considered the totality of macro-
states the Universe can have then the actual state in which it began belongs to
a tiny subset of these. This is rather like the first example we considered, the
ink droplet in water – the system starts in a very special low probability, low
entropy state and evolves toward a higher probability, high entropy equilib-
rium. Our Universe is also evolving toward a high entropy equilibrium state.
The assumption that the Universe began in a low entropy state is sometimes
called the “past hypothesis.”
10.8 EXERCISES
T2
efficiency ≤ 1 −
T1
11
Oscillations
11.0 OSCILLATIONS
An oscillator undergoes regular periodic motion about a fixed equilibrium
position (or value). The simple model of a mass oscillating on a spring can be
adapted to explain a wide variety of physical phenomena, from lattice vibra-
tions in a crystalline solid to the effects of seismic waves. The mathemati-
cal analysis developed to describe mechanical oscillators can also be used for
electrical and electromagnetic oscillations and is the starting point for under-
standing all kinds of wave motion.
support
spring
amplitude of
oscillation: A
mass equilibrium position
connection to
oscilloscope
ultrasonic
Data logger
position
sensor
5 displacement / cm
4
3
2
1
me / s
0
-1 0 1 2 3 4 5 6
-2
-3
-4
-5
The graph above shows a simple harmonic oscillation with amplitude of 4.0
cm, time period of 2.0 s, and frequency of 0.50 Hz. At t = 0, the oscillator is
at its maximum positive amplitude so the displacement x of the oscillation can
be represented by cosine function:
x A cos t
The term in the bracket is called the phase of oscillation. It is an angle in
radians.
The oscillator must complete one cycle in a time T so wT= 2π and ω = 2π/T =
2pf. This is called the angular frequency of the oscillation.
x A cos 2ft
x A cost
dx
v A sin t
dt
dv
a 2 A cos t 2 x
dt
There is a π/2 phase difference between v and x and between a and v. There
is a π phase difference between a and x.
The three graphs on the next page show how these graphs are related to one
another. Note the maximum values of velocity and acceleration and when
these occur:
vmax A when displacement is zero
amax 2 A when displacement is maximum and velocity is zero
displacement / cm
displacement
5
4
3
2
1
0 me / s
-1 0 1 2 3 4 5 6
-2
-3
-4
-5
v = dx/dt
1
velocity / cms velocity Max. velocity
5
15
vmax = A = 2fA
10
occurs at zero
5 displacement
0 me / s
0 1 2 3 4 5 6
-5
-10
-15
a = dv/dt
Max. acceleraon
acceleraon / cms
2
acceleraon
50 2 2
amax = A = 4 fA
40
30
occurs at max.
20
displacement
10
0 me / s
-10 0 1 2 3 4 5 6
-20
-30
-40
-50
displacement / cm
5
4
3
2
1
time / s
0
-1 0 1 2 3 4 5 6
-2
-3
-4
-5
acceleration = −
(constant) × (displacement from equilibrium)
a = − w2 x
d2 x
2
2 x
dt
d2 x
x A cos t 2 x
dt 2
As can be easily shown by differentiating this twice. It then follows that:
ω = 2pf = 2π / T
Notice that if we had not used a squared term for the constant in the equation
of motion we would have had a square root in this solution.
k 1 k
so and f
m 2 m
kx0 = mg
k(x + x0) − mg = kx
kx0 k(x0 + x)
equilibrium
x
mg
mg
Resultant force is in the opposite direction to the displacement from equilib-
rium so:
F = − kx
This satisfies our conditions for simple harmonic motion so an ideal mass-
spring oscillator is a simple harmonic oscillator. Now we can use angular fre-
quency to find the frequency and time period:
For the simple harmonic motion: F m2 x
For the mass-spring oscillator: F kx
k
Therefore:
m
1 k m
Giving: f and T 2
2 m k
l Direction of
resultant
force
A B mg
x
The free-body diagram on the right shows that the resultant of the two forces
(tension and weight) acting on the bob will be mg sin θ along the line shown.
If we consider the horizontal motion (along AB) then the resultant force act-
ing horizontally is:
F mg sin cos
It is clear from the diagram on the left that sin θ = x/l so:
mgx
F cos
l
This does not satisfy our condition for simple harmonic motion because the
resultant force is proportional to x cos θ and not just to x (cos θ is itself a func-
tion of x). However, for small angles, cos θ is approximately equal to 1 so the
smaller the angular amplitude of the oscillation the closer it will be to an ideal
simple harmonic oscillator:
mg
As θ → 0 cos θ → 1 and F → x
l
Under these conditions, the constant of proportionality between F and x
is (mg/l).
mg
Therefore:
l
1 g l
Giving: f and T 2
2 l g
Notice that this is independent of the mass of the bob. This is because the
resultant force is directly proportional to the mass so mass will cancel when
calculating accelerations:
a = F/m = (constant×m)/m.
This is for the same reason that objects of different mass fall with the same
acceleration in the same gravitational field.
3 energy / J
one cycle
2.5
2
TE
1.5 PE
KE
1
0.5
time /s
0
0 0.1 0.2 0.3 0.4 0.5 0.6
Notice that both KE and PE reach their maximum values twice per oscilla-
tion, this is because they are related to the square of the sine and cosine.
3
energy / J
2.5
2
TE
1.5
PE
1 KE
0.5
0 position / cm
-6 -4 -2 0 2 4 6
11.5.3 Damping
Real oscillators are subject to frictional forces which oppose their motion.
The oscillator must do mechanical work against these forces so (unless it is
driven by an external energy source) its total energy decreases with time and
so does its amplitude. The oscillator is “damped.” The heavier the damping,
the greater the rate at which the oscillator loses energy and the greater the
rate of decay of its amplitude. In many cases, the amplitude decays approxi-
mately exponentially. This occurs when the oscillator loses the same fraction
of its total energy on each oscillation.
5 displacement / cm
4
3
2
1
time /s
0
-1 0 0.1 0.2 0.3 0.4 0.5 0.6
-2
-3
-4
x Ae t cos t
amplitude
strong
of oscillator
resonance
peak
effect of
increasing
damping
driver
amplitude
driver
frequency fd
fd = f0
phase difference
between driver and
driven
/2
0 driver frequency
fd
11.7 EXERCISES
1. A particle moves with SHM between points A and C in the diagram below.
Where is it at the instants when:
A B C
(a) It is stationary?
(b) It has the maximum velocity to the right?
2. The diagram below shows a trolley tethered between two similar springs.
When it is displaced from equilibrium and released it undergoes periodic
oscillations about an equilibrium position. Neither spring goes slack dur-
ing these oscillations and both springs obey Hooke’s law in compression
and extension. The trolley has a mass of 0.80 kg.
M
A B
equilibrium position
(i) The period of the oscillations will change if the mass is changed or
the stiffness of the springs is changed. Why?
(j) After the trolley is released its oscillations gradually die away. Explain
why this happens.
(k) As the oscillations decay successive amplitudes are:
6.0 cm, 5.4 cm, 4.9 cm, 4.4 cm, 3.9 cm, 3.5 cm, 3.2 cm
x 2.5 cos 10 t
12
Rotational Dynamics
12.0 INTRODUCTION
The circle is a simple geometric shape that is used throughout science to
model cyclic processes and is often the starting point for more complex theo-
ries (e.g., of planetary motion or electron orbits). Simple harmonic motion
can be regarded as a projection of circular motion onto a diameter making the
concept of phase very clear. Rotating vectors, or phasors, are powerful ways to
model oscillations and waves.
12.1 ANGLES
12.1.1 Measuring Angles in Radians
The degree is an arbitrary division of the circle into 360 equal parts. This is a
useful measure of angle but in physics, it is often simpler to work in a different
unit, the radian. The reason for this is that the radian is defined directly from
the geometry of the circle.
Definition: The angle θ at the center of a circle subtended by an arc of
length l is equal to the ratio l / r where r is the radius of the circle:
= l / r
2 radians = 360
radians = 180
(h = radius of
circle)
sin
h
h
o l cos
h
As 0 o l and
a a h (radius)
therefore:
cos 1
sin
tan
cos
The significance of this is that, for small angles, we can replace the sine or
tangent of the angle with the angle itself (in radians). But how small is small?
Like all approximations, this depends on how precise a value is needed. The
table below shows that the approximations work to better than 1% when the
angle is 0.1 radians and better than 2% for 0.2 radians.
Angle in radians Sine of angle Cosine of angle Tangent of angle Angle in degrees
1.00 0.842 0.540 1.56 57.3
0.50 0.479 0.878 0.546 28.6
0.30 0.296 0.955 0.309 17.2
0.20 0.199 0.980 0.203 11.5
0.10 0.0998 0.995 0.100 5.73
Taking t 0 gives:
v
or v r
r
This is a particularly useful relation.
d d 2
dt dt 2
For circular motion the radius is constant so:
d d v 1 dv a
dt dt r r dt r
where a is the tangential acceleration (not to be confused with centripetal
force).
a loose object would behave inside a moving vehicle that is initially moving in
a straight line and then begins to turn in an arc of a circle. The object is free
to slide inside the vehicle and is initially traveling at the same speed as the
vehicle. The upper diagrams show the view from outside the vehicle and the
lower diagrams show the view from inside the vehicle.
v v
v
In the first case (upper diagrams) we are describing the physics from an exter-
nal non-accelerating (inertial) frame of reference. Newton’s laws of motion
apply so the object continues to move in a straight line in the absence of a
resultant force. There is no acceleration or resultant force until the side of the
vehicle has moved inwards and collides with the sliding object.
In the lower case, the apparent acceleration is because the observer is inside
the vehicle and does not take into account his own acceleration. In order to
explain the apparent acceleration of the object he introduces an imaginary
outward force, the centrifugal force.
Centrifugal forces are examples of “inertial forces,” introduced in order to
explain observed physics from a non-inertial (accelerating) frame of refer-
ence. Unlike the centripetal forces in an inertial reference frame, inertial
forces have no physical origin and that is why they are referred to as imagi-
nary. They are helpful if we need to solve physical problems inside a rotating
reference frame but we must always bear in mind that they are an artifact of
our reference frame and do not arise from physical causes.
v
v
a
a
v
a
a
v
a
a
v v
When you swing a stone in a circle on the end of a string the forces acting on
the stone are directed toward the center of the circle. If the string suddenly
breaks the stone has no resultant force acting on it and flies off along a tan-
gent, it does not accelerate outwards because there is no centrifugal force. It
moves in a straight line at constant velocity. (Here we have ignored the effects
of other external forces such as gravity).
A v
Note that centripetal force is not a new kind of force in physics. It is the
magnitude of the resultant force on a body moving with constant uniform
circular motion. It arises as a result of the real physical forces that act on it.
In some cases this is simple. For example, the Moon’s orbit around the Earth
is approximately circular and the centripetal force is provided by the gravita-
tional attraction toward the Earth.
Fgrav
Moon
Earth Moon
later
4 2 mr
Fgrav mr 2
T2
T is the Moon’s orbital period, r is its orbital radius and m is its mass.
A more complex situation involves an object moving in a vertical circle in a
uniform vertical gravitational field. Examples might be a person on a funfair
ride, a plane looping the loop, or just a stone on a string. If the motion is
uniform (constant angular velocity) the resultant force stays constant in mag-
nitude but the forces that contribute to it change. The example below shows
a person standing in a capsule on a fairground ride and indicates the forces in
four different positions. The capsule is rotating at constant angular velocity.
Only two forces act on the man. His weight mg is the same in all positions but
the contact force R from the floor of the capsule changes with position. The
resultant force at all points is the vector sum of weight and reaction force and
must be F = mrw2 toward the center of the circle.
g R
mg
R R
D B
mg mg
C
mg
Position A:
Weight and contact force act in the same direction so, mrw2 = mg + RA
RA = mrw2 − mg
For one particular angular velocity w0, the contact force falls to zero, RA = 0.
g
This is when mrw02 = mg so 0 .
r
Under these conditions, the contact force between the man and the floor is
zero and he feels “weightless.” This is only apparent weightlessness because he
is in fact free falling at this moment and the capsule is accelerating downwards
at g. For higher values of angular velocity, there must be a contact force from
the floor. For lower values of angular velocity, he will lose contact with the
floor and begin to fall downwards toward the roof of the capsule. In practice,
this sets a minimum value for the practical angular velocity in such a fair-
ground ride.
Position C:
Weight and contact force act in opposite directions so, mrw2 = RA − mg
RA = mrw2 + mg
The contact force is greater than his weight. He experiences this through the
reaction force from the floor pushing up on his feet. This makes him feel
heavy at the bottom, as if his weight has increased. Once again this is only an
apparent increase in weight since the gravitational forces have not changed.
RD RB
mg mg
2
Resultant = mr Resultant = mr2
RD sin = mg RD sin = mg
Positions B and D:
In both positions the contact force has a vertical component that balances the
weight and a horizontal component that provides centripetal force:
length of the phasor, A, defines the radius of the circle and is equal to the
amplitude of the simple harmonic motion. The constant angular velocity ω
of rotation of the phasor is equal to the angular frequency of the simple har-
monic motion. The angle the phasor turns through is the phase of the simple
harmonic motion.
displacement
t=0
A
t = 3T/4 me
t = T/4
t = T/2 A
t=0 t = T/4 t = T/2 t = 3T/4 t=T
Phasors are a useful mathematical tool that is particularly useful when analyz-
ing wave superposition.
x = A cos
A A cos
For rotaon at constant angular velocity:
= t
Therefore: x = A cos t
ds d
Velocity v= Angular velocity
dt dt
dv d 2 s d d 2
Acceleration =
a = Angular acceleration
dt dt 2 dt dt 2
s
u v t
→
i
f t
2 2
1 1
s ut at2 → i t t
2 2
1 2 1
s vt at → f t t2
2 2
v u 2 as
2 2
→ f i 2
2 2
These are used in exactly the same way as the original suvat equations.
mi
ri vi
CM
The kinetic energy of the ith particle is: kei = ½ mivi2 = ½ miri2w2
The rotational kinetic energy of the body is:
i N i N
1
RKE kei m i ri 2 2
i1 i1 2
1 i N
RKE m i ri 2 2
2 i1
The term in brackets is called the “moment of inertia” I of the body
i N
I m i ri 2
i1
The SI unit for a moment of inertia is kgm2.
h
2 mgh
v
I
m 2
r
The larger the moment of inertia the lower the final linear velocity. This is
because a larger fraction of the energy goes into rotation. A cylinder has a
larger moment of inertia than a sphere because more of the mass is farther
from the axis. If a cylinder and ball are rolled down the same slope side by side
then the ball will reach the bottom first.
If the surface is completely smooth and the object slides without rolling then
there is no rotational kinetic energy and the expression above reduces to:
v = 2 gh
This shows that a block sliding down a smooth slope will always get to the bot-
tom faster than any round-rolling object on the same slope.
m
Magnitude of linear momentum p = mv
mi
ri vi
CM
i N
L mi ri 2 I
i 1
i N
Notice that the “moment of inertia” I of the body, m i ri 2 , once again plays
a role analogous to mass in linear mechanics i 1
Torque = force
i F perpendicular distance
r from pivot P
P = Fr
sum of torques from these individual forces must equal the total resultant
force on the body.
mi
ri fi
CM
This has the same form as Newton’s second law in linear mechanics:
d I
so I dt
dt
that is, change of angular momentum = angular impulse
They also show that when systems interact the torques experienced by each
system are equal and opposite so while they interact they exert equal and
opposite angular impulses on one another. Consequently, any change of
momentum of system A is equal and opposite to the change of momentum of
system B and the total angular momentum of the combined system does not
change. This can be stated as the law of conservation of angular momentum:
The angular momentum of an isolated system (no external resultant
torque) is constant.
An interesting example of this is when an ice skater pirouettes. She balances
on the point of one skate while rotating relatively slowly with arms and one leg
outstretched. Gradually she draws her arms and leg closer to the axis of rota-
tion, thus reducing her moment of inertia. There is no external torque so her
angular momentum (L = Iω) cannot change. If I falls ω must increase, so she
spins faster. Another similar example is the collapse of the core of a massive
star at the end of its life. Its radius reduces by many orders of magnitude so
the rotation rate can be very high – some neutron stars have rotation periods
of less than one millisecond.
m
r
P
For several point masses at various distances from the center of rotation:
r1 m1
r2
r3 m2
m3
i N
I m i ri 2 m1 r12 m2 r2 2 m3 r3 2
i 1
l
P
x
x
Let the total mass of the rod be m. The mass of the short section of length dx
mx
at distance x from the center of rotation is m and the moment of iner-
l
mx2 x
tia of this section (treated as a point mass) is I .
l
Solid cylinder is
made up of an
r infinite number
of infinitely thin
shells.
The diagram at the top of the next page shows an end view of a solid cylinder
of total mass m and radius r. A thin shell at radius x is shown.
The moment of inertial of a cylindrical shell is δI = dmx2 and the mass of the
2 xmx 2 xmx
shell is a fraction of the mass of the cylinder given by m .
r 2 r2
This is because the area of the shaded strip above is effectively 2pxdx if treated
like a long thin rectangle.
The moment of inertia of the entire cylinder is then:
xr
x r
2 mx3 dx 1 2
Icylinder I r 2 2 mr
x0 x0
rota
on axis is:
I = ½ x2 m
m r 2 sin 2 x
dx
However, x and θ are related by: x = r cos θ so r sin
d
so we can replace dx by −r sin θ dθ
1
Isphere
0
r 5 sin 5 d
2
This can be integrated using standard techniques to give:
8r 5
Isphere
15
3m
Now replace the density with: to give:
4 r 3
2
I sphere = mr 2
5
W Fds → W d
Power: P = Fv → P = Γw
P P
h h
CM CM
mg
Equilibrium
The right-hand diagram above shows how the weight produces a restoring
torque Γ = − mgh sin θ (negative sign indicates that this is directed toward
equilibrium) when the body is displaced through angle θ.
d d2
From Newton’s second law: I I 2
dt dt
The equation of motion for this compound pendulum is therefore:
d2 mgh sin
2
dt I
For small angles sin θ → θ (in radians) so for small angles, we can write
(approximately):
d2 mgh
2
dt I
This has the same mathematical form as the equation of motion for simple
harmonic motion
d2 x
2
2 x
dt
so it will have sinusoidal solutions of the same kind.
mgh
2
I
I 1 mgh
giving: T 2 and f
mgh 2 I
12.9 EXERCISES
1. (a) Use Newton’s laws of motion and a suitable diagram to show that for
an object to move in uniform circular motion there must be a result-
ant force acting toward the center of the circle.
(b) Explain why a centripetal force cannot do work on an object moving
in circular motion.
2. A small ball of mass m is released from a height h on a track that leads to
a looping section as shown below.
h
r
B
(b) Draw free-body diagrams to show the forces acting on the ball at
points A, B, and C.
(c) Discuss whether the resultant force on the ball is toward the center
of the circle at all points as it completes the loop.
(d) Derive an expression for the velocity of the ball at point C.
3. A car tire of radius r is rolling along a flat horizontal surface at constant
speed v. Copy the diagram below and add labeled arrows to show the
velocity and acceleration of a particle fixed to the tire at each of the
points A to D.
D B
40 m
400 m
(a) Estimate the moment of inertia of the space station about its rota-
tion axis by treating the pods as point masses concentrated at their
centers of mass and by treating the connecting tunnel as a rod. The
moment of inertia of a rod of mass m and length l about its CM is
1
given by I = ml 2 .
12
(b) Explain why an astronaut standing on the outer edge of a pod would
experience artificial gravity.
(c) Calculate the angular velocity of rotation that will create an effect of
artificial gravity of strength 9.8 Nkg− 1 at the outer edge of the pods.
(d) Calculate the rotational kinetic energy of the space station when its
angular velocity is 0.20 rads− 1.
7. A small flywheel has a moment of inertia of 2.0×10− 3 kgm2 and it is rotat-
ing at 50 revolutions per second. The frictional torque working against
rotation is 2.4×10− 2 Nm.
(a) Calculate the angular momentum of the flywheel including appro-
priate units.
(b) Calculate the time taken for the flywheel to come to rest.
8. A group of children is sitting on the outside of a merry-go-round that is
rotating at a constant rate. They all move toward the center of the merry-
go-round and it speeds up. Explain why this occurs and discuss the angu-
lar momentum and energy changes that take place in this system. Assume
that external torques can be ignored.
9. The Earth’s rotation is slowing because of its tidal interaction with the
Moon. This causes the day length to gradually increase. Day length
increases by 1 second roughly every 18 months.
(a) Use this information and the data below to calculate the torque act-
ing on the Earth to slow down its rotation. Assume the Earth is of
uniform density.
Mass of Earth = 6.0×1024 kg. Radius of Earth = 6400 km
Number of seconds in a day = 86400
Moment of inertia of a solid sphere of mass m and radius a is
I = 2/5 ma2
(b) The Earth is not uniform, its density increases toward its center.
State and explain how this would affect your answer to (a).
13
Waves
13.0 INTRODUCTION
There are many different types of wave but the underlying physics is com-
mon to all of them, whether they are mechanical vibrations in a medium, like
sound, or vibrations of an electromagnetic field, like light. They all transfer
energy, reflect, refract, diffract, and interfere. The wave model is one of the
most important models in physics and in the next four chapters we will inves-
tigate waves in considerable detail.
This is called the phase velocity of the wave. It is the movement of the wave
disturbance but not the outward movement of matter. The diagram below
shows how this appears from above. The circular lines of constant phase are
called wave fronts and the arrows are called rays. Rays and wave fronts are
perpendicular to one another and are alternative ways to represent the wave
pattern. The separation between two adjacent wave fronts is called the wave-
length λ of the wave.
ray
The time period T for the formation of one wave is equal to the time period of
the source oscillations so all particles in the wave oscillate with the same fre-
quency f as the source. During one oscillation of the source, the wave moves
forward a distance equal to its wavelength so the phase velocity v of the wave
is given by:
v f
T
We can now define some terms:
Traveling or Progressive wave
A wave where all the particles oscillate with the same (or a decaying)
amplitude but with a progressive phase delay in direct proportion to their
distance from the source.
Wavelength λ
Shortest distance between two particles oscillating in phase in the wave.
Time period T
Time for one complete wave to leave the source or time for a particle to
complete one cycle of oscillation.
Frequency f
1
Number of waves leaving the source in 1 second: f = .
T
Phase velocity v
Velocity at which a point of constant phase (e.g., a wave crest) travels away
from the source:
v f
Wave front
Line of constant phase in the wave pattern. Perpendicular to rays.
Ray
Arrow in the direction of energy transfer. Perpendicular to wave fronts.
phase velocity v
A
direction of
oscillations
direction of
energy transfer
A
As the wave moves to the right, the disturbance at each position varies verti-
cally with the same amplitude as the source (assuming no energy dissipation).
All electromagnetic waves are transverse, as are seismic S-waves (secondary
or shear waves).
oscillation
direction
phase
wavelength velocity v
Longitudinal Waves
Vibration directions are parallel to the direction of energy transfer. This
results in regions of compression (shown as dark areas below) and rarefaction
(low density, shown as lighter areas below) in the medium through which the
wave passes.
As the wave moves to the right, the disturbance at each position also varies
horizontally, with the same amplitude as the source (assuming no energy dissi-
pation). Sound and ultrasound are longitudinal waves, so are seismic P-waves
(primary or pressure waves).
Some waves are a combination of longitudinal and transverse waves. This
results in particles undergoing elliptical motions as the wave passes. Surface
water waves are like this.
direction of motion
of water wave
longitudinal transverse
component component
wave disturbance
(e.g. displacement)
versus position at x
one time
A
wave disturbance
(e.g. displacement)
versus time at one t
position
A T
These graphs are similar to one another but represent different things. The
repeat distance in space is the wavelength and the repeat distance in time is
the time period of the wave.
source oscillates with simple harmonic motion, then the displacement at the
source (x = 0) can be written as:
y A cos t
where ω = 2pf.
The oscillation at any other point on the positive x-axis will be given by an
equation of the form:
y A cos t
where δ is a phase delay that depends on x (distance from the source). When
x = λ the phase delay is 2π (two particles separated by one wavelength oscil-
2 x
late in phase). Therefore and so:
2 x
y A cos t
2
The term is called the wavenumber k so we can write the equation for a
1D traveling wave of amplitude A moving in the positive x-direction in the
simple form:
y A cos t kx
If the wave is moving in the negative x-direction the sign in the equation
changes:
y A cos t kx
t kx
d dx
k kv 0 (for the position of constant phase).
dt dt
2 f
v f
k 2
v f
I ∝ A2
13.2 REFLECTION
When a wave strikes a boundary, it can be wholly or partially reflected. The
law of reflection states that:
The incident and reflected rays make equal angles to the normal to the
surface at the point of incidence and both rays and the normal lie in the
same plane.
normal
i r
If two plane reflectors are placed at 90° to one another then a ray striking
either one of them will return parallel to its original path. This is used in car
reflectors and was used by the Apollo astronauts who left an array of corner
reflectors on the surface so that lasers sent from Earth would reflect back and
allow the distance between the Earth and Moon to be measured precisely.
i1 = r1
i2 = r2 = 90 i1
i1 Ori
Original incident ray has turned
r1 through a total angle of:
thro
13.3 REFRACTION
13.3.1 Refraction at a Boundary Between Two Different Media
Wave velocity depends on the medium through which the wave is traveling.
For example, light slows down when it travels from air into glass, and surface
water waves slow down when they travel from deeper to shallower water. If
the wave strikes the boundary between two different media at an angle to the
normal then the wave direction changes. This is called refraction.
medium 1: v1 medium 1: v1
1 1
2 2
medium 2: v2 medium 2: v2
No waves are created or destroyed so the frequency must be the same in both
media. Therefore:
v2 2
v1 f 1 and v2 f 2
v1 1
Sheet of A3 white
paper fixed to
drawing board
ray
box
1
2
Rectangular
glass block
The position of the edge of the block and the normal to this line are marked
onto the white paper and the ray box is used to direct a single fine ray at the
point where these two lines intersect. The ray is traced using optical pins
(shown by X on the diagram) pushed into the board. The block can then be
removed and the path of the ray outside and inside the block can be drawn
using a pencil and ruler. The incident angle θ1 and the refracted angle θ2 are
then measured using a protractor. If this is done for incident angles in the
range 0 to 90° then a graph of sin θ1 against sin θ2 is a straight line.
sin 1
sin 2
This is called Snell’s law and the constant is called the relative refractive
index 1n2 for a ray passing from medium 1 into medium 2.
1 Snell’s law:
sin 1
1 n2
sin 2
2
If medium 1 is the vacuum, then this constant is the absolute refractive index
for medium 2.
In practice, since the speed of light in air is almost the same as the speed of
light in a vacuum, the absolute refractive index is usually used when medium 1
is air.
Refraction occurs because the wave velocity changes at the boundary. This
implies that refractive index must be related to this change in velocity.
The diagram below can be used to derive this relationship.
B
1
Medium 1 1
Medium 2 A 2 C
D
2
AB and DC are adjacent wave fronts so they are separated by one wavelength:
v1
BC= 1 =
f
v2
AD= 2 =
f
It is also clear that with a greater ratio of velocities, more refraction will occur
and that rays of light will bend toward the normal as they enter a medium with
a lower speed of light and away from the normal when they enter a medium
with a higher speed of light.
Air
Glass
c
v1 n1 n2
=
1n = =
2
v2 c n1
n2
This gives us the most useful form of Snell’s law:
n2 sin 1
n1 sin 2
n1 sin 1 n2 sin 2
a ray of light at a semi-circular glass prism. If the ray is aimed along a radius
there is no change of deflection on entering the prism.
partial reflection
TIR
TIR
1 < c 1 = c 1 >c
The condition for the critical angle is that the refracted angle is equal to 90°:
n1 sin 1 n2 sin 2
=
n1 sin c n=
2 sin 90 n2
n2
sin c =
n1
If medium 2 is the vacuum (or air) then:
1
sin c =
n
Where n is the absolute refractive index of the transparent block.
Total internal reflection can only occur when light travels from a material of
higher refractive index to one of lower refractive index.
Each time the ray reaches the boundary its incident angle is greater than the
critical angle so it is repeatedly totally internally reflected. Light has a very
high frequency so it can be modulated to carry a great deal of information.
There are two main types of optical fiber: mono-mode fibers, which effec-
tively only allow a single path for the light (by having an extremely narrow
core of about 10 µm), and multi-mode fibers which are much thicker and
allow multiple light paths. The disadvantage of multi-mode fibers is that, over
a long distance, different parts of the signal have traveled significantly differ-
ent distances and develop time delays. If these become comparable to the
time between ones and zeroes in the digital signal then the information is lost.
The longer the fiber the lower the maximum data transfer rate, so they tend
to be used over shorter distances, for example, within a single building. The
advantage of multi-mode fibers is that they are cheaper and can carry the light
of multiple wavelengths so they are often used with LEDs rather than lasers.
Mono-mode fibers use laser sources working at a single wavelength and can
transmit high data rates over great distances (up to thousands of kilometers).
13.3.6 Dispersion
The amount of refraction at a boundary depends on the refractive index at that
boundary. However, absolute refractive indices depend on the wavelength of
light (because the speed of light in a medium depends on wavelength). This
means that if a polychromatic ray (i.e., one with a range of wavelengths pre-
sent, such as white light) refracts at a boundary, different wavelengths will
refract in different amounts. This is called “dispersion” and is familiar from
the way a triangular glass prism can disperse white light into a “spectrum” of
colors.
For glass the higher frequency, shorter wavelength end (blue/violet end) of
the spectrum travels more slowly than the lower frequency, longer wavelength
end (red/orange) of the spectrum, and so has a higher refractive index. For a
certain type of crown glass nred = 1.509 while nviolet = 1.521, a small difference
but one that is clearly demonstrable.
13.4 POLARIZATION
13.4.1 What Is Polarization?
Longitudinal waves vibrate parallel to the direction in which they transfer
energy, so there is a unique vibration direction. Transverse waves, however,
vibrate at 90° to the direction in which they transfer energy so they can vibrate
in any direction perpendicular to the direction in which the wave is traveling.
If a transverse wave is confined to oscillate only in one plane it is said to be
plane-polarized. Transverse waves can be polarized but longitudinal waves
cannot.
For example, if the only vibration direction is vertical then the wave is verti-
cally plane polarized.
possible
vibration direction of energy transfer
directions
vertical
vertical vibrations
vibrations
direction of
energy transfer
In the diagram below we are looking in the direction of wave travel with the
wave moving away from us. The diagram shows a polarizing filter that will
transmit vertically plane-polarized light. The light incident on the filter is
plane polarized at an angle θ to the vertical.
Transmitted
A cos
A
Absorbed
A sin
fraction of
maximum 1
intensity
0.8
0.6
mean value: 0.5
0.4
0.2 angle of
polarising filter
0
0 30 60 90 120 150 180 210 240 270 300 330 360
be rotated and the angle between the two filters can then be read off from a
fixed scale. With no sample, the maximum intensity will be when the filters
are aligned. If the sample rotates the plane of polarization the second filter
can be rotated until the intensity is once again a maximum and the angle of
rotation can be measured.
direction
of wave
sample rotates
polarization
direction by filter must be
light source:
rotated by to
unpolarized
find maximum
unpolarized polarized
B
B B
2
partially
polarized
We can derive an expression for the Brewster angle θB by using Snell’s law and
the law of reflection. Let the refractive index of the first and second media be
n1 and n2, respectively.
n1 sin θB = n2 sin θ2
but θ2 = 180 − (90 + θB) = 90 − θB
n1 sin θB = n2 sin (90 − θB) = n2 cos θB
n2
tan B
n1
This is “Brewster’s law.”
Photographers use polarizing filters to enhance contrast, for example, to
darken the sky compared to the clouds, or to reduce glare from reflective sur-
faces (e.g., water or windows). Polarizing sunglasses are also used to reduce
reflected glare by absorbing on component of polarization.
13.5 EXERCISES
y 0.25 cos 30 t 2 x
3. The diagram below shows a pin placed symmetrically between two plane
mirrors that are perpendicular to one another.
pin
An observer looking into the mirrors can see three images of the pin.
Draw a careful ray diagram to locate the positions of these images.
4. Here are some absolute refractive indices for different media.
Vacuum n = 1 (exactly)
Air n = 1.000293 (at s.t.p.) .... usually taken to be 1.00
Diamond n = 2.42
Glass n = 1.50
Water n = 1.33
Speed of light in a vacuum c = 3.0×108 ms− 1
The table below refers to a ray of light traveling from medium 1 to
medium 2 as shown in the diagram.
1
2
medium 1 medium 2
(a) Complete the table (you will need to use values from the list above):
Medium 1 Medium 2 v1 v2 θ1 θ2
Glass 3.0 × 10 ms
8 −1
40 0
(b) Explain what is meant by total internal reflection and state the con-
ditions under which it occurs.
(c) Calculate the critical angle for an interface between:
(i) water and air,
(ii) glass and air,
(iii) diamond and air.
Suggest why a real diamond sparkles more than a fake glass
(d)
“diamond.”
5. The refractive index n of glass can be determined by measuring the mini-
mum angle of deviation D when light passes through a triangular prism
of apex angle A:
A
D
AD
sin
n 2
A
sin
2
Derive this formula.
6. Two polarizing filters are placed at 900 to each other. A third polarizing
filter is placed between them and slowly rotated. Unpolarized light is
directed into the system of three filters.
(a) Explain why the intensity of light passing through the system of
three filters reaches a maximum when the third filter is at 450 to the
direction of the first polarizing filter.
(b) Calculate the fraction of the incident intensity that passes through
the system when the middle polarizing filter is at 30° to the first
polarizing filter.
14
Light
14.1.2 Electromagnetism
In the 19th century, Michael Faraday and James Clerk Maxwell tried to make
sense of all the different electromagnetic phenomena that had been discov-
ered at that time. This included:
how charges exert forces on one another (see Section 17.3.1)
how electric currents create magnetic fields (see Section 20.3.1)
how magnetic fields exert forces on moving charges and electric currents
(see Section 20.2.2)
how changing magnetic fields can induce voltages (see Section 21.1.1).
direcon of electric
field vibraons
direcon of energy
transfer
The polarization direction is, by convention, the direction in which the electric
field vibrates. In the example above, the polarization is vertical.
It is important to realize that this diagram represents the variation of the two
fields at a point ON the red line. They do not represent physical motions of
anything off the line. As the wave passes a point the electric and magnetic
fields at that point increase and decrease in strength periodically. If a wave
like the one above strikes a material medium the two fields will cause charges
in the surface to vibrate at the frequency of the wave. This is how a radio
antenna detects radio waves – the vibrating charges in the antenna set up a
weak a.c. signal that can be amplified.
Maxwell’s equations show that electromagnetic waves are emitted when
charges accelerate (e.g., while oscillating) and cause charges to accelerate
when they are absorbed.
Visible light consists of part of the much larger electromagnetic spectrum.
Our eyes respond to wavelengths from about 390 nm (violet) to 700 nm (red)
which corresponds to a frequency range of 4.3 × 1014 Hz to 7.7 × 1014 Hz. The
main regions of the electromagnetic spectrum are shown below.
frequency / Hz
10 24
10 2
10 20
10 18
10 16
10 14
10 1
10 10
10 8
10 6
10 4
102 100
micro
visible
gamma X-rays UV IR
radio waves
rays wave
0 2 4 6 8
10−1 10−14 10−1 10−1 10−8 10−6 10−4 10− 10 10 10 10 10
wavelength / m
You might still be wondering what it is that actually vibrates when an electro-
magnetic wave passes through a vacuum – calling it the electric and magnetic
field does not really answer the question. 19th-century physicists were also
puzzled about this and assumed that there must be some invisible medium
filling all of space so that electric and magnetic fields are distortions of this
medium and electromagnetic waves involve vibrations in the medium. They
called the medium the “luminiferous aether” but no attempt to demonstrate
its existence was ever successful and Einstein’s special theory of relativity
abandoned it (see Section 24.1.2).
J
Maximum
J
separaon
E
Minimum
separaon
The first terrestrial measurement of the speed of light was made by Armand
Fizeau in 1849. He employed an ingenious method using a rapidly spinning
toothed wheel to chop light into short pulses. The pulses then hit a distant mir-
ror and reflected back to the wheel. As the speed of the wheel increased the
returning light hit the next tooth and was blocked. This meant that the time
taken for the light to travel to the mirror and back was equal to the time taken
for the wheel to rotate by the angle between a tooth and the next gap. He meas-
ured the rate of rotation at which this occurred and used this to find the time
of flight t of the light pulse. The speed of light was then calculated from the
distance to the mirror, d, and back divided by the time of flight: c = 2d/t.
spinning
mirror
toothed
wheel
*
d
observer
To oscilloscope to
record emied pulse
Mirror
To oscilloscope to d
record detected pulse
This method was modified and improved by Jean Leon Foucault and then by
Albert Michelson. They replaced the toothed wheel with a rotating mirror but
the principle of the method was similar. Foucault even measured the speed of
light in water by placing a tube of water in the light path between the rotating
mirror and the fixed distant mirror. His results showed that light travels more
slowly in water than in air, a result that reinforced the wave model of light
because it was consistent with the wave explanation of refraction at an air–
water boundary (the particle model predicted that the particles of light would
speed up as they entered the denser medium). Michelson spent most of his
life refining methods for measuring the speed of light. He also carried out
the famous Michelson–Morley experiment, which seemed to show that the
speed of light is independent of the motion of the observer, an effect that was
eventually explained by Einstein’s special theory of relativity (see Chapter 24).
What makes it difficult to measure the speed of light in a laboratory is the fact
that it is so fast. This means that we have to measure extremely short time
intervals. Nowadays, high-speed electronic devices can do this and the meas-
urement can be carried out successfully over a distance of just a few meters.
The principle is very similar to that used by Fresnel, Foucault, and Michelson,
short pulses of light are generated by an LED laser and a fast oscilloscope is
used to detect the emission of the pulse and the detection of its reflection
from a mirror placed a few meters away.
One slight complication with this method is that the time it takes for the elec-
tronic processing of the signals is comparable to the time of flight of the light
pulses, so the time t1 measured on the oscilloscope must be corrected for this.
One way to do this is to place the emitter and detector so that they are facing
each other and the light path is virtually zero. There will still be two separate
peaks on the screen and the time delay t2 is now entirely due to the equip-
ment. The actual time of flight of the light pulse is therefore (t1 - t2) and the
speed of light is 2d/(t1 - t2). Typical oscilloscope traces are shown below.
t1 t2
1
c=
ε0µ0
1
v=
ε0 εr µ0µr
where er is the relative permittivity of the medium and mr is the relative perme-
ability of the medium. Whereas the speed of all electromagnetic waves is the
same in a vacuum the values for the relative permittivity and permeability in
a medium vary with frequency, so different parts of the electromagnetic spec-
trum travel at different speeds inside media causing dispersion. This is respon-
sible for the way white light can be spread into a spectrum by a triangular prism.
c
We have already met the equation v = where n is the absolute refractive
n
index of a medium. It follows that the absolute refractive index is related to
permittivity and permeability by:
n= εr µr
the focal length f of the lens. The reciprocal of this value is called the power
of the lens P and is measured in diopters (1 diopter is equivalent to 1 m- 1).
principal axis F
1
P=
f
Concave, or diverging, lenses are shaped in such a way that light traveling par-
allel to their principal axis diverges from a virtual focal point before the lens.
principal axis F
In a real lens, the light will refract as it enters the lens and refract again as
it leaves on the opposite side. A thin lens is one where both refractions can
be assumed to take place at the same point, on a line perpendicular to the
principal axis and passing through the optical center of the lens. Ray diagrams
involving thin lenses are often drawn with the lens itself shown as a single
vertical line passing through the optical center and a small symbol is used to
show the type of lens.
thin lens
In ray optics, the direction of the ray is reversible so a ray passing through the
focal point before it reaches the lens will then travel parallel to the principal
axis on the far side of the lens.
Convex lens – predictable ray 2: a ray passing through the optical center of
the lens does not change direction.
thin lens
Concave lens – predictable ray 1: a ray parallel to the principal axis diverges
from the principal axis on the other side of the lens along a line that traces
back to the focal point.
thin lens
The direction of the ray is reversible, so a ray heading toward the focal point
on the far side of the lens will then travel parallel to the principal axis after it
passes through the lens.
Concave lens – predictable ray 2: a ray passing through the optical center
of the lens does not change direction.
thin lens
14.2.3 Images
An image is a point-by-point representation of an object. A “real image” is
formed when the rays creating the image pass through the image points,
for example, the image formed on the retina of the eye or the image pro-
jected onto a cinema screen. A virtual image is formed when the rays cre-
ating the image diverge from points on the image but do not pass through
those points, for example, the image in a plane mirror or the image of
the bottom of a swimming pool seen through the surface. Since the rays
responsible for the image do not pass through it, a “virtual image” cannot
be formed on a screen.
The size and position of an image can be determined by drawing two predict-
able rays from a single point on an object and tracing their paths. The place
where they intersect is the position where the image point is formed. Objects
are often shown as vertical arrows and the two rays to be traced are from the
tip of the arrow.
u v
ho
F image
object F hi
thin lens
u=f
F
object F
to image
thin lens at infinity
image u<f
F
object
F
diverging
thin lens rays
This ray diagram can be used to explain how a magnifying glass works. Your
eye would be positioned on the optical axis to the right of the lens and would
look through the lens to see the magnified image beyond the object.
F
object
image
thin lens
The image is erect (the same way up as the object) diminished and virtual.
focal plane
Rays from a single
point on distant object
F F
thin lens
For an extended object at a great distance, for example, the Sun, there will be
an angle between the rays from a point on one side of the Sun and the rays
from a point on the opposite side of the Sun. They will be focused in the focal
plane above and below the focal point of the lens and an image of the Sun will
be formed between them. focal plane
Rays from a
single point
on top of Sun
image
of Sun
Rays from a
single point on
boom of Sun thin lens
This creates an image of the Sun that can be projected onto a screen – for
example, to look for sunspots.
The power of a lens depends on its shape and on the material from which it is
made. The higher the refractive index the more the light is refracted and the
shorter the focal length of the lens. However, the refractive index depends on
wavelength, so objects which have a range of colors are not focused sharply
at any one point. This is called chromatic aberration. Achromatic lenses are
composite lenses that use a range of different lenses to compensate for chro-
matic aberration.
u v
ho
F image
object F
hi
thin lens
hi v
=
m =
ho u
Now look at these two shaded triangles which are also similar:
v
u f
ho
F image
object F
hi
thin lens
Therefore:
hi v f v
ho f u
Which can be rearranged to give the lens equation:
1 1 1
= +
f u v
This can be used for convex and concave lenses using the REAL IS POSITIVE
sign convention:
f is positive for a convex lens (real focus) and negative for a concave lens
(virtual focus)
u is positive for a real object and negative for a virtual object
v is positive for a real image and negative for a virtual image
u v
object image
normal
normal
The two shaded triangles are similar so u = v and m = 1. The object and the
image lie on the same normal to the mirror.
Curved mirrors can be used like lenses to focus and project images. A suitably
shaped concave mirror has a real focus and a convex mirror has a virtual focus.
F F
Concave mirrors are used to focus microwaves and radio waves so that a detec-
tor placed at or near the focal point receives a strong signal (e.g., for a radio
telescope or a satellite TV dish). The way to think about curved mirrors is to
realize that the ray diagrams for a concave mirror are like those for a convex
lens, but with a reflection, and the ray diagrams for a convex mirror are like
those for a concave lens with a reflection.
If a convex mirror is actually part of a sphere rays nearer the edge of the mir-
ror focus on a different position than rays near its center. This results in an
unfocused image and is an effect called spherical aberration. To avoid this the
correct shape for a concave mirror is a paraboloid.
sin α
= nwater
sin β
x x
From the diagram: sin α = and sin β =
happt hreal
hreal
nwater =
therefore:
happt
air x
water
β happ hreal
α
image point
β
object point
rays from
point at top of
f0 fe
distant object
α F β
intermediate image
In the ray diagram above, the rays from the top of the object enter the objec-
tive at an angle α to the principal axis. Rays from the bottom of the object are
assumed to enter along the principal axis but are not shown in the diagram.
The angle α is therefore equal to the angle subtended by the object at the
objective and β is the angle subtended by the image at the eye:
α is the angle subtended by the distant object at the objective lens.
β is the visual angle – the angle subtended at the eye by the rays forming
image.
The angular magnification M of the telescope is the ratio β/α.
tan β fo
Using similar triangles and alternate angles it is easy to see that =
tan α fe
However, the angles involved are small so we can use the approximation tan
q ≈ q (in radians) to obtain and expression for angular magnification:
β fo
M= =
α fe
The image is, as can be seen in the diagram, inverted. For astronomical work,
this is not an issue. However, a terrestrial refracting telescope has a third con-
vex lens in between the intermediate image and the eyepiece and this lens
simply inverts the intermediate image so that the final image is erect.
Objecve: concave
mirror of focal length fo
Secondary mirror
Eyepiece: convex
lens of focal length
Radio telescopes are also reflecting telescopes, using a large concave dish to
reflect radio waves to a detector. Radio waves have much longer wavelengths
than light so radio telescopes must have much larger diameters than optical
telescopes (the resolution of a telescope depends on its diameter – see Section
15.3.4). The world’s largest radio telescope is the Tianyan (Heavenly Eye) tel-
escope in Guizhou Province, China. Its objective reflector has a diameter of
500 m and is built into a natural depression in the landscape. The signal from
a radio telescope is detected by placing a receiver close to the focal point of
the objective mirror.
objecve eyepiece
lens lens
intermediate
image
f0 fe
object
F F F
F
eye
Inverted
magnified
virtual image
frequency heard by
staonary observer
siren
approaching frequency of siren
f0 at rest
siren receding
me
Doppler shifts:
v
source and observer approaching with relative velocity v: ∆λ = − λ 0
c
v
source and observer receding with relative velocity v: ∆λ = + λ 0
c
v
∆λ reflection = −2 λ 0
c
The Doppler effect is often used to measure the velocity of a remote object.
Doppler radar can be used to work out the velocity and rotation rate of an
asteroid. The Doppler effect is used by the police in radar speed guns to
detect speeding motorists. Astronomers can use the Doppler shifts of radio
waves in the spiral arms of the Milky Way to work out the rotation rate of our
own galaxy. Doctors use Doppler ultrasound to measure the rate of flow of
blood inside capillaries.
λ´ −λ 0 v
=z =
λ0 c
In a similar way, when a source moves away from us, the light we receive is
said to have been “blue-shifted” (moved to shorter wavelengths).
In the 1920s, the astronomers Edwin Hubble and Vesto Slipher discovered
that the spectral lines in light received from distant galaxies are all red-shifted.
The implication is that distant galaxies are all moving away from us and from
each other. In addition to this Hubble showed that the red-shift is directly
proportional to the distance of the galaxy, so that very distant galaxies are mov-
ing away from us more rapidly than closer galaxies. Since red-shift is directly
proportional to recession velocity this discovery showed that recession veloc-
ity v is directly proportional to distance d. This is called Hubble’s law:
v = H0 d
14.5 EXERCISES
1. The distance to the Moon has been measured by reflecting light from
arrays of mirrors left on the Moon’s surface by Apollo astronauts. The mir-
rors are arranged as corner reflectors that reflect incoming beams back on
themselves. However, the spreading of the beam means that only about
1 in 1017 photons leaving Earth are detected in the reflection! The uncer-
tainty in the measurements of distance is of the order of 1 cm.
4. Complete the table below, which refers to a concave lens of focal length
20 cm.
Object distance Image distance Linear magnification Nature of image (real/virtual)
10 cm
20 cm
30 cm
infinity
15
Superposition Effects
screen or
detecon plane
maximum
*
Source 1 minimum
SUPERPOSITION maximum
Source 2
* minimum
maximum
In order for stable clear interference effects to be created the two sources
must be coherent.
Coherent sources maintain a constant phase relationship.
This means that they must be the same type of wave, and have the same wave-
length and frequency. The sources do not have to be in phase but the phase
difference between them must be constant. They must also have comparable
amplitudes; if one wave has a much greater amplitude than the other then
variations in intensity will be hard to detect.
microphone
M
connecon
S to
Signal
generator oscilloscope
For a maximum the waves must reach the microphone in phase, this occurs
when the path difference is a whole number of wavelengths. For a minimum,
they must arrive in antiphase (an odd multiple of π phase difference), this
happens when the path difference is an odd number of half wavelengths.
There will be a maximum at the center of the pattern because S2M = S1M
here so the path difference and phase difference are both zero. Minima will
occur when x = λ/2, 3λ/2, 5λ/2, etc. and these minima will be positioned sym-
metrically about the central maximum. In between the minima there will be
further maxima when x = λ, 2λ, 3λ, etc. The intensity between maxima will
vary continuously from maximum to zero and then back to a maximum. The
path difference, phase difference, and effects at maxima and minima are tabu-
lated below.
Path difference Phase difference (rad) Resultant intensity
0 0 Central maximum
½λ π Minimum
λ 2π Maximum
/2 λ
3
3π Minimum
2λ 4π Maximum
5
/2 λ 5π Minimum
LASER SUPERPOSITION
single
slit interference
paern
double screen
slit
The diagram above has exaggerated the angle of spread from each slit. In
practice the angles are small and this allows us to use the small angle approxi-
mation when deriving the relationship between the wavelength of the light
and the structure of the pattern. It is also important to realize that the slit
separation is very small compared to the distance between the double slits
and the screen.
Here is an image of an actual double slit pattern formed using a green laser pen.
The small bright vertical patches are interference maxima (sometimes called
interference fringes). Notice that the pattern itself fades in and out. This is a
secondary effect caused by the diffraction pattern from individual slits rather
than the interference pattern between different slits.
15.1.3 Using the Double Slit Experiment to Find the Wavelength of Light
Young’s double slit experiment was used to measure the wavelength of visible
light. We can derive an equation that relates the wavelength of light to three
parameters: the separation of the double slits s, the distance from the double
slits to the screen d, and the separation of adjacent maxima in the interference
pattern y (fringe separation).
P st
1 max.
y
*
S1
s max.
O
*
S2 T
d
screen or
detecon
plane
P is the position of the first maximum above the central maximum. A line
drawn from O, half-way between the two slits, to P, makes an angle θ with the
central axis. If a line is now drawn from S1 perpendicular to OP then distances
S1T and TP are equal. The distance S2T is therefore equal to the path differ-
ence and this must be equal to one wavelength:
y S2 T l
sin q= = =
d s s
Rearranging:
sy
d
=y1 A sin ( ωt )
y2 A sin ( ωt + φ )
=
Y= y1 + y2= A {sin ( ωt ) + sin ( ωt + φ )}
where φ is the phase difference between the two oscillations. This will depend
on the path difference x between waves from the two sources.
This can be simplified using a well-known trigonometric identity:
R+T R−T
sin R + sin T =
2 sin cos
2 2
to give:
φ φ
=Y 2 A sin w t + cos
2 2
φ φ
=Y 2 A cos sin w t +
2 2
The term in square brackets represents the amplitude. When φ/2 = 0, 2π, 4π,
etc., the cosine term will be one, representing a maximum in the interference
pattern and the amplitude will be 2A. When φ/2 = 0, π, 3π, etc., it will be zero
representing a minimum.
The graphs below show the resultant oscillations when two waves, each of ampli-
tude 2 units, combine with various different phase differences. The two broken
lines represent oscillations from waves 1 to 2 while the solid line is the sum of these.
Another way to determine the resultant of two superposed waves is to add
phasors representing each wave. This approach is illustrated below for the
Superposion effects
6
4
Phase difference = 0
2 Lines for waves 1 and 2
0 are on top of each
Y
-2 other.
-4
-6 Resultant amplitude =
me 2+2=4
3
Phase difference =
2
1 Lines for waves 1 and 2
0 are in anphase.
Y
-1
-2 Resultant amplitude = 0
-3
me
4
Phase difference = /2
2
Lines for waves 1 and 2
0 are /2 out of phase.
Y
-2 Resultant amplitude =
-4 22 cos (/4) = 22
me
final case where the amplitude of each wave is 2 units and the phase differ-
ence is π/2.
Phasors are particularly helpful for visualizing how two or more waves will
superpose.
22
2
2 /2 /4
Second-order maximum
First-order maximum
central (zeroth-
LASER order) maximum
First-order maximum
Second-order maximum
diffracon
maxima (orders
of diffracon)
diffracon
grang
screen
LASER pen
LASER O
central (zeroth
L
order) maximum
Often the grating is described by the number of lines per meter (or per mil-
limeter) N, rather than the slit separation d. These are related by d = 1/N.
The path difference between the ray from S1 and the ray from S2 is S2Q = d
sin θ.
d
Q
S2
R
S3
It is clear from the diagram that the path difference between the ray from
S1 and the ray from S3 is just double this, and that the path difference to a
slit m times further away is m times greater. It follows that if the rays leaving
adjacent slits are in phase, then ALL rays across the entire grating will be in
phase at that angle and an intense maximum will be created. These maxima
are called orders of diffraction.
For rays from adjacent slits to be in phase the path difference between them
must be an integer number of wavelengths. The condition for a maximum is
therefore:
nl
sin q = or n d sin
d
As the angle moves away from a diffraction grating maximum the phasors
from slits across the width of the grating curl up more and more so that the
maximum itself is sharp and has a series of closely packed secondary maxima
on either side of it. The more slits the sharper the maximum. This increases
the precision with which wavelengths can be measured.
+ + + + + + + =
Grang
Number of Orders
The maximum possible path difference between rays leaving adjacent slits
occurs when the rays leave parallel to the grating surface (q = 90°) and is
then equal to the slit separation d. The maximum value of n is therefore d/λ.
However, n can only be an integer so the maximum number of orders must be
the largest integer less than this; for example, if d/λ = 5.7 then 5 orders of dif-
fraction could be formed on either side of the central maximum. In practice,
these might not all be visible. The intensity of light diffracted at large angles
might be too low for them to be seen or they might correspond to missing
orders where there is a minimum of the single slit diffraction pattern (see
Section 15.3.1).
15.2.2 Spectroscopy
Spectroscopy is the analysis of electromagnetic radiation to identify the
wavelengths present in a source. This is one of the most important experi-
mental techniques in science and is used in a vast range of different areas of
research, from cosmology to atomic physics.
connuous
spectrum
line emission
spectrum
line absorpon
spectrum
When atoms or molecules are excited to higher energy states the electrons
can then return to lower energy states by making quantum jumps and emit-
ting photons. The wavelength of the emitted photon is related to the energy
change by the equation λ = hc/E (see Section 27.2.1) so the greater the energy
jump the shorter the wavelength. For isolated atoms, the energy levels are
very distinct so there is a discrete set of emitted wavelengths forming a “line
emission spectrum.” The spectrum of each atom is unique so an analysis of
its spectrum can be used to identify the types of atom present in the source
(e.g., a distant star). Atoms in molecules interact producing a more complex
spectrum that includes “bands” corresponding to small allowed ranges of
energy. The atoms in solid materials are packed closely together and electrons
occupy wide energy bands so the characteristic emission from a hot solid is a
“continuous spectrum.” When electromagnetic radiation is absorbed by atoms
L first
maximum
slit
above center
x
1 central
source* 1 maximum
first
diffracon
maximum
grang with slit
separaon d below center
The angle q1 can be found by measuring x and L with a ruler and then using
tan q1 = x/L.
In practice, it is best to use an average value for q1 by measuring the first
order on both sides of the central maximum. This helps to reduce errors due
to alignment. The wavelength is then calculated using:
=l d sin θ1
If several orders are visible each one can be used and wavelength can be cal-
culated from:
d sin θn
l=
n
Another approach is to plot sin qn against n. The gradient of this graph is l /d.
15.2.3 Spectrometers
For more precise spectroscopy a specialized instrument called a spectrometer
must be used. A traditional spectrometer consists of a collimating tube, which
ensures that rays from the source arrive along a normal to the grating, and a
telescope that is used to detect the orders of diffraction. These are all mounted
on a rotating base so that the angle between the normal and each order can be
measured from a Vernier scale engraved on the base.
diffracon grang
collimator
source
*
The eyepiece of the telescope has a built-in cross-hair so that the position of
the image can be found precisely. The observer sees a fine vertical line in each
wavelength at each order of diffraction. For the greatest precision, the col-
limator slit must be adjusted so that it is very narrow.
Setting up a spectrometer requires great care to ensure that the rays leaving
the collimator are parallel and the diffraction grating is in the plane perpen-
dicular to these rays.
Digital Spectrometers
Digital spectrometers are used to display spectra and to measure wavelengths
but are rarely able to provide the level of precision achievable with a tradi-
tional set-up. Most simple digital spectrometers use an optical fiber to direct
light into the device where it then falls onto a reflection grating. This consists
of a large number of parallel reflecting lines (like on the surface of a CD)
and produces orders of diffraction by reflection. The light is detected by a
CCD detector and the position on the detector corresponds to wavelength so
this can then be recorded digitally and used to generate a graph of intensity
against wavelength.
The intensity of the diffracted waves varies with angle and usually has maxima
and minima.
narrower
secondary
maxima
broad central
LASER
maximum
narrower
Single slit of width w secondary
maxima
intensity
posion on screen
The pattern for a thin wire of diameter d is identical to that for a slit of width
w when w = d.
A
D
w B
E
out of phase with each other and so will also add to zero. In fact, continuing
this argument, all the rays will cancel in pairs across the slit and the resultant
at the screen will be zero. The condition when this first occurs is when the
ray at B is π out of phase with the ray at A and the path difference BE is λ/2.
Consider triangle BCE when this occurs:
B
E BE/BC = sin
sin = /w
C
The condition for the nth minimum is when rays can cancel in pairs n times
across the width of the slit. The path difference between the rays at each edge
of the slit must therefore be nλ so that:
nl
sin θn =
w
This has a similar form to the equation for the maxima of a diffraction grating –
don’t forget that this gives the positions of minima!
If the pattern subtends a small angle at the slit then the minima are equally
spaced and the central maximum, which goes from the minima on either side,
is exactly double the width of the secondary maxima. This contrasts with the
double slit pattern where the maxima have equal widths.
narrower
secondary
maxima
broad central
LASER maximum
between the size of the object and the size of the pattern, so the smaller the
object the larger the angle at which the first minimum occurs. In practice, the
wavelength of the source needs to be known and the experiment consists of
measuring the angle at which the first minimum occurs.
object 1
*
*object 2 objecve
diameter D
Rayleigh criterion:
1.22 not resolved
q<
D
1.22 limit of resolution
q=
D
1.22 resolved
q>
D
signal vibrator
generator
or
frequencies that “fit” the length of the string – in other words, patterns that
have nodes at the ends. The lowest frequency, longest wavelength wave that
sets up a standing wave is called the fundamental.
The sequence of different standing waves is harmonics. Since the speed v of
transverse waves on a string is determined only by the tension and mass per
unit length of the string it is the same for all harmonics and the frequencies
form a simple sequence. The diagrams on the next page show the first few
harmonics for the string above.
It is clear that the harmonic frequencies are all integer multiples of the funda-
mental frequency f1. The frequency of the nth harmonic will be nf1. A similar
series is obtained if the boundary conditions have an antinode at both ends
(e.g., standing sound waves in a tube open at both ends).
If the boundary conditions are different at each end, so that there is a node
at one end and an antinode at the other, then the sequence is f1, 3f1, 5f1, etc.,
all the odd multiples of the fundamental frequency (e.g., standing waves in a
tube open at one end and closed at the other).
annode
node node
L = 1/2
f1 = v/1=v/2L
annode annode
L = 2
2nd harmonic):
L = 33/2
3rd harmonic):
T
v=
m
n T
fn =
2L m
A+ B A−B
cos A + cos B =
2 c o s sin
2 2
To give:
Y = 2 A cos kx cos w t
where we have reversed the order of the cosines.
The first part of this, 2A cos kx, can be regarded as a position dependent
amplitude. This will be zero when cos kx = 0 and this occurs when kx = 0, π,
2π, etc., that is, kx = nπ. These are the nodes and the separation of two nodes
is given by kΔx = π:
2p 2p∆x l
k= so = p giving ∆x = (separation of nodes)
l l 2
The maximum value of this term is when cos kx = 1. The amplitude is then 2A.
This occurs at the antinodes, half way between nodes.
The second term, cos ωt, is a simple harmonic oscillation at all points in the
standing wave.
15.5 EXERCISES
C
A 20 cm
26 cm D
4.0 m
Speakers A and B are 26cm apart along a line parallel to CD. They both
emit sound waves of a single frequency and the same amplitude α and
they are in phase with one another. D is equidistant from A and B. When a
microphone is placed at D it records a sound of maximum intensity. When
it is moved along the line DC it gradually fades to become a minimum at
C, immediately beyond C the intensity increases.
(d) State the amplitude and intensity of the resultant sound at D (assume
I = kα2 where k is a constant).
(e) State the amplitude and intensity of the resultant sound at C (assume
I = kα2 where k is a constant).
(f) What would happen to the amplitude and intensity of the sound at D
if one of the speakers was switched off?
(g) Use the dimensions on the figure to work out the wavelength and
frequency of the sound. The speed of sound is 340 ms–1.
(h) How would CD change if the wavelength of the sound was doubled?
(i) How would the distance CD change if the separation of the speakers
was doubled?
(j) Explain why the intensity of sound increases beyond D.
3. Explain in detail how and why the interference pattern from three slits
compares with one from a double slit if they both have the same slit width
and slit separation and if they are both illuminated with monochromatic
light of the same frequency. Illustrate your answer with a graph showing
how the intensity of the light varies with position on a screen.
4. A diffraction grating has 300 lines per mm and it is illuminated by light
containing two strong emission lines at 480 nm and 520 nm. The width of
each slit is 1.11 μm.
(a) What are the angular positions of the first order maxima for each of
these lines?
(b) What is the angular separation of lines in the second-order spectrum?
(c) How many orders of diffraction are there?
(d) Explain why the third order of diffraction will not be observable.
5. Describe an experiment using a diffraction grating to measure the wave-
length of light from a laser pen. Your description should include:
a labeled diagram of the apparatus.
what you will measure and how the measurements will be made.
how you will calculate the wavelength from the data you collect.
how you will maximize accuracy and precision.
6. Red light of wavelength 525 nm passes along a normal to a narrow vertical
slit of width 0.20 mm.
(a) Calculate the angle to the normal at which the first minimum of the
resultant diffraction pattern occurs.
(b) Sketch a graph of intensity against position (in mm) for the diffraction
pattern formed on a screen 1.2 m from the slit.
(c) Explain how the minima in the pattern are formed.
(d) How would the pattern change if the red light source was replaced
with a blue light source of wavelength 450 nm?
7. Describe an experiment that could be carried out to measure the thick-
ness of a human hair using a laser of a known wavelength. Your descrip-
tion should include:
a labeled diagram of the apparatus.
what you will measure and how the measurements will be made.
how you will calculate the wavelength from the data you collect.
how you will maximize accuracy and precision.
8. The pupil of the human eye is about 5.00 mm in diameter.
(a) Use the Raleigh criterion to calculate the theoretical minimum angu-
lar limit of resolution.
(b) If the eye could reach this diffraction limit calculate the maximum
distance at which it could resolve two point sources separated by a
distance of 1.00 mm.
(c) A pair of stars form a binary system 20 light years from Earth. It is just
possible to resolve them into separate images using the naked eye.
Estimate a lower limit for the separation of the stars.
(d) Explain why your answer to (c) is a lower limit.
(e) Explain why a small telescope, with aperture diameter of 15 cm can
easily resolve these two stars.
9. (a) Estimate the maximum distance at which you could resolve a car’s
headlamps into separate sources.
(b) It has been claimed that a spy satellite orbiting at a height of 600 km
could resolve the letters in a car number plate on the surface of the
Earth. Is this a realistic claim?
T
10. The speed of transverse waves on a string is given by v = where T is
m
the tension in the string (N) and μ is the mass per unit length of the string
(kgm−1).
A steel wire of a diameter of 0.12 mm is held at a tension of 26 N between
two fixed points 0.75 m apart. The density of steel is 7800 kgm−3
(a) Calculate the mass per unit length of the steel string.
(b) Calculate the speed of transverse waves in the steel string.
(c) What is the frequency of the fundamental?
(d) List the frequencies of the first three harmonics.
(e) The temperature of the wire increases. State and explain how this will
affect the frequencies of sound emitted by the vibrating wire.
16
Sound
oscilla on
direc on
wavelength λ
The sound wave itself can be described either by the varying par cle
The sound wave itself can be described either by the varying particle
displacements at each point or by the varying pressure at each point.
g RT
The speed of sound in an ideal gas is given by: v =
M
where M is the molar mass of the gas and g is the adiabatic gas constant.
E
The speed of sound in a solid material is approximately given by: v =
r
where E is the Young modulus and r is the density of the solid.
The speed of sound in a liquid is given by a similar equation but the Young
modulus is replaced by the bulk modulus of the liquid, a quantity that meas-
ures the compressibility of the liquid.
Typical values for the speed of sound are:
Air at atmospheric pressure and 20°C v = 343 ms- 1
(this falls to about 300 ms- 1 at the altitude of a commercial jet).
Water: v = 1482 ms- 1
Steel: v = 6000 ms- 1 (varies with type of steel).
signal
generator
vercal cylinder
L
containing air
The air column shown is closed at the bottom and open at the top. Particles
near the bottom cannot undergo longitudinal oscillations so this must be a
node of displacement. The top of the column is open to the atmosphere, so
particles are free to oscillate. This allows an antinode to form but it actually
occurs a small distance e above the top of the column. e is called the end cor-
rection and it is about half the radius of the tube. The effective length of the
resonating column is therefore L + e.
The diagrams below (shown horizontally) show the standing waves that can
be formed in a tube open at one end and closed at the other. Note that, while
the diagrams look like transverse waves, they simply represent the amplitude
of longitudinal vibrations at each position in the tube.
L+e
Fundamental
(1st harmonic)
L + e = λ/4
2nd harmonic
L + e = 3λ/4
3rd harmonic
L + e = 5λ/4
( 2 n + 1) ln ( 2 n + 1) v
=
L+e =
4 4 fn
=
fn ( 2 n + 1) f1
if the air column is open at both ends the boundary conditions will be
displacement antinodes at both ends (i.e., a column of length L + 2e). If the
column is closed at both ends the boundary conditions will be displacement
antinodes at both ends. In both cases, the harmonic series will be integer mul-
tiples of the fundamental frequency.
It was mentioned previously that we can describe the sound wave in terms
of particle displacements or variations of pressure. However, displacement
nodes are actually pressure antinodes and vice versa. This can be understood
by considering the particle motions on either side of a displacement node.
The particles move toward the displacement node and increase the pressure
and then move away from the displacement node and decrease the pressure.
The variation of pressure has a large amplitude here so it is a pressure antinode.
Method 1
mic. 1 mic. 2
sig.
gen.
to to
oscilloscope oscilloscope
The signal generator is set to a single frequency (5–10 kHz is suitable) and
the two microphones are connected to a dual-beam oscilloscope. The oscillo-
scope is triggered from the first microphone. The second microphone is then
positioned close to the first one and the two traces, which will be sinusoidal,
are compared. Then x is changed until the signals are in phase. This position
x = x0 is recorded. Now the second microphone is moved away from the first
and until the phase has changed by 2np (i.e., has gone in and out of phase n
times). The new position x = xn is recorded and the value of the wavelength
is calculated from l = (xn - x0)/n. The frequency of the sound can also be
measured from the oscilloscope (using f = 1/T) and the speed of sound is
calculated from v = fl .
Method 2
sig. A N A N A
gen.
to oscilloscope
solid reflector
A standing wave is formed where the reflected sound superposes with incident
sound. The microphone is moved along a line perpendicular to the reflector
and the signal on the oscilloscope screen has periodic maxima every time the
microphone passes through an antinode. The separation of adjacent maxima is
l/2 so an average value for the separation of adjacent nodes can be found and
the wavelength can be calculated. The oscilloscope can also be used to find the
frequency of the sound so that the speed can again be calculated from v = fl.
16.5 ULTRASOUND
Ultrasound has frequencies greater than 20 kHz, that is, above the highest
sound frequency audible to humans. Ultrasound scanning is used for medical
imaging (e.g., in pre-natal scans). Ultrasound pulses from a transmitter on
the surface of the patient’s skin partially reflect at each boundary inside the
patient. The times of the returning pulses can be used to determine the depth
of the boundary and to map out structure. This is done automatically so that
The low value of Z for air compared to body tissues means that if there is an
air gap between the ultrasound transmitter and the patient then most of the
ultrasound reflects from the surface and does not enter the body. To solve
this problem, a gel is spread on the skin under the transmitter. The gel has
an acoustic impedance similar to that of water or body tissues so most of the
ultrasound is transmitted rather than reflected.
excess pressure
me
The graph above shows the result of adding three sinusoidal sound waves of
the same amplitude but with frequencies f, 2f, and 4 f. This can also be repre-
sented by the equation:
16.7 EXERCISES
3. A student sets up the apparatus below and records the amplitude of sound
emitted by the air column as the tube is gradually filled with water. The
end correction for the tube is 5.0 cm and the frequency of the sound used
is 1200 Hz. The speed of sound is 340 ms- 1.
signal
generator
40 cm vercal cylinder
containing air
h water
supply
17
Electric Charge and
Electric Fields
17.1 ELECTRIC CHARGE
Electric charge Q is a fundamental property carried by some fundamental
particles. There are two types of charge, positive (e.g., the charge of the pro-
ton) and negative (e.g., the charge of the electron). Like charges repel one
another and unlike charges attract.
Charge is quantized. This means that the total charge on any object is always
a multiple of a fundamental amount equal to the magnitude of the charge on
an electron or a proton. The SI unit for charge is the coulomb (C) and the
charge on an electron is:
e = − 1.60217662 × 10−19 C
The charge on the proton has the same magnitude but opposite sign. Atoms
contain equal numbers of protons and neutrons and are neutral. If an atom
loses an electron it becomes a positive ion and if it gains an electron it
becomes a negative ion. You might have read that quark, the fundamental par-
ticles inside protons and neutrons, have fractional charges. However, quarks
are never found as individual particles, they are always combined in pairs
(mesons) or triplets (baryons) and they always combine in a way that makes
the total charge of the composite particle an integer multiple of e.
When charge flows it is called an electric current. Electric current I is defined
as the rate of flow of electric charge:
dQ
I=
dt
Q = It
Some materials, for example, metals, allow charge to flow through them.
These are called conductors. Others, such as plastics and rubber do not allow
charge to flow and are called insulators. A third class of materials, with inter-
mediate properties, is called semi-conductors.
17.2 ELECTROSTATICS
Electrostatics deals with situations where objects become charged, remain
charged, or lose charge. Most of these situations can be explained in terms
of the transfer of electrons. The reason for this is that the electrons are on
the outside of the atom so when atoms interact (e.g., if two materials are
rubbed together) the electrons can move from one place to another. Protons
are locked inside the atomic nucleus so proton transfer does not occur. When
a neutral object loses electrons, it becomes positively charged and when it
gains electrons it becomes negatively charged.
An object is said to be “earthed,” when it is connected to the Earth by a good
conductor (e.g., an electrical wire). From the point of view of electrostatics,
the Earth itself is a huge conducting sphere that can gain or lose any number
of electrons while remaining neutral. Another way of looking at this
is to say that the Earth is always at zero potential so that any object
connected to the earth is also at zero potential. The symbol to show
an earth connection is shown on the right.
surface negave
charge on rod
polythene rod
polythene rod
electron transfer + +
+ + +
+ ++
+
cloth rubbed back and
surface
forth along surface of
posive
An acetate rod can be charged positively in the same way. However, in this
case, electrons jump from the rod to the cloth and the rod retains a positive
surface charge. This is because atoms on the surface of the rod are no longer
electrically neutral; some of them have fewer electrons than protons and so
have a net positive charge. However, this positive charge has, once again, been
brought about by the movement of the negatively charged electrons.
steel cap
insulator
steel rod
gold leaf
earthed box
The cap rod and leaf are all metallic conductors but they are isolated from the
earth by an insulator. If a positively or negatively charged rod is held close to
the cap but not touching it, the leaf will rise. This is because the charge on the
rod exerts electrostatic forces on the free electrons making them move. The
rod and leaf gain the same charge and repel one another. The leaf is very thin
so responds to this by rising. In the diagrams below only the cap, leaf and rod
are shown.
The process by which the cap gains a charge opposite to the charge on the
rod is called electrostatic induction. In the examples above there is no net
charge on the cap leaf and rod, the electrons have just been redistributed.
However, it is possible to use induction to give the electroscope a net positive
or negative charge – this is called charging by induction and is explained in
the sequence of diagrams below (resulting in a net positive charge).
1. Hold a negatively charged rod close to the cap. The leaf rises.
2. Keeping the negatively charged rod in place momentarily earth the cap
and then disconnect the earth. Electrons flow to earth and the leaf falls.
3. Remove the rod. The electroscope has lost electrons so the cap, rod, and
leaf have a net positive charge. When the rod is removed electrons spread
out and there is a positive charge on the cap, leaf, and rod. The leaf rises.
charge is measured in
nano-coulombs
the volume of a sphere it acts like a point charge of the same total charge
located at the center of the sphere.
T T
F Q Q F
mg mg
The charges repel one another so the system hangs in equilibrium with the strings
making an angle q to the vertical. q and r can be determined using a digital cam-
era to capture an image of the apparatus with a suitable scale placed behind. The
force F can then be found using the condition for equilibrium of forces:
T cos q = mg
T sin q = F
so F = mg tan q
Q can be found by discharging the sphere to a coulomb meter.
If Q is kept constant (charging from the same high voltage) and the mass of the
spheres is varied then F and r will change, so the inverse-square law can be tested.
Q can be varied by charging the spheres from a different voltage but this will
also change r unless a compensating change in mass is made.
the SI unit for electric field strength is NC−1, which is equivalent to Vm−1. The
electric field is a “vector field”: the electric field points in the direction of the
force on a positive charge.
This can be represented by drawing field lines. The direction of the field lines
is the direction of the electric field and the separation of the field lines repre-
sents the strength of the field. The two diagrams below show the shape of the
electric field close to a point positive and a point negative charge.
+
EC
EB
C EA
A +
B +
The diagram below shows the electric field close to a dipole, two charges of equal
magnitude separated by a small distance. Many molecules have electric dipoles.
- +
x
+Q Q
E E+
2a P
+Q −Q − axQ
EP = + =
4pe 0 ( x + a ) 4pe 0 ( x − a ) pe 0 ( x2 − a2 )
2 2 2
E+
P
x E
+Q Q
2a
Sometimes the electric fields of two or more charged particles sum to zero
at a point. This is called a neutral point. For example, half-way between two
charges of the same magnitude and sign:
neutral point
a a
+ P +
P = − =0
πε0 πε 0
ΦE = EA = E×4p r2
spherical surface
of area A = 4r2
+
When this is rearranged and compared with the expression for the field
strength of a point charge it is clear that the flux through the surface is directly
related to the charge within that surface.
ΦE Q
=E =
4p r 2
4pe 0 r 2
so that: Φ E =Q
e0
This is an example of a general result known as Gauss’s theorem. This states
that:
The total flux through any closed surface is equal to the total charge con-
tained within that surface divided by the permittivity of free space (or of
the medium if the charge is not in a vacuum).
This is a powerful theorem that can be used to understand how the electric
field behaves in a range of important situations. Gauss’s theorem can be stated
more precisely using an integral:
∑
i= N
Qi
=
ΦE ∫=
E .dS i =1
surface
e0
of charges contained inside the closed surface. The next section shows how
Gauss’s theorem can be used to derive some important results.
+ + + + + + + + + + + + + + + +
conductor
The electric field through the lower surface must be zero because it is
inside the conductor.
The electric field through the sides of the cylinder must also be zero because
there can be no component of electric field on the surface of the conductor.
All of the electric fields must pass through the upper surface.
Using Gauss’s theorem:
Flux through upper surface = EA = charge contained inside Gaussian surface
divided by ε0 = σA/ε0.
Conclusions:
s
The electric field strength close to a conducting surface is E = and this is
perpendicular to the surface. e0
hollow Gaussian
conducng surface
box
This is an example of a Faraday cage – a conducting box used to shield its con-
tents from external electric fields and electromagnetic waves. Metal cars and
aircraft act as Faraday cages protecting their occupants from lightning strikes.
A room with conducting walls, floor, and ceiling can also be used to provide
security against unwanted communications – for example, mobile phone sig-
nals (which cannot enter or leave the room). The effect can be demonstrated
by placing a mobile phone inside a metal cookie tin. When the tin is closed it
is impossible to ring the mobile phone.
B
The electric field exerts + The electric field
a force downwards on exerts a force
A but A moves upwards downwards on B and B
so work is done on A by moves downwards so
an external agent and work is done by the
the electric potenal + electric field and the
energy increases. A electric potenal
energy decreases.
and the SI unit of potential is the joule per coulomb: JC–1 which is the volt, V.
In other words, 1 V = 1 JC–1. Electric potential, like energy, is a scalar quantity,
so the potential at any point in space is the sum of potentials due to all fields
at that point.
The electrical potential difference ΔV between two points is equal to the work
that must be done per unit charge in moving the charge between the two
points concerned.
ΔV = W/Q
E
F = EQ
+
x
This work has been done by the electric field so the electric potential energy of
the system has fallen (in the same way that gravitational potential energy falls
when an object is dropped):
δEPE =
−QEδx
δV
= −E
δx
∆V =∫ dV =− ∫ Edx
VA xA
equipotenal
surface
+
1
mv2 = QV
2
2QV
The velocity is then: v = where m is the particle mass.
m
However, we must be careful when using this equation. If the velocity is a
significant fraction of the speed of light (e.g., v > 0.05 c) then we should use
relativistic equations. The Newtonian equations above are useful for lower
velocities but only give an approximate value for the final velocity.
Many electron tubes use an electron gun to accelerate electrons and form a
beam. Here is a simplified diagram showing how the electron gun works.
accelerang
voltage V
Vacuum
Heater supply electron beam
electrons
(low voltage)
2eV
v=
m
y-axis
+V
electron beam
inial velocity v x-axis
d
eVt 2
y=
2 md
eV 2
y= 2 x
2 mv
∆V =− ∫ Edx
xA
can be used to find the absolute potential at point B if we know the potential
at point A. In order to find the potential at any point in the Universe we must
define the position where the potential is zero. This is an arbitrary decision
because forces and fields only depend on differences in potential and not
on their absolute value. However, it makes sense to choose the position for
the zero of potential in such a way that it is easy to use in calculations. The
zero of potential in the electric field is taken to be at infinity. In other words,
if all charges were separated so that they were at infinite distance from one
another, then the total electrical potential energy would be zero.
Here are two qualitative examples of how this work:
Consider two point positive charges a distance r from one another. These
will repel one another and if no other forces act on them, they will move
toward infinity. While they are moving electrical forces are doing work
on them transferring electrical potential energy to kinetic energy, so the
electrical potential energy is decreasing all the time. However, we know
that their electrical potential energy will be zero at infinity so the initial
electrical potential energy of two positive charges must have been posi-
tive. The same argument shows that the initial electrical potential energy
of two negative charges a distance r apart must also be zero.
Now consider separating a positive and a negative charge that is initially a
distance r apart. These two particles attract one another so we would have
to apply an external force to each of them to move them out to infinity.
Work must be done by this external agent to separate them so the electric
potential energy increases all the time and eventually becomes zero. The
initial electric potential energy must have been negative. This is the case
for the electron inside a hydrogen atom. It is attracted to the proton in
the nucleus of the atom and the system has a negative electric potential
energy. The energy that must be put in to separate the two particles is the
ionization energy for the hydrogen atom.
We can now derive an expression for the electric potential energy at a point in
the electric field of a point charge. This is done by deriving an expression for
r
x
+Q to infinity
P
the potential difference between that point and infinity and using the fact that
the potential is zero at infinity.
The electric field strength at a point distance r from a point positive charge Q is:
Q
E=
4pe 0 r 2
so the potential difference between P (at distance r from Q) and infinity is:
x=
∞ x=
∞ x= ∞
Qdx −Q −Q
∆V =V∞ − VP =−
=x r =x r
∫ Edx =− ∫ 4pe 0 x 2
=− =
4pe 0 r x = r 4pe 0 r
Q Q
VP =
V∞ − =
4pe 0 r 4pe 0 r
1. (a) A small electric cell can supply a continuous current of 2.0 A for 2
hours. How much charge passes through the cell in this time?
(b) A large capacitor (component for storing charge) is charged using a
steady current of 4.0 mA. It takes 0.75 s to reach full charge. How
much charge does it store?
(c) An electrostatic generator stores 8.0 C on a large metal dome. This
charge leaks away through the air in 40 s. What is the average electric
current during this time?
2. Write an instruction sheet explaining how to charge an electroscope posi-
tively using just a polythene rod and a cloth.
D 2.0 C A + 2.0 C B
x
4.0 m
The dipole is located on the x-axis with its center at the origin.
(a) Calculate the electric field strength (and direction) and the electric
potential at points A to E:
A (0, 0)
B (+4.0 μm, 0)
C (0, +4.0 μm)
D (−4.0 μm, 0)
E (0, −4.0 μm)
(a) that the electric field strength inside a charged conducting sphere is
zero,
(b) that the electric field strength immediately above a charged conductor
is perpendicular to the surface and has a magnitude E = σ/ε0 where σ
is the charge density on the surface,
(c) that the flux of electric field entering a volume of empty space is equal
to the flux of electric field leaving that volume of space.
8. (a) Prove that the uniform electric field between two parallel conducting
plates separated by distance d and connected to a potential difference
V is given by: E = V/d.
(b) Sketch the electric field lines and equipotentials between two p arallel
metal plates 2.0 cm apart with a potential difference of 5000 V (include
equipotentials at 1000V, 2000V, 3000V, and 4000V).
(c) Calculate the electric field strength between the plates.
(d) Calculate the force on an alpha particle (charge +2e) half way between
the plates.
(e) How does this force vary if the alpha particle moves close to the posi-
tive or negative plate (ignore induction effects)?
(f) An air molecule between the plates loses two electrons and becomes
ionized. What is the ratio of the acceleration of the electron to the
acceleration of this positive ion in the electric field? Assume that the
mass of the ion is 60 000 times greater than that of the electron.
9. An electron in a vacuum tube is accelerated horizontally in an electron gun
through a potential difference of 1500 V. It then enters a region of uniform
vertical electric field of strength 2.0×104 Vm−1 that extends 5.0 cm horizon-
tally. It is deflected by the field and emerges into a field-free region.
(a) Calculate the velocity of the electron as it enters the region of vertical
electric field.
(b) Describe the shape of the electron’s path in the field. Assume the
direction of the field is vertically upwards.
(c) Calculate the time spent by the electron in the vertical field.
(d) Calculate the vertical component of velocity gained by the electron in
moving through the vertical field.
(e) Calculate the angular deflection of the beam.
(f) Calculate the work done by the vertical electric field on the electron.
18
DC Electric Circuits
conventional current I
− − −
−
− − − −
Positive charge carriers (e.g., positive ions in an ion beam or holes in a p-type
semiconductor). The positive charge carriers move in the same direction as
the conventional current.
conventional current I
Positive and negative charge carriers (e.g., when current flows through an
electrolyte). The positive charges move in the direction of conventional cur-
rent and the negative charges move in the opposite direction.
conventional current I
− −
−
−
In all the examples above, the velocities of the charge carriers represent their
average drift velocities, the particles will also have random thermal motion
which might involve speeds that are several orders of magnitude greater than
the drift velocities.
cross-sectional
conventional current I area A
v v v v P
v v v
vt
There are n charge carriers per unit volume in the conductor and each charge
carrier has a charge q.
Consider the charge leaving the right-hand end of the wire at P during a short
time δt.
All charge carriers within a distance vδt will leave at this time. The volume
within this distance of point P is Avδt so the number of charge carriers is
N = nAvδt and the total charge δQ passing P in time δt will be:
Q qnAvt
Q
so the current is: I nAvq
t
Very often the charge carriers have a charge equal to the electronic charge, e,
so this equation becomes:
I = nAve
Typical values of the charge carrier density n for two different metals and two
types of semiconductors are shown below. The values are for a temperature
of 300 K.
Copper n = 8.5×1028 m− 3
Silver 1.1 × 1028 m− 3
Silicon 1.5 × 1013 m− 3
Germanium 2.4 × 1016 m− 3
Note that the carrier density in the pure semiconductors is very much
smaller than for the metals so the drift velocity in a semiconductor carrying
the same current as a metal of the same dimensions will be much greater
than in the metal. The carrier density is highly temperature dependent
in semiconductors and increases rapidly with temperature. Pure, or intrinsic,
semiconductors contain both negative charge carriers (electrons) and positive
charge carriers (holes) in equal numbers but doped, or extrinsic, semiconduc-
tors have small quantities of other elements added to increase their carrier
density of a factor of 106 or more.
The ammeter must be connected in series at the point where the current is
to be measured.
An ideal ammeter has zero resistance so it has no effect on the current it is
measuring. In practice ammeters do have a small internal resistance, so the
current measured by the ammeter is slightly less than the current would be
if the ammeter was not in the circuit. This difference is usually small enough
to be neglected but can be important, especially if the circuit resistance is
particularly low.
Small currents are often measured in milli-amps (mA) or micro-amps (µA).
1 mA = 0.001 A = 10− 3 A 1 µA = 0.001 mA = 0.000001 A = 10− 6 A
I2
A
I1 A A I3 I1 = I2 + I3 + I4
A
I4
If there are no junctions in the circuit, then the current is the same every-
where – that is, at all points around a series circuit.
The current that flows in each branch of a parallel circuit is inversely propor-
tional to the resistance of the branch.
An ideal voltmeter has infinite resistance so that it does not draw any current
from the circuit. In practice, a real voltmeter will have a large but not infinite
resistance and this can affect readings if there are also very large resistors in
the circuit.
E
r
emfs p.d.s
closed closed
loop loop
If the potential differences are all across resistive components this can be
written:
i N
emfs IiRi
closed i1
loop
where there are N resistors in the loop and the ith resistor has a resistance Ri
and current Ii.
The example below shows how Kirchhoff’s second law can be applied to two
different loops in an electric circuit:
Loop
V2
V1
E1 V V3
Loop
P
E2
E3 V4
Each loop starts and ends at P, works in a clcokwise direction and provides a
different equation:
Loop 1: E1 − E3 − E2 = V1 + V5
Loop 2: E1 − E2 = V1 + V2 + V3 + V4
Care must be taken over the signs of the emfs and pds.
While two loops have been shown in the diagram there is also a third loop in
the right-hand section of the circuit. This has not been included because it
is not independent of the other two. The fact that all parts of this thrid loop
were included in parts of the first two loops means that it would not provide
additional information.
Kirchhoff’s first and second laws generate a series of simultaneous equations
that can be used to solve complex circuit problems (see Section 18.6.1).
18.3 RESISTANCE
Resistance R is defined as the ratio of potential difference V across a compo-
nent to current I through the component.
V
R=
I
The SI unit for resistance is the ohm (Ω) and 1 Ω = 1 VA− 1. Large resistances
are measured in kilo-ohms (kΩ) and mega-ohms (MΩ).
resistance =
voltmeter reading
divided by ammeter
reading
V
A
While dedicated ammeters and voltmeters can be used, multimeters can be
used instead. A multimeter can be used as a voltmeter or an ammeter depend-
ing on its settings. When using a multimeter it is important to select appropri-
ate settings before connecting he circuit to the power supply otherwise there
is a danger of damaging the meter or the circuit. For example, if the multim-
eter was set as an ammeter but connected in parallel like a voltmeter it would
short out the component and draw a large current.
Multimeters can also be used as an ohm-meter to measure resistance directly.
When it is used like this the component must first be isolated from the circuit,
and then the meter must be connected directly across the component.
The image below shows a typical multimeter. The values shown on each range
indicate the maximum value that can be measured on that range and shows
the units in which it will be displayed. For example, a setting of 2 V DC would
measure voltages from 0.00 V to 2.00 V whereas a setting of 200 mV would
measure from 000 mV to 200 mV. When choosing a suitable scale it is best
to choose the most sensitive scale that has a maximum value greater than the
value to be measured.
Rotating
switch to
Resistance
select
ranges
range.
D.C. D.C.
current voltage
ranges ranges: this
meter is set
A.C. A.C.
current voltage
ranges ranges
10A
V terminal:
terminal:
the second
the second
lead must be
lead must
connected
be
to this
connected
terminal for
to this
the meter to
terminal
work as a
for the
voltmeter or
meter to
COM terminal: ‘common’ connection for all ohm-meter.
work as an
uses of the meter. One lead must be
ammeter.
connected to this terminal.
current / A
P
IP
P
P =
P
potential
0
0 VP difference / V
R
A
V
To reverse the current through the component the battery or the component
itself can be turned around. This allows both positive and negative values for
current / A
voltage / V
I∝V
V
= constant
= R
I
The resistance is constant.
Components that behave like this are described as “ohmic conductors” and
are said to obey Ohm’s law. Gustav Ohm investigated the electrical behavior of
metals kept at constant temperature and discovered that the current passing
through them and the potential difference across them were directly propor-
tional so Ohm’s law really only applies to metals at a constant temperature.
Nowadays Ohm’s law is invoked whenever current and voltage are directly
proportional. The equation V = IR is also sometimes referred to as Ohm’s law,
but this is not really correct – the equation defines resistance and only cor-
responds to Ohm’s law when the resistance is constant.
A metal filament (e.g., in a lamp).
Metal wires heat up when electric currents pass through them and their
resistance changes. They are non-ohmic conductors.
current / A
voltage / V
The fact that the graph is not a straight line through the origin shows that this
is a non-ohmic conductor.
The ratio of V to I is increasing so the resistance has increased as the current
has increased.
The reason this happens is that the charge carriers passing through the metal
transfer energy to the metal ions making them vibrate more rapidly and this
in turn increases the scattering of charge carriers. More work has to be done
to maintain the current (re-accelerate the charge carriers after scattering) so
a higher voltage is needed for the same current. This increases the ratio V/I
and increases the resistance.
Components whose resistance changes with temperature are called ther-
mistors. The resistance of a metal increases with temperature so it is a posi-
tive temperature coefficient (PTC) semiconductor. Platinum is commonly
used as a PTC thermistor and can be used as a resistance thermometer.
Semiconductors have a resistance that falls as temperature increases so they
are used as negative temperature coefficient NTC thermistors. The circuit
symbol for a thermistor is shown below:
A semiconductor diode.
The circuit symbol for a semiconductor diode is shown below:
This is a conductor that conducts with very low resistance in one direction
(forward bias) and acts as an insulator (infinite resistance) in the other (up to
its breakdown voltage, at which point the diode resistance suddenly drops and
it is likely to be destroyed by the current surge).
A small voltage is required in the forward direction before the diode begins
to conduct. This switch-on voltage depends on the material from which the
diode is made. For silicon diodes, it is about 0.6 V and for germanium diodes,
it is about 0.2 V. Once the forward voltage exceeds this value the diode begins
to conduct with very low resistance and care must be taken not to allow the
current to grow so large that it melts the diode (e.g., by having a fixed resistor
in series with it).
Diodes are important components in rectifier circuits, used to convert alter-
nating current to direct current.
current / A
forward bias
breakdown
voltage
voltage / V
switch-on
voltage
Some diodes emit light when they conduct. These are called light-emitting
diodes or LEDs. The circuit symbol for an LED is shown on the right:
V1 V2 V3
I I
R1 R2 R3
VS
I I
RS
VS V1 V2 V3
RS R1 R2 R3
The total resistance of several resistors connected in series is the sum of the
individual resistances. The general relationship for N resistors in the series is:
i N
Rseries Ri
i1
Resistors in parallel
To replace several resistors in parallel with a single resistor RP we would need
a resistor that has the same ratio of V to I as the set of parallel resistors. We
start by using the fact that the same potential difference is across each arm of
a parallel circuit:
V
i1
R1
i2
I I
R2
i3
R3
V
I I
RP
V V V V
RP R1 R2 R3
1 1 1 1
RP R1 R2 R3
In general, the reciprocal of the equivalent resistance is equal to the sum of
the reciprocals of all the resistances of the resistors connected in parallel. For
N resistors in parallel, this can be written more formally as:
1 i N 1
RP i1 Ri
For the simple but very common situation where there are just two resistors
in parallel, the equation can be rearranged to:
R1 R2
RP
R1 R2
this is easily remembered as “product over sum,” however it only works for
two resistors in parallel.
The formula for parallel resistors shows that the total resistance of several
resistors connected in parallel is always less than the smallest resistance in the
network.
18.3.4 Resistivity
The resistance of a component depends not only on the type of material from
which it is made but also on the dimensions of the component. Resistivity
is a property of the material alone and does not depend on its dimensions.
For example, the resistances of copper wires of different lengths and cross-
sectional areas differ but the resistivity of the copper from which they are
made is the same (at the same temperature).
For a cylindrical wire of length l, cross-sectional area A and resistance R the
resistivity is given by:
RA
l
The SI unit of resistivity is Ωm.
Typical resistivities (at 20°C) for different conductors are shown below:
Metal Resistivity
Silver ρ = 1.6 × 10− 8 Ωm
Copper ρ = 1.7 × 10− 8 Ωm
Aluminum ρ = 2.8 × 10− 8 Ωm
Tungsten ρ = 5.8 × 10− 8 Ωm
Platinum ρ = 1.1 × 10− 7 Ωm
Constantan ρ = 4.9 × 10− 7 Ωm
Steel ρ = 7.2 × 10− 7 Ωm (varies depending on type of steel)
Nichrome ρ = 1.3 × 10− 6 Ωm
The resistivity of a length of a resistance wire can be measured using the cir-
cuit below:
resistance wire
l sliding contact
4 I
V IR 2 l
d
4 I
The gradient of the graph of V against l is equal to 2 .
d
d
2
Resistivity, gradient .
4I
E = QV
V2
= =
P VI = I2R
R
If the current in the resistor is constant then the energy transfer E in time t
is given by:
E = VIt
E
representation r
of a real cell
E Ir
current Ito
external circuit
terminal p.d. V
The potential difference across the internal resistance is equal to the work
that must be done per coulomb of charge to move charge carriers through the
cell. The terminal pd is therefore equal to the difference between the cell emf
and these “lost volts”:
V E Ir
As the current drawn from the supply increases the terminal pd falls.
The internal resistance also limits the maximum current that can be drawn
from the cell. This occurs when the cell is short-circuited by connecting its
terminals together with a conductor of negligible resistance. The only resist-
ance in the circuit is the internal resistance so the short-circuit (maximum)
current from the cell is:
E
ISC =
r
The internal resistance of a typical 1.5 V alkaline AA cell is about 0.15 Ω so the
maximum current that could be drawn from it is 1.5/0.15 = 10 A. However,
shorting the cell would drain it very quickly so the working current for a
device operated by AA cells must be much less than this. If a cell or battery is
required to provide a very large current it must have a very low internal resist-
ance. Car batteries have an emf of 12 V but need to provide over 100 A when
the ignition is switched on and the battery turns the starter motor. Typical car
batteries have an internal resistance of less than 0.01 Ω.
E r
V
As the value of the load resistor is changed the current I in the circuit and the
terminal pd. V change too. I and V are related by the equation:
V E Ir
terminal p.d./V
gradient = r
0 current / A
0
E
r
E
I
R r
E2 R
P I2 R
R r 2
It is clear that P is zero when R = 0.
P also becomes asymptotic to zero as R → ∞ since the denominator then
dominates the expression.
The expression is positive for all values of R so there must be a maximum
value for P at some point. This occurs when R = r, that is, the load resistor has
a resistance equal to the internal resistance.
Power transfer P
Pmax
Load resistanceR
0
0 R=r
The fact that maximum power transfer occurs when R = r can be demon-
strated using calculus. You need to find the condition for dP/dR = 0, this must
be a stationary value of the function and, in this case, it is a maximum.)
It might be tempting to think that maximum power transfer also corresponds
to maximum efficiency for the system, but this is not the case.
power transferred to R
efficiency 100%
power transferred from the emf
E2 R
power transferred to R is: Pout I 2 R
R r 2
power transferred from the emf is:
E2
Pin EI
R r
R2
Vin
R1 Vout
Vout = IR1
Vout R1
Vin R1 R2
If Vout is connected across a load resistor RL then the fraction changes because
the lower part of the potential divider now has two resistors in parallel (R1
and RL) so that:
Vout RP
Vin RP R2
R1 RL
where RP .
R1 RL
Potential divider circuits can also be used as sensing circuits, producing an
output voltage that depends on external conditions such as temperature or
light intensity.
I I
R2 R2
Vin Vin
R1 R2 R3
i1 i2
i3 R4
E2
Applying Kirchhoff’s second law to the loop passing through resistors R1, R2,
and R3:
E1 i1 R1 i2 R2 i2 R3
Applying Kirchhoff’s second law to the loop passing through resistors R1, R4,
and E2:
E1 i1 R1 i3 R4 E2
h+ h+ e
e
h+ e
current I h+ current I
e
+ +
h h
e h+ e
e
+
e h+ h
intrinsic semiconductor
At absolute zero semiconductors are pure insulators with no free charge
carriers. However, as temperature increases, thermal agitation frees a small
proportion of the valence electrons and allows them to become free or con-
duction electrons. Whenever an electron is freed it leaves behind a hole which
behaves very much like a particle carrying a positive charge +e. This means
that a pure semiconductor contains equal numbers of electrons and holes and
that both contribute to current flow. While the density of holes and electrons
is equal they do not contribute equally to the current because their mobility
(ability to move through the material) differs.
The electrical properties of intrinsic semiconductors can be changed by intro-
ducing impurity atoms. Semiconductors such as silicon and germanium have
four electrons in the outer shell, so adding atoms of an element such as boron,
aluminum, or gallium, with three outer electrons, has the result of donating
one additional hole per doping atom. Semiconductors doped in this way have
an excess of holes (positive charge carriers) and are called “p-type semicon-
ductors.” If phosphorus, antimony, or arsenic, with five electrons in the outer
shell, is added this will donate one electron per doping atom. Semiconductors
doped in this way have an excess of electrons (negative charge carriers) and
are called n-type semiconductors. Semiconductor devices such as diodes,
transistors, and integrated circuits depend on doped semiconductors and are
often constructed from junctions between n-type and p-type materials.
0 1 T T0
ρ = resistivity at temperature T
ρ0 = resistivity at reference temperature T0
Values for the temperature coefficient of resistance for different metals based
on T0 = 293 K is shown below:
Metal Temperature coefficient of resistance
Aluminum α = 4.3×10− 3 K− 1
Copper α = 4.0×10− 3 K− 1
Silver α = 4.0×10− 3 K− 1
Tungsten α = 4.5×10− 3 K− 1
18.7 EXERCISES
(i) a metal;
(ii) a p-type semiconductor;
(iii) an n-type semiconductor.
6. The table below gives the resistivities of three metals at two different
temperatures
Metal Resistivity at 273K Resistivity at 373K
−8
Aluminum 2.45 × 10 Ωm 3.55 × 10− 8 Ωm
−8
Copper 1.55 × 10 Ωm 2.38 × 10− 8 Ωm
Iron 8.70 × 10− 8 Ωm 16.61 × 10− 8 Ωm
12.0 V
200 200
200
A
V
(b) In fact, the internal resistance is 10 Ω. How does this affect your
answers to part (a)?
1.0 k
6V
RLDR Vout
(a) Calculate the output voltage in the dark and in bright light.
(b) How does the value of the top fixed resistor affect the range of the
output voltages?
11. Use Kirchoff’s laws to determine the readings on the three ammeters and
the voltmeter in the circuit below. Assume that the cell’s internal resist-
ance can be neglected.
6.0 V
A1
V 500 30 10
A3
A2
3.0 V
19
Capacitance
VS
A A
S R C
connection
to
oscilloscope
When switch S is closed both ammeters jump to a positive reading that gradu-
ally falls back to zero. This shows that charge has moved around the circuit,
however, it cannot cross the gap between the two plates so what has happened
is electrons have left one plate (giving it a net positive charge) and electrons
have moved onto the other plate (giving it a net negative charge). The current
in the circuit is monitored by connecting an oscilloscope across the resistor in
series with the capacitor.
current
time
The oscilloscope trace actually displays the voltage across the resistor but this
is directly proportional to the current because the resistor is constant. The
current decays exponentially to zero as the capacitor charges up. Once fully
charged there is a charge of +Q on one plate and −Q on the other plate. While
it is true that the net charge on the capacitor is actually zero, we refer to the
charge on the positive plate and say that the capacitor is “charged” or “stores
a charge Q.”
19.1.2 Capacitance
The charge stored on a capacitor is directly proportional to the potential dif-
ference between the two conductors. This can be verified using the arrange-
ment shown below:
flying lead
coulomb
meter
Variable H.T.
aluminium plates
supply 0 – 5 kV
separated by a few
mm
charge stored / C
potential difference / V
Q∝V
Q = CV
V p. d. /
C
V
S
charge
R
Q stored / C
The two expressions for E are equivalent since C = Q/V. There is also a third
form of this expression in terms of just V and C (found by using Q = CV to
eliminate Q): E = ½ CV2.
Here are the three equations for energy stored on a capacitor:
Q2 CV 2 QV
=E == = ==
2C 2 2
One of the key uses for a capacitor is as a temporary store of electrical energy
that can then be used later when the capacitor is discharged – for example,
in a camera flashlight. One of the advantages of using capacitors as energy
storage devices is that the energy can be accumulated slowly and released
quickly so that the power delivered can be much greater than the power used
to charge the capacitor.
charge Q passes
through supply
A A
R C
+Q Q
charge Q leaves charge Q arrives
positive plate negative at plate
equation for the rate of flow on or off of the capacitor when it already stores a
charge q and has a potential difference V across its plates. It is simpler to set
this up for discharge than for charging so we will begin with the equations for
the discharge of a capacitor through a fixed resistor.
100
90
80
70
60
50
40
30
20
10
0 time / s
t = RC t = 2RC
The quantity RC is called the “time constant” for the circuit. The dimensions
of RC are the dimensions of time so in SI units it is a time in seconds. After
RC
one-time constant (t=RC) the fraction remaining is e e1 0.37 so there
RC
will be 37% of the initial charge remaining after one time constant. After n
time constants, the fraction remaining is e− n. This falls rapidly with n. After
three time constants, there is 0.050 of the original charge so the discharge is
95% complete. After five time constants 0.0067 of the original charge remains
so the discharge is more than 99% completed. As a rule of thumb discharging
(and charging) processes are considered complete when a time t = 5RC (five-
time constants) has elapsed.
The time constant controls the rate of charging and discharging.
percentage of Q0, I0 or V0
100
90
80
70
60
increasing time
50 constant (RC)
40
30
20
10
0 time
tim
time
VS VR VC
Q
VS IR
C
C dQ Q
R VS R
I dt C
dQ 1
+Q Q CVS Q
dt RC
dQ 1
VR VC QF Q
dt RC
where QF = CVS is the final charge when the voltage across the capacitor is
equal to the supply voltage.
This is a first-order linear differential equation whose solution is:
t
Q QF 1 e RC
The difference between Q and QF decays exponentially toward zero.
The voltage across the capacitor is directly proportional to the charge stored
(V = Q/C) so the voltage rises in a similar way to the charge:
t
V VS 1 e RC
Current is the rate of change of charge so the charging current decays expo-
nentially from an initial value I0 = VS/R.
t
I I0 e RC
100
80
60
40
RC= 2.0 s
20
0 time /s
0 2 4 6 8 10 12
C1 C2 C3
+Q Q +Q Q +Q Q
V1 V2 V3
The total charge on the set of series capacitors is therefore Q and the voltage
across the set is equal to the sum of voltages across the individual capacitors.
V V1 V2 V3
Q Q Q Q
Cseries C1 C2 C3
1 1 1 1
Cseries C1 C2 C3
For n capacitors in series:
i n
1 1
Cseries i 1 Ci
This has the same form as the equation for resistors in parallel but here it
applies when the capacitors are in series.
V
C1
Q Q1 Q2 Q3
C2 Cpara V C1 V C2 V C3 V
Cpara C1 C2 C3
C3
This has the same form as the equation for resistors in series but here it applies
when the capacitors are in parallel.
19.6 EXERCISES
V1
A1 A2
S R C
V2 V3
8.0 V
S1
C1 = 50 F
S2
C2 = 100 F
The plates are pulled apart by an external agent and their separation is
doubled.
Describe and explain what happens:
to the charge stored on the capacitor
to current flow in the circuit
to the energy stored on the capacitor
to the voltage across the capacitor.
7. A 470 µF capacitor is charged to 10 V and then discharged through a 2200
Ω resistor. Calculate the charge and voltage remaining on the capacitor
after 1.5 s.
20
Magnetic Fields
Each filing is like a small rod of iron that lines up with an invisible force field.
If a magnetic compass is moved nearby it too lines up with this force field and
can be used to trace its pattern.
Michael Faraday explained these effects by saying that a bar magnet has two
poles, north and south, and that these create a magnetic field in the space
surrounding the magnet. The lines of magnetic field begin on north poles and
end on south poles, they cannot start or end in space (although they can form
closed loops). The direction of the field lines is the direction of force that
would be exerted on a free north pole (even though free north poles do not
exist in nature). This is all very similar to the description of the electric field
and the link between electric field lines and electric charges.
N S
magnetic field
The Earth has its own magnetic field. The North Pole of compass points
toward the Earth’s North Pole. This implies that the geographic North Pole of
the Earth is actually a south magnetic pole. Sometimes the poles on perma-
nent magnets are called “north-seeking” or “south-seeking” poles so that the
north-seeking pole of a compass always points toward magentic north.
From a distance, the magnetic field in space created by the Earth has a similar
shape to that of a large bar magnet. However, the center of the Earth is very
hot and this high temperature would destroy any permanent magnetism (it is
above the Curie point for iron and nickel). The Earth’s field is generated by
electric currents.
The direction of the magnetic field can be predicted using a “right-hand grip
rule.” If you make a “thumbs up” with your right hand and align your thumb
with the conventional current direction, then your fingers are curling in the
direction of the magnetic field lines.
Electric current is a flow of charge so the source of the magnetic field is the
moving charges inside the wire. A beam of charged particles moving through
a vacuum would create a similar pattern of the magnetic field.
It turns out that ALL magnetic fields originate from moving charges. Even
the magnetic fields of permanent magnets originate from the movements of
electrons inside the atoms of the material.
circuit to
control current
A
current-carrying wire of length l
I
N x S
l
top pan
constant
balance
uniform
magnetic field
directed out
of page
In the diagram above, the X on the end of the wire represents a current into
the page.
A permanent magnet with flat pole pieces is placed on a top pan balance. The
magnet creates a uniform horizontal magnetic field (directed from N to S). A
separate circuit is used to control the current through a wire that passes hori-
zontally between the poles perpendicular to the magnetic field lines. As the
current is increased from zero the reading on the top pan balance decreases,
showing that the magnet is being pulled up and the wire (by Newton’s third
law) is being pulled down. The magnitude of the force increases as the current
is increased. Experiments like this can show that the magnetic force F
is directly proportional to the current I
is directly proportional to the length of wire in the field l
is perpendicular to the current and the magnetic field
F ∝ Il
thumb:
‘thrust’
or force
first finger: field
second finger:
current
F BIl sin
where θ is the angle between the magnetic field and the current.
conventional conventional
current I q q vq q v current I
v vq
q v v q v
l cross-sectional
magnetic field of strength B area A
directed into the page
F = BIl
I = nqAv
F = BnqAvl
This force arises from the individual forces on N charge carriers where:
N = nAl
f = Bqv
f Bqv sin
r
magnetic field of
f q strength B directed
into page
v
q
The magnetic field provides a constant centripetal force so the charged parti-
cle moves in an arc of a circle of radius r:
mv2
= =
f Bqv
r
mv
r=
Bq
When particles are created in particle accelerators like the LHC at CERN
particle physicists use strong magnetic fields to deflect them so that they can
determine their momentum and mass.
f = felectric + fmagnetic
f = qE + qv∧B
Note that the electric force is parallel to the electric field whereas the mag-
netic force is perpendicular to the magnetic field (in a direction given by
Fleming’s left-hand rule).
incoming beam of charged particles, then the electric and magnetic force act
along the same line and can be made to oppose one another. By adjusting the
values of the fields the two forces can be made to cancel out so that the beam
is undeflected.
For the beam to be undeflected:
fmagnetic = felectric
Bqv = Eq
E
v=
B
uniform electric field of strength E
fmagnetic
q q q
magnetic field of
strength B directed
into page felectric
For any ratio of electric field strength to magnetic field strength there is just
one velocity that will be undeflected. This arrangement is called a velocity
selector. If a stream of charged particles with a range of incident velocities
enters the region of crossed fields then only those satisfying the equation
above go straight through.
Mass spectrometer
A velocity selector is used to send ion beams with a particular velocity into a
mass spectrometer. The beams are then deflected in a constant magnetic field
so that they move in a semi-circular path. The radius of curvature of this path
can be used to determine the masses of the ions. It is also possible to measure
the amount of each type of ion that is present in the beam, so mass spectrom-
eters are ideal for analyzing ionic ratios in samples.
A simplified diagram to illustrate the principle of the mass spectrometer is
shown below. The entire apparatus is evacuated.
r
region of uniform
E magnetic field B2
source of ions
B1
with a range of
velocities
B2
current
element
I
l
I x
P
The direction of δB is found using the right-hand rule and using the continu-
ation of the current element as the current direction. The magnitude of δB is
given by the equation:
0 I sin
B l
4 x2
where µ0 is the permeability of free space, a constant that determines the
ability of a vacuum to support a magnetic field.
The diagram on the right has the axis of the coil into the page. The resultant
magnetic field is also into the page, shown by the cross at the center:
l I
x B
0 I sin / 2
B l
4 r 2
0 I sin / 2
l 2 r
B
l 0
4 r 2
l
All the terms in the curly bracket are constant so this is a very simple integral
resulting in:
0 I
B
2r
If the coil has N turns the magnetic fields add together:
0 NI
B
2r
An expression for the field strength at other points along the axis of the nar-
row coil can also be determined using the Biot–Savart law. The integration is
a little more complicated but the result is:
0 Ir 2
B
2 z2 r 2
3
2
Where z is the distance along the axis from the center of the coil. When z = 0
the expression reduces to the previous equation (as it should).
B P
x
r
l
l I
The resultant magnetic field forms concentric rings around the line of the
electric current:
0 I sin
B l
4 x2
The contribution from current elements to the left and right of P are the same
so the resultant magnetic field strength at P is:
l
0 I sin
B2
l 0
4 x2
dl
The three variables θ, x, and l are all related so in order to carry out the inte-
gration we need to express two of them in terms of just one of them and the
constant distance r:
r
x
sin
r dl r
l so 2
tan d sin
The integral now becomes:
0
0 I sin d
B 2 r
2
0 I
B
2 r
I I
An electromagnet consists of a long coil wound around a soft iron core. The
magnetization of the core increases the magnetic field strength and the field
at the center is then given by:
0 r NI
B
l
where µr is the relative permeability of the ferromagnetic core. Some typical
values are shown below but magnetic permeability depends strongly on the
magnetic field strength so these are representative values only.
Iron 5000
Ferrite 600
Nickel 300
Carbon steel 100
I
B.dl 0 I
closed enclosed
loop by loop
l
I
X
r
B.dl 2 rB
enclosed
0I 0I
closed
loop by loop
0 I
B
2r
B C
north pole
south pole
I I
b
A D
a
a
2 BIb
BIab BIA
2
Where A = ab is the area of the coil. If the coil has N turns the torque is:
NBIA
AB CD
north pole
south pole
AB . X CD
AB CD
The device used to reverse the current is called a split-ring commutator. For
a simple motor with a single coil, the commutator consists of a conducting
cylinder split in half so that the halves are separated by an insulator. The com-
mutator is attached to the axis of the motor and rotates with it. Current enters
and leaves the coil via brushes that make sliding contact with the surface of
the commutator.
brushes
conductors
coil rotates with
commutator
rotation axis
insulator separating
two halves of
commutator
The commutator acts as a rotating switch ensuring that the direction of cur-
rent in the coil stays the same as it rotates.
20.5 EXERCISES
1. The Earth’s magnetic field is very similar to the field of a dipole bar mag-
net. However, geophysicists are sure that the field is not caused by a per-
manent magnet inside the Earth. Explain why not.
2. Two long straight parallel wires separated by 0.045 m each carry a current
of 2.0 A in the same direction.
(a) Draw a diagram showing the magnetic field around one of the wires
interacts with the other wire, and use this diagram to explain why the
wires exert a force on one another. State the direction of this force.
(b) Calculate the magnitude of the force per unit length on each wire.
(c) Explain why coils of wire carrying a very large current must be able
to withstand large stresses.
3. A mass spectrometer is used to measure the masses and abundances of
different isotopes. It does so by accelerating ions of each isotope to the
same speed and then deflecting them into a semi-circular path in a strong
uniform magnetic field.
(a) Explain how a velocity-selector, consisting of perpendicular electric
and magnetic fields can be used to select ions of the same speed
from a group containing a wide range of different speeds.
(b) Ions of mass m1 and m2 and equal charge q enter the same uniform
magnetic field of strength B at the same velocity v at right angles to
the field lines and both are detected after moving through a complete
semi-circle in the field. Derive an equation for their separation.
4. The diagram below shows an end view of a rectangular coil in a uniform
magnetic field of strength 0.05 T. The dot represents current out of the
page and the cross represents current into the page.
(a) Use a diagram to explain why there is a resultant moment on the coil
and state the direction of this moment.
(b) Describe qualitatively how the moment on the coil changes as θ var-
ies from 0 to 90°.
The coil has 80 turns and an area of 0.012 m2. It carries a constant
current of 0.65 A.
(c) Calculate the moment on the coil when θ = 0°, 30°, 45°, 60°, and 90°.
(d) Explain how the coil would move if it was released from a horizontal
position and allowed to move freely about a central axis directed into
the page.
(e) Explain what has to be done to the current in the coil if it is to oper-
ate as a DC motor.
21
Electromagnetic Induction
mechanical Generator
electrical
energy energy
galvanometer deflects
N S when wire moves and
G
reads zero when wire is
staonary in or out of field
bar magnet S
moved along
doed line N
coil G
primary secondary
coil coil
so iron
core
to the primary coil with an AC supply. The output from the secondary coil is
now an AC emf that can be displayed on an oscilloscope. If a dual beam oscil-
loscope is used the emf in the secondary can be compared with the current in
the primary (which is in phase with the magnetic field).
primary secondary
coil coil
so iron
core
A.C.
to oscilloscope
input 2
to oscilloscope
input 1
Note that the peaks of the induced emf occur at times when the rate of change
(gradient) of the magnetic field through the secondary coil is greatest and that
the induced emf is zero at times when the rate of change of the magnetic field
through the secondary coil is zero (gradient is zero).
for the induced emf we need to define what we mean by “amount of magnetic
field.” This will depend on three factors, the strength of the field, its orienta-
tion, and the area through which it passes.
Magnetic Flux
If a constant uniform magnetic field of strength B passes normally through a
surface of area A the flux Φ is defined as the product BA.
magne
c field of strength B
area A perpendicular
to magne
c field
= BA
A
The total flux is found by integrating these contributions across the surface:
=Φ ∫ B cos θdA
surface
Magnetic Flux-Linkage
When magnetic flux passes through a coil of N turns it links each turn in the
coil, so it is convenient to define the magnetic flux-linkage as the product of
the magnetic flux and the number of turns linked by that flux:
flux-linkage = NΦ
If the flux is caused by a constant uniform field of strength B along the axis of
the coil the flux linkage is simply:
flux-linkage = NBA
where A is the area of the coil. N is a dimensionless number so the SI unit for
flux-linkage is the weber (Wb).
region of
magnec field of
strength B into
G v v d
The conductor runs along two parallel conducting rails connected back to a
stationary galvanometer. As the conductor moves it cuts through lines of mag-
netic field. In time δt it cuts all the magnetic field lines in the darker shaded
area and the flux through the circuit increases by an amount equal to the flux
through that additional area.
Flux cut in time δt:
δΦ = Bdvδt
Rate of flux-cutting:
dΦ
= Bvd
dt
Using Faraday’s law, the induced emf in the moving conductor is:
E= ( − ) Bvd
Lenz’s Law
The negative sign in the equation for Faraday’s law can be interpreted in the
following way:
The direction of the induced emf is such as to oppose the change that
caused it.
This sounds a little obscure but becomes clear when we apply it to the exam-
ple above. In order to induce an emf we needed to push the wire to the right.
The existence of an emf in the circuit causes a current to flow. However,
when a current flows in a conductor lying perpendicular to the magnetic
field there is a motor effect force on the conductor that is perpendicular to
both the current and the field. In this case, the force must lie either in the
direction of v or in the opposite direction. If it was in the direction of v then
once we had started the wire moving the induced emf would create a cur-
rent that experienced a force in the direction of motion and the wire would
accelerate with no need for us to do any further work on it. Energy would
be generated from nothing! This violates the law of conservation of energy
so cannot occur. The force on the induced current must oppose the force
moving the wire so that we do work to move it and “pay” for the electrical
energy it generates. Lenz’s law ensures that energy is conserved when an
emf is induced. It is the reason we need to burn fuel to turn the generators
in a power station.
B
t
cos
=
Flux-linkage: Φ NB0 A cos ωt
d ( NΦ )
Induced emf: E= − =
ωNB0 A sin ωt
dt
This is an AC output with peak value ωNB0A.
21.3 INDUCTANCE
When there is current in a coil the coil becomes an electromagnet and creates
a magnetic field. If the magnetic field changes, there is changing flux-linkage
inside the coil so an induced emf is created that opposes the external supply
that is changing the current. This is often called a “back emf.” This opposi-
tion to changing current is called inductance. Coils act as inductors in electric
circuits.
21.3.1 Self-inductance
The tendency for a coil to oppose changes in current is called inductance.
The greater the inductance, the stronger the opposition to changing currents.
Inductance is defined by the equation:
dI
E = −L
dt
−E
L=
dI
dt
Inductance is equal to the back emf per unit rate of change of current.
The SI unit for inductance is VsA− 1 or the henry (H). 1H = 1 VsA− 1.
An expression for the self-inductance of a long solenoid can be determined by
comparing the definition of inductance with Faraday’s law:
dI dNΦ
E=
−L = −
dt dt
µ 0 N 2 IA
=
Φ NBA
=
l
so
dI µ 0 N 2 A dI
L =
dt l dt
µ0 N 2 A
L=
l
This effect means that it takes time for the current to reach its steady final
value and that the supply does work against the back emf as this current is
established. During this time energy is being transferred from the supply to
the magnetic field.
VS current
VS /R
S
me
VR VL 0
0
dI
V=
S IR + L
dt
This is a first order differential equation that can be solved to give an expres-
sion for I.
− t
R
=I I0 1 − e L
This has a similar form to the equation for the charging of a capacitor and
(R/L) is the effective time constant.
If S is opened the current falls to zero in a very short time so that dI/dt is very
large. This results in a large back emf that can cause sparks across the switch
terminals. Interrupting the current in a coil is a way to generate spikes of high
voltage – for example, for a car’s spark plugs.
dI
= = IL
P IV
dt
separating variables and integrating:
t=∞ I = I0
1 2
=E
=t 0=I 0
∫=
Pdt ∫=
ILdI
2
LI
Coil 2: I2, N2 , A2
flux-linkage in the other coil and an induced emf. The strength of this cou-
pling is measured by the mutual inductance of the system.
The mutual inductance of two coils is defined by the equations:
dI dI
E2 = M 1 E1 = M 2
dt dt
Where k is a coupling constant that depends on how the coils are arranged
and how they are magnetically linked.
21.4 TRANSFORMERS
Electromagnetic induction is the key principle on which the generation and
distribution of electrical energy depend. Thermal power stations use fossil or
nuclear fuels to transfer chemical or nuclear energy to thermal energy and
then use this to power generators that output AC electricity. Transformers step
up the voltage so that electricity can be transmitted with low losses over large
distances. More transformers are used to step down the voltage for consumers.
I
output
o utput
A.C
A.C.
.C.. ssupply
upply V1 R
volta
tage
ge
voltage
pprimary
rim
ri il,, N1 turns
mary ccoil,
oil turns il,, N1 turns
secondary ccoil,
secondary oil turns
Flux in core
induced emf in
secondary
me
The peaks of emf correspond to times when the flux in the core has its greatest
rate of change.
For an ideal transformer all of the flux created by the primary passes through
the secondary. If there are equal numbers of turns on both primary and second-
ary the back emf in both coils will be equal and will equal the supply voltage.
However, the voltage on the secondary is also directly proportional to the num-
ber of turns on the secondary (V2 ∝ N2) so changing the turns ratio (N2/N1) must
change the voltage ratio (V2/V1) in a similar way. This leads to the transformer
equation:
V2 N2
=
V1 N1
I1 V1 = I2 V2
I2 V1 N1
= =
I1 V2 N2
“the current ratio is the inverse of the voltage and turns ratios.”
V1 V2 V3
step-up step-down
transformer transformer
P1
I2 =
V2
RP12 1
=
PR I=
2
2R 2
∝ 2
V2 V2
Flux losses: the magnetic circuit provided by the transformer core might
not confine all of the magnetic flux so that some of the flux created by the
primary coil does not pass through the secondary coil.
Eddy current losses: transformers work with AC so there is an alternat-
ing magnetic field in the core itself. If this is made from a conducting
material (e.g., soft iron) there will be induced current loops in the core.
These will dissipate heat. In order to reduce these losses the core is usually
laminated – cut into slices that are separated by thin layers of insulator.
High-frequency transformers, for example, in radio frequency circuits use
non-conducting cores (e.g., made ferrite cores to prevent these losses).
Hysteresis losses: the continual magnetization and demagnetization of
the core create alternating stresses inside the core that dissipate energy as
heat. Soft iron has low hysteresis losses so, in addition to its high perme-
ability, it is a good material for low-frequency transformer cores.
Our discussion of transformers has assumed that the loads attached to them
are purely resistive and ignore the phase relationships between currents and
voltages in the two coils. In practice, this is quite complex and the inductance
and capacitance of the circuits need to be taken into account.
south
north
Flux linkage:
=Φ NBA sin θ
me
T=2/
The negative sign in the equation has been ignored in the graph. The output
is an AC voltage with peak value E0 = ωNBA.
When the switch is closed the induced emf causes alternating currents in the
coil. By Lenz’s law, these currents flow in a direction that opposes the change
that caused them. As the north pole approaches the top of the coil the current
moves in such a direction that the top of the coil is also a north pole, repelling
the magnet. When the magnet moves away from the coil the induced currents
form a south pole at the top of the coil, attracting the magnet. It is clear that
the electromagnetic forces act in the opposite direction to the motion, like
friction, and provide additional damping. The oscillations decay more rapidly
with the switch closed.
S
oscillaons
N
switch
change that caused them. As the north pole approaches the top of the coil
Another simple demonstration involves placing a cylindrical neodymium mag-
net on a thick sheet of aluminum or copper and then tipping the sheet until
the magnet slides down. Aluminum and copper are not ferromagnetic metals
so when the magnet is stationary there are no magnetic forces acting on it.
However, when it begins to move the magnetic flux cuts through the conduc-
tor inducing emfs. The induced emfs create current loops and the magnetic
fields of these loops oppose the motion of the magnet. There is a magnetic
drag force that slows the magnet down. This can be a surprisingly large effect
if the metal sheet is titled at a large angle.
Another intriguing demonstration involves dropping a cylindrical neodymium
magnet down a copper water pipe with an internal diameter slightly greater
than the diameter of the magnet. The magnet falls at a slow terminal velocity.
Electromagnetic damping forces balance its weight.
copper
pipe
FEM
S
neodymium
v
magnet
mg N
As magnet falls its flux cuts the conductor surrounding it. This
induces current loops above and below the falling magnet. These
create magnetic fields that oppose the magnet’s motion.
moon of magnet
F
The moving magnetic field induces emfs in the metal that create current
loops. By Lenz’s law, these create magnetic fields that oppose the relative
motion, resulting in the forces on the magnet and conductor as shown above.
The effect is that the conductor tends to follow the moving magnetic field.
In an induction motor, coils are used to create a rotating magnetic field. A
non-ferromagnetic rotor is placed in the rotating field and it too rotates. This
has a big advantage that there are no brushes. The idea was first suggested by
Nikola Tesla and induction motors are now widely used from DVD players to
electric cars.
A rotating magnetic field can be created by superposing two alternating mag-
netic fields at right angles to one another and giving them a phase difference
of π/2. If we represent each magnetic field by a phasor then the resultant of
the two phasors represents the rotating field.
By B
Bx
=
Bx B0 cos ωt
=
By B0 sin ωt
=
B Bx2 + B=
2
y B0 cos2 ωt + sin 2 ω=
t B0
By
θ = tan −1 = tan −1 ( tan ωt ) = ωt
Bx
In other words, this creates a magnetic field of constant strength B that rotates
with constant angular velocity ω. A simple way to create such a field is to use
two pairs of coils arranged as shown below:
rotor
Vy = V0 sin t
Vx = V0 cos t
21.8 EXERCISES
bar magnet S
moved along
doed line N
coil G
While the magnet is moving toward the coil the galvanometer deflects to the
right.
2.5
2
current / A
1.5
0.5
0
0 0.05 0.1 0.15 0.2 0.25
me / s
(a) Use the graph to calculate the inductance and resistance of the coil.
(b) Calculate the energy stored when there is a current of 2.0 A in the coil.
(c) When the switch is opened, a spark is observed to jump across its contacts.
Explain why this occurs.
7. The diagram below shows an end view of a simple generator consisting
of a coil rotating at a constant rate in a uniform magnetic field. The AC
output is connected via slip rings to a purely resistive load.
south
north
22
AC
22.1 AC AND DC
DC stands for “direct current” and AC stands for “alternating current.”
DC current flows in one direction around a circuit, so the supply polarity
is c onstant. AC currents change direction periodically, so the polarity of the
supply alternates. Most AC supplies are sinusoidal and can be represented by
the equations:
=V V0 sin ωt
=I I0 sin ωt
voltage or
current
T= 2/
V0 or I0
0 me
0
= = I0 V0 sin 2 ωt
P IV
power
T= 2/
I0V0
½ I0V0
00 me
The power is always positive and peaks twice as frequently as the current or
voltage. The peak power is I0V0 and the average value of the power is ½ I0V0.
This can be shown using the trigonometric relation:
1 1
sin 2 A= − cos2 A
2 2
1 1
sin 2 ωt = − cos2ωt
2 2
The average value of a cosine term over a whole number of cycles is zero so
the average of a sine-squared term is ½.
The average AC power is therefore:
1
PAC = I0 V0
2
The root-mean-square values of current and voltage (for sinusoidal variation) are:
I0
Irms =
2
V0
Vrms =
2
By using the rms values instead of peak values we can express the formula for
average AC power in the same way as we express the formula for DC power.
This means that an AC supply voltage of 240 V rms would light a lamp to the
same brightness as a DC supply of constant value 240 V. In this sense, the
rms value is the “DC equivalent” value. However, an AC voltage of 240 V rms
actually peaks at ± 339 V. We can show where the factor of 1 / 2 comes from
as follows.
=V V0 sin ωt
=V 2 V0 2 sin 2 ωt
1 2
=
V2 V0 2 sin=
2
ωt V0
2
V0
V=
rms V=
2
V0 2 sin 2 ω=
t
2
Vrms 2
=P = Irms=Vrms Irms 2 R
R
dQ dV
=I = C
dt dt
=V V0 sin ωt
I =ωCV0 cos ωt =I0 cos ωt
I0 = 2p fCV0
ωCV0 =
This introduces a π/2 phase difference between the voltage and the current
with the current leading the voltage:
current in
I0
capacitor
voltage
V0 across
capacitor
0 time
0
The ratio of V/I is of little use because it varies continuously (being zero at some
points and infinite at others). However, the ratio of the peak values of voltage
and current is constant, this is the reactance XC (“kie-cee”) of the capacitor.
V0 1 1
X=
C = =
I0 wC 2p fC
The SI unit for reactance is the ohm (W) but reactance is not the same as
resistance because the peak values occur at different times.
Reactance can be used to find the peak or rms current in a capacitor if we
know the peak or rms AC voltage across it. However, the interesting thing
about reactance is its frequency dependence.
1
XC ∝
f
E is the back emf in the inductor which opposes the supply voltage V, hence
the change of sign.
If the current is:
=I I0 cos ωt
V0 = −ωLI0
Once again there is a π/2 phase difference but this time the voltage leads the
current by π/2.
I0 current in
inductor
V0
me
voltage
across
inductor
V
X L = 0 =ωL =2p fL
I0
This is also measured in ohms (Ω) and is frequency dependent. At low fre-
quencies the inductor has very low reactance, low-frequency signals pass
through an inductor easily. At high frequencies, the reactance becomes very
great so that high-frequency signals are severely impeded. This frequency
dependence is the opposite of that for the capacitor.
In common with the capacitor, however, there is no net energy flow into the
inductor. During one cycle energy is used to build up the magnetic flux in the
inductor but then energy flows out of the inductor as the field collapses.
In practice inductors consist of coils of conducting wire which has some resist-
ance, so we do not usually encounter pure inductance. A real inductor is mod-
eled as a pure inductor in series with a fixed resistor:
L R
VL = IX L
VR = IR
I
VC = IX C
The supply voltage is equal to the sum of the phasors across the three
components.
22.3.2 Impedance
The impedance Z of a load that contains resistive, capacitive, and inductive
components is defined as the ratio of the peak voltage across the load to the
peak current in the load:
V
Z= 0
I0
VL = IX L
VC = IX C
VR = IR
VS
I
VR 2 + ( VL − VC ) = I R 2 + ( X L − X C )
2 2
VS =
For particular values of L and C, VL, and VC are equal in magnitude and add to zero
(because they are p out of phase with each other). Under these circumstances, the
phase angle is 0 and the impedance is a minimum and purely resistive: the current
has a maximum value and is in phase with the supply voltage. This is called reso-
nance and if resistance is small it can result in a large increase in current.
The only component that dissipates energy is the resistor so the power dis-
sipated in this circuit is:
= =
P IVR IVS cos f
A.C.
C
R
VR VC
VR = IR
I
VS
VC = IX C
VS = VR 2 + VC 2 = I R 2 + X C 2
2
1
VS
= R + 2 πfC
2
=
Z = R2 + X C2
I
XC 1
tan=
φ =
R 2 πfRC
VS
A.C.
L
R
VR VL
VL = IX L VS
VR = IR
I
VS = VR 2 + VL 2 = I R 2 + X L 2
VS
R 2 + X L 2 = R 2 + ( 2p fL )
2
=
Z =
I
XC 1
tan=
φ =
R 2 πfRC
VS
A.C.
C
R
VR VC VL
VL = IXL
VS
I
VR = IR
VC = IX C
VL − VC X L − X C
=
tan φ =
VR R
XL = XC
1
ωL =
ωC
1
2 πf0 L =
2 πf0 C
1 1
f0 =
2 π LC
At this frequency, the current in the circuit has its maximum value
VS
Imax =
R
The circuit acts as a purely resistive load and the current and supply voltage
are in phase.
The graph below indicates how the resistance, impedance, and circuit current
vary with frequency for an RCL series circuit.
Resistance
Impedance
Current
impedance
resistance
current
0
0 f = f0 frequency
For f < f0 the reactance of the capacitor is greater than that of the inductor and
the circuit is said to be capacitive. For f > f0 the reactance of the inductor is
greater than that of the capacitor and the circuit is said to be inductive.
VS
A.C.
IR
IC
IL
IC = VS/X C
IR = VS/R
VS
IL = VS/X L
1 1
f0 =
2 π LC
At resonance, the currents in the capacitor and inductor are equal in magni-
tude but opposite in direction at every moment.
C L
When the capacitor is connected to the inductor the voltages across them
must be equal so that:
Q dI
= −L
C dt
Q d 2Q
= −L 2
C dt
d 2Q 1
2
= − Q
dt LC
This equation has exactly the same form as the equation of motion for simple
harmonic motion so the solutions will have the same form too:
Q = Q0 sin ù t
where
1
ω=
LC
1 1
f=
2 π LC
Q0
=
VC sin ωt
C
dQ
I= =
ωQ0 cos ωt
dt
d 2Q 1
2
= − Q
dt LC
d2 x k
2
= − x
dt m
Analogies like this occur all over physics. They help us to understand new phe-
nomena in terms of ones we are already familiar with but they are also useful
in their own right. For example, this correspondence between the mechanical
and the electrical means that we can model mechanical systems using electri-
cal circuits – for example, to test new designs for structures.
22.6 EXERCISES
The AC supply is adjusted until the lamp lights equally brightly from both
supplies.
(a) Calculate the peak value of the rms voltage.
(b) State the rms value of the AC voltage.
(c) Discuss whether the lifetime of the bulb (a filament lamp) will
depend on the type of supply used to light it.
16 V rms
A.C.
22 6.5 mH
VR VL
12 V rms
A.C.
10 22 F 50 H
VR VC VL
(a) Calculate the impedance of the circuit at 0 Hz, 1.0 kHz, 5.0 Hz,
10 kHz and 100 kHz.
(b) Explain what is meant by resonance and calculate the resonant
frequency for this circuit.
(c) Sketch a graph to show how the rms current varies with frequency
from 0 Hz to 100 000 Hz.
(d) Sketch a graph to show how the phase difference between the supply
voltage and the current varies as the frequency changes from 0 Hz to
100 000 Hz.
(e) Describe the energy transfers that take place in the circuit at
resonance.
23
The Gravitational Field
m1 F F m2
Gm1 m2
F= −
r2
their centers. This means that we can treat object like planets and stars as point
masses when considering the orbital motion. It is also important to note that,
by Newton’s third law, the forces on each mass have the same magnitude, even
if the masses are different. For example, the weight of an apple in the Earth’s
gravitational field is the same as the weight of the Earth in the apple’s gravita-
tional field. It is also the case that the gravitational force exerted on the Earth by
the Moon is equal in magnitude to the gravitational force exerted on the Moon
by the Earth.
The resultant gravitational force on a body affected by the gravitational fields of
several other objects (e.g., the Earth affected by the Sun, Moon, and other planets)
is the vector sum of the gravitational forces from each of the other objects.
The SI unit for gravitational field strength is newton per kilogram (Nkg− 1).
For a point or uniform spherical mass, the field strength at a distance r from
the center of mass M can be determined by considering the force per unit
mass acting on a small mass m placed at that distance:
F GM
g= = − 2
m r
ΦG = ∫ g.dA = −4p G ∑ m
enclosed
closed
surface
The negative sign again arises because of the attractive nature of the gravita-
tional field, the flux entering through a closed surface is equal to 4πG times the
total mass enclosed by the surface. In the same way, as in electrostatics, this
form of Gauss’s law is particularly useful for situations with spherical sym-
metry. For example, we can use it to show that the gravitational field strength
inside a hollow uniform spherical shell is zero everywhere.
4 4
r1
4 0
0
We can determine an expression for the field strength inside the earth by
recalling that the field strength inside a hollow sphere is zero. This implies
that the field strength at distance r < RE from the Earth’s center is that of the
mass inside radius r.
GME r
g ( r < RE ) =
−
RE 3
The magnitude of g is zero at the center of the Earth and increases linearly to
the surface. The gravitational field strength at the Earth’s surface is:
GME
gsurface = −
RE2
This is about 9.8 Nkg− 1 although it varies by a few percent at different locations.
g
r = RE
0 r
If an apple falls from a tree to the ground a gravitational force (its weight)
acts on the apple and does work on it, so its gravitational potential energy
decreases.
A m mg
x
The work done by the gravitational field to move it a short distance δx along
the line AB is:
d W mg cos θδx
=
where mg.dx is the scalar product of the two vectors mg and δx, that is, the
sum of the component of the gravitational force in the direction of motion
along each line element. The gravitational forces do work on the particle so its
gravitational potential energy falls:
B
− ∫m g.dx
∆GPE AB =
A
The differential form of this equation is:
dGPE AB
= −m g
dx
A
m
h
x
B
− ∫mg.dx =
∆GPE AB = mgx cos è =
= − mgh
A
While this is only exact when g is constant it is a good approximation for verti-
cal displacements close to the surface of the Earth that are small compared to
the radius of the Earth.
The SI unit for gravitational potential is the joule per kilogram (Jkg− 1).
The zero of gravitational potential is taken to be at infinity.
This is the same convention used to define the zero of electrical potential and
it makes sense because when particles are separated by very large distances
their interactions, under an inverse-square law, become negligible. Once we
have defined the zero of potential, we can determine the absolute potential
(and potential energy) at any point. This is equal to the work that must be
done per unit mass to move a small mass from infinity and to place it at point P.
M P mg m x
to infinity
r
x
where
GM
g= −
x2
x= ∞
GM GMm
GPE (• ) GPE ( r ) =
∆GPE = − ∫ m − 2 dx =
x=r x r
GMm
GPE ( r ) = −
r
GPE ( r ) GM
VG ( r ) = = −
m r
All gravitational potential energies are negative because work would have to
be done to move any mass to infinity. This is a consequence of the attractive
nature of gravitational forces.
Close to the surface of the Earth the gravitational field is approximately uni-
form. The equipotentials are planar surfaces parallel to the surface of the
Earth and are equally spaced.
equipotenal
increasing
surface of Earth: potenal
7 1
VG6.310 Jkg
GM
VG ( RE ) =
− −6.25107 Jkg −1
=
RE
The change in potential energy when a mass is moved from one place (A) to
another (B) in the field is given by:
∆GPE ( AB ) =m∆VG =m ( VG ( B ) − VG ( A ) )
Consider a mass m moved through a vertical height h from the surface of the
Earth:
GM GM GM GM
∆GPE ( AB ) =
m∆VG =
m − −− =
m − +
( RE + h ) RE ( RE + h ) RE
This is equal to the work that must be done against gravitational forces to
move a mass m from the surface of the Earth to infinity, that is, to completely
escape from the Earth’s field.
M
v
r m
1 GMm
TE = mv2 − ≥0
2 r
The escape velocity is therefore given by:
1 GMm
mvesc 2 − =
0
2 r
2GM
vesc =
r
2GME
=vesc = 1.1210 4 ms−1
RE
Black Holes
The speed of light is a limiting speed in the universe so if the escape velocity
reaches this value then nothing, not even light, can escape. Black holes are
objects that are sufficiently massive and compact that the escape velocity at a
certain distance from the object is equal to the speed of light. This distance is
called the Schwarzschild radius and while a thorough analysis of black holes
requires general relativity, it is possible to derive some useful results from
Newton’s law of gravitation.
Consider a black hole of mass M. At a certain distance from the center, called the
“Schwarzschild radius” RS, the escape velocity is equal to the speed of light, c:
2GM
vesc= c=
RS
2GM
RS =
c2
Another way to look at this is to realize that a body of mass M will become a
black hole if all of its mass is compressed inside its Schwarzschild radius, RS.
For the Earth to become a black hole this radius is just under 1 cm. The Earth
would have to be compressed to the size of a table tennis ball.
A sphere of radius RS surrounding the black hole is called the “event horizon.”
This is because no events inside this radius can communicate with the outside
Universe. In a sense everything inside the event horizon has been cut off from
the rest of the Universe. In May 2022, astronomers using the Event Horizon
Telescope created the first image of the supermassive black hole at the center
of the Milky Way galaxy.
Earth
explain the motions of objects in the Solar System and beyond. It is hard
to over-emphasize the importance of this work, it provided a single elegant
explanation for motions across the Universe. In our own Solar System, it
explained the motions of all the planets and other bodies (such as comets and
asteroids) using a single principle.
Perihelion: Aphelion:
closest point to farthest point
Sun, greatest from Sun,
orbital speed lowest orbital
speed
Assume that the central body has a much greater mass than the orbiting body
so that the motion of that central body can be neglected. The center of the
orbit is then the center of the central mass. (in fact, both objects orbit about
their mutual center of mass).
v
m
mv2 GMm
=
r r2
2p r
v=
T
r 3 GM
=
T 2 4p 2
1 GMm
Total energy =KE + GPE = mv2 −
2 r2
mv2 GMm
=
r r2
so
1 2 GMm 1
KE = mv = = − GPE
2 2r 2
The total energy is negative. This makes sense because an orbiting mass is in
a bound state. It would require an external agent to do work to remove the
mass to infinity.
r 3 GM
=
T 2 4p 2
4p 2 r 3
T=
GM
T ∝ r2
Satellites in low Earth orbits, like Sputnik, have periods of about 90 minutes.
Low Earth polar orbits are useful for weather satellites and other satellites
that need to scan the entire surface of the Earth because they pass over a dif-
ferent strip of the surface on each orbit and can observe the entire surface in
a 24-hour period.
GPS satellites are further out, at an altitude of about 20 000 km, and have an
orbital period of about 12 hours. The GPS system creates a constellation of
satellites around the globe and relies on several satellites being visible to a
GPS receiver at any point on the Earth’s surface at any time. The minimum
number of operational satellites required to achieve this is 24 but the target
number is 33. Over 70 GPS satellites have been launched but not all of them
are still operational.
“Geostationary satellites” are placed in an equatorial orbit at an altitude of
about 35 800 km (r = 42 200 km) so that they have an orbital period of 24
hours. This ensures that they remain stationary above the same point on the
Earth’s equator because they complete one orbit as the Earth itself rotates
once. This is called a “geosynchronous orbit” and is used for communications
satellites. The fact that they remain stationary in the sky means that ground-
based antennae do not have to track them.
field than B and A is in a weaker field so both are pulled away from mass B).
There will also be a smaller compression along line DE because of the inward
components of the gravitational field at the edges of body 1. The effect is to
stretch the body 1 along AC and to compress it along DE. This is shown in
the diagram below.
small squares
represent small
masses m at each
posion
body 1 of
radius r
gravitaonal field
A B C
distance from
E
center of body 1 to
center of body 2 = R
mass of body 2 = M
The tidal forces shown are those that act in the freely falling reference frame.
Tidal forces are differential forces and are defined as the difference between
the actual gravitational force FA on a small mass m and the force that would
be exerted on it if it was at the center of the body, FB. For example, the tidal
force on the mass at A is given by:
the negative sign here indicates that the direction of the tidal force is away
from the center of the body. This obeys an inverse-cube law with respect to
distance so tidal forces will only be significant for bodies that are relatively
close together.
Earth Sun
Moon Moon
relave posions
for spring des
The period of the Earth’s rotation is significantly less than the period of
the Moon’s orbit so the Earth effectively rotates under the tidal bulges
and there are approximately two high tides in each 24-hour period.
However, the Earth’s rotation pushes the bulge slightly ahead of the
Moon’s position. This has the effect of creating a retarding torque on
the Earth that reduces its rotation period, making day length gradually
increase, and reducing its angular momentum. However, angular momen-
tum for the system must be conserved so the radius of the Moon’s orbit
also increases. The present rate of increase in day length is about 1.7 ms
per century and the Moon’s orbital radius increases by about 3.8 cm per
year. When the Moon first formed, the length of a day on Earth was only
about 5 hours. The long-term effect of this interaction is that the Moon’s
orbital period and the Earth’s rotation period will eventually become the
same – they will be tidally locked.
Moon
Earth Sun
relave posions
for neap des
Moon
ball projected on
surface
As seen above, the presence of the central mass has caused the deflection of
the ball. As far as the ball is concerned it has simply followed the curvature of
the surface. John Wheeler summarized Einstein’s theory of general relativity
by saying that:
Matter tells space how to curve.
Space tells matter how to move.
In Newtonian mechanics, an object continues to move in a straight line
unless acted upon by a resultant external force. In Einstein’s Universe objects
move along the shortest paths in curved space–time. These paths are called
geodesics. A good analogy is found in the paths followed by aircraft on long-
haul flights around the Earth. The shortest path between two points on the
Earth’s surface is an arc of a great circle around the center of the Earth.
According to Einstein’s theory, the Earth’s orbit around the Sun is actually
a geodesic in space–time. The Earth is not being deflected by a gravitational
force it is simply following the local curvature of space–time and that c urvature
has been caused by the presence of the Sun.
a
g
A simple formula for time dilation can be derived using Newton’s theory of
gravity. Consider a photon emitted vertically from a source on the surface of
a star of mass M. As the photon travels away from the star its gravitational
potential energy increases, so photon energy and frequency fall. The change
in a time period of the photon in different positions in the gravitational field
can be used to compare the rates of clocks at these positions. The time period
of the photon at A is T and at B is T′.
M
A B
m is the mass equivalent of the photon energy and ΔV is the change in GPE
between A and B.
If the change in potential energy of the photon is small compared to its energy,
mΔV << hf then T′ ~ T so we can simplify the expression as follows:
hf ∆V
hf ′ − hf = 2
c
h h h∆V
− =
T ' T Tc2
∆T ∆V
=
T ' T Tc2
T ∆V
∆T ~
c2
∆V
T′ T 1 + 2
=
c
The greater the change in potential the greater the difference in clock rates.
direcon of
gravitaonal wave
However, the predicted amplitude of these vibrations is tiny, even from pow-
erful astronomical events such as the collision of black holes. Typical ampli-
tudes change the dimensions of an object by about 1 part in 1020. For an
object 1 m long this is 10 billion times smaller than the diameter of an atom.
This makes them extremely difficult to detect and the first evidence for the
existence of gravitational waves was indirect.
Gravitational waves transfer energy, so an orbiting binary system loses energy
as it radiates and its period gradually changes. Alan Hulse and Joseph Taylor
analyzed the period of a binary system that consisted of a pulsar and a normal
star over a period of 30 years. They showed that the reduction in the period
was within 0.2% of the change predicted on the basis of gravitational r adiation.
The energy radiated is about 7.35×1024 W, about 2% of the Sun’s luminosity.
Their work won them the 1993 Nobel Prize in Physics and provided the first
convincing experimental evidence for the existence of gravitational waves.
The most sensitive terrestrial detectors use an interferometer to detect small
changes in light path over two perpendicular arms.
mirror
laser
mirror
beam
superposed beams
to detector
Monochromatic light is split so that half of the beam travels along the
“vertical” arm and half along the “horizontal” arm (in reality these will both be
horizontal – for example, NS and EW, and perhaps several kilometers long).
The returning beams superpose and interfere. When a gravitational wave
passes, the lengths of the arms fluctuate introducing a periodic phase differ-
ence that makes the interference pattern change at the frequency of the wave.
These fluctuations were finally discovered at the LIGO, Laser Interferometer
Gravitational Waves Detector, in 2015 and announced in February 2016 (100
years after they were predicted). The signal detected by LIGO was consist-
ent with the source being the inward spiral and coalescence of a pair of black
holes of around 36 and 29 solar masses. This discovery opens the door to a
new age of astronomy in which the most violent cosmic events are observed
using gravitational wave “telescopes.”
23.6 EXERCISES
(a) Calculate the weight of a 1.0 kg mass close to the Earth’s surface.
g = 9.8 Nkg–1.
(b) Calculate the centripetal force required to keep 1.0 kg of the Moon’s
mass in its orbit around Earth. Moon’s orbital period = 27.3 days. The
mean radius of Moon’s orbit = 3.84×108 m.
Your answer in (b) is much smaller than in (c), but The Moon is fur-
ther from the Earth, so if both forces arise because of gravitational
attraction to the Earth then the strength of this attraction must fall
with distance.
(c) Show that the forces in (a) and (b) are consistent with the idea that
gravitational forces obey an inverse-square law with distance. The
radius of the Earth = 6.4×106 m.
(d) Give an argument to support the idea that gravitational attraction is
proportional to the mass of the object attracted (think of the Moon in
its orbit).
(e) Give an argument to support the idea that gravitational attraction is
proportional to the mass of the attracting object (think of Newton’s
third law).
(f) Use your answers to (c), (d), and (e) to justify the form of Newton’s law
of gravitation.
2. Consider a mass m suspended at a point P just above the surface of a
planet of mass M and radius R. Write down expressions for each of the
following quantities:
(d) Use Kepler’s third law to calculate the distance from the Earth at
which an artificial satellite would be geostationary (i.e., have an orbital
period of 24 hours or 1 day).
(e) Why must geostationary satellites be placed in equatorial orbits?
(f) Use Kepler’s third law to derive a lower limit for the period of an artificial
Earth satellite.
6. Ganymede is Jupiter’s largest moon. Its orbit is circular with a radius of
1.07 × 109 m. The orbital period is 7 days, 3 hours and 43 minutes. Use
Kepler’s third law to find Jupiter’s mass.
7. If the Olympics were to be held on the Moon suggest how the shot putt
record on the Moon would compare with the record on Earth. Explain
your answer.
8. Both the Sun and Moon exert tidal forces on Earth’s oceans.
(a) Explain, qualitatively, how tidal forces arise.
(b) Show that the tidal effects caused by the Sun are significantly less than
those caused by the Moon.
9. Astronomers think that black holes may exist at the center of most g alaxies.
The evidence is from the high orbital speeds of stars near the center of
the galaxies (including our own Milky Way). The star orbits because of
attraction to the central object and that object must have a radius smaller
than that of the star’s orbit. Use this information to show that the object is
likely to be a black hole if:
c
v≥
2
Where v is the orbital speed (assume the orbit is circular) and c is the
speed of light.
2GM
The equation for the Schwarzschild radius is: RS = 2
c
where M is the mass of the central object.
24
Special Relativity
from rest? This is an important question because if the answer is “yes” then
there is one special reference frame that is at rest and all other reference
frames are in motion. If the answer is “no” then the idea of being at rest is not
special at all. The idea of a privileged reference frame at rest in the Universe
leads to the concept of “absolute space,” a fixed background or reference
frame against which all other motions can be measured.
This is where the speed of light comes in. This speed emerges naturally
from Maxwell’s equations and represents the speed of a disturbance in the
electromagnetic field. Other kinds of waves are all vibrations in some mate-
rial medium so physicists naturally thought that light must also have its own
medium that fills the vacuum of space and is at rest in absolute space. They
called this all-pervasive medium the “luminiferous ether.” The speed of light
would then be the speed relative to the ether and to absolute space. They
immediately began to think about how they might detect the ether, or at least
detect our motion relative to the ether.
In principle, it should be relatively easy to detect the effect of the Earth’s
motion relative to the ether. The Earth orbits the Sun so its motion through
the ether ought to vary periodically during the year. If the speed of light is con-
stant relative to the ether, then the speed of light measured on Earth would
be a relative velocity and should also vary, sometimes above and sometimes
below the value given by Maxwell’s equations. Several ingenious experiments
were carried out to try to test this idea, the most famous of which was the
Michelson-Morley experiment in 1887.
In the Michelson-Morley experiment a light beam is divided in two and each
half is sent on a round trip perpendicular to the other half of the beam. The
two beams are then combined and form an interference pattern. Any differ-
ence between the time of flight on the two paths shows up as a shift in the
interference pattern when the returning beams superpose. Since the Earth
orbits the Sun the experimenters expected the interference pattern to shift
during the year as the Earth’s motion relative to the ether changes and this
changes the relative velocity of light, producing different time delays for each
path. For example, if the Earth moves through the ether in the same direc-
tion as light then the relative velocity is lower, and if the Earth moves in the
opposite direction the relative velocity is higher.
The arrangement of the apparatus for the Michelson-Morley experiment is
shown (simplified) below. This is an example of an “interferometer” consisting
of two arms of equal length. The motion of the laboratory through the ether
delays the light along both arms but the delay is greater along the path p
arallel
to motion through the ether. If the apparatus is rotated through 90° the greater
delay should shift to the other arm of the interferometer and the interference
pattern should shift. The apparatus used by Michelson and Morley was sensi-
tive enough to detect shifts caused by relative velocities comparable to the
Earth’s orbital speed, so they expected to detect the Earth’s motion through
the ether.
half-silvered
mirror
mirror (beam
monochromac splier)
light
interference paern
This is called a “null result” and is probably the most famous null result in the
history of physics. All attempts to detect the ether have failed and the fact that
there is no delay in experiments such as the Michelson-Morley experiment is
very hard to explain using the concept of an ether.
This failure to detect the ether is also a failure to detect the Earth’s motion in
absolute space. It seems that the Earth’s motion does not affect the speed of
light.
v
mirror
c
L
c
Observer B
mirror mirror mirror
c
c Observer A
mirror
mirror
mirror mirror
vT’/2 vT’/2
2L
For A, the time between ticks of his own light clock is T where: T = Also
c
for A, let the time between ticks on the moving (B’s) clock be T’. Now use
Pythagoras’s theorem to find the relationship between T and T’.
Considering either one of the two right angled triangles above:
c 2 T ′ 2 v2 T ′ 2
− =
L2
4 4
cT
L=
2
Therefore:
c 2 T ′2 v2 T ′ 2 c 2 T 2
− =
4 4 4
T
=T′ = gT
v2
1− 2
c
where:
1
g=
v2
1−
c2
This is Einstein’s “time dilation” formula, showing how time slows down in
a moving reference frame, that is, the time between “ticks” on the “moving”
clock is increased relative to the “rest” clock.
The “gamma-factor” determines the significance of relativistic effects. For
v << c it approaches 1 and T’ = T, which is consistent with an idea of an
absolute time. However, as v increases the factor grows so that time in the
moving reference frame (B’s frame) slows down when observed from the
rest frame (A’s frame). As v approaches the speed of light the gamma-factor
increases without limit so that time would slow to a halt at v = c. Note that
we are not just talking about clocks here. As far as B is concerned time in
his reference frame is measured correctly by his own light clock so when
observed by A it is not just that the clock in B’s frame slows down, but so
does B’s aging process – time slows down.
gamma-factor
γ =1
velocity of moving
γ =0
reference frame
v=0 v=c
For relative velocities small compared to the speed of light, the gamma-factor
is negligible so relativistic effects can be neglected. This allows us to assume
that we all share the same time and space and can agree over measurements
of time intervals and distances. For example, even at the speed of a jet airliner
(about 300 ms- 1) the value of the gamma-factor is just 1.0000000000005!
If A sees B’s time slow down then what does B see when he looks back at A?
Both of the observers are in inertial reference frames so the laws of physics
are the same for both of them and the speed of light is constant in their refer-
ence frames. B will see A moving in the opposite direction at speed v and will
see the light path in A’s clock stretched out. B will see A’s time run slow by
the same time dilation factor that A saw B’s clock run slow. The effect is the
same for both observers. At first sight, this seems to lead to a contradiction
but while the clocks are in motion they cannot be placed at rest next to one
another to compare them to find out which one has gained or lost time. To do
this we would have to take at least one of them on a round trip.
v
B
A
B
v
Star
Earth
However, all inertial reference frames are equivalent, so B might argue that
from her point of view A and the Earth move away and return and the star
approaches and recedes. During the motion, she sees A’s time run slowly and
so concludes that A should be the younger one when they reunite. Is this a
valid way to describe the journey? In fact, it is not. If we ignore the relatively
low velocity of the Earth then A stays in the same inertial reference frame
throughout and so A’s point of view is valid. B however, undergoes three sep-
arate periods of acceleration: as she leaves Earth, as she turns around and
accelerates back toward the Earth at the distant star, and then again as she
comes to rest back on Earth. To describe the journey from B’s point of view
we must take into account the effects of these periods of acceleration. From
B’s point of view, she will experience inertial forces during these periods that
are just like increases and decreases in the local gravitational field. These
changes introduce gravitational time dilation effects (see Section 23.5.3) that
are not present for A; so B’s view is not equivalent to A’s.
The traveling twin (B) will be younger when they reunite and the apparent
paradox is resolved.
Light source
c emits flash c
*
Clock A Clock B
clock A first so that clock starts first. Clock B starts later. The two clocks are not
synchronized in the stationary observer’s reference frame even though they
are synchronized in the moving reference frame. For the stationary observer
the synchronization “error” increases with the separation of the clocks.
staonary
observer
This is a profound result. Two events that occur near A and B when the clocks
show the same time are simultaneous in the moving reference frame but occur
at different times for the stationary observer. This means that one moment in
a particular inertial reference frame is spread over different times in different
inertial reference frames. This destroys the concept of absolute time which
assumes a unique progression of moments for all observers.
It should again be noted that there is nothing special about the so-called “sta-
tionary observer.” He is just in a different inertial reference frame and if he
were to synchronize two distant clocks in his own reference frame then an
observer in the “moving” reference frame would think that there was a syn-
chronization error.
If the observers on board the space craft work out the distance they have
traveled, it will be 5 (v/c) light years rather than the 100 000 (v/c) light years
as observed from the Earth. The distance traveled has contracted by the same
gamma-factor as the time. This is an example of relativistic length contraction.
v v
L L
L
L′ =
γ
1
γ=
v2
1−
c2
L is the length of an object measured in its rest frame: its “proper length.”
L’ is the length of the same object when it moves past the observer at velocity v.
event in one inertial reference frame to the coordinates of the same event in
another inertial reference frame moving at constant velocity with respect to
the first.
z z’ Event coordinates:
(x, y, z, t)
x x’
The diagram above shows the coordinate systems used in two inertial refer-
ence frames. The primed frame is moving at velocity v in the positive x- and
x’-direction. At time t = 0 the origins of both coordinate systems coincide.
An event with coordinates (x, y, z, t) in the unprimed frame has coordinates
(x’, y’, z’, t’) in the primed frame. The Lorentz transformation allows us to
transform from one frame to the other.
Lorentz transformation - to transform from the primed to the unprimed
reference frame:
x′ = γ ( x − vt )
y′ = y
z′ = z
vx
t′ =γt − 2
c
x= γ ( x′ + vt′ )
y = y′
z = z′
vx′
γ t′ + 2
t=
c
These equations can be used to derive the formulae for time dilation, length
contraction, and synchronization differences.
We will use them to derive an equation for relativistic velocity addition.
z z’
dx′
u=
dt′
dx dx′ dt′ dt′ dx′ dt′
w= = γ +v = γ γ ( u + v)
+ v =
dt dt dt dt dt′ dt
dt v dx′ uv
=γ 1 + 2 =γ 1 + 2
dt′ c dt ′ c
γ
=w ( u + v)
uv
γ 1 + 2
c
( u + v)
w=
uv
1 + 2
c
If u and v are small compared to the speed of light then this reduces to the
familiar equation for velocity addition: w = (u+v). The relativistic velocity
addition equation has the interesting property that no two sub-light speeds
can be added together to produce a speed greater than the speed of light.
However, long something accelerates it will approach but never quite reach
the speed of light. Consider the limiting case where u = v = c:
( u + v) ( c + c) 2c
=
w = = = c
uv c 2
2
1 + 2 1 + 2
c c
Eqδt′
δv =
m0
1
E = mc2 = γ m0 c2 = m0 c2 + mv2 + …+ terms converging to zero
2
The second term on the right is the classical kinetic energy and the converg-
ing series of terms beyond it (which are all negligible for v << c) are the rela-
tivistic corrections to the equation for kinetic energy.
The term E = mc2 on the left-hand side of the equation must represent the
particle’s total energy when it is moving at velocity v. This becomes E = m0c2
when the particle is at rest so this is called the rest energy of the particle and
it suggests that mass and energy are equivalent in some sense.
The kinetic energy of the particle is the difference between its total energy
and its rest energy:
= mc2 − m0 c2
KE
∆E = c2 ∆m
c2 is so large (9×1016 m2s- 2) that most energy transfers in everyday life make
negligible difference to the mass. The only place where we see measurable mass
changes because of energy transfers is in nuclear physics (see section 26.1).
On the other hand, the conversion of even a tiny amount of matter would
release a huge amount of energy. The only process that converts mass com-
pletely to energy is the annihilation of matter and antimatter, for example, the
annihilation that occurs when an electron meets a positron (anti-electron).
24.6.1 Invariants
The fact that measurements of lengths and times made in different inertial
reference frames differ from one another can be disconcerting. Physicists like
to discover quantities that are the same for all observers. Such quantities are
called “invariants.”
Consider two points on a 2D surface. The location of each point can be
described by stating the coordinates of each point relative to a fixed coordinate
system consisting of two perpendicular axes. However, there is an arbitrary
choice about which set of perpendicular axes to use and the coordinates will
be different for different choices, as shown in the diagram on the next page.
While the coordinates change when the axes are rotated, the distance AB does
not. It is an invariant under rotation of the axes:
y y’
A: ( x A, yA ) ( x A’, yA’ )
y B’
B B: ( x B, yB ) ( x B’, yB’ )
yB
yA A
yA
xA xB x
x A’
x B’
x’
24.6.2 Space–Time
Space–time is a four-dimensional continuum where the fourth dimension is
related to time. Whereas a point in space is defined by three coordinates (x, y,
and z) a point in space–time, called an event, has four (x, y, z, and t). In the
SI system, distances and times are measured in different units, meters and
seconds, so instead of simply using the time in seconds in a space–time dia-
gram we multiply the time by the speed of light so that the units on the time
axis are also meters. Minkowski realized that while measurements of time
intervals and distances will differ for observers in different inertial reference
frames, the Lorentz transformation ensures that the 4D “distance” between
two events is the same for all inertial observers, regardless of their velocity.
This 4D distance is calculated by Pythagoras’s theorem but by treating the
time differences as if they are mathematically imaginary quantities, in other
words, the fourth dimension is actually the ict dimension where i is the square
root of minus one.
ict
worldline of rocket
B moving at velocity v
ictB
ictA A
xA xB x
The diagram above shows a space–time diagram. Only one spatial dimension
is shown (x). Each point on the diagram represents an event – something
( xB − xA ) + ( ictB − ict A )
2 2
AB =
Since the interval is an invariant it must be the same using the coordinates of
events A and B in any inertial reference frame, including that of the rocket.
However, in the rocket’s reference frame both A and B occur where the rocket
is, in other words at x’ = 0 but at different times t′A and t′B. We cannot assume
that the times are the same as those in the other reference frame. The space–
time diagram in the rocket’s reference frame looks like this:
ict’
ict’B B
worldline of rocket at
rest in its own ineral
reference frame
ict’A A
0
x’
The interval AB in the rocket frame is:
( ict′B − ict′A )
2
=
AB
we can equate the two expressions for the interval (because it is an invariant):
( ict′B − ict′A =
) ( xB − xA ) + ( ictB − ict A )
2 2 2
we can also express the distance traveled in the un-primed frame in terms of
the velocity:
( xB − xA ) = v ( tB − tA )
After some algebraic rearrangement, we can express the time elapsed in the
rocket frame (primed frame) in terms of the time elapsed in the rest frame
(un-primed frame)
− c2 ( t′B − t′=
A) v2 ( t B − t A ) − c 2 ( t B − t A )
2 2 2
v2 ( tB − t A )
( t′B − t′A=) 1 − ( tB − t A=
)
c2 g
This is time dilation. The time elapsed in the moving reference frame is less
than the time elapsed in the rest frame by the gamma-factor.
This illustrates how a geometrical approach in flat space–time reproduces the
standard results of special relativity.
( m0 c2 )
2
E2 − p2 c2 =
24.7 EXERCISES
m = g m0
Explain why there is a time difference between the two clocks at the
(a)
end of the experiment and say clearly which clock will be fast or slow
with respect to the other.
Calculate the expected time difference.
(b)
5. A spacecraft carrying observer A, has clocks, X and Y, at each end and is
passing an observer B at speed v.
X A Y v
Z B
25
Atomic Structure and
Radioactivity
25.1 THE NUCLEAR ATOM
The concept of an “atom” comes from the ancient Greeks and derives from
the idea of an uncuttable smallest part of each element. While we still consider
each atom to be the smallest part of an element it is certainly not “uncutta-
ble.” At the end of the 19th century J.J. Thompson showed that electrons were
small parts of all atoms and at the start of the 20th century Ernest Rutherford
probed the atom with high energy alpha particles and showed that all atoms
have a tiny, massive core or nucleus. Later, in 1919, he managed to split a
nitrogen atom. Rutherford’s work resulted in the nuclear model of the atom.
alpha
gold foil
source
A small number (about 1 in 104) scattered through angles greater than 90°
(backscattered).
Rutherford assumed that any deflections must be electrostatic in nature and
explained the results in the following way:
the atom is mainly empty space, so most alpha particles do not pass close
to any concentrated charge centers;
there is a tiny electrostatically charged central nucleus. The small propor-
tion of alpha particles that pass close to a gold nucleus will be deflected
through large angles and those that make an almost direct hit will back
scatter.
The nucleus must contain most of the mass of the atom otherwise the
nucleus would recoil strongly and the alpha deflections would be less
significant.
Rutherford used the scattering data and Coulomb’s law to calculate an upper
limit for the radius of the nucleus and found that the nuclear radius was of the
order of 10-14 m or smaller, about 104 times smaller than the atomic radius.
Later work showed that the charge on a nucleus is equal to +Ze where Z is
the atomic number, equal to the position of the atom in the Periodic Table of
elements. The forces acting on the scattering nucleus and the scattered alpha
particle are shown below.
The diagram below shows typical alpha particle trajectories close to a nucleus.
On this scale, the outside of the atom would be about 10 m away! Therefore
the vast majority of alpha particles, which pass much further away from the
nucleus than those shown in this diagram, experience weak electrostatic forces
and suffer small deflections.
The greater the impact parameter (the perpendicular distance between the
initial path of the alpha particle and the nucleus) the greater the minimum
distance between the alpha particle and the target nucleus and the smaller the
angle of deflection. An alpha particle traveling directly toward the center of
the target nucleus makes the closest approach to it and is deflected back along
its original path (deflection angle 180°).
2
=
4
+2e r
+Ze 2
=
nucleus 4
2 Ze2 2 Ze2
Eα = and d =
4 πε 0 d 4 πε 0 Eα
So, the radius of the nucleus, rn must be less than this value:
incident alpha
parcles
increasing
impact
gold nucleus parameters
closest
approach d
2 Ze2
rn <
4 πε 0 Ea
The alpha particle energy in Rutherford’s experiment was 4.7 MeV and the
atomic number of gold is 79. This gives an upper limit for the radius of the
gold nucleus:
st
1 minimum of
high-energy diffracon
electron beam paern
θ
target
The first minimum of the diffraction pattern for a spherical object is at:
1.22λ
sin θ =
D
Where D is the diameter of the target nucleus and l is the de Broglie wave-
length of the electrons.
1.22l
D=
sin θ
h
λ=
p
However, in order to obtain a wavelength comparable to nuclear dimensions
the electron must be accelerated to very high energy so relativistic equations
must be used to determine the electron momentum. In particular:
E2 + ( m 0 c 2 )
2
p2 c=
2
the energy required E is much greater than the rest energy m0c2 so the equa-
tion simplifies to:
E
p=
c
giving:
1.22 hc
D=
E sin θ
For example, 12
C represents the nucleus of carbon-12 with 6 protons and 6
6
neutrons.
Elements are determined by their chemical behavior and this depends on
the arrangement of their outer electrons. Since the electronic configuration is
determined by the charge on the nucleus atoms with the same atomic number
are chemically identical even if the number of neutrons in the nucleus varies.
Atoms with the same atomic number and different neutron numbers (and
mass numbers) are isotopes of the same element. For example:
12
6 C, 136 C, and 146 C
are all isotopes of carbon. They are chemically identical but have slightly dif-
ferent physical properties because of their differing masses. Carbon-14 is also
unstable and, in common with many neutron-rich nuclei, it decays by beta-
minus emission.
∗
There are two types of beta emission, beta-minus (electrons) and beta-plus
(positrons). Here we are describing beta-minus.
These depend on energy so vary with different sources and are given here as
∗∗
α α
γ γ
β β
E-field B-field
upwards into page
R
I=
4 πr 2
sphere at radius
r from source
source
δx
I I+δI
x
The diagram above shows part of the path of a gamma-ray beam through
matter. Then the intensity of the beam changes by an amount dI as it passes
through a short thickness dx of material. Since the beam is being absorbed dI
is negative. The proportion absorbed per unit length is constant so:
δI
= −µ
Iδx
where m is the absorption coefficient (a constant) for the medium with units m-1.
In the limit that dx ® 0 this becomes the first-order differential equation:
dI
= −µI
dx
I = I0 e−µx
Intensity falls exponentially from its initial value I0. As with all exponential
changes, it has a constant proportion property in that the intensity will always
fall by the same fraction in the same distance. We can therefore derive an
expression for the half-thickness x½ of the material, that is, the thickness of the
material that will reduce any initial intensity of the radiation by 50%.
−µx 1
1
e 2
=
2
ln 2
x1 =
2 µ
While the surface densities for aluminum and copper are similar, the range of
a 1.0 MeV beta particle is very different because the two elements have quite
different densities.
Range in aluminum:
σ 5.55
=
=R = 2.1 × 10 −3 m
ρ 2700
Range in copper:
σ 6.29
R == = 0.70 × 10 −3 m
ρ 8960
Copper is roughly three times as dense as aluminum so the range in copper is
roughly a third of the range in aluminum.
cloud
source chamber
E1
E2
The high positive charge and large mass of an alpha particle make it interact
strongly with matter so that alpha particles transfer energy quickly and have
very short ranges in anything other than gas. Even in the air they are stopped
within a few centimeters.
Nuclear
Food weapons
Nuclear industry
Medical
Radon
Cosmic rays
The pie chart below shows the origins of the typical annual radiation dose for
a UK citizen. By far the largest contribution comes from radon gas, a natural
product of the uranium decay series that seeps into the atmosphere from the
ground and can accumulate in cellars and ground floor rooms if they are not
properly ventilated.
Patients who survive high radiation doses have a significantly increased risk of
developing cancers such as leukemia in the future. Those who have not been
sterilized by the radiation also risk passing on genetic damage to their chil-
dren, resulting in stillbirths and birth defects. Approximately 50% of people
exposed to 5000 mSv (500 rem) will die.
The annual exposure limit for nuclear industry employees in the UK is 20
mSv, about double the dose from a CT scan of the spine.
during the duration of the experiment and a list of actions that will be taken
to minimize this dose. A judgment must also be made – do the benefits of
carrying out the experiment (e.g., educational or medical benefits) outweigh
the increase in the risk of damage to your or other people’s health? In a school
laboratory, sources are usually weak and well protected so that their use will
result in a negligible increase in dose compared to the natural background;
however, it is always important to be confident that this is the case.
Here are some additional procedures that should be used:
Keep the source inside its shielded container except when carrying out
the experiment
Maximize the distance between the source and the experimenters
Minimize the time of the experiment
Do not direct a collimated beam toward anyone
Always handle sources remotely, for example, using tongs
Include shielding between the experiment and the experimenters
Do not eat or drink while carrying out the experiment
Wash your hands thoroughly after the experiment
Cover any open cuts or scratches with a plaster
If you drop the source or suspect it is damaged in any way report this
immediately
dN
∝ −N
dt
dN
= −λ N
dt
N is the number of nuclei present at time t and the minus sign indicates that
the number is falling with time.
l is the decay constant and depends on the nuclide being considered. It has an
SI unit s-1 and represents the probability of decay per unit of time in the limit
of small time intervals.
This is a first-order differential equation that can be solved by the separation
of variables. Its solution represents how the number of nuclei in the sample
varies with time:
N = N0 e−λt
This is exponential decay. The term e− l t is equal to the fraction of the initial
number of nuclei remaining after time t.
The “half-life” of the nuclide is the time taken for the number of nuclei in the
sample to halve.
−λt 1
1
e 2
=
2
ln2
t1 = k
2
The graph below shows the decay curves for three nuclides, A, B, and C.
Their half-lives are shown on the time axis. A has a half-life of about 0.35 y, B
has a half-life of about 0.69 y and C has a half-life of about 1.4 y.
The fraction remaining after n half-lives is 1/2n. While the mathematical
model suggests that the sample never completely decays, the number of nuclei
remaining will eventually become so small that the model does not apply and
the random nature of radioactive decay will result in significant fluctuations in
decay rate until the last nucleus eventually decays.
The “activity” of a radioactive source is defined as the number of disintegra-
tions (decays) per second taking place inside the source. This is not the same
% of N0
100
90
80
70
60
50
40
A
30
20
B
10
0
C
0 1 2 3 4 5
me
( C) ( B) ( A)
as the count rate in a detector because many emissions will miss the detector
or not be detected and some will be absorbed before even leaving the source.
Activity = number of decays per second inside the source
dN
A= −
dt
The SI unit for activity is the Becquerel (Bq). 1 Bq = 1 decay per second.
This is rewritten in terms of the number of nuclei in the source:
dN
A= − = −λN0 e−λt = −λN
dt
A= −λN
238
92 U→ 234
90 Th + 24 α
A
Z X→ A-4
Z-2 Y + 24 α
0
−1 e or −01 β
This is a subtle process involving weak nuclear force and occurs when a neu-
tron decays to become a proton inside the nucleus. In addition, the creation of
an electron conservation of lepton number (see Section 26.4.1) demands that
an anti-neutrino is also emitted. This is an anti-particle of the neutrino, it is
neutral, has a tiny rest mass, and is like an uncharged electron. Its symbol is: 00u
The bar over the symbol indicates that this is an anti-neutrino.
14
6 C → 147 N + −01 β + 00u
A
Z X→ A
Y + −01 β + 00 υ
Z +1
1
0 n → 11 p + −01 β + 00 υ
Beta decay results in three particles moving away from the position of the
original nucleus. This can occur in a variety of ways with almost all the energy
released being shared by the electron and anti-neutrino. The way in which
this energy is shared is random so the beta-particles emitted from a source
have a continuous range of kinetic energies up to some maximum value.
This differs from alpha decay where, with only two emerging particles, the
energy is shared in a definite way and the alpha particles have a discrete
energy spectrum.
number of beta
parcles
kinec
energy
cut off
∆E =hf
60
27 Ni
15
8 O → 157 N + 0
+1 β + 00 ν
A
Z X→ A
Y+
Z −1
0
+1 β + 00 ν
1
1 p → 01 n + 0
+1 β + 00 ν
25.6.5 Electron-Capture
Electron-capture is another way that a proton-rich nucleus can decay. The
nucleus captures one of the atom’s inner electrons and combines it with a pro-
ton to create a neutron and emit a neutrino, so the net effect on the nucleus
is the same as beta-plus decay. However, the loss of an inner electron allows
other electrons to cascade down to lower energy levels inside the atom emit-
ting characteristic X-rays.
The general equation for electron-capture is:
A
Z X+ 0
−1 e→ A
Z −1 Y + 00n
1
1 p+ 0
−1 e → 01 n + 00n
Electron capture is an alternative mode of decay for nuclei that undergo beta-
plus decay.
alpha source
ion pair
+
posive ions are
aracted to the cap
and negave ions are
repelled from it
negavely
charged
electroscope
fine wire
gap of ∼ 1 mm
some of the gas molecules become ionized. The electric field between the anode
and cathode accelerates these ions and there is a small pulse of current in the
external circuit. This generates a voltage pulse that can be detected and counted.
Alpha and beta radiation enter the Geiger-Müller tube through a thin end
window but gamma-radiation can also enter through the walls of the tube.
All of the pulses are identical, regardless of the type of radiation so a Geiger
counter does not indicate the type of radiation detected. There is also a “dead
time” following each voltage pulse. During this time the detector will not reg-
ister any further ionization events. This limits the maximum count rate that
can be measured accurately.
voltage pulse
central wire
anode (+)
−
variable supply
The count rate from a Geiger counter should not be confused with the activity
of a source. The latter is the number of disintegrations per second inside the
source. The count rate on the Geiger counter is related to this but is usually
only a small fraction of the activity because of the emissions that miss the
counter, are not detected by it, or are absorbed inside the source or between
the source and the detector.
Count rates are usually recorded as counts per minute (cpm) and often have
to be corrected for the average background count where the experiment is
being carried out.
constant distance
GM tube counter or
sealed
source rate meter
1
0 n + 147 N Æ 14
6 C + 01 p
f = f0 e−λt
f f
ln ln
=t = f0 f0
−λ − ln 2
This method assumes that there has been no contamination of the sample
and that the fraction of carbon-14 in the atmosphere has remained constant
over the time period being measured. Radiocarbon dating methods are cali-
brated against other methods such as dendrochronology (tree ring counting).
f/f0 × 100%
100
90
80
70
60
50
40
30
20
10
time /y
0
573 1146 17190
of 1.3 billion years). All such methods rely on assumptions about the initial
state of the rocks, for example, that there was no argon present in a rock
when it solidified so that all of the argon now present must have come from
the decay of potassium, and about the isolation of the rock since its forma-
tion, that is, so that no potassium or argon has been added from or lost to
external sources.
25.9 EXERCISES
3. (a) Copy and complete the table below for neutral atoms.
Isotope Symbol Atomic Nucleon protons electrons neutrons
number number
Hydrogen 1
12
Carbon - 12 6 C
Carbon -13
Carbon - 14
Oxygen - 16 8
56
Iron - 26 Fe
Gold - 79 197
Uranium - 235 92
Uranium - 238
(b) Use examples from the table to explain what is meant by an “isotope.”
(c) The heavier nuclei, such as iron, gold, and uranium, have an excess of
neutrons. Suggest a reason for this.
4. A radioactive rock is tested in a school laboratory. Here are the results:
Set-up Number of counts in 5 minutes
No rock present 100
Rock alone present 900
Rock behind card 402
Rock behind 2 mm aluminum sheet 398
Rock behind 1 cm lead 123
Rock alone present 24 hours later 197
(b)
The first three nuclides in the series are: uranium-238 (atomic
number 92), thorium-234, palladium-234, and uranium-234. Write
down balanced nuclear transformation equations for the first three
decays.
(c) The half-lives of the four nuclides above are: 4.51´109 years, 24.1 days,
77s, and 2.47´107 years. Suggest how the relative abundances of each
nuclide compare.
(d) The isotope bismuth-214 (Z = 83) can decay by either alpha or beta-minus
decay, but whichever decay it undergoes the series reaches lead-210
(atomic number 82) in two steps. Write down the nuclear transforma-
tion equations to show how this occurs by each route. (The element with
Z = 81 is thallium and the element with Z = 84 is polonium).
11. An experiment was carried out using a Geiger counter to monitor the
activity of a radioactive source. The table below gives the average counts
per minute, corrected for background, recorded during the experiment.
Time, s Activity, 1013 s-1
0 1.64
50 1.36
100 1.13
150 0.935
200 0.775
250 0.643
300 0.533
50 0.422
400 0.366
450 0.304
500 0.252
26
Nuclear Physics
energy B.E. of the nucleus is equal to the mass defect multiplied by the speed
of light squared.
repulsion
3 1015 m
attraction
m Zm p Nm n Mnucleus
Particle Mass, kg
electron 9.1094×10−31
proton 1.6726×10−27
neutron 1.6750×10−27
Oxygen-16 atom 26.5676×10−27
Atom Mass
4
2 He 4.002603 u
234
90 Th 234.04364 u
238
92 U 238.05082 u
The mass defect for the reaction is the difference between the nuclear masses
on each side of the equation. To calculate the nuclear masses we need to sub-
tract 92me from the left-hand side and (90 +2)me from the right-hand side.
These electron masses cancel so we can work directly with the atomic masses:
∆m = 238.05082 u − (234.04364 + 4.002603) u = 0.00458 u
The energy released is: E = 932.9 × 0.00458 = 4.27 MeV
This is shared between the alpha particle and the recoiling nucleus and linear
momentum must be conserved so (in the reference frame of the original ura-
nium-238 atom) the two must travel in opposite directions and:
mnucleus vnucleus m v
so that:
m 234
v nucleus vnucleus vnucleus
m 4
The alpha particle travels much faster than the recoiling nucleus and carries
away most of the kinetic energy.
2
1 1 m m 234
E m v 2 m nucleus vnucleus 2 nucleus Enucleus Enucleus
2 2 m m 4
The alpha particle gets 234/238 E.
Beta-minus decay of carbon-14
14
6 C 147 N 01 00
Atom Mass
0
1 0.000549 u
14
6 C 14.003242 u
14
7 N 14.003074 u
The mass defect for the reaction is the difference between the total rest masses
on each side of the equation. To calculate the nuclear masses of carbon-14 and
nitrogen-14 we must subtract 6 me from the atomic mass of carbon-14 and 7
me from the atomic mass of nitrogen-14.
Mass defect = (14.003242 − 6 × 0.000549) u − ((14.003074 − 7 × 0.000549)
+ 0.000549) u
When the beta-minus is included, the electron masses cancel:
∆m = (14.003242 − 14.003074) u = 0.000168 u
The energy released is: E = 932.9 × 0.000168 = 157 keV
This energy is shared between the recoiling nitrogen-14 nucleus, the beta-
particle, and the neutrino. The mass of the neutrino and beta-particle is neg-
ligible compared to the mass of the nucleus so virtually all the energy goes
to the two light particles. However, this energy is shared randomly between
them so beta-particles from a carbon-14 source are emitted with a continuous
range of kinetic energies up to a cut-off value of 157 keV.
nuclei with specific decay modes that result in product nuclei lying closer to
the band of stability.
band of stability
neutron number:
N = A Z
alpha emitters
beta-minus
emitters
beta-plus emitters
and electron capture
proton number: Z
Beta-minus emitters: N/Z too high - neutron rich nuclei that approach
stability by converting a neutron to a proton.
Beta-plus emitters and electron capture nuclei: N/Z too low – proton-rich
nuclei that approach stability by converting a proton to a neutron.
Alpha emitters: N/Z too low – heavy proton-rich nuclei that approach sta-
bility by reducing both N and Z by 2. Since N>Z this reduces the ratio of
N to Z.
Nuclear fission: some heavy nuclei close to the top of the band can
undergo induced or spontaneous nuclear fission to create pairs of neu-
tron-rich daughter nuclei which lie about half-way down the band and are
beta-minus emitters.
The diagram below shows the effect of alpha and beta decays on a plot of
proton number against neutron number:
neutron number
Z 2 Z 1 Z Z+ 1 Z+ 2
N+ 2
N+ 1
+
ec
N
N1
N2
proton number
pr
8.0
6.0
4.0
2.0
Nucleon number
0.0 50 100 150 250
200
It is better to compare binding energy per nucleon rather than total nuclear
binding energy because the latter depends on the number of nucleons in the
nucleus so that a large value does not necessarily mean that the nucleus is
particularly stable.
Binding energy per nucleon increases rapidly with nucleon number for
light nuclei.
The curve has a peak value that occurs for iron-56. This is the most stable
nuclide.
For nuclides heavier than iron-56 the binding energy per nucleon gradu-
ally falls.
Most nuclides (from Oxygen to Uranium) have a binding energy per
nucleon between 7.5 and 8.5 MeV/nucleon.
Some light nuclides, such as helium-4, carbon-12 and oxygen-16 have
particularly large binding energy per nucleon compared to other nearby
nuclides.
‘daughter
g nuclei’
B.E./A
nuclear fission
nuclear fusion
The initial steepness of the curve shows that nuclear fusion releases more
energy per kilogram of fuel than nuclear fission. However, the high nucleon
number of the fissioning nucleus shows that nuclear fission releases more
energy per reaction than nuclear fusion.
26.3.1 Nuclear Fission
Nuclear fission was discovered by Otto Hahn and Lise Meitner in 1938. They
were bombarding uranium with neutrons and noticed that lighter nuclei with
mass numbers approximately half that of uranium were being formed. They
came to the correct conclusion that the neutrons had induced fission reactions
in nuclei of uranium. However, natural uranium consists almost entirely of
two isotopes, uranium-238 (99.3%) and uranium-235 (0.7%). The fissionable
isotope is uranium-235 while uranium-238 is more likely to absorb neutrons
than fission.
There are many ways that the uranium-235 nucleus can split when it absorbs a
neutron. Here is one nuclear equation for the induced fission of uranium-235:
1
0 n 235
92 U 144
56 Ba 36 Kr 2 0 n
90 1
Atom Mass
1
0 n 1.009 u
235
92 U 235.044 u
144
56 Ba 143.923 u
90
36 Kr 89.920 u
We need to work with nuclear masses, but since there are equal numbers of
electrons to subtract from each side of the equation we can, once again, sim-
ply use the atomic masses:
Mass defect = (235.044 + 1.009) u − (143.923 + 89.920 + 2×1.009) u = 0.192 u
The energy released is: E = 932.9 × 0.192 = 179 MeV
The fact that additional neutrons are emitted could lead to further nuclear
fission reactions. The Hungarian physicist Leo Szilard realized that it would
be possible to initiate a chain reaction if more than one neutron per fission on
average went on to cause further fission reactions.
incident neutron
Uranium-235 nucleus
undergoing fission
One way around this problem is to use enriched uranium, that is, uranium
with a higher content of U-235, but this is difficult to obtain in large quanti-
ties because U-235 and U-238 are isotopes of the same chemical element and
so have the same chemical properties. Various separation techniques have
been used but the most successful involves centrifuging uranium hexafluor-
ide, a gaseous uranium compound, to increase the concentration of U-235.
Since this method can be used to produce fuel for nuclear reactors (about
4% enrichment) and for weapons (about 80% enrichment) it is very difficult
to distinguish between the peaceful production of enriched uranium and its
production for atomic bombs.
conventional
explosive
(detonator)
shaped explosives
sub-critical assembly of
plutonium-239
compressed, critical
assembly of
plutonium-239
While atom bombs are incredibly powerful weapons they release far less
energy than a hydrogen bomb, which is based on nuclear fusion reactions.
containment
control rods
steam to
pressure vessel
turbo-
generator
fuel rods
core
cool water
condensed
heat
from steam
light water
moderator
steam
pump
generator
There are two cooling circuits. In the primary circuit, water is pumped up
through the core and then returns to the core via a heat exchanger. In the heat
exchanger energy is transferred to water in the secondary circuit. This gener-
ates steam that is used to drive a turbogenerator to generate electricity. The
steam is then condensed and returned to the heat exchanger. Cold water from
a lake or river is needed to operate the condenser and large cooling towers are
used to cool this water once it has returned from the condenser.
26.3.4 Plutonium
Plutonium is a fissile material that can be used in bombs and reactors. However,
it does not occur naturally on Earth in any significant quantities but it is cre-
ated as a by-product of nuclear fission reactions in a reactor core. Neutrons
that are absorbed by uranium-238 create an unstable and short-lived nuclide,
uranium-239 that undergoes a beta-minus decay to form neptunium-239.
This is also unstable with a relatively short half-life and it undergoes a second
beta-minus decay to form plutonium-239:
1
0 n 238
92 U 239
92 U
239
92 U 239
93 Np 01 00
239
93 Np 239
94 Pu 01 00
Plutonium can be harvested from spent fuel rods. This is called “reprocessing.”
2
1 H 31 H 42 He 01 n
Atom Mass
1
0 n 1.008664 u
2
1 H 2.014102 u
3
1 H 3.0160492 u
4
2 He 4.002603 u
We need to work with nuclear masses, but since there are equal numbers of
electrons to subtract from each side of the equation we can, once again, sim-
ply use the atomic masses.
Mass defect = (2.014102 + 3.0160492) u − (4.002603 + 1.008664) u = 0.0189 u
The energy released is: E = 932.9 × 0.0189 = 17.6 MeV
This is about 3.5 MeV/nucleon compared to about 0.76 MeV/nucleon from
nuclear fission (combustion releases less than 1 eV per nucleon!)
For nuclear fusion to take place the reacting nuclei must come close enough
(a few times 10−15 m) for the short range strong nuclear force to bind them
together. However, all nuclei are positively charged and repel one another.
To approach close enough for fusion to take place they must have a very
large kinetic energy. This can be achieved by accelerating the nuclei and
then crashing them together in a device such as the Large Hadron Collider
(LHC) at CERN or by confining the reactants and heating them to extreme
temperatures.
Three situations that involve nuclear fusion reactions are:
Nucleosynthesis – the formation of heavy nuclei from light nuclei in the
cores of stars.
Thermonuclear weapons – the nuclear fusion of isotopes of hydrogen
in a bomb.
Fusion reactors – commercial reactors designed to produce electrical
energy from nuclear fusion.
26.3.6 Nucleosynthesis
Soon after the Big Bang the early Universe consisted mainly of hydro-
gen with some helium and trace amounts of other nuclei. Nuclei of all the
heavier elements were formed (and are still being formed) by nuclear fusion
As fuel for the fusion reactions in the core runs out gravitational forces cause
the core to collapse. What happens next depends on the mass of the star
(see Section 28.1.1) but when stars of mass greater than about 10 times the
mass of the Sun collapse, they undergo a sequence of fusion reactions and
create all the nuclides up to iron-56 and then explode in a supernova. Some of
the energy released in the explosion creates heavier nuclei up to uranium and
the explosion distributes them throughout space.
Our Sun is a medium-sized star and will spend almost all of its life s ynthesizing
helium from the hydrogen in its core. The net effect is to convert four protons
into a helium nucleus but the probability of this happening in one step by a
fortunate collision of four particles with enough energy to get close enough
to fuse is effectively zero. The main process by which helium is created in the
Sun is called the proton-proton cycle which proceeds in three steps.
Step 1: two protons collide to form a deuteron, a positron (anti-electron),
and a neutrino:
1
1 H 11 H 21 H 10 e 00
Step 2 (twice): a proton collides with a deuteron to form a nucleus of
helium-3 and emit a gamma-ray:
1
1 H 21 H 32 He 00
Step 3: two helium-3 nuclei collide to form a helium-4 nucleus and
release two protons:
3
2 He 32 He 42 He 2 11 H
This process releases about 26 MeV per helium-4 nucleus produced. The
overall reaction for the proton-proton cycle is then:
4 11 H 42 He 2 10 e 2 00
The two positrons created in the core almost immediately annihilate with
electrons creating high energy gamma-rays that contribute to the outward
radiation pressure that supports the star. The neutrinos are very weakly inter-
acting and pass through the outer layers of the Sun and into space. The flux
of solar neutrinos detected on Earth gives astronomers a way to monitor the
fusion processes going on in the Sun’s core. A different sequence of fusion
reactions (the CNO cycle) creates about 10% of the Sun’s energy and is domi-
nant in stars above about 1.3 solar masses. Astronomers estimate that there
is enough hydrogen left in the core for the Sun, which was formed about 5
billion years ago, to continue to shine for another 5 billion years.
shaped explosives
The blanket would also heat up and a suitable coolant (water) could be pumped
through the lithium blanket and used to raise steam to drive turbo-generators.
physics is called the Standard Model. Here we will simply describe the main
characteristics of each type of particle and the structure of the model.
26.4.1 Leptons
The anti-neutrino, emitted in beta-minus decay, and the neutrino, emitted in
beta-plus decay, are closely related to the electron and positron that are also
emitted in those decays. These are all “leptons,” particles that interact by the
weak nuclear force. Surprisingly it turns out that there are also heavier ver-
sions of these particles so that there are three generations of leptons:
The muon and tau are effectively more massive versions of the electron and
thy tend to undergo decays that eventually produce electrons. The neutrinos
are all neutral and have very low rest mass. Beams of neutrinos of any one type
begin to oscillate between the different “flavors” of neutrino so that soon the
beam contains equal numbers of electron-, muon-, and tau-neutrinos. The
discovery that this occurs solved the so-called “solar neutrino problem” where
only about 1/3 of the electron-neutrinos emitted by nuclear fusion reactions in
the Sun were detected here on Earth. The other 2/3 had oscillated to muons
or taus en route.
e exchange of e
virtual photons
The underlying process that explains how these three quantum forces work is
based on the “exchange of virtual particles.” A virtual particle can be created
by “borrowing” energy from the Universe for a short time and then “paying
it back” when the particle disappears. This is possible because of the energy-
time Uncertainty principle in quantum theory. For example, the electromag-
netic repulsion between two electrons comes about as a result of an exchange
of virtual photons.
The exchange particles for electromagnetism are photons, for the weak force
they are W+, W−, and Z0 particles and for the strong force they are differ-
ent kinds of gluons. Richard Feynman developed a pictorial way to represent
interactions. His method was useful because it provided a link to the mathe-
matical methods needed to solve problems in quantum electrodynamics. The
diagrams are known as “Feynman diagrams” and the diagram below shows
two of the many ways a pair of electrons might interact:
e e e e
exchange of a single photon exchange of two photons
The table below summarizes all the particles in the Standard Model.
Quarks Force
carriers
First generation Second generation Third generation
Leptons
Higgs boson
Charge 0
Spin 0
The fact that there are three generations of quarks and three generations of
leptons suggest that there is an underlying symmetry linking the quarks and
the leptons. This has led to several hypotheses about new particles and mech-
anisms for changing leptons to quarks and vice versa, but so far there has been
no experimental evidence to support these ideas. While the Standard Model
is incredibly impressive, it is unlikely to be the last word on particle physics;
there are too many arbitrary constants that have to be put into the model to
make it work.
ordinary
matter
5%
dark matter
23%
dark energy
72%
26.5 EXERCISES
(b) Sketch a graph to show how nuclear binding energy per nucleon var-
ies with nucleon number and use it to explain:
(i) Why iron-56 is regarded as the most stable nucleus.
(ii) How nuclear fission reaction of some heavy nuclei can release a
large amount of energy.
(iii) How nuclear fusion of some light nuclei can release a large
amount of energy.
(c) Suggest why a nucleus of an element with atomic number 120 is likely
to be highly unstable.
2. Use data from the table below to calculate the binding energy and binding
energy per nucleon for iron-56 (Z = 26).
Particle Mass
electron 0.000549 u
proton 1.007276 u
neutron 1.008665 u
iron-56 atom 55.934934 u
Particle Mass
0
1 0.000549 u
16
6 C (atom) 16.01470 u
16
7 N (atom) 16.006103 u
Atom Mass
Electron 0.000549 u
12 12.01864 u
7 N
13 13.005738u
7 N
14 14.003074 u
7 N
15 15.000108 u
7 N
16 16.006103 u
7 N
Atom Mass
17 17.00845 u
7 N
18 18.0142 u
7 N
13 13.003354 u
6 C
13 13.0248 u
8O
Proton 1.007276 u
Neutron 1.008665 u
27
Quantum Theory
black-body radiaon
spectrum
wavelength
The shape of the black body radiation spectrum was well known, but all
attempts to use classical physics to derive a formula for the spectrum failed.
In fact, they agreed with the spectrum at long wavelengths (low frequencies)
but diverged drastically from it at short wavelengths (high frequencies), pre-
dicting an infinite amount of high-frequency radiation from a hot body. This
was clearly wrong. It was called the “ultraviolet catastrophe.”
incident EM
radiaon of ejected electron
sufficiently high (‘photoelectron’)
frequency
−
metal surface
E = nhf
energy
n=4
n=3
n=2
n=1
This makes it far less likely that a high-frequency oscillator will be excited
because it needs a large energy to start vibrating. The effect is to suppress
high-frequency vibrations and dramatically reduce the high-frequency
E = hf
hf ≥ Φ
hf 0 = Φ
This was another radical departure from classical physics where the light was
considered part of the electromagnetic spectrum and was assumed to be a
continuous wave that can take any energy. Einstein’s theory treated electro-
magnetic radiation as if it consisted of discrete packets of energy and trans-
ferred that energy discretely too. The photon theory treated light more like a
particle model than a wave model.
F < f0 f > f0
cap: metal with work
−−−−−− −−−−−−
funcon Φ
−−−
−−−
electroscope is
discharged
If the metal is zinc, then visible light does not discharge the electroscope
even if it is very intense but ultraviolet radiation will discharge it even if
its intensity is low.
Increasing the intensity of the ultraviolet radiation discharges the
electroscope more rapidly.
When ultraviolet radiation is used, the leaf begins to fall as soon as it is
illuminated: there is no delay.
KEmax= hf − Φ
= hf − hf 0
consists of a metal emitter and a collector. When light strikes the emitter
electrons are ejected and travel across to the collector. If the photocell is
connected to a circuit it provides an emf that can transfer energy to other
components.
− +
collector emier
The circuit shown on the next page can be used to find the maximum kinetic
energy of the emitted electrons by applying an opposing voltage to the
cell and increasing this until the current in the circuit is reduced to zero.
Measurements of this “stopping voltage” for incident light with a range of
different frequencies can be used to find the Planck constant and the work
function of the emitter.
photocurrent
By making the emitter positive with respect to the collector electrons must do
work to move between them. When the voltage between the two electrodes
is V the work that must be done is eV. Increasing V increases the work that
must be done and the photocurrent falls to zero when the work is equal to
the maximum kinetic energy. The voltage at which this occurs is called the
stopping voltage VS:
eV=
S = hf − Φ
KEmax
If the stopping voltage is measured for light sources with a range of frequencies
(e.g., by using different colored filters in front of a white light source) then a
graph of VS against f can be used to find both the Planck constant and the work
function.
h Φ
=VS f −
e e
the gradient is h/e, the intercept on the voltage axis is – F/e, and the intercept
on the frequency axis is f0:
0
f0 frequency
− Φ/e
Robert Millikan used a similar method to make the first measurement of the
Planck constant in 1916.
Photon energy: E = hf
E
Mass equivalent to this energy: m =
c2
E hf h
Momentum associated with this mass: p= mc= = =
c c λ
h
p=
λ
This is the de Broglie relation, the link between wavelength and momentum.
While we have derived it for the photon, de Broglie’s hypothesis was that it also
applied to matter. This would imply that a moving particle has an a ssociated
wavelength given by:
h
λ=
p
The faster the particle moves the shorter its de Broglie wavelength.
6.6 × 10 −34
sin θ = ~ 10 −35
70 × 1
This is unbelievably tiny so the diffraction pattern would effectively have zero
width and so would not affect the classical expectation that the person contin-
ues through the doorway with no diffraction at all.
However, if an electron is traveling at 5% of the speed of light its de Broglie
wavelength is about 5 × 10- 11 m. This is comparable to the spacing of atoms
in a crystal lattice so we might expect to detect interference and diffraction
effects when electron beams accelerated to comparable speeds are directed at
crystals. This would only require an accelerating voltage of about 600 V.
The first physicists to detect electron diffraction were Davisson and Germer
in 1925 when they fired electron beams at a crystalline sample of nickel. They
noticed that the electrons were scattered into a pattern of maxima and minima
similar to the one that would be obtained if X-rays were passed through the
structure. Furthermore, the wavelength of X-rays needed to obtain the same
pattern corresponded to the de Broglie wavelength of the electrons.
At almost the same time G.P. Thomson, the son of J.J. Thomson (who had
shown that the electron can be treated as a particle), produced electron dif-
fraction rings by passing a beam of electrons through a thin slice of graphite.
The experimental arrangement is shown below.
Graphite consists of planes of atoms arranged in a hexagonal pattern. Rows of
atoms within these planes act like the lines in a diffraction grating. For planes
separated by a distance d, there will be diffraction maxima at angles given by:
nλ
sin θ =
d
accelerang
voltage V Vacuum
electrons diffracon
paern
Heater (rings)
supply (low
voltage) hot cathode anode graphite
target
fluorescent
screen
The fact that there are many such planes in all orientations relative to the
incident beam results in diffraction maxima being formed around the surface
of a cone with a half angle q for each set of planes and each order of d
iffraction.
Where these cones of diffracted electrons hit the end of the vacuum tube they
form a pattern of concentric rings.
planes of atoms d
acng like lines in a
diffracon grang
atomic arrangement
in graphite
The radius of diffraction rings can be used to find the spacing of atomic planes
in the crystalline structure. Changing the accelerating voltage changes the
wavelength of the electrons and the radius of each ring. Higher voltage gives
the electrons greater momentum and a smaller de Broglie wavelength so the
rings get smaller.
r
electron
beam θ
L
graphite
λ h h
sin θ = = =
d dmv d 2 meV
r
For small angles, sin θ ~ θ ~ so that:
L
hL
r~
d 2 meV
where d is the spacing of a particular set of atomic planes and V is the acceler-
ating voltage for the electrons. The radius of a diffraction ring is approximately
proportional to the reciprocal of the square root of the accelerating voltage.
incident photon
of wavelength λ
Scaered electron
with momentum p
− φ
−
θ
scaered photon of
wavelength λ’>λ
This was verified experimentally and showed once again that the particle
model must be used for some interactions between radiation and matter.
NA photons
boom slit from slit A
covered
P
B
LASER
A
single
slit diffracon
paern
single screen
slit B
NB photons
top slit from slit B
covered P
B
LASER
A
single
slit
diffracon
single screen paern
slit A
Zero
photons
both slits
open
P
B
LASER
A
single
slit interference
paern
double slit A and B screen
One, unlikely, possibility is that when both slits are open photons from A
and B somehow interact on their way to the screen and this prevents them
reaching P. To rule this out the experiment has been repeated using a filter
in front of the source to reduce the intensity so far that only one photon at a
time is interacting with the apparatus. Now photons arrive one at a time at the
screen and cannot interact with one another en route. What happens?
At first, the photons seem to be arriving completely randomly but after a short
while, it becomes apparent that the same patterns as before are produced
with single or double slits. If we insist that the photons behave like particles
then each photon can only pass through one of the slits. If it passes through
slit A when B is closed it can reach P. If it passes through slit B when A is
closed it can also reach P. However, if it passes through slit A when slit B is
open it cannot reach P! Similarly, if it passes through slit B when A is open it
cannot reach P! To maintain the particle model we would need to assume that
when the photon passes through one of the slits its future path is affected by
the state of the slit through which it did not pass.
LASER
filter single
slit
double screen
slit
interference paern
builds up gradually
A single photon passing through a double slit apparatus cannot reach any of
the minimum positions, even though it could reach all of them if it passed
through either slit when the other one is closed! This shows that photons do
not interfere with each other but every photon interferes with itself. Where
does this leave us?
The wave model can be used to explain the intensity distribution in super-
position effects.
The particle model can be used to explain the discrete emission and
absorption of radiation.
A similar experiment can be carried out with electron beams. The results are
exactly the same. Wave-particle duality affects matter and radiation in the
same way.
predictions about the future state of the system (e.g., where the electron
will hit the screen). Quantum theory is “indeterministic – the future of
the Universe is not uniquely determined by its present state. This is a
complete departure from the determinism of classical physics.
∂2 y 1 ∂2 y
=
∂x2 v2 ∂t 2
where y is the wave disturbance and v is the speed of the wave in x-direction.
A solution to this is:
=y A cos ( ωt − kx )
∂2 E ∂2E
=ε µ
0 0
∂x2 ∂t 2
∂2 B ∂2 E
=ε µ
0 0
∂x2 ∂t 2
1
c=
ε0µ0
ih ∂Ψ −h2 ∂ 2 Ψ
= + Vψ
2 π ∂t 8 π2 m ∂x2
where y is the wavefunction, i is the square root of minus one, and m is the
mass of the electron. V is the potential energy that might vary with position
and time. If the electron is moving freely in space then V = 0.
It is possible to find wave-like solutions to this equation. These represent the
de Broglie waves of the electron. Once Schrödinger had published his equa-
tion physicists could use it to solve a vast range of problems, and Schrödinger
himself showed how it could explain the energy level structure and spectrum
of the hydrogen atom. In Schrödinger’s atomic model the electron orbitals are
three-dimensional standing wave solutions to the equation.
According to the Copenhagen Interpretation, the square of the magnitude
of the wavefunction at each point in space is equal to the probability per unit
volume of finding the electron at that point. Since the wavefunction is itself a
complex quantity it is not directly observable, and its magnitude is found by
multiplying it by its own complex conjugate:
if this is integrated over all of space it must equal 1: the electron will be found
somewhere in the Universe.
acceleration. Orbiting electrons ought to radiate energy and fall rapidly into
the nucleus. Classical physics leads us to the conclusion that Rutherford’s
nuclear atom is unstable.
It was also known that excited atoms emit a line spectrum, that is, the radiation
emitted by isolated atoms contains a set of discrete frequencies. If the photon
model is considered this suggests that the emission of radiation involves dis-
crete energy jumps DE = hf for each spectral line. This further suggests that
the electrons inside an atom cannot have any value of energy but can only
exist at certain discrete energy levels.
electron
−
rn
F mvn
+
proton
The diagram above shows an electron in the nth orbit inside the hydrogen
atom. The dotted lines indicate other quantized orbits. The energy of the nth
orbit is derived below.
Angular momentum:
nh
mvn rn =
2π
Centripetal force:
mvn2 e2
=
rn 4pe 0 rn2
Kinetic energy:
1 e2
=
KE =mvn2
2 8 πε0 rn2
Potential energy:
e2
PE = −
4 πε0 rn
Energy:
1 e2 e2
En =
KE + PE =mvn2 − =
−
2 4 πε0 rn 8 πε0 rn
Bohr used these equations to eliminate rn from the energy equation and to
express the energy of an electron in the nth level in terms of the quantum
number n.
− me 4 1 −13.6 eV
=En =
8e 0 2 h2 n2 n2
The diagram below is an energy level diagram for the hydrogen atom based
on the Bohr model.
Energy
13.6
− = −0.85 eV 0 n=∞
4 n=4
13.6 n=3
− = −1.5 eV
3 n=2
13.6
− = −3.4 eV
2
13.6
− = −13.6 eV n=1
1
As n increases the energy levels get closer and closer together becoming a
continuum of states as n approaches infinity. The energy needed to remove
an electron from the ground state of the hydrogen atom (its first ionization
energy) is 13.6 eV, and the energy needed to move from n = 1 to n = ∞ where
the energy would be zero (a free electron).
Rydberg generalized this formula for all the series in the hydrogen spectrum:
1 1 1
= R 2 − 2
λn m n
Energy
13.6 0 n=∞
− = −0.85 eV n=4
4 n=3
13.6
− = −3.4 eV n=2
2
13.6
− = −13.6 eV n=1
1
− me4 1 − me 1
4
me4 1 1
∆E24 = E4 − E2 = − 2 2 2 = 2 2 2 − 2
8ε 0 2 h 2 4 2 8ε 0 h 2 8ε 0 h 2 4
3 me4
=
128ε0 2 h2
3 me4
hf24 =
128ε0 2 h2
3 me4
f24 =
128ε0 2 h3
c 128 cε0 2 h3
λ 24 = = 4
= 4.9 × 10 −7 m
f24 3 me
The Rydberg formula is derived in a similar way but using a transition from n
to m.
1 me4 1 1
= 2 − 2
λ mn 8 c0 2 h3 n m
h
2 πrn =n
mvn
The left-hand side of this equation is equal to the circumference of the cir-
cular orbit. The right-hand side of the equation is an integer number of de
Broglie waves. Bohr’s quantum condition is equivalent to saying that energy
levels can only occur when a circumference is an integer number of electron
wavelengths. This guarantees only a discrete set of energy levels in the atom
in a similar way that a stretched string can vibrate in only a discrete set of fre-
quencies (the fundamental and harmonics).
The energy levels of an atom correspond to standing wave patterns for the de
Broglie electron waves. The lowest or ground state is when the circumference
of the orbit corresponds to a single electron wavelength.
e2
PE =
4 πε 0 r
Then he needed to solve the equation in spherical polar coordinates (r, θ, φ).
These coordinates are used because of the symmetry of the problem – solving
1s 2s 2p
Wolfgang Pauli realized that no two electrons in the same atom can have the
same set of quantum numbers. This is an example of the “Pauli Exclusion
Principle” and the consequence is incredibly important. If it were not the
case then all the electrons in a multi-electron atom would fall into the lowest
energy level and different elements would have very similar chemical prop-
erties. The Exclusion principle prevents this and ensures that the electrons
must fill up each energy level in turn. In a very general sense this is what gives
us chemistry. It certainly accounts for the Periodic Table. For example:
When n = 1: l = 0, m = 0 and s = ±1. Two electrons can occupy these states.
These are the 1s states in the atom and correspond to a spherically symmetric
wavefunction. Hydrogen, with one electron has a half-filled 1s shell. This can
be represented by:
1s1
The first number is the energy level or principal quantum number, the letter
refers to the type of orbital (related to the value of l) and the superscript is the
number of electrons in the shell.
This accounts for its reactivity – it can complete the shell by reacting with
other elements with one or more outer electrons.
Helium, has two electrons, so the 1s shell is full and the atom is very stable.
Its electronic configuration is:
1s2
Lithium has three electrons so the n = 1, l = 0, m = 0 and s = ±1, states are
filled by two electrons so the 1s shell is full and the third electron has to go
into one of the eight available states with n = 2 (the 2s shell). The lowest
energy state is the n = 2, l = 0, m = 0, s = 1 (or -1) state. This leaves the 2s shell
partially filled and lithium is again reactive. Its electronic configuration is:
1 s2 2 s1
This filling of energy shells continues as we move to larger and larger atoms.
The periodicity of the periodic Table comes about as successive shells fill up
and electrons start to fill the next shell.
The fifth element in the Periodic table is Boron. This must accommodate five
electrons so both the 1s and 2s shells are full and the next electron must have
an orbital quantum number l = +1. This is called a p-shell and has a different
shape to the spherical s-shells. There are three p-shells (m = − 1, 0, and +1)
in different orientations, and each is shaped like a dumbbell. Two electrons
can go into each shell (with s = +1 or s = −1). The electronic configuration for
Boron is:
1 s2 2 s2 2 p1
The table on the next page shows the order of the first few electronic energy
levels and the corresponding electronic configurations. Just a few examples
will show how the electronic configuration, determined by solutions to the
Schrödinger equation, affects chemical properties.
Atoms with filled shells tend to be very stable – for example, helium and
neon.
Atoms with a single electron in an otherwise empty shell tend to be very
reactive because they easily lose that electron to an atom that needs one
or more electrons to complete its own outer shell – for example, sodium.
Atoms with an almost full shell are also very reactive, easily gaining elec-
trons from other atoms that have only one or a few electrons in their outer
shell – for example, fluorine.
Carbon has 2 electrons in the 2p shell, so to complete this shell carbon
must gain four electrons. This makes it “4-valent.” Carbohydrates (com-
pounds of carbon and hydrogen) are the most important class of com-
pounds for living things and illustrate how valency is linked to electronic
configuration. In methane, CH4, each hydrogen atom shares one electron
so that the carbon 2p shell is completed and the hydrogen 1s shell is com-
pleted, so both achieve more stable (lower energy configurations) and
four covalent bonds are formed.
−1 1s2 Helium
2s shell 2 0 0 +1 1 s2 2 s1 Lithium
−1 1 s2 2 s2 Berylium
2p shell 1 −1 +1 1 s2 2 s2 2 p1 Boron
−1 1 s2 2 s2 2 p2 Carbon
0 +1 1 s2 2 s2 2 p3 Nitrogen
−1 1 s 2 2 s 2 2 p4 Oxygen
+1 +1 1 s 2 2 s 2 2 p5 Fluorine
−1 1 s2 2 s2 2 p6 Neon
3s shell 3 0 0 +1 1 s2 2 s2 2 p6 3 s1 Sodium
−1 1 s2 2 s2 2 p6 3 s2 Magnesium
Etc.
This gives a very simple explanation of key chemical properties. More detailed
analysis of chemical bonding requires a solution of the Schrödinger equation
for electrons in the field of both atoms. The Schrödinger equation is the fun-
damental equation in chemistry.
The diagram below gives a simplified explanation of the double slit experi-
ment using the Copenhagen Interpretation.
Prior to observation the photon could be anywhere within the probability dis-
tribution. It is said to be in a “superposition of states.” After the observation,
the photon has arrived at a particular place on the screen.
Here is a summary of the main features of the Copenhagen Interpretation.
The wavefunction y (a solution of the Schrödinger equation) gives the
most complete description of a quantum state (this is unobservable).
Schrödinger’s Equation governs the behavior of the wavefunction and its
interaction with the apparatus.
The evolution of the wavefunction is continuous and deterministic (i.e.,
if we know its initial state we can calculate, with certainty, its future state.
|y|2 gives probability of finding photon at each point in space (Born’s sta-
tistical interpretation).
Before an observation or measurement photons are in a superposition of
states.
Observation or measurement results in the discontinuous collapse of the
wavefunction.
We can only make statistical predictions about the outcome of wavefunc-
tion collapse so the quantum theory is indeterministic.
When we use high-intensity light (large numbers of photons) the quantum
distribution is identical to that expected from classical physics. This is not
really surprising, if there is a large amount of energy the discrete nature
of individual quanta is hard to observe. The idea that quantum predictions
merge smoothly into classical predictions at high energies is called the cor-
respondence principle.
If the hole diameter is Dx the angular spread of the first diffraction maximum
(which is where most electrons will be found after passing through the hole)
is given by:
1.22λ
sin θ = ±
∆x
For small angles this is approximately:
λ
θ~±
∆x
The de Broglie wavelength links wavelength to momentum so:
h
λ=
p
h
θ~±
p∆x
where p is the original momentum of the electrons (perpendicular to x). After
the electron passes through the hole its momentum also has an x-component
Dp perpendicular to its original momentum:
∆p
p
h
∆p ~
∆x
∆ p∆ x ~ h
While this is a rough derivation, it does suggest that the uncertainty in x-posi-
tion and the uncertainty in x-momentum are inversely proportional and that
their product is of the order of the Planck constant. The more precisely we
measure the position of the electron (smaller hole) the greater the uncer-
tainty in its momentum (greater spread). But this is not just about our ability
to make precise measurements. The wavefunction contains complete infor-
mation about the electron, so the properties of position and momentum are
indeterminate – the electron does not possess independent properties of posi-
tion and momentum at each moment.
This discovery, made by Werner Heisenberg in 1927, is one of the defining
features of quantum theory and is known as the uncertainty principle or inde-
terminacy principle. Its more formal statement is in terms of an inequality:
h
∆x∆p ≥
4π
If particles do not possess defined properties such as position and momentum
then it is impossible to predict the future in detail based on the total informa-
tion about the state of the present. This makes quantum theory “indetermi-
nate” and leaves the future open. It also undermines our classical ideas about
the nature of reality – how can an electron be regarded as a real particle if it
does not possess definite values for momentum and position?
arrive at any point on the screen. The phase can be worked out if we know
the wavelength associated with the photon (e.g., a path of length 72.5 wave-
lengths introduces a phase difference p). The next step is to add up the pha-
sors at each point on the screen. The square of the resultant phasor amplitude
is proportional to the probability that the photon is found at that point. The
diagram below shows three phasors for three of the many possible routes a
photon could take from one point, A, to another, B.
possible path 1
phasor:
Point B path 1
possible path 2
phasor:
phasor at path 2
point A sum of
phasor::
possible path 3 path 3 phasors
path via B
B
Path B is longer than path A so the phasor has rotated more times and there
is a phase difference between phasors via A and via B.
via B
via A
resultant
The length of the resultant will vary as the detector is moved to different
positions. If the two phasors arrive in phase they reinforce so the probabil-
ity at that point is a maximum value. At positions where the phasors arrive
in antiphase (p phase difference) they undergo destructive interference and
cancel out. The probability at that point is zero so no photons actually arrive
there.
This “sum-over-histories” approach is particularly helpful in particle physics
where all possible mechanisms for a particular interaction contribute a phasor
and the sum of all the phasor amplitudes (squared) gives the probability of the
process. As an interpretation of quantum theory, it suggests that beneath what
we regard as the real world of actual events there are potential events (possi-
ble paths) that may not actually be where the photon is found but nonetheless
contribute to what it can do.
system and the observer and that this wavefunction contains a superposition
of all possible states of both. For example, if an atom of a radioactive element
has a 50% chance of decaying in 1 hour then the wavefunction of the system
consists of two parts, one representing the undecayed atom and the other
representing the decayed atom and the emitted radiation. During 1 hour
the amplitude of the first part gets weaker and the second part gets stronger
until at the end of the hour both parts have equal strength. This represents
an equal chance of having decayed or not having decayed. Now consider an
observer who can detect whether the atom has decayed or not. According to
Everett, the observer is also described by a wavefunction and this contains
a superposition of two states, one in which the observer detects a decay and
the other in which he does not. As time goes on the amplitude of the first
part grows and the amplitude of the second part falls until after 1 hour they
have equal amplitudes. No collapse has occurred but the description of the
Universe now contains two observers, one having observed the decay, the
other having observed no decay. The single world has split into two.
According to Everett’s model, the wavefunction for the whole Universe con-
tains a superposition of all possibilities. We only think that the wavefunction
collapses because we, as individuals, only experience one path through this
ever-splitting Universe even though copies of ourselves occur in many of the
other parallel worlds. The attraction of the many-worlds theory is that it solves
(or at least removes) the measurement problem. Its drawback is the weird
multiplicity of Universes it imagines.
Every measurement causes the world to split into multiple copies of itself,
each copy containing one possible outcome of the measurement AND a
copy of the measuring device giving that outcome.
This solves the “measurement problem” – No “collapse of the wavefunction.”
But it suggests that world is continually splitting into multiple copies of
itself – “many-worlds.”
M1 1 World 1
[Observer 1]
[[Observer]
IN
M M2 2 World 2
[Observer 2]
M3 3 World 3
[Observer 3]
M is a measurement and M1, M2, and M3 are three different possible results of
that measurement that exist as superpositions in the wavefunction.
In the many-worlds interpretation everything that is not impossible actually
occurs in some world – if life is highly improbable but not impossible it must
exist in part of the multiverse.
vial of
poison
?
hammer
release
detector
radioactive
atom
27.6 EXERCISES
28
Astrophysics
28.1 STARS
Before the start of the 20th century physicists were unable to explain how
a star like the Sun could continue to radiate energy for billions of years. No
known chemical or gravitational process could provide a large enough source
of energy. This was not a major problem until the theories of long-term geo-
logical processes and evolution by natural selection became established. Both
required the Earth to have existed, and had a source of energy, for billions of
years.
The problem of the Sun’s energy source was solved by the discovery of
Einstein’s mass-energy equation and the process of nuclear fusion (see Section
26.3.4). Stars fuse light nuclei into heavy nuclei in their cores releasing a huge
amount of energy. Radiation from the core supports the star against gravita-
tional collapse while energy is transferred to its surface where it radiates out
into space.
28.1.1 Mass
The most important parameter when modeling a star is its mass. The greater
the mass of the star the higher the temperature at its core. This can allow
fusion reactions to proceed faster and nucleosynthesis (see Section 26.3.5) to
produce heavier elements. Increasing mass rapidly increases the reaction rate
in the core so that more massive stars use up their fuel relatively more quickly
than less massive stars and reach the end of their lives earlier.
Mass determines the fate of a star.
Low-mass stars are stable for a long time but their cores are not hot
enough to create nuclei beyond carbon. When they run out of fuel, they
swell to become red giant stars before shedding their outer layers of gas
and forming a planetary nebula. This exposes the white-hot core. This
final state is called a white dwarf star. Fusion reactions have now ceased
so the white dwarf cools down over a long period of time (of the order of
a billion years) eventually becoming a dense black dwarf star.
High-mass stars have shorter lives but the core becomes hot enough for
nuclear fusion to create iron. Iron is the most stable nuclide so at that
point fusion reactions stop suddenly and the star undergoes a violent col-
lapse and explosion called a supernova. This can increase the luminosity
of a star by a factor of around 1010 for a short period (days or weeks). The
supernova explosion has two effects – some of the energy creates heavier
nuclei than iron and the process blasts these out into space, where they
can become part of the raw material for second and third generation stars
to form. Our own Solar System must have formed from supernovae rem-
nants because it contains significant amounts of the heavy nuclides (e.g.,
uranium in the Earth’s crust).
Mass also determines what happens to the core left after the supernova
explosion.
The core is so massive (typically 2-3 times the mass of the Sun) that the
forces that prevent the collapse of ordinary matter – that is, forces that
stop atoms being crushed (called electron degeneracy pressure) – are
overcome by gravity. This effectively forces orbital electrons and nuclear
protons to combine to form neutrons. When this happens the core radius
decreases enormously (to about 10 km!) and the density of the core,
which is now almost entirely made of neutrons, increases spectacularly
to around 1017 kgm−3. One centimeter cubed of this neutron star material
would have a mass of one hundred million metric tonnes! The core is now
called a neutron star. This collapse causes the rotation rate to increase too
so neutron stars spin rapidly, sometimes completing a revolution in milli-
seconds). Neutron stars form from the collapse of stars with initial masses
in the approximate range of 10–30 times the mass of the Sun.
Another effect of the collapse is to intensify the magnetic field of the star.
This has the effect of directing a beam of radio waves out along the mag-
netic axis of the star. Since this axis can be in a different place to the rota-
tion axis the beam sweeps around like the light from a lighthouse. If the
Earth happens to be struck by this beam we receive regular pulses. When
these pulses were first discovered they were so regular that astronomers
thought they might be alien radio signals. They are called “pulsars.”
If the mass of the core is greater than about 5 times the mass of the Sun
then the collapse to a neutron star is not the end of the story. The gravita-
tional forces are so strong that the neutrons themselves are crushed and
at the present time we know of no physical force that prevents collapse to
a point or singularity. A black hole is formed. The reason for the name is
that at a certain distance from the central singularity the escape velocity
is c, the speed of light. Since this is a universal speed limit no material or
information from points closer to the singularity can reach the outside
world. The surface at which this occurs is called the event horizon of the
black hole (see Section 23.2.5).
Red giant
Red supergiant
supernova
planetary
nebula explosion
nucleosynthesis up
to uranium-238
wavelength
peak
2.90 ´ 10-3 mK
λ max T =
2.90 × 10 −3
T=
λ max
If the luminosity is also known it is possible to use Stefan’s law to calculate the
radius of the star:
L =σAT 4 =4 πr 2 σT 4
L
r=
4 πσT 4
super giants
short lifeme: Antares
7
10 years
Siriu
giants
Sun
white dwarf
stars
4
long lifeme:
10 11
10 years
O B A F G K M
The diagram on the previous page shows (in a very simplified form) the
main regions of the HR diagram. The letters refer to spectral classes used
in astronomy. The Sun is in class G and has a surface temperature of about
5800 K.
A diagonal band runs from large luminous hot blue-white stars at the top left
to small dim red stars at bottom right. This is called the “main sequence” and
stars spend most of their lives on this band. At the end of their lives, when
nuclear fusion fuel in their core runs out, they move off the band as they
become red giants or supergiants and eventually white dwarf stars, neutron
stars, or black holes. These final two-star types do not appear on the HR dia-
gram because their luminosity and spectrum are not measured directly (and
luminosity is very low).
28.2 DISTANCES
One of the greatest challenges for astronomers and cosmologists is to find
ways to determine accurate distances to the objects they observe. Ancient
Greek astronomers managed to find ingenious methods to estimate the size of
the Earth and the distances to the Moon and Sun but modern space explora-
tion has provided accurate methods for surveying our immediate surround-
ings in space. Distances to objects beyond the solar system are determined
by several different overlapping methods and these regions of overlap can be
used to calibrate one technique against another.
R
background of
d
Telescopes can be used to measure the parallax angle a. This is half of the
total angular shift in the star’s position during a 6-month period (as the earth
completes half of an orbit).
R
= tan α
d
parallax angles are tiny so we can use the small angle approximation and
replace tan (a) with a (in radians).
R
d=
α
where R is the radius of the Earth’s orbit. This is known very accurately from
laser ranging within the solar system and trigonometry. If R is measured in
meters and a in radians then d will also be in meters. Astronomers often use
different (non-SI units):
1 Astronomical unit = 1 AU = 149 597 870 700 m
They also measure the parallax angle in seconds of arc where:
1 second of arc = 1/3600 degree
When these units are substituted into the equation for distance above, the
result is in parsecs (pc) so that a star with a parallax angle of 0.1 seconds of arc
is at a distance of 10 pc.
1 parsec (pc) = 3.0857 ´ 1016 m = 2.26156 light years (ly)
The parallax method using Earth-based telescopes is limited to about 100 pc
because as distance increases the parallax angles soon become too small to be
resolved. However, space telescopes (e.g., the Hipparcos satellite) can extend
this method to about 1000 pc.
L
d=
I
flux of radiaon
at Earth
mean flux
at Earth
period of Cepheid
variable, T
luminosity
period
T
Cepheid variables are very luminous so they can be detected out to a very
great distance. This gave astronomers a method to extend distance measure-
ments from our own galaxy to other quite distant galaxies.
The way to determine distance using Cepheid variables is summarized below:
Monitor the flux from a Cepheid variable.
Measure its period T and mean intensity I.
Use the period-luminosity relation to find the luminosity L.
Use the inverse-sqaure law to find the distance d.
This method can be calibrated against the parallax method using nearby
Cepheids.
Hubble used this distance method with groups of Cepheids in nebulae and
showed that the nebulae were actually separate galaxies outside the Milky Way.
dλ = ( λ − λ0 )
where λ0 is the wavelngth from a stationary source and λ is the wavelength
from the moving source.
δλ
Astronomers usually work with the fractional shift, z = . If this is positive
λ0
the wavelengths are increased and it is called a “red-shift.” If it is negative the
wavelengths are decreased and it is called a “blue-shift.”
For velocities small compared to the speed of light the relationship between
δλ v
=
z =
λ0 c
δλ
z= is positive for all distant galaxies, that is, light from all distant galax-
λ0
ies is red-shifted.
The red-shift is directly proportional to the distance of the galaxy:
δλ v
z= = ∝d
λ0 c
v∝ d
v = H0 d
This final equation is the “Hubble law.”
red-shi or
recession velocity
distance
Having established Hubble’s law, red-shifts can be used to determine the dis-
tances to distant galaxies. However, for very distant objects the source must
be extremely bright otherwise the flux of radiation reaching the Earth is too
weak for measurements to be made. Fortunately, there are extremely bright
objects that can act as “standard candles” for these measurements. One such
object is a type 1a supernova. Astronomers understand the physics of these
stars and can predict their luminosity, which is great enough for them to be
seen in the most distant galaxies. At these distances, the recession velocities
are a significant fraction of the speed of light so relativistic effects must also
be taken into account.
28.3 COSMOLOGY
Cosmology is the science of the Universe as a whole, dealing with its origin,
nature, evolution, and end. All of the evidence that we have suggests that the
laws of physics we have discovered from our own planet operate throughout
the Universe, so we use these to try to understand it. While cosmology deals
with physics on the largest scale it is intimately linked to physics on the small-
est scales and discoveries in particle physics and cosmology are often linked.
The enormous energies present soon after the Big Bang are reproduced in
particle physics experiments such as the Large Hadron Collider at CERN.
Big Bang A B C
A B C
me
in both diagrams. This confirms the Hubble law – a galaxy at double the dis-
tance has double the recession velocity, v µ d. The same argument would hold
equally well for observers at B or at C.
If the Universe is expanding now, it must have been much smaller in the past.
Stephen Hawking and Roger Penrose showed that it must have begun as a
point or singularity of infinite density that exploded and has been expanding
ever since. This initial explosion is called the Big Bang.
We can use Hubble’s law to estimate how much time has passed since the Big
Bang. This is an estimate of the age of the Universe. The method assumes that
the galaxies have always had their present relative velocities. We can calculate
how long they have been separating by dividing their current separation by
their current recession velocity:
d 1
T= =
v H0
This is the “Hubble time.” It does not take into account the variation in galac-
tic velocities caused by gravitational forces but does give a good order of
magnitude for the age of the Universe. More sophisticated methods using
evidence from the cosmic background radiation (measured by the Wilkinson
Microwave Anisotropy probe, WMAP, in 2012) have given a much more pre-
cise estimate of the age of the Universe:
Age of Universe = 13.772 ± 0.059 × 109 years
So far we have described the red-shifts as if they are caused by the motions
of galaxies through space. However, we have already seen that the concept of
absolute space had to be abandoned and only relative motions are significant.
Einstein’s general theory of relativity goes further showing that space has geo-
metrical properties that can be changed by the presence of matter or energy
(space-time curvature). This theory provides a different interpretation of
red-shifts and the Big Bang and expansion. According to general relativity
space itself is expanding (rather like the surface of the balloon in the analogy
above). The red-shifts are therefore a result of the stretching of electromag-
netic waves as they cross the expanding space between galaxies. Hubble’s law
is consistent with this approach (as can be seen from the example above). One
new consequence of Einstein’s approach is that there will be “horizons” in the
Universe. While no object can move through space faster than the speed of
light it is possible for the space between two galaxies to expand so fast that
light cannot travel between them. When this happens the galaxies effectively
disappear over a cosmic horizon.
a brief period after the Big Bang when the Universe was hot and dense
enough for some nucleosynthesis to take place but this soon stopped as
the Universe expanded and cooled. All the heavier elements were synthe-
sized by nuclear fusion reactions in stars.
The Big Bang and expanding Universe model is consistent with Einstein’s
general theory of relativity. In fact his theory requires a Universe that
either expands or contracts.
28.4 EXERCISES
1. (a) The surface temperature of the Sun is about 5800 K. Sketch a graph
to show how the intensity of radiation varies with wavelength for EM
waves leaving the surface and calculate the wavelength at which the
peak of this distribution occurs.
(b) The luminous flux from the Sun is about 1400 Wm−2 at the radius
of the Earth’s orbit. Calculate the luminous flux at the orbit of Mars
(about 1.5 times further away than the Earth.
(c) The luminosity of the Sun is about 4 × 1026 W with a surface tempera-
ture of about 5800 K. Calculate its radius.
(d) Toward the end of its life the Sun will become a white dwarf star
with a surface temperature of about 105 K. How will this affect the
spectrum of radiation it emits? Support your answer with a relevant
calculation.
Wien’s constant = 2.90 × 10− 3 mK, Stefan’s constant = 5.67 × 10− 8 Wm− 2K− 4
2. The peak wavelength in the spectrum of light emitted by the super-giant
star Betelgeuse is 830 nm and its luminosity is 3.5 × 1031 W
(a) Calculate its surface temperature.
(b) Calculate its radius.
Wien’s constant = 2.9 × 10− 3 mK, Stefan’s constant = 5.67 × 10− 8 Wm− 2K− 4
3. A distant galaxy has a red-shift of 0.05. The Hubble constant is about
2.2 × 10− 18 s− 1 in SI units.
(a) Calculate the velocity of the galaxy relative to the Earth and state the
direction in which it is moving.
(b) Calculate the distance of the galaxy.
(c) Use the Hubble constant to estimate the age of the Universe in years
and explain why the actual age is likely to differ from this value.
4. (a) Describe a parallax method for measuring the distance to a relatively
nearby star.
(b) Show that 1 parsec is about 3 ´ 1016 m and about 3.26 light years.
(c) Explain why stellar parallax cannot be used to measure the distance to
very distant stars.
1 astronomical unit (AU) = 1.496 ´ 1011 m
(d) How far away (in pc) is a star with parallax 0.052 seconds of arc?
5. (a) Two similar galaxies are observed. The brightest blue stars in galaxy A
have an apparent brightness 10 000 times greater than those in galaxy B.
Galaxy A is 107 light years away. How far away is galaxy B?
(b) Two type Ia supernovae are observed one week apart. The first is 100
Mpc away but its apparent brightness is only 0.070 of the apparent
brightness of the second. How far away is the second supernova?
6. (a) Explain what is meant by a “standard candle” in astronomy.
(b) Explain how a Cepheid variable can be used as a standard candle to
measure distances to galaxies beyond the Milky Way.
7. (a) A rocket is traveling away from the Earth at a velocity of 9.0 ´ 106 ms−1
when it transmits a signal to the Earth on a carrier frequency of
10.000 ´ 1010 Hz. To what frequency should the receiver on Earth be
tuned?
(b) A distant galaxy has a red shift of 0.030. Aliens on a planet in this galaxy
transmit a signal to Earth on a carrier frequency of 10.000 ´ 1010 Hz. To
what frequency should the receiver on Earth be tuned?
(c) Compare your answers to (a) and (b) and discuss whether or not the
shift in frequency has the same physical cause.
(d) Estimate the time taken for the signal in (b) to reach the Earth.
H0 = 2.2 × 10− 18 s− 1
8. (a) Explain how observations of galactic spectra led to the ideas of the
expanding universe, Hubble’s law and the Big Bang.
(b) State and explain two other pieces of evidence for the Big Bang as the
origin of the Universe.
29
Medical Physics
29.1 ULTRASOUND
29.1.1 Overview of Ultrasound
Ultrasound (sonography) uses high-frequency sound waves to form images
of structures inside the human body and is particularly suited to imaging soft
tissues. Typical frequencies are in the range of 1–20 MHz with corresponding
wavelengths from 2 to 0.1 mm. Distances to tissue boundaries are computed
using the time for reflected pulses to return:
1
distance to boundary = × speed of ultrasound in tissue × time for pulse to return
2
The speed of ultrasound in the body is about 1550 ms- 1. The higher frequency
waves provide higher resolution but are absorbed more strongly so do not
penetrate so far into the body.
Ultrasound transducers use piezoelectric crystals to generate and detect the
waves. When an alternating voltage is applied across the crystal it vibrates and
when the crystal experiences alternating stress (as it absorbs an ultrasound
wave) it generates an alternating voltage.
There are several different types of an ultrasound scan. An A-scan (“amplitude
mode”) is the simplest procedure, using a single transducer to detect echoes.
As the scanner is moved along a line the depth is computed and d isplayed
on a screen. A B-scan (“brightness mode”) uses a linear array of transducers
and creates a 2D image as the scanner is moved. It is also possible to ultra-
sound pulses to compute the velocities of tissue boundaries in order to create
cornea rena
gel lens
ultrasound
trasnducer
Signal strength
ultrasound pulse
me / µs
0 40
B-scans of the eye are used to diagnose problems such as a detached retina,
glaucoma, or cataracts or to monitor the shape and size of a tumor. In the
B-scan, the incident ultrasound is moved back and forth to scan slices of the
eye. These can then be combined to form an image. The eye is quite small so
ultrasound does not have to penetrate far and higher frequencies can be used.
Recently frequencies up to 50 MHz have been used to provide extremely
high-resolution images of structures at the front of the eye. At these frequen-
cies, penetration is just a few millimeters.
B-scans in pre-natal scanning have to penetrate farther into the body so these
are limited to lower frequencies. This reduces their resolution but the struc-
tures being imaged tend to be larger so this is not a major problem.
ultrasound transducer
coupling gel
blood flow
As the blood flow pulses the frequency difference shows a series of peaks:
frequency
me
29.2 X-RAYS
29.2.1 Overview of Medical X-rays
X-rays were discovered in 1895 by Wilhelm Roentgen and he won the very
first Nobel Prize for physics in 1901. X-rays are a form of high frequency,
short wavelength electromagnetic radiation emitted when electrons moving
at high-speed crash into a target and stop suddenly. The radiation is ionizing
and highly penetrating and Roentgen took the first X-ray photograph, of his
wife’s hand, soon after his discovery.
X-rays are now used routinely in medicine to create images of the inside of the
human body. In conventional X-ray radiography the X-rays pass through
the body and are absorbed to differing extents by the tissues through which
they pass, creating a shadow image that can be captured on film or by arrays
of detectors. A more sophisticated technique, called Computed Tomography
(CT-scanning) involves rotating the X-ray source and detectors around the
body to create a 3D image.
7 6
6
5
dose / mSv
4
3
average annual radiaon dose
2
2 1.5 1.5
1 0.1 0.4
0.001 0.005
0
procedure
X-rays are ionizing radiation so they can damage tissues and doctors must
always balance risk against benefit when deciding whether to use them. The
risks depend on the wavelength and intensity of the X-rays, the duration of
the procedure, and the tissues being exposed. To assess the risk the -ray dose
is compared to the annual radiation dose from natural background sources.
The chart below shows typical doses from different types of X-ray procedures.
rotang tungsten
anode (target)
cathode
vacuum cooling system for
anode
electron
beam
X-rays
There are two main adjustments to the tube – the electron current and the
accelerating voltage. Increasing the current increases the number of electrons
per second striking the target and this increases the number of X-ray photons
emitted per second. Increasing the voltage increases the energy of the elec-
trons and the maximum frequency of the X-ray photons. It also increases the
number of photons emitted.
The spectrum of X-rays produced is continuous, but also contains some sharp
emission lines that are characteristic of the target element. These lines are
created when electrons strike atoms in the target and eject an electron from
an inner orbit (e.g., K-shell or L-shell). The vacancy is then filled by electrons
from higher orbits cascading down and emitting photons as they do so.
X-ray intensity
K-lines
L-lines
wavelength
The largest energy jumps are for electrons dropping into the innermost shell,
the K shell, and these correspond to the shortest wavelength spectral lines.
The energy jumps into the K and L shells correspond to X-ray photon energies.
The short wavelength cut-off corresponds to all the energy of one incident
electron being transferred to a single X-ray photon. The rest of the continuous
spectrum corresponds to more complex interactions and multiple collisions.
The higher frequency, shorter wavelength X-rays are the most penetrating.
These are called “hard X-rays” and are the ones needed for image formation.
Longer wavelength, “soft X-rays” are usually filtered out because they do not
contribute to the image but do increase the radiation dose. The absorption of
X-rays by tissues attenuates the beam. The amount of attenuation increases
with the density of the tissue, so bones absorb X-rays more strongly than the
surrounding soft tissues. This is what creates the contrast in an X-ray image
and X-rays are particularly good for imaging bones and bone damage. Soft
tissues do not create much contrast so often a contrast medium is injected
prior to the X-ray. CT scans are better at imaging soft tissue than standard
X-rays.
Absorption coefficients for soft tissues do not vary much so contrast media
are often used to create X-ray images of blood vessels, the fallopian tube, the
urinary tract, the digestive system, etc. These are based on iodine or barium
compounds that absorb X-rays strongly.
X-rays
paent
lead grid
film
Some of the X-rays passing through the patient are scattered off axis so a lead
grid is slowly moved between the patient and the film during the exposure.
This eliminates off-axis X-rays and increases the signal-to-noise ratio for the
image.
An intensifying screen can be used to increase the number of light photons
created from each X-ray photon. This consists of thin layers of fluorescent
material placed in front of the film. The fluorescent layers absorb the X-rays
and emit visible photons. The arrangement is housed in a cassette that is
placed under the patient for exposure.
An X-ray filter (usually a metal plate) is placed between the X-ray source and
the patient to filter out the low energy (long wavelength) X-ray photons. This
reduces the total dose given to the patient but does not affect the intensity of
the image because low energy photons would have been absorbed in the body.
An X-ray image intensifier can also be used to increase the brightness of the
image. Incident X-rays strike a phosphor screen that converts the X-rays to
photons of visible light. These strike a photocathode that emits electrons.
X-rays
to camera/viewer
vacuum
CT scans rotate an X-ray source and collimator around the patient to create
detailed images of slices of the body. An array of fixed detectors surround
the patient.
X-rays X-rays
fixed detectors
CT
X-rays
8 voxel cube
detectors
the human body, mainly in water molecules, they are ideal for imaging all
parts of the body.
When an external magnetic field is applied the spinning nuclei precess around
it at a fixed frequency called the Larmor frequency:
γB0
f=
2π
superconducng coils in
liquid helium cryostat
magnec field
layers of thermal
insulaon gradient coils
radio frequency
transmit coils
scanner moveable
table detecon coils
MRI
that emit electrons and amplify the signal. A computer processes its electrical
output to produce the final image. A lead grid is placed in front of the scintil-
lator to reject off axis gamma-rays and create a sharper image.
gamma-rays
collimator:
lead grid
scinllator: sodium
iodide crystal
photomulpliers
Compute
Display
0
1 e+ 0
−1 e → 2 00 γ
The positron emitted in this decay only travels a short distance (less than
1 mm) through the tissue before meeting an electron and annihilating. The
annihilation of the electron-positron pair results in the emission of a pair of
gamma rays that travel in opposite directions (a pair must be emitted in order
to conserve linear momentum). The position of the annihilation along the
line determined by the two gamma rays is determined from the time delay in
arrival at detectors on either side of the patient.
The detectors only respond to near-simultaneous pairs of photon arrivals
(within about 10 ns of each other) and then measure the small additional time
delays for each pair.
B
PET
annihilaon
For a time delay dt the annihilation event must have been at a position
that is cdt/2 closer to A than the center of the chord AB, that is, a distance
AB/2 - cdt/2 from A and AB/2 + cdt/2 from B.
In practice, the two gamma rays emitted from an annihilation event are not
emitted at exactly 180° so this introduces an uncertainty into the position
of the chord. In addition to this, the detector can only resolve events that
are more than about 0.50 ns apart so this introduces an uncertainty into the
position along the chord. The image quality and resolution improve as more
events are detected (signal-to-noise ratio falls) and resolutions of about 1-2
mm are possible with clinical scanners. This is not as good as a CT image but
PET scans can be used to investigate a very wide range of metabolic pathways
and when used alongside CT or MRI scans information about both structure
and function can be combined.
The injection of a radioactive tracer means that the patient remains
radioactive for a short time after the procedure. For a typical PET scan
involving fluorine-18 the total activity injected is about 370 MBq. The patient
will absorb a radiation dose equivalent to that of a full body CT scan (about
7 mSv). If the PET scan is combined with a CT scan the total dose will be the
sum of doses from the two procedures.
29.6 EXERCISES
State and explain the risks associated with X-ray imaging of the
1. (a)
human body.
When an X-ray tube is used to produce a photographic image of a
patient an aluminum plate is placed between the patient and the
X-ray tube.
Explain how this can reduce the radiation dose without affecting the
(b)
intensity of the X-ray image that is formed.
2. Some tissue injuries can be imaged equally effectively using ultrasound
or an MRI scan.
X-ray intensity
K-lines
L-lines
wavelength
Explain how the continuous spectrum arises and why it has a short
(a)
wavelength cut-off.
(b) Calculate the cut-off wavelength for a 40 kV X-ray machine.
Explain how the line spectra come about and account for the differ-
(c)
ence between the K and L lines.
4. State and explain four factors that must be considered when selecting a
suitable radioisotope to use as a tracer in the human body.
Explain why the annihilation of an electron and a positron during
5. (a)
a PET scan is likely to result in two gamma-rays of the same wave-
length emitted in opposite directions.
(b) Calculate the wavelength of these gamma rays.
A
Estimations and Fermi Questions
It’s not clear what method Fermi used to make his estimation but there
are usually many ways to solve a Fermi problem and here is a (very simple)
approach to this one.
Let’s assume that the energy released by the explosion pushes the atmos-
phere back so that a hemisphere of air with a radius equal to Fermi’s distance
from ground zero (the position of the explosion) is displaced outwards by 2.5
m. Work must be done against the atmospheric pressure further out, so the
energy transfer can be calculated using force times distance:
Surface area of hemisphere = 2pr2
Outward force F = 2pr2 × p where p is the atmospheric pressure (about
105 Pa).
Work done = F δx where δx is the outward displacement of the air.
Work done = 2p × (12 000)2 × 105 × 2.5 = 2 × 1014 J
1 kT = 4.2 × 1012 J
Fermi
This gives a result of about 50 KT, much greater than Fermi’s estimate and
about 2.5 times the actual yield of the Trinity test. However, it IS the correct
order of magnitude, which is pleasing given the incredibly simple model used
to make the estimate. Fermi would have used a more sophisticated model,
taking into account the actual pressure differences in the shock wave and
the proportion of the input energy that went into it. His estimate of 10 kT
was impressive, but did not win the bet. Isodor Rabi was the winner with an
estimated yield of 18 kT. We do not know how he did this (or maybe he just
got lucky).
ρgh =
10 5
10 5
=h = 10 4 m
1 × 10
This is much less than the radius of the Earth so the atmosphere can be
treated as a thin layer (like a carpet with an area equal to the surface area of
the Earth). The volume of this layer can then be used to find how many moles
of gas are present and this can be used to find the number of molecules. The
atmosphere is equivalent to a 10 km deep uniform layer. The fact that the
density of the atmosphere varies with height does not affect this estimate – it
would have the same weight and exert the same pressure at the surface for
any pattern of density variation because it is only the total weight of all the
molecules that are responsible for the surface pressure.
The volume of air at atmospheric temperature and pressure that would exert
the same pressure at sea level as the Earth’s atmosphere is:
V = 5.15 × 1014 × 10 4 m3
5.15 × 1014 × 10 4 m3
=
n = 2.1 × 10 20 moles
0.024
1 mole contains 6.02 × 1023 molecules so the total number of air molecules in
the atmosphere is:
Method 2
This method has some similarities to the first but uses the total mass of the
atmosphere and the mass of an “air molecule.” The total weight of the atmos-
phere is equal to the atmospheric force exerted on the entire surface of the
Earth:
F
m= = 5.25 × 1018 kg
g
5.25 × 1018
=
n = 1.75 × 10 20 moles
0.030
v2 = u2 + 2 gh = 0 + 2 × 9.8 × 2 = 39.2
v = 6.3 ms-1
During a time dt the canopy moves down through a distance vdt and sweeps
through a volume pr2vdt of air. The mass of air in this volume is equal to
rpr2vdt where r is the density of the air. If we assume that all of this air must
be accelerated up to speed v as the canopy passes we can work out the rate of
change of momentum:
ρπr 2 v2 = mg
This equation can be rearranged to give an expression for r, the radius of the
parachute:
mg
r=
ρπ2 v2
Try it. Which estimates are you most confident about and which have larg-
est uncertainty? How would varying these parameters affect the final esti-
mate? Can you put an upper and lower limit on the number of piano tuners in
Chicago? Is there any way to test your estimate?
B
Experimental Investigations
To carry out and write up an experiment effectively you will need to:
Have a clear aim
Consider relevant underlying physical principles (this might involve research)
Identify relevant parameters and decide which will be varied and which
will be held constant
Select appropriate measuring instruments bearing in mind their range
and precision
Plan an experimental procedure
Carry out a risk assessment
Decide how to record and analyze the data including uncertainties
Draw a conclusion
Evaluate the experiment
List any references you have used
B.1.1 Variables
To make your experiment a “fair test,” the parameter you are investigating
must be affected only by the parameter you are varying. For example, if you
are investigating factors affecting the acceleration of a dynamics trolley you
might vary its mass by adding loads to the trolley. However, if you then pull
it with different forces you have varied two parameters, both of which affect
its acceleration, so you will not be able to separate the effect of mass from the
effect of the resultant force. To make this fair you need to keep the resultant
force constant while varying the mass and then carry out a separate experi-
ment in which the mass is kept constant and the resultant force is varied. The
three parameters, mass, resultant force, and acceleration are examples of the
three types of variable parameters in all experimental work:
Independent variable: the unique variable that we change (e.g., mass)
Dependent variable: the variable that we are investigating (e.g., acceleration)
Control variable: the variable we keep constant (e.g., resultant force) so that
it will not affect the dependent variable
In most experiments, there are many control variables that must be kept
constant.
There are guidelines that must be followed when you work with hazardous
chemicals, electricity, radioactive sources, vacuum containers, etc, Your
teacher or supervisor should be consulted about these and if you are ever in
doubt about the risk of an experimental procedure, do not proceed.
∆E
−
so the current should be given by an equation of the form: I = Ce kT
.
Variables: state the independent and dependent variable and all of the
important control variables, for example, the dependent variable is current,
the independent variable is temperature, and the control variable is a poten-
tial difference across the thermistor.
Apparatus: you could give a list of instruments, or label them clearly on a
diagram showing the experimental setup. Either way, it is important to state
(and if necessary justify) their range and precision. For example, a suitable
thermometer and ammeter. A voltmeter will also be needed to ensure that the
potential difference is constant.
Procedure: a clearly labeled diagram showing the experimental arrange-
ment can save a lot of writing and is easier to understand than a block of text.
Take care of the diagram and if you draw it yourself use a ruler. Explain what
you will measure and why and state the range and number of readings you
intend to take, including any repeats. For an electrical experiment draw a
circuit diagram and make sure the symbols you use are correct.
Risk assessment: once you have planned a procedure look at it critically.
Assess the risks and plan precautions to minimize them. This might mean
modifying your experiment or seeking advice from your teacher or supervi-
sor. Do not carry out an experiment if you have any doubts about its safety.
Include a summary of the risk assessment as part of the experimental account.
Conclusion
It is surprising how often students forget to state a conclusion to their experi-
ments. This should be a simple statement of what you have achieved in your
experiment and should relate back to the original aims.
Evaluation
Having carried out the experiment and drawn a conclusion you should consider:
How strongly the conclusion is supported.
How do any calculated values compare with expected or known values?
The significance of the uncertainties.
Likely sources of error.
Suitability of the apparatus used.
How the experiment could be improved.
Glossary
Include a list of technical terms you have used in your account along with
brief explanations.
References
Include a list of references indicating where you have used each source.
Provide enough information so that someone reading your report can easily
find the information you used. This should include:
the title of the work or article,
the author or authors,
publisher and publication date,
the page or pages used, and
URL and date accessed (for websites).
B.2 INVESTIGATIONS
An investigation is a more open-ended project that involves a considerable
amount of preliminary research and pilot experiments (to try things out and
to explore the phenomena). These will help you to plan a sequence of experi-
ments, but as you proceed you should be prepared to modify your plans in the
light of results, or when you discover that a particular experiment does not work.
It takes time to get into an investigation, and you must not be easily deterred,
especially at the start when there is a lot of uncertainty about how to begin.
Researchers spend a lot of time getting nowhere, but without trying a range
of different approaches you are unlikely to hit upon the method that actu-
ally works. You might also discover interesting and unexpected aspects of the
problem along the way.
Science is a collaborative endeavor and carrying out an investigation usually
requires you to discuss ideas with your teacher or supervisor and to think
about the feedback they give you. You are also likely to need a fair amount of
apparatus, and this could involve discussing your needs with laboratory tech-
nicians or equipment suppliers (in advance.) In both cases it is important to
be as clear as you can about what you need and what you need it for – asking
for a “block of wood” is pretty meaningless: what type of wood? What dimen-
sions? What is it for?
Writing up an investigation can be a daunting task, especially if you leave
it all until you have finished in the laboratory. It is worth using a laboratory
notebook and it is essential to write up and process data from every experi-
ment before moving on. A good investigation cannot be completely planned
in advance, it evolves as you discover more about the problem you are tack-
ling. There are many ways to write up an investigation but the written report
could take the following form:
Aim: statement of what is to be investigated
Background physics: summary of research about the problem identifying
aspects for investigation.
Pilot experiments: these should be used to explore the phenomena and test
methods or instruments to see if they are suitable. These pilot experiments
are not intended to produce precise data for analysis although they might
suggest relationships to be investigated in more detail later.
Plan of investigation: this should outline a sequence of related experiments
that can be used to collect good relevant data to move the investigation forward.
Experiments: each experiment should be written up fully (using the guide-
lines in Section B.1) including risk assessments and taking account of anything
learned in earlier experiments. The glossary and references can be left until
the end of the complete report.
Conclusions: while each individual experiment might lead to its own conclu-
sion this section takes an overview of the whole investigation and must be
written at the end of the work. It should relate back to the original aims.
Evaluation: this covers the same points as an individual experimental evalua-
tion but refers to the whole investigation. It might consider the relevant merit
and contribution of different experiments.
Glossary: a list of technical terms you have used in your investigation along
with brief explanations.
References: The point of a reference is so that someone reading your work
can find the information easily without having to search through an entire
book or article, so give full details. There are several standard ways to provide
references; one of the most widely used is the Harvard system.
C
Units, Constants, and Equations
C.1 SI UNITS
Base units
Quantity Name Symbol
SI base unit
Length Meter m
Mass Kilogram kg
Time Second s
Electric current Ampère A
Thermodynamic temperature Kelvin K
Amount of substance Mole mol
Luminous intensity Candela cd
Derived units
Derived quantity Name Symbol
SI-derived units SI base units
Force Newton N kgms- 2
Pressure Pascal Pa kgm- 1s- 2
Energy Joule J kgm2s- 2
Power Watt W kgm2s- 3
Charge Coulomb C As
Resistance Ohm W kgm2s- 2A- 2
Potential difference Volt V kgm2s- 2A- 1
Capacitance Farad F kg- 1m- 2s3A2
dy n −1
Power law, for example, y = Axn = nAx
dx
dy x
Exponential function: y = ex =e
dx
dy bx
Exponential relationship: for example, y = Aebx = bAe
dx
dy
Sine function: y = sin x = cos x
dx
dy
Function: y Derivative (rate of change):
dx
dy
Cosine function: y = cos x = − sin x
dx
dy
Sinusoidal variation: for example, y = A sin (bx) = bA cos ( bx )
dx
dy
Cosinusoidal variation: y = A cos (bx) = − bA sin ( bx )
dx
dN
Radioactive decay = −λt N = N0 e−λt N = N0 at t = 0
dt
dQ Q −
t
Capacitor discharge = − Q=Q e RC Q = Q0 at t = 0
dt RC 0
2
Simple Harmonic d x A, w , f constants
2
= −ω x =x Acos ( ωt + φ )
Motion 2 (w = 2pf )
dt
ds
Dynamics v= s = ∫vdt
dt
dv
Dynamics a= v = ∫ adt
dt
d ( mv )
Newton’s laws F= mv − mu =
∫Fdt
dt
dQ
Electric circuits I= Q = ∫Idt
dt
dN dN
Radioactivity
dt
= −λN ∫ N
= − ∫ λ dt
dQ Q dQ dt
Capacitors
dt
= −
RC ∫ Q
= −∫
RC
C.6 EQUATIONS
Mechanics
Equations for constantly accelerated motion:
( u + v) t 1 1
v= u + at s = = s ut + at 2 =
s vt − at 2
2 2 2
v=
2
u2 + 2 as
u2 sin 2θ
Range of a projectile: R=
g
Coefficient of friction: Flimit = m N, m S = tan q (q is limiting angle)
Weight: W = mg
m
Density: ρ =
V
F
Pressure: p =
A
Linear momentum: p = mv
dp F
Newton’s second law: F= a= (constant mass)
dt m
Impulse and change
of momentum: ∫Fdt = ∫dp = ∆ p
=
Ft ( mv − mu ) (constant mass)
m0
Rockets: v= v + u ln
f 0
mf
∂ ∂L ∂L
Euler-Lagrange equations: =
∂t ∂q ∂q
Fluids
Materials
Hooke’s law: F = ke
1 1 1 1
Spring systems: = + + …
kseries k1 k2 k3
kparallel = k1 + k2 + k3 …
1 1
Strain energy (spring): EPE = Fe = ke2
2 2
EPE 1
Strain energy (wire): = σε
V 2
F e
Stress and strain: σ = ε=
A l0
σ
Young’s modulus: E=
ε
Thermodynamics
pV
Ideal gas equation: = constant pV = nRT
T
Adiabatic compression: pV g = constant
1 1
Kinetic theory equation: pV = Nm v2 p= ρ v2
3 3
1 3 R 3
Mean molecular KE: =
m v2 = T kT
2 2 NA 2
3 kT
RMS molecular speed: v=
rms =
v2
m
1 3
Internal energy: U==Total KE = N A m v2 RT
2 2
3 5
Heat capacities (ideal gas): cV = R cP = R
2 2
c
Adiabatic gas constant: γ= P
cV
γp
Speed of sound in gas: c=
ρ
∆E
−
Boltzmann factor: f = e kT
Qhot 1
Heat pump: CoPheat pump = ≤
W 1 − Tcold
Oscillations Thot
1
Frequency: f=
T
Angular frequency: ω = 2 πf
d2 x
SHM: a = = −ω2 x
dt 2
=x A cos ( ωt + δ )
=x A cos ωt
dx
v= = −ωA sin ωt
dt
dv
a = = −ω2 A cos ω = −ω2 x
dt
Mass-spring system: 1 k T = 2 π m
f=
2π m k
1 g l
Simple pendulum: f= T = 2π
2π l g
1
Total energy: =
TE mω2 A2
2
Damped SHM: =x Ae− γt cos ( ωt )
Resonance condition: f d = f0
Rotational Dynamics
l
Angles in radians: θ=
r
Small angle approximations:
v
Angular velocity: ω=
r
a
Tangential acceleration: α=
r
v2
Centripetal acceleration: a= = rω2
r
Centripetal force: mv2
= =
F ma = mrω2
r
Equations of motion for constant angular acceleration (by analogy):
v= u + at → ω f = ωi + αt
s=
( u + v) t
→ θ=
(ω + ω ) t
i f
2 2
1 1
=
s ut + at 2 → θ = ωi t + αt
2 2
1 1
=
s vt − at 2 → θ = ω f t − αt 2
2 2
v=
2
u2 + 2 as → ω f 2 = ωi 2 + 2αθ
Moment of inertia: = (∑ i= N
i=1
m i ri 2 )
1
Rotational KE: RKE = Iω2
2
Angular momentum: L = Iω
d ( L) dω
Torque: Γ = = I = Iα
dt dt
Moments of inertia for uniform objects:
Point mass: I = mr 2
1 2 1
Rod: Iend = ml ICM = ml 2
3 12
Thin hoop: I = mr 2
1
Disc/cylinder: ICM = ma2 (radius = a)
2
2
Sphere: ICM = ma2 (radius = a)
5
Waves
λ
Wave speed: v = = fλ
T
1D travelling wave: =y A cos ( ωt − kx )
Intensity: I ∝ A2
c
Wave speed in medium: v=
n
Refraction: n1 sin=
θ1 n2 sin θ2
n2
TIR: sin c =
n1
Malus’s law polarization): Itrans = I0 cos2 q
n
Brewster’s law: tan θB = 2
n1
1
Speed of light: c= (vacuum)
ε 0µ 0
1
v = (medium)
ε 0 ε r µ 0µ r
Refractive index: n= εr µr
1
Power of a lens: P=
f
hi v
Linear magnification: =
m =
ho u
1 1 1
Lens equation: = +
f u v
β fo
Astronomical telescope: M= =
α fe
v
Doppler shift (light): ∆λ = ± λ 0
c
λ′ − λ 0 v
Red shift: z= =
λ0 c
Hubble’s law: v = H0 d
2 π∆ x
Phase difference: ∆φ =
λ
sy
Young’s double slit: λ=
d
Diffraction grating maxima: =
nλ d sin θ
1.22λ
Single slit minima: sin θ =
D
1.22λ
Rayleigh criterion: θ≥ (for resolution)
D
T
Waves on a string: v=
µ
γRT
Speed of sound: v= (ideal gas)
M
E
v = (solid)
ρ
I
Decibel scale: intensity level ( B ) = log10
I0
Acoustic impedance: Z = (speed of sound) × (density)
2
Reflection: Ir Z2 Z1
I0 Z2 Z1
Electricity
dQ Q
Electric current: I= I = (constant current)
dt t
Q1Q2
Coulomb’s law: F=
4 π∈0 r 2
F dV
Electric field strength: E= E = −
q dx
F Q
E-field strength (point charge): E= =
q 4pe 0 r 2
EPE
Electric potential: V=
Q
W
Potential difference: ∆V =
Q
Q
Electric potential (point charge): E =
4 πε0 r
∑
i= N
Qi
Gauss’s theorem: ∫
surface
E.dS = i=1
ε0
V
Resistance (Ohm’s law): R=
I
Rseries = ∑ i=1 Ri
i= N
Resistors in series:
1 1
= ∑ i=1
i= N
Resistors in parallel:
Rparallel Ri
RA
Resistivity: ρ =
l
Electrical energy: E = VIt
Real cell: V= E − Ir
Q
Capacitance: C=
V
Q2 CV 2 QV
Energy stored (capacitor): =
E = =
2C 2 2
ε0 εr A
Parallel plate capacitance: C=
d
t t t
− − −
Capacitor discharge: Q ( t ) = Q0 e RC
I ( t ) = I0 e RC
V ( t ) = V0 e RC
Capacitor charging: −
t
=Q QF 1 − e RC
−
t
=V VS 1 − e RC
t
−
I ( t ) = I0 e RC
Time constant: t = RC
1 1
= ∑ i=1
i= n
Capacitors in series:
Cseries Ci
i= n
Capacitors in parallel: Cpara = ∑Ci
i =1
Q
Capacitance (charged sphere): Csphere = = 4 πε0 a
V
Magnetism
µ 0 I sin θ
Biot-Savart law: =
δB δl
4 πx2
µ 0 NI
Magnetic field strength: B= (narrow coil)
2r
µ0 I
B= (long straight current-carrying wire)
2 πr
µ µ NI
B= 0 r (long solenoid at center)
l
µ µ NI
B= 0 r (long solenoid at end)
2l
Ampère’s theorem: ∫
closed
=
B.dl ∑µI
enclosed
0
loop by loop
d ( NΦ )
Faraday’s law: E= −
dt
dI
Self-inductance: E = −L
dt
dI
Mutual inductance: E2 = M 1
dt
E1 = M dI2
dt
1
Energy stored in inductor: W = LI 2
2
Ideal transformer: I2 V1 N1
= =
I1 V2 N2
A.C. Circuits
V0 1 1
Reactance (capacitor): X=
C = =
I0 ω C 2 π fC
V
Reactance (inductor): X L = 0 =ωL =π
2 fL
I0
V0
Impedance: Z =
I0
2
V 1
Z = S = R2 + ( X L − X C ) =
2
R 2 + ωL − (RCL series circuit)
I ωC
VL − VC X L − X C
Phase angle: =
tan φ = (RCL series circuit)
VR R
1 1
Resonant frequency (RCL circuit): f0 =
2 π LC
Gravitational fields
Gm1 m2
Newton’s law of gravitation: F= −
r2
gravitational force F
Gravitational field strength:
= g =
mass m
F GM
g= = − 2 (point mass or uniform spherical mass)
m r
GPE
Gravitational potential: VG =
m
GM
VG ( r ) = − (point mass or uniform spherical mass)
r
∆GPE = (h>>r)
mgh
2GM
Escape velocity: vesc =
r
2GM
Scwarzschild radius: RS =
c2
−2GMmr
Tidal forces: ∆F =
R3
∆V
Gravitational time dilation: T ′ T 1 + 2 (weak fields)
=
c
Special relativity
1
Gamma-factor: g=
v2
1−
c2
Time dilation: T′ = g T
L
Length contraction: L′ =
g
Lorentz transformations:
x′ g ( x − vt )
= z′ = z
y′ = y vx
t′ =γt − 2
c
Inverse Lorentz transformations:
x = γ ( x′ + vt′ ) z = z′
vx′
y = y′ γ t′ + 2
t=
c
( u + v)
Velocity addition: w=
uv
1 + 2
c
Mass increase: m = γm0
E2 − p2 c2 = ( m0 c2 )
2
Mass and energy: ∆E = c2 ∆m
ln 2
Half-thickness: x 1 =
2 µ
σ
Range of radiation: R=
ρ
Radioactive decay: N = N0 e − l t
N
Activity: A = − = −λN0 e−λt = −λN
dt
ln 2
Half-life: t 1 =
2 λ
Nuclear binding energy: B.E. = c2Dm
Binding energy per nucleon: B.E./A = c2Dm /A
Quantum physics
Photon energy: E = hf
Photoelectric effect: KEmax= hf − Φ= hf − hf0
eV=
S KEmax= hf − Φ
h
de Broglie relation: λ =
p
h
Compton effect: ( λ=
′ − λ) (1 − cos θ )
me c
− me4 1 −13.6 eV
Hydrogen atom energy levels: = En =
8e 0 2 h2 n2 n2
1 1 1
Balmer series: = R 2 − 2
λn 2 n
1 1 1
Rydberg formula: = R 2 − 2
λn m n
me4
Rydberg constant: R= 2 3
8 cε0 h
h
Heisenberg indeterminacy relation: ∆x∆p ≥
4π
Astrophysics
L
Radius of a star: r= 4
4 πσT
Hubble’s law: v = H0 d
1
Hubble time: TH =
H0
Medical Physics
C.7 CONSTANTS
Speed of light in a vacuum c 3.00 × 108 ms− 1 299 792 458 ms− 1
− 19
Electronic charge (magnitude) e 1.60 × 10 C 1.60217662 × 10− 19 C
Planck constant h 6.63 × 10− 19 Js 6.62607004 × 10− 34 Js
− 11 −2
6.67408 × 10− 11Nm2kg− 2
2
Gravitational constant G 6.67 × 10 Nm kg
Avogadro constant L 6.02 × 1023 mol− 1 6.02214086 × 1023 mol− 1
− 23 −1
Boltzmann’s constant k 1.38 × 10 JK 1.38064852 × 10− 23 JK− 1
Molar gas constant R 8.31 J mol− 1 8.3144598 mol− 1
− 27
Unified atomic mass unit u 1.66 × 10 kg 1.660539040 × 10− 27 kg
−7 −1 −6 −1
Permeability of free space m0 4p × 10 Hm 1.25663706 × 10 Hm
− 12 −1 − 12
Permittivity of free space e0 8.85 × 10 Fm 8.85418782 × 10 Fm− 1
Stefan’s constant s 5.67 × 10− 8 Wm− 2K− 4 5.670367 × 10− 8 Wm− 2K− 4
−3
Wien’s constant W 2.90 × 10 mK 2.8977685 × 10− 3 mK
Proton mass mp 1.67 × 10− 27 kg 1.6726219 × 10− 27 kg
− 31
Electron mass me 9.11 × 10 kg 9.10938356 × 10− 31 kg
− 27
Neutron mass mn 1.67 × 10 kg 1.674927471 × 10− 27 kg
Rydberg constant R 1.10 × 107 m− 1 1.0 973 731.568 508 × 107 m− 1
−2
Standard acceleration of gravity g 9.81 ms 9.806 65 ms− 2 (defined)
Standard atmospheric pressure atm 1.01 × 105 Pa 101 325 Pa (defined)
1.2 × 10− 2 kg⋅mol− 1 1.2 × 10− 2 kg⋅mol− 1 (defined)
12
Molar mass of carbon-12 M( C)
D
Solutions to Exercises
Descriptive answers are not usually included and can be found by referring to
the relevant chapter.
1. displacement 0.41 s
0.4 0.83 s 1.18 s
0.82 m
0.82
0 82 m
me
−1.2 m
velocity
4.0 ms−
1
me
7.5 ms−
1
acceleraon
me
− 9.8 ms−
2
1. All of them.
2. (a) 2.83 N at 45° to left of vertical. (b) 5.10 N at 32.1° left of vertical
3. 6.2 N
4. 0.75 N
5. (a) FA = 1225 N FB = 1715 N (b) 2940 N
6. (a) mS = 0.466 (b) mK = 0.443
(c) Force
2.69 N
2.56 N
me
(d) These are opposite ends of the same interaction so they must be
equal (N3).
3. (a) 0.50 ms−2 to the right (b) a = 19.4 ms−2 to the right
4. (a) 30 000 kgms−1 (b) 0.16 kgms−1
5. 1.3 ms−1 to the right
6. (a) v1 = 0.594 ms−1 v2 = 0.316 ms−1
(b) Initial KE = 0.32 mJ and final KE = 0.23 mJ so the collision was
inelastic. 094 mJ has been transferred to other forms in the collision.
7. (a) 2500 N
(b) (i) Power needed to overcome drag is unchanged but additional
power must be supplied to increase the GPE of the car as it rises.
(ii) 111 kW
mu2
8. (a)=
s uT +
2B
(b)
distance 2
braking distance
2
thinking distance
speed
(c) Braking distance depends on u2, so halving u reduces this by a factor of 4.
CHAPTER 6: FLUIDS
(b) The largest Reynold’s number is ~ 1000 in the aorta. This is close to
the limit for laminar flow so some turbulence could occur, but else-
where the flow will be laminar.
CHAPTER 9: GASES
1. (a) 4.4 × 105 Pa, 15°C
(b) No change. Work is done on the gas but this is equal to the heat trans-
ferred to the surroundings.
(b)
(a)
volume
9. (a) −
(b) (i) DU = 0, (ii) W = H
10. (a) −
(b) (i) exponential increase (ii) exponential decay (iii) exponential decay.
(c) Increasing T increases the B.F. so a larger fraction of collisions exceeds
the activation energy and the reaction rate increases.
11. 2.5×
12. 1.3 × 10−5
13. (a) As T increases B.F. increases so a larger fraction of the electrons have
enough energy to jump to the conduction band.
∆E
−
(b)=
I GV= const.×V×e kT so a graph of ln I against 1/T has a gradi-
ent -DE/k
14. (a) − (b) 0.67 (c) 0.174 gs−1
N N
mg mg mg mg
(c)
No. Ball will have tangential acceleration as well as centripetal
acceleration and resultant force is parallel to resultant acceleration.
(d) v = rg
3. Velocities shown on diagram. Accelerations all equal to v2/r and directed
toward center.
2v
C
2 v
45 B
D
45
2 v
v=0
A
4. (a) 1.82 s (b) 0.55 Hz (c) 3.5 rad s−1 (d) 7.8 × 10−3 Js (e) 0.054 Nm
(f) 0.013 J
5. −
6. (a) 4.7 × 109 kgm2
(b) Astronaut experiences an inward contact force that maintains his cir-
cular motion about the center of rotation. This feels like the reaction
to a gravitational field of strength g = rw2.
(c) 0.22 rad s−1 (d) 94 MJ
7. (a) 0.63 Js (b) 26.2 s
8. No external resultant torque. Angular momentum is conserved. L = Iw. I
is reduced so w increases. Work must be done by the children as they use
forces to move inwards. This increases RKE.
9. 8.3 × 1028 Nm
3. First row: 20 cm, 2, virtual erect; second row: infinity, infinity, not
defined; third row: 60 cm, 2, real inverted; fourth row: 40 cm, 1, real
inverted; fifth row: 20 cm, 0, not defined.
4. First row: 6.7 cm, 0.67, virtual erect; second row: 10 cm, o,50, virtual
erect; third row: 12 cm, 0,40, virtual erect; fourth row: 20 cm, 0, not
defined.
5. (a) 30 times (b) 20 times and length increased by 2.0 cm.
6. (a) 12.5 ms−1 (b) 3.4 × 1024 m
7. For example, by measuring the Doppler shift of 21 cm hydrogen lines
from spiral arms and calculating their velocities relative to earth. From
these 9 and some geometry) the angular velocity of the galaxy can be
calculated.
6. field lines originate on charge and meet the surface at 90°. Equipotentials
perpendicular to field lines.
7. (a) Gaussian surface inside the sphere encloses zero charges so net flux is
also zero. By symmetry zero everywhere inside.
(b) Use the surface of a small flat cylinder embedded in a conductor with
ends inside and outside the surface of the conductor. All field lines
must pass through the external surface and must be perpendicular to
the conductor so EA = Q/e0 and E = s/e0 where A is the surface of the
conductor inside the cylinder.
(c) Gaussian surface enclosing empty space encloses zero charge so net
flux through surface is zero and flux entering volume must equal flux
leaving it.
8. (a) -
(b) -
(c) 250 000 Vm−1 (d) 8.0 × 10−14 N (e) No change (uniform field)
(f) ae = 60 000 aion
9. (a) 2.3 × 107 ms−1 (b) parabolic downward (c) 2.2 × 10−9 s
(d) 7.7 × 106 ms−1 (e) 18.5° (f) 2.7 × 10−7 J
1. (a) − (b) −
2. 7.0 × 10−5 ms−1
3. (a) 0.34 A (b) 4.3 × 103 Am−2 (c) 2.2 W −1m−1 (d) 0.17 W −1
(e) 2700 ms−1 (f) e− , h+ . e−
4. (a) B lights normally, A does not light.
(b) A lights almost normally and B lights just below normal brightness.
ER
5. (a) − (b) V = (c) Plot 1/V against 1/R: gradient = r/E and
( + r)
R
intercept = 1/E (d) −
6. (a) copper (b) 0.62 W, 0.39 W, 2.22 W (c) 44%, 54%, 91%
(d) 0.43 W (e) 4.6 A
2. (a) Circular field lines around each wire: direction from RH rule. Field of
either wire intersects another current at 90° creating a “motor effect”
force: directions from FLHR. Attraction.
(b) 1.78 × 10−6 Nm−1
(c) Currents on opposite sides of coil have a repulsive interaction pro-
ducing an outward (explosive) force. This will be large if current and
number of turns are large.
m1 v m2 v 2 ( m1 − m2 ) v
3. (a) − =
(b) r1 = r2 (c) separation =
Bq Bq Bq
4. (a) − (b) Distance of line of action of force from axis varies as coil turns.
Max. at 0°, min. at 90°.
(c) 0.0312 Nm, 0.027 Nm, 0.022 Nm, 0.016 Nm. 0 Nm
CHAPTER 22: AC
1. (a) 9.8 N (b) 2.72 × 10−3 N (c) 3600 and 3602 − that is, the ratios are
the same so it is consistent. (d) − (e) − (f) −
12
Carbon - 12 6 C 6 12 6 6 6
13
Carbon -13 6 C 6 13 6 6 7
14
Carbon - 14 6 C 6 14 6 6 8
16
Oxygen - 16 8 O 8 16 8 8 8
56
Iron - 56 26 Fe 26 56 26 26 30
157
Gold - 79 Au 79 197 79 79 118
235
Uranium - 235 92 U 92 235 92 92 143
235
Uranium - 238 92 U 92 238 92 92 146
(b) For example, carbon isotopes: same atomic number but the different
mass number
(c) Electrostatic repulsion between protons is a cumulative long-range
repulsion. Strong nuclear interaction between neutrons and protons is
short-range. For a large nucleus, more neutrons are needed to stabi-
lize the nucleus against the coulomb force.
4. (a) 20 cpm (b) radioactive decay is a random process so it fluctuates
(c) 8 h (d) alpha and gamma
5. (a) 1.22 × 10−4 y−1 or 3.85 × 10−12 s−1
(b) 2 2 800 y assuming no contamination with “younger” carbon.
(c) 19 000 years
6. 2
7.
Time, years P Q
0 N 0
1000 N/2 N/2
2000 N/4 3N/4
3000 N/8 7N/8
5000 N/32 31N/32
(c) 0.0052 u
4. (a) 166 C → 167 N + −01 β + 00 υ
(b) 8.0 MeV (c) Energy is shared randomly with anti-neutrino
(d) Neutron-rich nucleus. Large mass defect for decay. High probability
of decay and therefore short half-life.
5. (a) 137 N → 136 C + +01 β + 00 υ (1) 137 N → 138 O + −01 β + 00 υ (2)
(b) For (1) Dm = 0.0024 u, for (2) Dm = - 0.019 u, so mass of products
in (2) is greater than mass of original nucleus so that reaction cannot
proceed spontaneously.
(c) 7.5 MeV / nucleon, 7.7 MeV / nucleon − both are stable and nitrogen-15
might be expected to be more abundant. In fact, nitrogen-14 is much
more abundant because the odd-odd configuration is much more sta-
ble than an odd-even configuration of nucleons, allowing protons and
neutrons to pair up.
(d) t1/2 (C-16) > t1/2 (C-17) > t1/2 (C-18). Neutron excess destabilizes the
nucleus because neutrons are themselves unstable.
6. (a) −
(b)(i) 146 C → 147 N + −01 β + 00 υ (ii) 0 n → 1 p + −1 β + 0 υ
1 1 0 0
(iii) 1/ 3
−2 / 3 d→ 1/ 3
+1/ 3 u + −01 β + 00 υ
(c) Yes − proton (uud) changes to a neutron (udd) so an up quark changes
to a down quark.
6. −
7. (a) 9.7 × 1010 Hz (b) 9.7 × 1010 Hz
(c) Shift for rocket is caused by Doppler effect due to relative motion,
shift for galaxy is caused by cosmological expansion of space.
(d) 4.3 × 108 years
8. −
Dependent variable: the variable you are investigating to find out how it
depends on an independent variable.
Derived unit: unit built up from more than one base unit, for example, ms- 1
for velocity. Some derived units have their own name, for example, the joule,
for energy. The joule is equivalent to kgm2s- 2 in base units.
Deterministic: where having complete knowledge of the present state
of a system is sufficient to make a complete prediction of its future state.
Newtonian mechanics is a deterministic system.
Dielectric material: insulating material that is polarized by an external elec-
tric field. When a dielectric material is placed between the plates of a capaci-
tor increases its capacitance.
Diesel cycle: idealized cycle for a diesel internal combustion engine.
Diffraction grating: an optical component consisting of a large number of
narrow parallel slits – used in spectroscopy to analyze light.
Diffraction: spreading of a wave into a region of geometric shadow after
passing through an aperture or past an object or edge.
Dimension: the nature of a physical parameter independent of its quantity,
for example, the dimension of time but not the actual duration of a particular
time.
Dipole (electric): two charges of opposite sign and equal magnitude sepa-
rated by a distance – for example, in a polar molecule.
Dipole (magnetic): object (e.g., a bar magnet) with a south magnetic pole at
one end and a north magnetic pole at the other end.
Direct current (D.C.): current that always flows in the same direction
around a circuit. The polarity of a d.c. supply is constant.
Dispersion: when refractive index depends on wavelength (e.g., for differ-
ent w
avelengths of visible light in glass) the amount of refraction will also be
wavelength dependent (e.g., when a triangular prism spreads white light into
a spectrum of colors).
Doppler effect: shift in observed wavelength and frequency of a wave as a
result of relative motion between the source and the observer.
Drag coefficient: constant in the drag equation related to the shape and
nature of the surface of a body moving through a fluid.
Impedance: ratio of the peak voltage to the peak current in an A.C. system
even though these values occur at different times. Resistance and reactance
are special cases of impedance when the phase difference between voltage
and current is 0 or p/2, respectively. Measured in ohms.
Impulse: integral of force and time equal to the change of momentum. When
force is constant impulse is Ft = mv - mu.
Independent variable: the variable you vary to find its effect on the depend-
ent variable. Sometimes called the “manipulated variable.”
Indeterministic: where having complete knowledge of the present state
of a system is insufficient to make a complete prediction of its future state.
Quantum theory is an indeterministic system.
Indicator diagrams: plot of pressure against volume for a cyclic process in
a heat engine.
Inertial force: an apparent force that is a result of applying Newton’s laws to
an accelerating reference frame as if it is not accelerating. For example, think-
ing that a force throws you forward when the train in which you are traveling
suddenly slows down. No physical force pushes you forward.
Inertial reference frame: an unaccelerated reference frame, one that is at
rest or moving at a constant velocity.
Intensity: energy per unit area per second in a wavefront, measured in Wm- 2
and proportional to the amplitude-squared.
Interaction: all forces arise from interactions. In nature, there are four fun-
damental interactions: gravitational, electromagnetic, and strong and weak
nuclear forces.
Interferometer: instrument in which light rays traveling along two per-
pendicular paths are brought back together and then allowed to superpose
and interfere in order to measure small differences in the optical paths (e.g.,
to detect the distortions caused by gravitational waves passing through the
apparatus).
Internal energy: sum of random thermal kinetic energies and potential
energies of all particles in the body.
Internal resistance: resistance inside a cell or battery that dissipates energy
when current is drawn from the cell and results in a lost voltage so that the
terminal voltage is less than the emf of the supply.
Invariant: a quantity that is the same for all inertial observers (e.g., the 4D
interval between events).
Neutral point: point in space where the fields caused by two or more sources
cancels out.
Neutron star: fate of a heavy star that has formed a planetary nebula. Its core
continues to collapse beyond the white dwarf stage until it is prevented from
further collapse by neutron degeneracy pressure.
Newton’s law of gravitation: the gravitational force between two point
masses is proportional to the product of the masses and the inverse-square of
their separation: F = Gm1 m2 .
r2
Newton’s laws of motion: three fundamental laws of mechanics related to
the effects of resultant forces and the nature of interactions.
Nucleon: particle found in the nucleus – a proton or a neutron.
Null result: when an expected effect is absent even though the method and
precision should have detected it. The Michelson-Morley experiment is the
most famous example of a null result.
Optical Fiber: narrow transparent fiber along which light or infrared radia-
tion can be transmitted because it repeatedly undergoes total internal reflec-
tion at the boundary. Used to transmit information, for example, for computer
networks, telephone systems, and cable TV.
Oscillation: periodic motion about an equilibrium position, such as the vibra-
tion of a mass on a spring or the swing of a simple pendulum.
Otto Cycle: idealized cycle for a petrol internal combustion engine.
Pair annihilation: conversion of the mass of a matter particle and its corre-
sponding anti-particle into energy in the form of gamma rays.
Parsec: unit of distance in astronomy equal to about 3.26 light years.
Percentage uncertainty: ratio of absolute uncertainty dx to measured value
δx
x expressed as a percentage: × 100% .
x
Perihelion: point on an elliptical planetary orbit when the planet is closest
to the Sun. The equivalent point in the orbit of an Earth satellite is called
perigee.
Permanent magnet: magnetic material in which the atoms are themselves
magnetic dipoles. If these are aligned (e.g., in a ferromagnetic material such
as iron) the sample becomes a magnetic dipole.
Permeability: property of a medium related to its ability to support a mag-
netic field
Standard model: current best model of all particles and forces in the
Universe.
Standing/stationary wave: localized wave disturbance consisting of a sta-
tionary p
attern of nodes (positions of minimum disturbance) and antinodes
(positions of maximum disturbance).
Static friction: frictional force between two surfaces in contact with each
other and at rest.
Stefan-Boltzmann law: relationship between the power per unit area emit-
ted by a radiator and its temperature.
Stiff: material that has a large stress to strain ratio (large Young modulus) –
that is, hard to stretch.
Strain energy: energy stored because of deformation, for example, in a
stretched spring.
Strong: large breaking force (for a sample) or large breaking stress (UTS) for
a material.
Sum-over-histories: approach to quantum theory suggested by Richard
Feynman in which all possible paths contribute a phasor and the square of
the resultant phasor at each point represents the probability of the process
taking place.
Supernova: explosion of a massive star at the end of its life.
Superposition: when two or more waves are present at the same point in
space the resultant disturbance is the vector sum of the disturbances due to
each wave.
Symmetry principle: when an operation carried out on a system leaves it
unchanged.
Systematic error: measurement error that affects all measurements in the
same way – for example, making them all too large or too small by the same
quantity or proportion. If the error is known it can be corrected (e.g., by sub-
tracting a constant value from each measurement).
Tensile strain: ratio of extension to the original length. Dimensionless.
Tensile stress: ratio of axial force applied to the cross-sectional area of sam-
ple perpendicular to the force, measured in Nm- 2.
Thermal conduction: transfer of heat as a result of particle-to-particle
interactions.
Thermal equilibrium: when two objects are at the same temperature and,
if placed in thermal contact, there is no net transfer of heat between them.
Thermal radiation: emission of electromagnetic radiation with a spectrum
that depends on the temperature of the emitting body.
Thought experiment: an imagined experiment used to explore the implica-
tions of the theory, for example, Schrödinger’s cat or the twin paradox.
Tidal forces: differential forces arising because of the difference in gravita-
tional force across the diameter of an orbiting body. Tidal forces tend to dis-
tort the body along and perpendicular to the line joining it to the body around
which it orbits.
Time dilation: the observed slowing of time in a reference frame that is mov-
ing relative to the observer.
Time period: time for one complete cycle of oscillation.
Torricelli vacuum: space above the mercury inside a mercury barometer
containing very low-pressure mercury vapor.
Total internal reflection: when a wave strikes the boundary between a
medium of higher refractive index and one of lower refractive index above a
certain critical angle and all of the wave energy is reflected back into the first
medium.
Tough: undergoes a considerable amount of plastic deformation and absorbs
a lot of energy before fracture.
Transformer: electrical device usually consisting of a primary coil and a sec-
ondary coil wound onto the same ferromagnetic core. An A.C. voltage in the
primary can be stepped up or down according to the transformer equation:
V2 N2 .
=
V1 N1
Transverse wave: wave in which the vibration direction is perpendicular to
the direction in which the wave transfers energy (e.g., all EM waves).
Trigonometric parallax: method for determining the distance to a star by
measuring the change in its apparent direction (parallax) angle as the Earth
orbits the Sun. The more distant the star, the smaller the parallax angle. A
method of triangulation with the Earth’s orbital diameter as baseline.
Triple point of water: unique temperature at which ice, water and water
vapour are in equilibrium.
Fleming’s left hand rule, 422 General theory of relativity, 58, 84, 307,
Flotation, 119 503, 505, 508, 530, 588, 590, 639, 652,
Fluids, 111 653, 683
Flux losses Generations
transformers, 453 leptons, 587
Forced (or driven) oscillator, 230 quarks, 587
Foucault, 289 Generator, 453
Fourier analysis, 346 Geocentric model, 495
Fourier’s equation, 159 Geodesics, 504
Fourier synthesis, 346 Geometry
Frames of reference, 102 and special relativity, 529
Free-body diagrams, 55, 56, 85, 134, 226 Geostationary satellites, 499
Freely falling reference frames, 505 Geosynchronous orbit, 499
Frequency, 265 Glasses, 151
Friction GPS satellites, 499
static, 72 Gradient
Frictional forces, 71 how to calculate, 31
Fundamental, 332 Graham’s law of diffusion, 181
Fundamental interactions, 588 Gran Telescopio Canaris, 303
Fusion reactors, 581, 584 Graphs of motion, 42
Gravitational fieldlines and
G equipotentials, 490
Galilean relativity, 80, 81, 513, 516, Gravitational field strength, 484
534, 746 Gravitational field strength of the
Galilean transformation, 105 Earth, 486
Galileo, 79–83, 103, 496, 513 Gravitational mass, 83
Gamma-camera, 666 Gravitational potential, 487
Gamma emission, 555 Gravitational potential energy, 93, 487
Gamma-factor, 519, 520, 523, 524, Gravitational time dilation, 508
533, 534 Gravitational waves, 508
Gamma-rays Gray (S.I. unit), 549
inverse-square law, 543 Great Magellan telescope, 303
Gauss’s law, 359 Ground state, 617
Gauss’s theorem, 360, 361, 371, Gyromagnetic ratio, 665
484, 704
gravitational field, 484 H
Geiger, 537 Hadrons and quarks, 587
Geiger counter, 558 Hahn, Otto, 575
General relativity, 493, 504, 506, 590, Half-life, 552
652, 684, 747 Half-thickness, 545
S absorption, 324
Scalar product, 18 line, band and continuous, 323
Scalars, 16 Speed of light
Schrödinger atom, 621 constancy, 291
Schrödinger equation, 614, 615, 621, measuring, 288
622, 624–627, 632, 757 Speed of sound, 181, 195, 336, 339, 340,
Schrödinger’s cat, 634 343–347, 703, 737
Schwarzschild radius, 493 in a gas, 181
Scientific notation, 5 Spherical aberration, 300
Second derivatives, 11 Spin quantum number, 622
Second law of motion for rotation, 250 Spontaneous process, 551
Second law of thermodynamics, 188, Spreadsheet
199–204, 212–217 using, 31
Self-inductance, 446 Spring constant, 140
Semiconductors, 350, 396 Spring tides, 502
Shear stress, 111 Sputnik 1, 498
Sievert (S.I. unit), 549 Stability, 68, 69
Simple harmonic motion, 223 Standard candles, 650
Simple pendulum, 225 Standard Model, 587, 590, 591
Simultaneity Standing (stationary) waves, 331
relativity of, 522 Standing waves in air columns, 341
Single photon interference, 612 Standing waves on a string, 331
Single slit diffraction equation, 329 Stars, 640
Singularity, 641, 651 Stars as black bodies, 643
Slipher, Vesto, 307, 648 Statistical interpretation
Small angle approximations, 236 of quantum theory, 613
Snell’s law of refraction, 271 Statistical thermodynamics, 199
Solenoid, 430 Stefan–Boltzmann law, 162
Sound waves, 339 Stellar spectra, 644
Space-time, 504, 531, 758 Stokes’ law, 3, 123
Space-time curvature, 504 Stopping voltage, 604
Spark counter, 558 Strain, 143
Special relativity, 513, 730 Strain energy, 142
Specific heat capacity, 162 Streamlines
measuring, 164 in a fluid, 121
Specific latent heat, 165 Stress
Spectrometer, 325 tensile, 143
Spectroscopy, 323 Strong nuclear force (color force), 588
Spectrum Sum-over-histories, 626, 630, 632