Introduction To Calculus of Vector Fields
Introduction To Calculus of Vector Fields
Introduction To Calculus of Vector Fields
April 4, 2001 1
These course-notes are a draft and were prepared for a course in fall 2000 at ASU.
These notes are partially based on work supported by the National Science Foun-
dation grant DUE 97-52453: “Vector Calculus via Linearization: Visualization
and Modern Applications.” The affiliated explorations via computer algebra sys-
tems and interactive JAVA software have not yet been integrated into these notes.
These notes for this class of fall 2000 do not contain reviews of most prerequisite
material as be easily be found in the class’ textbook. Similarly, there are some
vector calculus topics, especially routine exercises, where the class will be referred
to the textbook.
2
Chapter 1
Vector fields
1.1 Introduction
Vector fields are abstractions of very familiar images that convey information using lots of arrows.
One example are force fields and magnetic field lines that surround are magnet. Another common
picture are geographical maps of prevailing winds or currents.
Figure 1.1: A magnetic force field, and a map showing prevailing winds
These examples associate to every point (usually on some grid) an arrow distinguished by its
direction and its length. The arrow points in the direction of the force or in the direction into
which the wind is blowing at that point. The length of the arrow symbolizes the strength, or
magnitude, of the force, or the speed of the wind at that point.
Mathematically a vector field is a special kind of function: The inputs in the examples above
are points in the plane. The outputs are vectors – which abstract the pictorial arrows.
The first task is to make mathematically precise this notion of a vector field as a function.
Thereby we will gain access to a familiar machinery of algebraic and analytic tools.
Our development shall be a two way street: Mathematical objects provide the tools for modeling
physical phenomena. In turn these provide questions and problems which motivate, often even
guide, further mathematical constructions.
3
4
For a very naive preview consider again the two examples given above. One natural question
asks which path a speck of dust will take as it is blown around by the wind. Similarly, one may
ask which path a small particle of magnetic material, or an ion, will follow if subject only to the
magnetic force field. Mathematically, these lead to two different notions of integrals of vector
fields. One is commonly studied under the name of differential equations, the other is connected
to line integrals in vector calculus.
For another question think of the vector field modeling the current in some ocean. A natural
question asks how much water will flow every year through the narrows between two islands.
Similarly, a fisherwoman may ask how much water will flow through her fishing net during one
night. Mathematically, these questions will lead to different notions of integrals of vector fields,
different kinds of line integrals, and surface integrals.
Remembering the first calculus course, a natural question is whether there is an analogue of the
fundamental theorem of calculus.
Exercise 1.1.1 Recall the first and second fundamental theorems of calculus. Look up, and
write down each theorem in precise mathematical terms, with all hypotheses. Then write down
in plain English what they say. Finally give one example each for how they are used.
As another motivation, consider the space shuttle as it travels from the ground to its desired orbit
around the Earth. A natural question is whether some flight paths use less energy than others
as the shuttle is lifted against the gravitational force field of the Earth. Mathematically, this
becomes a question about whether every, or some, or no vector field(s) have an antiderivative.
(Recall that every (continuous scalar) function (of a single variable) is the derivative of some
function. A major thread of vector calculus investigates if this generalizes in some sense to
vector fields).
Physically such antiderivative may correspond to a notion of potential energy. The associated
fundamental theorem for line integrals then has an interpretation in terms of conservation of
energy.
This course we will find several more candidates for fundamental theorems of calculus of vector
fields. Initially these may appear quite different from one another. The final goal of the course is
the triumphant formulation, and use, of the general Stokes’ theorem that contains all as special
cases.
Derivatives of vector fields may arise from questions about rates of change of vector fields. There
are quite a few possible directions in which such questions may lead. However, meaningful
derivatives of vector fields most naturally arise after first developing useful integrals of vector
fields. The desired fundamental theorems will provide additional guidance to develop useful
notions of derivatives! This is much more natural than the experience of first year calculus
may suggest: Integration is about multiplying and adding, differentiation about differences and
quotients. Arguably the natural order is to first learn addition and multiplication, and then
develop subtraction and division by asking e.g.: How much do we have to add to a to get b?
How many times a is equal to b?
Exercise 1.1.2 Make a list of at least five different phenomena that may be suitably pictured by
an arrow at every point. Try to make as diverse a collection as possible. Think broadly, go beyond
physics, and consider e.g. migration in biology and social studies, gradients of temperatures or
of concentrations of chemicals, and others . . .
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 5
Exercise 1.1.3 For each of your examples in exercise 1.1.2 formulate one or two practical
questions that might have answers in terms of integrals or derivatives.
Other places use e1 , e2 and e3 to denote the same vectors. For example the arrows based at the
point P (2, 1, 3) with head at Q(−1, 5, 3) may then be denoted by either
−3
~v = −3~ı + 4~ or ~v = 4 (1.2)
0
Note the ambiguity in the first expression which does not make it clear whether ~v is a vector
in the plane or in 3-space! Throughout this introduction to vector calculus we will use either
notation as is convenient. One good motivation for this is that on one side the ~ı, ~, ~k is very
firmly established in the sciences. On the other side, most any computer program simply works
with pairs or triples of numbers.
Exercise 1.2.1 Write each of the following vectors in the form a~ı + b ~ + c ~k, and as a column
vector.
6
√
a. The vectors along the sides of the triangle with vertices at (−1, 0), (1, 0) and at (0, 3).
b. A vector of unit length that is tangent to the graph of y = x2 at the point (1, 1).
c. A vector in the plane that has unit length and is perpendicular to tangent line to to the
graph of y = x2 at the point (1, 1).
d. A vector of unit length that is perpendicular to the triangle with vertices at (4, 1, 1), (1, 5, 1)
and (1, 1, 6).
On the side we note that once one makes the step from arrows to pairs or triples of numbers, it
is natural to consider n-tuples of numbers, i.e. generalize vectors to n dimensions. This way a
collection of n measurements or data points may be considered a vector, and a small step further,
functions are considered vectors in infinite dimensional spaces. The formal development of this
mathematical notion of vectors is studied in linear algebra. In this introduction to vector calculus
hardly any of this is needed – only occasionally shall we appeal to a little open-mindedness, e.g. to
explore whether our constructions make sense in general settings, or are restricted to very special
e.g. two dimensional phenomena. For the sake of completeness, we note that mathematically ,
vectors are – loosely speaking – objects for which sums and scalar multiples (that obey the usual
rules) make sense. More precisely, vectors are the elements of a vector space, which is a set
together with two operations, called addition and scalar multiplication, which satisfy a list of
basic axioms.
Thus we have a preliminary definition:
Definition 1.2.1 A vector field is a function that assigns to every point in (some subset of the)
plane or 3-space a unique vector in the plane 3-space, respectively.
Just like functions that assign a number y to every number x (possibly in some subset of
numbers), vector fields may be communicated in different ways: The first examples where given
by pictures. For practical computations tables of numbers are often the most suitable way.
For symbolic calculations algebraic formulas are generally desirable. Finally, very powerful are
verbal descriptions such as: “Let F~ be the vector field in 3 space that is defined everywhere except
at (0, 0, 0), that has unit length and points towards (0, 0, 0) at every point.”
symbolic
-
verbal
i
P 1
6 PPP PP P
6
PPPPP
PPP
PPP
PPPPPP
PP PP
? ) PPq ?
graphic
-
numeric
One of the most basic, essential skills is to go from any one of these descriptions to any other.
Some of these directions have the usual ambiguities, e.g. any finite collection of pictured arrows
or numerical data can be fitted exactly with infinitely many different formulas.
Clearly we do not have the space to discuss in detail all twelve directions. But rather than
despair, note that often it is convenient to go an indirect route, e.g. from a verbal description,
to graphical sketch, then read off a number of specific values, and use these to guess a possible
formula.
In the following we consider selected translations between these descriptions.
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 7
Example 1.2.1 An infinitely long straight wire carrying a constant electric current generates a
magnetic vector field. The magnitude of the field is inversely proportional to the distance from
the wire. The field is at every point perpendicular to both the wire and the shortest line from the
point to the wire. Its direction is further governed by the right hand rule: Roll up the fingers to
make a fist, but leave the thumb extended. If the thumb points in the direction of the current,
then the fingers indicate the direction of the magnetic field.
Translate this description into an analytic formula.
Example 1.2.2 Find a formula for the magnetic field described in exercise 1.2.1.
We are looking for three scalar functions M1 , M2 and M3 of three variables each such that
M~ (x, y, z) = M1 (x, y, z)~ı + M2 (x, y, z)~ + M3 (x, y, z)~k is everywhere perpendicular to the wire,
and such that the magnitude |M ~ (x, y, z)| is inversely proportional to the distance of the point
(x, y, z) from the wire.
To simplify matters, choose coordinates such that the z-axis is aligned with the wire. To be
perpendicular to the wire, the vector field must satisfy
!
0 = ~k · M
~ (x, y, z) = M3 (x, y, z) for all x, y, z. (1.3)
This leaves only two functions left to be determined. The shortest line segment from the wire
~ (x, y, z)
to a point P = (x, y, z) is described by the vector ~r = x~ı + y~. The condition that M
~
be perpendicular to this vector and to the wire means that M (x, y, z) must be a multiple of the
cross-product ~k × ~r, i.e.
for some scalar function m still to be determined. The right hand rule forces that m is always
positive.
The magnitude of the field at any point (x, y, z) is inversely proportional to the distance |~r| =
p
x2 + y 2 , i.e. for some constant C > 0
~ (x, y, z) = −y~ı x ~
M + for all (x, y, z) 6= (0, 0, ∗) (1.6)
x2 + y 2 x2 + y 2
where we have chosen the scaling constant C = 1. This formula is well worth memorizing as
this vector field is arguably one of the most important fields in physical applications. Moreover,
as we shall see soon, it is also one of the most interesting fields mathematically.
Given a formula for a vector field in the plane it is very easy to obtain a table of values, and
use this to obtain a graphical image. Today, most computer programs combine these steps,
apparently obviating the need to first create a table. However, without some practice by hand,
the machine’s plots are easily conceived as magic. Moreover, every once in a while the machines
don’t work as expected, have bugs, or show some unexpected results. In such cases it is an
essential skill to be able to verify the correctness of the machine plot, or to track down the
origin of the trouble.
8
Exercise 1.2.2 Use your favorite computer program to plot the vector field M~ (x, y) = sin(12xy)~ı+
cos(11x2 − 9y)~. Explain in words why, say on the rectangle [−10, 10] × [−10, 10], the image ap-
pears to be very complicated.
Example 1.2.3 Make a table of function values for the magnetic field discussed above, and
sketch a suitable cross-section of the vector field.
Since the vector field M~ depends only on the first two coordinates and is horizontal, it may also
be considered as a vector field in the plane! As a function of points in the plane, i.e. of pairs of
numbers, we need a two-dimensional, rectangular table. To facilitate going back and forth from
pictures to tables, it is convenient to arrange the x-values from left to right, and the y-values
from bottom to top. Evaluating the component functions M1 = ~ı · M ~ and M2 = ~ · M ~ at the
grid points, yields the following table of approximate values
−2 −1 − 12 0 1
2 1 2
2 −.25~ı − .25~ −.40~ı − .20~ −.47~ı − .12~ −.50~ı + 0.0~ −.47~ı + .12~ −.40~ı + .20~ −.25~ı + .25~
1 −.20~ı − .40~ −.50~ı − .50~ −.80~ı − .40~ −1.0~ı + 0.0~ −.80~ı + .40~ −.50~ı + .50~ −.20~ı + .40~
1
2 −.12~ı − .47~ −.40~ı − .80~ −1.0~ı − 1.0~ −2.0~ı + 0.0~ −1.0~ı + 1.0~ −.40~ı + .80~ −.12~ı + .47~
0 +0.0~ı − .50~ +0.0~ı − 1.0~ +0.0~ı − 2.0~ undefined +0.0~ı + 2.0~ +0.0~ı + 1.0~ +0.0~ı + .50~
− 12 +.12~ı − .47~ +.40~ı − .80~ +1.0~ı − 1.0~ +2.0~ı + 0.0~ +1.0~ı + 1.0~ +.40~ı + .80~ +.12~ı + .47~
−1 +.20~ı − .40~ +.50~ı − .50~ +.80~ı − .40~ +1.0~ı + 0.0~ +.80~ı + .40~ +.50~ı + .50~ +.20~ı + .40~
−2 +.25~ı − .25~ +.40~ı − .20~ +.47~ı − .12~ +.50~ı + 0.0~ +.47~ı + .12~ +.40~ı + .20~ +.25~ı + .25~
Since the vector field looks the same in any horizontal plane, it suffices to sketch it in the
xy plane. It is straight forward to draw at each grid point (xi , yj ) of the above table the
corresponding vector M ~ (xi , yj , ∗). The picture shown is a computer generated plot that simply
fills more intermediate plots.
Figure 1.2: The magnetic field about a straight wire carrying a constant current
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 9
Care needs to be taken as many computer plotting programs will be unable to plot the field due
to the division by zero at (x, y) = (0, 0). Mathematically the vector field is undefined at (0, 0)
– and it may take some effort to enter this into the plotting program. Physically, the points
(0, 0, z) correspond to the inside the wire where the magnetic field is ????, is it zero inside? .
Exercise 1.2.3 For each the following vector fields create a table of values, and use it to sketch
by hand an image of the field. Then check your result against a plot using a computer program.
~ y) = ~ı + x2~
• S(x,
~
• H(x, y) = −y~ı + x~
~
• G(x, y) = (x~ı + y~)/(x2 + y 2 )
In many cases one can directly sketch the vector field from a verbal description without having
to go through formulae and tables – but that detour always is an option.
Exercise 1.2.4 Sketch a vector field in the plane that has the same magnitude everywhere and
points everywhere in the direction of the positive x-axis.
Exercise 1.2.5 Sketch a vector field in the plane that points everywhere in the direction of the
positive y-axis and whose magnitude is proportional to the distance from the x-axis.
Exercise 1.2.6 Sketch a vector field in the plane that points everywhere in the direction of the
positive y-axis and whose magnitude is proportional to the distance from the y-axis.
Exercise 1.2.7 Sketch a vector field in the plane that at every point P is perpendicular to the
~ from the origin to P , and whose magnitude is proportional to the distance from the
vector OP
origin.
Exercise 1.2.8 Sketch a vector field in the plane that at every point P is perpendicular to the
~ from the origin to P , and which has unit magnitude.
vector OP
Exercise 1.2.9 Sketch a vector field in the plane that at every point P points towards the origin,
and whose magnitude is proportional to the distance from the origin.
Exercise 1.2.10 Sketch a vector field in the plane that at every point P points towards the
origin, and which has unit magnitude.
Exercise 1.2.11 Sketch a vector field in the plane that at every point P points towards the
origin, and whose magnitude is inversely proportional to the distance from the origin.
There are many other ways to go from one to another representation of a vector field. Just like
reading a graph of a scalar function, it is a basic skill to read off values of a vector field and
make a table when given a picture.
Exercise 1.2.12 Consider the vector field pictured in figure 1.1. Choose a coordinate system
and make a table of values of the pictured vector field.
Another valuable skill is to be able to precisely describe in words the key features of a pictured
vector field. This takes practice. As examples for concise verbal description consider the set of
exercises above which go in the other direction.
10
Another direction that takes practice is visualize a vector field from just a brief look at a table
of numerical values. We will practice this in the upcoming section on linear vector fields.
Some of the hardest translations are finding possible formulas that match a picture or table of
values. For us this is important primarily in a few special cases – just like in the first calculus
one should immediately recognize the shapes of a few basic function graphs! Here the most
important ones are constant and linear fields, and the special fields of primary physical interest,
the magnetic field discussed above, and the gravitational (or electro static) field about a point
mass (charge). Linear fields will be discussed in detail in a subsequent section, allowing for
practice of matching formulas to pictures and tables.
We conclude this section with a few exercises to go from formulas to pictures to words.
Exercise 1.2.14 Plot each of the following vector fields using a computer program. Then de-
scribe this vector fields in words as possible. Pay special attention to the changes of magnitude,
the direction (e.g. horizontal radially outwards etc.), and whether the horizontal and vertical
components depend on x, y or whatever else. Be creative, but precise. Think of giving instruc-
tions via telephone to a friend who needs to draw an accurate copy of a pictured vector field that
is in your hands. Pretend that you do not have a formula available.
• F~ (x, y) = ~ı − 2~
~
p
• G(x, y) = x2 + y 2 (~ı + ~)
~
• H(x, y) = −y~ı + x~
~
• K(x, y) = (y~ı + x~)
~ (x, y) = (−y~ı + x~)/ x2 + y 2
p
• U
Take a quick look at the dimensions: The coordinates x and y are lengths, the angular velocity
1
has the dimension of time , and thus the vector field has the expected dimension of a velocity,
distance per time, e.g. meters per second. Now suppose one changes the units, say from meters
to centimeters. Then the numerical values of the coordinates x and y of the same point increase
by a factor of 100, and so does the numerical value of the velocity of this point.
Associated with this vector field is a system of ordinary, autonomous differential equations:
dx
dt = ωy (= ~ı · ~v )
dy (1.7)
dt = −ω x (= ~ · ~v )
The general solution x(t) = x0 cos ωt + y0 sin ωt, y(t) = −x0 sin ωt + y0 cos ωt describes the path
traced out by the point on the rigid body that was located at (x0 , y0 ) at time t = 0.
Exercise 1.4.1 Plot the vector field ~v and solution curves (x(t), y(t)) for different initial points
(x0 , y0 ). Algebraically verify that the curves are indeed circles by simplifying (x(t))2 + (y(t))2 .
Describe the relation of the vector field ~v and the curves (x(t), y(t)).
Recall that the solution curves of differential equations are also called integral curves. A little
reflection should lead one to agree that this is a well chosen name: Going from the velocities
to the positions basically means integrating with respect to time. Thus integrating a system of
differential equations is a first kind of integral for vector fields!
Note that there is nothing special about this vector field. Indeed, every vector field determines
a system of autonomous differential equations. Conversely, every system of autonomous differ-
ential equations may be identified with a vector field. Thus in some sense one may claim that
differential equations and vector fields are the same. Indeed many mathematicians only work
with vector fields, which as functions have better properties than equations.
However, viewing a vector field just as a system of differential equations is not always the most
useful approach. The upcoming examples of force fields shall underline this reservation.
into a pipe. Another rewarding example is the water (or blood) flowing through a pipe or blood
vessel that has a constriction (e.g. a calcium deposit) or that has an enlarged section (e.g. an
aneurysm)
Questions about modeling and different notions of solutions of fluid flows have been some of
the hottest research areas in recent decades, and no final solution is in sight. Thus it is natural
for us to consider very special cases, making simplifying assumptions. Our first assumption is
that the flow does not show any turbulence, characterized by eddies of all sizes with a chaotic
appearance. We shall also ignore boundary layer effects which basically model that the fluid
sticks to the surfaces in some sense.
Simple formulas that model such fluid flows may be obtained fairly easily using functions of a
complex variable.
These models for two dimensional flows have the obvious advantage that they are more easily
to picture. Almost all physical flows are three dimensional. But the two dimensional models
may still be useful to describe flows in thin films, or shallow water as e.g. in the Everglades or
in a flooded plane. For us most important is that such two dimensional models already capture
many phenomena that are of primary interest to us.
Just as before for the rigid body, one can solve the associated system of differential equations and
thus obtain the curves along which idealized small fluid elements flow. But we may also consider
different kinds of integrals. One of the principal characteristics of fluids like water is that they
are virtually incompressible. (This makes fluids useful for example in hydraulic braking systems
in a car!) An fundamental conservation principle thus asserts that the volume entering the pipe
on one end must be equal to the volume exiting from the pipe on the other end. Let us take a
look at the dimensions – again in the case of a three dimensional pipe: Suppose the vector field
determines the velocity in units of centimeters per second. The cross-section of the pipe might
be measured in square centimeters. The product of these dimensions yields cubic centimeters
per second, i.e. a volume per time. Suppose the velocity ~v is constant and perpendicular to
the imagined surface at the end of the pipe. In this case it appears reasonable that the volume
per time passing through the end is obtained by simply multiplying the magnitude |~v | of the
velocity with the cross-section area. In general, we need to account for velocities that are not
perpendicular to the imagined surface. Here vector algebra and dot products shall come in
handy to account for the angles. Integral calculus shall serve to account for curved surfaces and
variable velocity fields ~v . There is a lot of work ahead to fill in the technical details, but this
exploration is the beginning of a different kind of integral of vector fields: The flux of a vector
field across a surface, and in the two-dimensional analog, the flux across a curve.
We conclude this discussion of velocity fields with a few remarks about gases and particles. The
suggestive example of a gas exiting from a rocket engine dramatizes the main difference between
fluids and gases: Gases can be compressed substantially, and they also expand dramatically
when heated. Thus, back in the picture of the flow through a pipe, the volumes entering and
exiting may be very different. Indeed, the difference in volume – again available through surface
integrals must be attributable to what has happened inside the pipe. This might be heating,
some chemical reactions, or due to compression. Nonetheless, mass will be conserved. Thus
aside from the velocity field ~v (measured e.g. in meters per second), one might also consider the
mass flow ~u in units of e.g. kilograms per square meter per second.
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 13
These units may be a little confusing: E.g. when changing the units from meters to centimeters,
the magnitude of the vector field apparently would decrease by a factor of 10,000, where we
expected an increase by a factor of 100. this mystery is easily resolved by regrouping the units
as (kilograms per cubic meter) times (meter per second). When changing the units in the domain
of the vector fields from meters to centimeters, one should only change the meters ion the second
term. The first should has the dimensions of a mass-density and should be treated as one block.
A final scenario is that the vector field models the velocity field of a very large number of
individual particles. Think of huge numbers of electrons or photons. Again the associated
differential equations yields the curves along the particles move. Nothing new here. The surface
integral shall provide the number of particles that cross an imagined surface per unit of time.
Thus the dimensions of the vector field should be number of particles per area per unit of time.
Again, more intuitive might be (number of particles per unit volume) times (distance per time).
In the case of charged particles, the (total charge per unit volume) takes the place of the (number
of particles per unit volume).
turbulence, vorticity, and line integrals!!
The technical names for these kinds of vector fields are contravariant if the changes are in the
same direction – like in a velocity field, and covariant if the changes are in the opposite direction
(like in gradient fields). These names are very unfortunate choices, but – the reversed names
would make more sense. But they have such a long tradition that today nobody has the guts
to champion the change . . .
Let us take a brief look at questions about gradient fields that may lead to notions of integrals
(and possibly, derivatives) of vector fields. Clearly one always can write down a system of
differential equations
ẋ = − ∂f∂x (x, y), and
(1.10)
ẏ = − ∂f∂y (x, y).
(be patient regarding the minus sign). The question is whether these differential equations, or
their solutions have any practical values. Consider first the case in which the function z = f (x, y)
is describes a real hilly landscape. In this case a very suggestive interpretation is that the solution
curves of the differential equations (with the minus sign) describe the path water (a creek)
running down the mountain will take. However, this is only correct in a rough approximation as
water will flow up-hill over obstacles, provided it arrives with sufficiently speed! Thus on a small
scale, of at most a few meters and very fast rushing water the solution curve of the differential
equations does not give the correct path of the water. However, on a sufficiently large scale,
with elevations changes of hundreds, or thousands of meters, and typical water velocities as in
rivers and streams, the solution curve provides a very good model for the path of the stream.
A simple explanation of this surprise is that the gradient field is not a velocity field, but rather
almost an acceleration field: The steepness of the mountain is intimately related to the effect of
the gravitational force, which in turn, via Newton’s law translates into accelarations. The next
subsection will further follow up on this.
Mathematically the system differential equation (1.10) is very useful for numerically calculating
(local) minima of a function z = f (x, y). The idea is very simple: Calculate the gradient of the
function f , and numerically solve the differential equations (1.10) with many randomly chosen
initial conditions. If the function f is bounded from below, every solution curve will eventually
reach an equilibrium which must be a critical point of the function. The simplicity of this
approach lends it much robustness – and with many technical improvements this idea leads to
very efficient numerical optimization algorithms. but this belongs into numerical analysis, not
into vector calculus.
Thus for gradient fields the integration of the system of differential equations (1.10) does not
appear to solve very compelling problems about vector fields.
Similarly, one may explore whether there is any good use for an analogue of the flux integrals
discussed for velocity fields. This would lead to descriptions like: “How much steepness flows
across a given curve?”, “How much temperature flows across a curve or through a surface?”.
Neither one sounds very exciting – indeed, unless a gradient field also can be suitably interpreted
as velocity field, or some similar object, we claim that such integrals of gradient fields are in
general rather meaningless!
However, as a derivative of a function, it is natural to ask whether, and how, one may recover
the function from the gradient field! If possible, this should be considered a way of integrating
vector fields. The idea is actually very simple: If the gradient field tells us at every point how
the graph of the original function is tilted, then one should be able to reconstruct changes in
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 15
altitude from the distance traveled together with the information of the slope. This is very
compelling in the special case where the original graph is a plane, and the path is a straight line.
Next it is intuitive that one should be able to put seevarl such planar-linear pieces together.
Finally calculus comes in – if the function f is differentiable – its graph is on a sufficiently
small scale arbitrarily close to a plane. Similarly, a differentiable path is on a sufficiently small
scale arbitrarily close to a straight line. With this in mind it will be rather straight forward to
construct line integrals – which will return the original function. Given a reference height at one
point, one can calculate the function value at any other point simply by adding the elevation
change along any path from the reference point to the desired point.
Briefly revisit the case of the gradient field of a function modeling temperature in a planar
region. In this case, integrating the rate of change of temperature along any path will yield the
total change of temperature between the starting and end point of the curve.
This general description should appear very familiar from first year calculus. The powerful
property of line integrals of gradient fields returning the total change in height, temperature,
. . . deservedly will be cast as the fundamental theorem of line integrals.
then F 0 = f , G0 = g and H 0 h. The functions F , G, and H are well defined, in a very constructive
way. For any value of x e.g. very basic numerical algorithms allow one to calculate the function
value to arbitrary accuracy.
~
Consider the vector field H(x, y) = (y~ı + x~) in figure 1.6. If it was a gradient vector field, then
the vectors would indicate the direction of steepest increase. Consider traveling along the unit
circle, starting and ending at the point (1, 0). At every point along this curve the vector field
is tangent to the path (in the forward direction!). Thus the curve is always climbing as steeply
as possibly. But at the end the path returned to its starting point – and the total change in
elevation must be zero – if the vector field was a gradient field. However, since the line integral
is definitively strictly positive this leads to a contradiction. Consequently, H ~ cannot possibly be
the gradient of any function!
Exercise 1.6.1 Consider the magnetic field M ~ (x, y) = (y~ı + x~)/(x2 + y 2 ) pictured in figure
1.6. Again pretending that this might be a gradient field, analyze the hypothesized total change
in elevation along closed curves that consist of two circular arcs with radii r1 and r2 , and two
radial line segments along lines that form angles Θ1 and Θ2 with the positive x-axis.
Recall that the magnitude of M ~ is inversely proportional to the the distance from the origin.
Use this, the tangency / perpendicularity of the vectors to the curve, and the length of the curve
segments to explain why along any such closed curve the total change in elevation is zero. (Thus,
the analysis along these curves would be consistent with the vector field being a gradient field.)
On the other hand verify that the argument made for the vector field H ~ along the unit circle still
works. Conclude, that even though along many closed curves there does not appear any conflict
~ is a gradient field, the argument along the circle shows that it is not
with the supposition that M
a gradient field.
Thus a simple picture compellingly shows that not every vector field is the gradient field of some
function. Indeed almost every vector field is not a gradient field. Nonetheless, the few vector
fields that are gradient fields have very important physical applications. Thus a detailed study
of when a vector field is a gradient field is well worth the time and effort.
Later we will establish a very simple algebraic criterion that decides whether a vector field
is a gradient field, or not. Nonetheless, we claim that the picture above, and some further
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 17
explorations of this situation, provide a much deeper understanding of what is going on.
Indeed, this discussion here is just the starting point for a lot of wonderful mathematics, most
notably algebraic topology. Also many of the intriguing features of complex analysis can be
traced back to the above argument about this simple picture of M ~ (x, y).
Traditionally vector calculus has emphasized the study of vector fields that are gradient fields,
largely because of their major importance in physics (compare the next subsection). However,
the defect of not being a gradient field has just as important applications. Arguably one of the
most important applications is in thermodynamics and models the process by which machines
(generators) convert (differences in) heat into mechanical energy. Thus every steam engine,
every gasoline burning car engine, and every electric power plant based on burning some fuel,
whether nuclear or fossil, works because of some line integral along a closed curve being nonzero.
Indeed, the vertical gap in the spiraling staircase picture of the preceding discussion is precisely
the amount of energy converted!
a cat can land on its feet, even when dropped with feet up (with zero total angular momentum).
These latter examples are a topic of an accompanying month- or semester long project.
Summarizing, unlike in functions in first year calculus, not every vector field is a derivative
(gradient field). The mechanism behind this defect is apparent already in very simple pictures.
While gradient fields play an very prominent role in classical physics, the defect of being gradient
has at least as important applications. Thus developing a deep and comprehensive understanding
of being gradient or not is a major goal in vector calculus.
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 19
Let us consider some questions that are motivated by the daily life and physics inquiries, and
which provide the motivation for calculus of vector fields.
First of all, we would like to know more details about the gravitational, electric and magnetic
fields that are generated by a mass, a charge, or a current. The easiest case is when the mass
or charge is concentrated in a single point. In that case the directions of the fields are quite
intuitive, radially outward or inward. However, the reason why the force fields’ magnitudes
decrease at the rates that they do is not immediately clear. Indeed, it will only be the big
integral theorems of vector calculus that really will shed light on why these rates of decay have
the values that they have. They will also resolve the key issues of the mass or charge is not
concentrated in one point, but distributed over a ball or a sphere, or even an irregular shaped
body.
The next interesting pictures arise when adding the effects of two masses, or two charges. It
is the very nature of vectors, characterized by addition and scalar multiplication, which makes
vector fields so useful to model force fields in physics. With modern computer tools it is very
easy to graph e.g. the gravitational field in the neighborhood of two or three masses (which
need not be equal).
Exercise 1.7.1 A gravitational field in the plane generated by a mass m1 at the point (x1 , y1 )
is modeled (using suitable units such that the gravitational constant has unit magnitude G = 1)
20
by
~ 1 (x, y) = −m1 · (x − x1 )~ı + (y − y1 )~
G (1.12)
(x − x1 )2 + (y − y1 )2
Plot the vector fields that results from superpositioning the gravitational fields of two or three
different point masses. In the case of three masses consider generic configurations, as well as
special ones like all three masses on a line, or at the corners of an equilateral triangle.
Plotting hints: Depending on the choice of your graphing tool, you may need to truncate the
vector fields near the singularities. In general you may have to redefine your function G ~ such
~
that G(x, 2 2 2
y) = 0 whenever (x − x1 ) + (y − y1 ) < ε for some cutoff value ε that you choose.
Other graphing programs may allow you to simply pre-multiply the above formula by a function
that is zero if (x − x1 )2 + (y − y1 )2 < ε2 . Depending on what functions are predefined on your
system, you may use the Heaviside function u, the signum function, or create a cutoff function
yourself via u(x) = (x + |x|)/2.
A briefly look at the other integral suggested in our initial discussion of velocity fields, provides
little insight at this time: What would we mean by how much force flows across a surface (or
curve)? In applications there will indeed be a need for such flux integrals, but they are not the
kind of questions suitable to motivate a first study of integrals of vector fields.
Instead force fields suggest a different question that is related to work and energy, and to gradient
fields. If you throw a ball into the air, it starts with lots of kinetic energy (it is moving fast).
The climb against the gravitational field of the Earth requires work (in the technical sense of
modern physics). This work uses up the kinetic energy until the vertical velocity becomes zero,
and the ball starts again falling down. The physicist says that as the ball climbed it gained
potential energy. Practically this means that due to its height, the ball could do work, again
accelerate and gain kinetic energy by giving up some of its potential energy, alas height.
In a slightly different scenario, the space shuttle climbs against the gravitational field, again
doing work and gaining potential energy. This time it is the rocket engines that provide the
thrust, which both accelerate and raise the shuttle.
The main concern for us is to develop a mathematical structure and formalism that is suitable
to model the work being done, and which captures the concept of potential energy.
The starting point is the work ∆W that is done by moving in a constant vector field F~ . If the
~ then the work equals by definition the scalar product
total displacement is ∆s,
∆W = F~ · ∆s
~ (1.15)
For example, if we throw a ball we may assume that the gravitational field is practically constant
along the flight path. However, the space shuttle experiences a weakening gravitational field.
This changing field, together with the curved path, calls for calculus. The key idea is simple, and
familiar from any kind of integral calculus: Chop up the flight path into small segments so that
along each segment the field is practically constant. The pieces must also be small enough so
that each segment of the path is practically linear! Then use the simple formula (1.15) on each
piece, and add up the results. If properly done, we expect that it will be possible to consider
the limit as the segments become infinitely small, which in turn gives rise to a notion of line
integral. Indeed, this line integral will be the same as the one desired in the preceding sections
on gradient fields.
This raises an immediate question: Are all, or some force fields gradient fields? If yes, what is
the physical meaning of “the function that the field is the gradient of ”? Maybe not unexpectedly,
if the force field is a gradient field – and some, but not all are – then this function will play the
role of a potential energy, possibly best thought off as stored work, or ability (potential!) to do
work. A major theme in the sequel is the in-depth study of the interconnections between line
integrals, whether a field is a gradient field, and the potential function (if there is one).
A final look at the units of force fields is less revealing, but still useful. The standard unit
for a force is one Newton, which is defined as one kilogram times one meter, divided by one
second squared. This may sound complicated, but makes sense in light of Newton’s third law.
Integrating a force with respect to distance yields Newton meters, which are also known as
Joules, the basic unit of energy. In electric settings the potential difference is measured in volts,
and the units for the electric field are volts per meter. This appears more suggestive: Changing
the units for distances in the domain from meters to centimeters will increase the coordinates
of a point by a factor of 100. However, the numerical value of the electric field, now measured
in volts per centimeter will decrease by a factor of 100. This suggests that electric fields be
22
regarded as co-variant vector fields, further suggesting a close relation to gradient fields. This
electric example also suggests that for general force fields one might consider the magnitude of
the field in (Joules per meter) rather than in Newtons!
Chapter 3
Line integrals
3.1 Introduction
The preceding section suggested the need for a number of different notions of integrals of vector
fields. Flow-lines, that is, integral curves of the associated system of differential equations are
one kind of integral. They are traditionally studied in courses on differential equations. Our
focus here is to develop two different kinds of line integrals. The development is guided by the
desire that the line integrals are useful to provide solutions for the practical problems considered
before: One notion of line integral shall provide the work done when traveling in a force field.
One (as it turns out, the same) line integral shall recover the original potential function in the
case that the vector field is a gradient field. Another line integral shall yield the total flux of a
moving fluid or gas across a curve.
The first two applications work as well in three dimensions as they do in the plane. However,
the flux is different in each case: In the plane we have the flux across a curve. In 3-space we
consider the flux across a surface. However, we shall initially focus on vector fields and their
integrals in the plane as these are much easier to visualize, and they already exhibit most of the
features of interest.
Some key points that are worth to be pointed out before getting into the details.
• Every kind of integral assigns a number to a pair of objects. One, the integrand, is “to be
integrated”. The other is the object over which the first is to be integrated.
In the first calculus course, the pair consists of a (piecewise continuous) function f , and a
finite interval [a, b]. The resulting number is denoted ab f (u) du.
R
For line integrals, the pair consist of a vector field F~ and a curve C. One traditional
notation for the resulting number uses the symbol c F~ · dR.
R
~
Either notation is loaded with ambiguities. The first step towards more precise terminology
~ in place of the function f or vector field F~ .
is to consider differential forms f dx or F~ · dR
The detailed study of differential forms is subject of advanced courses.
• Every integral is a limit of sums. The key idea (in Riemann integrals) is to break the
interval, or curve, or surface, or region in the plane, or . . . into smaller and smaller pieces.
The usual assumption that the integrand is (piecewise) continuous in a suitable sense
means that it will be practically constant on each piece – provided the pieces are small
enough. The precise technical hypotheses which make this intuitive foregoing rigorous are
the subject of (mathematical) analysis.
23
24
• Every integral is related to some notion of derivative via some form of a fundamental the-
orem. The basic theme is that integration and differentiation are inverse operations in a
suitable sense.
Recall that an integral yields a number for every fixed interval [a, b], curve C, surface S,
. . . , and any kind derivative of a number is zero. Thus it is clear that any such fundamental
theorem must employ some subtle changes of point of view. In first year calculus the key
for one fundamental theorem is to consider the integral ab f (u) du as a function of the up-
R
d Rb
per limit b of integration. The two fundamental theorems state that db a f (u) du = f (b),
Rb 0
and f (b) − f (a) = a f (u) du for suitably nice functions f .
Vector calculus has its own generalized fundamental theorems. The second fundamental
theorem of first year calculus, Green’s theorem, the divergence theorem and Stokes’ theo-
rem are all special cases of one general Stokes’ theorem. A key to success is to understand
the general structure common to all these special cases.
of the magnitude at the surface, respectively. Thus one may obtain a very crude estimate for the
space shuttle by assuming that the gravitational field is constant. However, for geostationary
satellites one cannot avoid using integral calculus to account for the changing field. On the other
hand, when calculating trajectories of base-balls and other objects in everyday surroundings we
are used to assuming that the gravitational field is constant – both in magnitude and direction!
The key idea in the calculation of the work required to lift the satellite into orbit is to break up
the flight path into many small pieces. These should be so small that on each piece one may
reasonably assume that the gravitational field is constant. The second issue is the curved flight
path. Again, if the path is broken up into enough small pieces one may reasonably approximate
each piece by a straight line segment.
On each of these small pieces use the definition for work from physics
∆W = F~ · ∆R
~ (3.2)
Figure 3.1: A path through a gravitational field on large and small scales
In general, suppose P and Q are the endpoints of a curve. Subdivide the curve into small
segments with respective endpoints Pi , i = 0, . . . n with P0 = P and Pn = Q. Let ∆R ~ i = Pi−1~ Pi
denote the respective displacement vectors between the endpoints of each segment. Write Fi for ~
the value of F~ (x, y) at points (x, y) on, or in the vicinity of, the i-th segment of the curve. Then
an approximate value for the total work is
n
F~i · ∆R
~ i
X
∆W = (3.3)
i=1
Choosing a small value for the number of pieces, create the following tables: The first columns
contain the coordinates of the endpoints of the segments of the curve. The last column of the
first table contains the values of the vector field at these sample points. The second table has
one fewer row the the first, since the number of segments is always one less than the number
of endpoints. The first column contains the displacement vectors between the endpoints of the
segments. The middle column lists the average value of the vector field at the endpoints of each
segment. The last column lists the dot products of the respective averaged values of the vector
field and of the displacement vectors.
26
i xi yi F~ (xi , yi ) ~ i
i ∆R F~i (∆W )i = F~i · ∆R
~ i
0 0.00 2.000 0~ı − .2500 ~
1 −0.017~ı + 0.261~ −0.2228~ −0.0582
1 -0.017 2.261 0~ı − .1956 ~
2 −0.051~ı + 0.257~ −0.1766~ −0.0454
2 -0.068 2.518 0~ı − .1577 ~
3 −0.084~ı + 0.248~ −0.1442~ −0.0358
3 -0.152 2.766 0~ı − .1307 ~
4 −0.116~ı + 0.234~ −0.1209~ −0.0283
4 -0.268 3.000 0~ı − .1111 ~
5 −0.145~ı + 0.218~ −0.1038~ −0.0226
5 -0.413 3.218 0~ı − .0966 ~
6 −0.173~ı + 0.196~ −0.0912~ −0.0179
6 -0.586 3.414 0~ı − .0858 ~
7 −0.197~ı + 0.173~ −0.0818~ −0.0141
7 -0.783 3.587 0~ı − .0777 ~
8 −0.215~ı + 0.145~ −0.0748~ −0.0108
8 -1.000 3.732 0~ı − .0718 ~
9 −0.234~ı + 0.116~ −0.0697~ −0.0081
9 -1.234 3.848 0~ı − .0675 ~
10 −0.248~ı + 0.084~ −0.0661~ −0.0056
10 -1.482 3.932 0~ı − .0647 ~
11 −0.257~ı + 0.051~ −0.0639~ −0.0033
11 -1.739 3.983 0~ı − .0630 ~
12 −0.261~ı + 0.017~ −0.0628~ −0.0011
12 -2.000 4.000 0~ı − .0625 ~
Summing the products in the last columns yield the approximate value W ≈ −0.2511. This
value is close to the estimate, and appears reasonable. Using finer subdivisions, one expects
that the values become more accurate, and that they approach a limit.
n 3 6 12 24 48 96 512 1024
W -0.2655 -0.2542 -0.2511 -025036 0.250067 -0.250016 -0.250001 -0.250000
Exercise 3.2.1 Repeat the calcualtions of the example,. but instead of using the adapted trape-
zoidal rule, use an adapted midpoint rule. That is, replace F~i = 12 (F~ (xi−1 , yi−1 ) + F~ (xi , yi )), by
F~i = F~ ( 12 (xi−1 + x1 ), 12 (yi−1 + yi )).
Exercise 3.2.2 Use N = 12 segments to numerically estimate the line integral of the vector
~ (x, y) = (−y~ı + x~)/(x2 + y 2 ) over the unit circle x2 + y 2 = 1 traversed counterclockwise.
field M
Exercise 3.2.3 Write a procedure in any software package that creates data as in the tables
above, and uses these to obtain a numerical estimate of the line integral for any pair of a vector
field and a curve. Test your procedure on the example worked above.
finer subdivision
define line integral
convergence criteria
piecewise C 1 and C 0 uniform easy proof appendix
proposition estimate upper bound
calculate use parameterization
independent of parameterization
intuitively clear
proof simple chain rule
Green’s theorem
4.1 Introduction
Only very few vector fields are gradient fields. For most pairs of a vector field and a closed
curve one may expect the line integral is different from zero. This chapter analyzes the defect of
vector fields that might not be conservative. The final highlights are the formulation of Green’s
theorem, powerful demonstrations of typical uses, and compelling notions of derivatives of vector
fields.
The development is deliberately slow and thorough. The rationale is that the case of vector
fields in the plane is the most accessible, and the development of Green’s theorem is a natural
model for other versions of Stokes’ theorem, in three and higher dimensions.
The emphasis is to develop a deep and coherent understanding of derivatives of vector fields,
and of the associated analogues of the fundamental theorem of calculus. A main theme is
that before one attempts to understand differential calculus and the fundamental theorem of
calculus, one ought to first develop a profound understanding the linear setting. After all,
differentiability merely means that on a small scale the object is well approximable by a linear
object. Moreover, in the linear context any fundamental theorem reduces to a precalculus (or
calculus-free) statement that everyone should comprehend first, including the basic ideas behind
its proof.
The main story line for this chapter is the investigation of the implications of not being conser-
vative. As this will eventually lead to developing notions of derivatives of vector fields, it is wise
to prepare by formally introducing linear vector fields.
~
L(x, y) = (ax + by)~ı + (cx + dy)~ = L1 (x, y)~ı + L2 (x, y) ~. (4.1)
29
30
~ 1) = L1 (1, 0)
a = ~ı · L(0, ~ 0) = L1 (0, 1)
b = ~ı · L(1,
~ 1) = L2 (1, 0) ~ 0) = L2 (0, 1) (4.3)
c = ~ · L(0, d = ~ · L(1,
81 cases each of the coefficients a, b, c, d taking one of the values −1, 0, +1.
Linear fields in polar coordinates, use ~ur and ~uΘ . Linearity defn as opposed to looks linear
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 31
Exercise 4.3.1 For a constant vector field F~ (x, y) = a~ı + b~ find a potential function ϕ(x, y).
The structurally next most simple case is that of linear vector fields
~
L(x, y) = (ax + by)~ı + (cx + dy)~ (4.4)
(ii) Look for patterns. Use the last two columns for additional data as needed. Make conjectures.
Test the conjectures by evaluating additional line integrals over other curves, or by modifying
the vector field.
Your conjecture should be powerful enough to predict – with only minimal calculation, without
paper or calculator! – the value of any line integral of a linear vector field over any closed curve
similar to those considered above.
Conjecture 4.3.1 For a linear field the line integral is scaled by the area of the region R
enclosed by the curve C: The ratio of the line integral divided by the area
H
L ~
~ · dr
C
= constant. (4.5)
(area of R)
is independent of the location, the shape, and the size of the curve C. The constant equals (c − b)
(in above notation) and only depends on the vector field.
Compare this with the slope of a line: The difference quotient (“rise over run”)
y2 − y1
= m is constant. (4.6)
x2 − x1
The constant depends only on the choice of the line, but is independent of both the location
of the points, and the size of the increment (x2 − x1 ). Of course, this constant is the slope of
the line, and it plays a central role in the definition of the derivative of a function of a single
variable. In analogy the quotient (c − b) plays a key role in vector calculus, especially in the
definition of the curl. It is appropriate to consider (c − b) as a generalization of “slope” of a
straight line.
The conjecture deserves proof – this special case will
lead to Green’s theorem! From there we will pro-
ceed in small steps, from simple regions to increasingly
more general curves. In the next sections we continue
with the generalization to nonlinear vector fields. In
the end we shall have developed both a compelling new
notion of a derivative for vector fields, and a power-
ful and useful analogue of the fundamental theorem of
calculus.
~
(i) Parameterize the line segment and directly evaluate the line integral for the field L(x, y) =
(ax + by)~ı + (cx + dy)~.
34
~
Lemma 4.3.2 The line integral of the linear vector field L(x, y) = (ax + by)~ı + (cx + dy)~ over
the edges of the rectangle with corners (x0 ± ∆x, y0 ± ∆y) equals
I
L ~ = (c − b) · 4∆x∆y
~ · dr (4.7)
C
Proof. (outline). Use the observation about the exactness of the midpoint rule, and calculate:
I I I I I
L ~ =
~ · dr L ~ +
~ · dr ~ +
~ · dr
L ~ +
~ · dr
L ~
~ · dr
L
C C1 C2 C3 C4
= (ax0 + b(y0 − ∆y)~ı + (. . .)~) · (2∆x~ı) + ((. . .)~ı + c(x0 + ∆x) + dy0~) · (2∆y~) +
(ax0 + b(y0 + ∆y)~ı + (. . .)~) · (−2∆x~ı) + ((. . .)~ı + c(x0 − ∆x) + dy0~) · (−2∆y~)
= ...
= (c − b) · 4∆x∆y.
Take a closer look how the scaling by area comes to be: It really consists of two components. On
one side, there is the contribution of the increments ∆x and ∆y to the change in the vector field,
e.g. comparing the vector field along the lower edge and along the upper edge. Dividing by the
distance between these edges gives a measure of a rate of change of the vector field. The other
increments ∆x and ∆y are contributed by the lengths of the curve segments. On a very small
scale the vector field is almost constant, and the value of the line integral along each edge is
almost directly proportional to the length of the line segment. Taken together, the line integral
over the closed contour is scaled by the square of the linear dimension, or by the area of the
region enclosed by the curve.
Exercise 4.3.5 Replace the rectangle above by a triangle with two sides aligned with the coor-
dinate axes. E.g. choose the corners to be (x0 , y0 ), (x0 + ∆x, y0 ), and (x0 , y0 + ∆y). Carry out
the analogous calculation.
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 35
Lemma 4.3.3 Suppose C is the curve consisting of the edges of the triangle with corners at
~ is the linear vector field L(x,
(x1 , y1 ), (x2 , y2 ), and (x3 , y3 ) (oriented counter clockwise). If L ~ y) =
(ax + by)~ı + (cx + dy)~, then
I
L ~ = (c − b) · ( area of the triangle )
~ · dr (4.8)
C
Exercise 4.3.6 Prove the lemma (4.3.3) via direct calculation analogous to that for lemma
(4.3.2)
Exercise 4.3.7 Using linearity rewrite the vector field L ~ as L~ = L
~ 0 + ∆L
~ where L
~ 0 (x, y) =
~ 0 , y0 ) for all (x, y) and ∆L(x,
L(x ~ ~ − x0 , y − y0 ).
y) = L(x
H
~ ~
(i) Explain why C L0 · dr = 0 and hence C L
H
~ = H ∆L
~ · dr ~
~ · dr.
C
−→
(ii) Evaluate ∆L at the midpoints of the four edges of the rectangle. Use the midpoint rule to
H
evaluate C ∆L ~
~ · dr.
−→
~
(iii) Alternatively, observe that ∆L (x0 + ∆x, y0 + ∆y) = L(∆x, ∆y), and hence the line integral
H
~ ~
∆L · dr equals
H
~ ~
L · dr where C0 is the curve C translated back to the origin (i.e. each point
C C0
(x, y) is moved to the point (x − x0 , y − y0 )). Thus it suffices to consider rectangles centered at
the origin! Repeat the calculations for the case of (x0 , y0 ) = (0, 0).
(iv) Compare these approaches with the calculations above. Which one illustrates best what is
going on?
(v) Go even a step further. Using linearity to combine some of the terms occurring in the
~
calculation. E.g., simplify L(∆x, ~
0) − L(−∆x, ~ 0), and eventually arrive at
0) = 2∆xL(1,
I
L ~ = L(1,
~ · dr ~ 0) · ~ − L(0,
~ 1) ·~ı · ( area of the region R) . (4.9)
C
Note that this formulation does not make any explicit reference to the coefficients of the vector
field when written out in rectangular coordinates. This very geometric point of view is most
useful when working with polar or other curvilinear coordinates.
So far we have proven that the conjecture holds true for (linear vector fields integrated over)
rectangles that are aligned with the coordinate axes, and, in the exercises, for any triangles.
The next step is to establish that the conjecture is true for (linear vector fields integrated over)
arbitrary polygonally bounded regions.
It is intuitively clear, but takes a little work to
prove rigorously that every region in the plane
that is bounded by a polygonal curve, i.e. made
up of line segments that do not intersect, can be
triangulated. This means that the region can be
decomposed into a union of a finite number of
triangles, any two of which have either no points
in common, one corner in common, or one edge
in common.
Consider a polygonal curve C that encloses a region R as in the illustration. Suppose that the
region is decomposed into a union of triangles (and/or rectangles) labeled R1 , R2 , . . . Rn (which
only meet on their corners or along their edges). Denote the boundaries of the regions Rk by
Ck = ∂Rk , respectively. It is important that all these curves are oriented in a compatible way,
i.e. all are oriented counterclockwise.
The area of the entire region R is just the sum
of the areas of the small regions Rk . More excit-
H
ing is that the sum of the line integrals Ck L ~
~ · dr
over all the edges of the small regions adds up
H
to just the line integral C L ~ over the outside
~ · dr
curve: The integrals over all the interior edges
cancel as each is traversed once in either direc-
tion. Formally this reads:
I n I
~ =
~ · dr ~
~ · dr
X
L L
C k=1 Ck
Xn
= (c − b) · ( area of Rk )
k=1
n
X
= (c − b) · ( area of Rk )
k=1
= (c − b) · ( area of R)
In words: For line integrals of linear vector fields over any region that is bounded by piecewise
smooth curves: The ratio of the line integral divided by the area of the enclosed region is a
constant. This constant is independent of the location, the shape, and the size of the curve.
The constant may be considered as an analogue of the slope of a straight line. As such it will
be the precursor for a geometric definition of the scalar curl, one derivative of vector fields.
This proposition also reaffirms the curl test of the previous section in the special case of linear
vector fields.
~
Corollary 4.3.6 A linear vector field L(x, y) = (ax + by)~ı + (cx + dy)~ is conservative if and
only if b = c.
In words: For line integrals of linear vector fields over any region that is bounded by piecewise
smooth curves: The ratio of the line integral divided by the area of the enclosed region is a
constant. This constant is independent of the location, the shape, and the size of the curve.
The constant may be considered as an analogue of the slope of a straight line. As such it will
be the precursor for a geometric definition of the scalar curl, one derivative of vector fields.
Exercise 4.3.12 In analogy to exercise 4.3.3, verify that the trapezoidal and midpoint rules
are exact for flux line integrals of linear vector fields L~ over any line segment C from (x1 , y1 )
to (x2 , y2 ). I.e. verify (i) by direct calculation using a parameterization, (ii) pictorially as in
exercise 4.3.3, or (iii) directly from the definition of linearity.
~
Lemma 4.3.8 The flux line integral of the linear vector field L(x, y) = (ax + by)~ı + (cx + dy)~
over the edges of the rectangle with corners (x0 ± ∆x, y0 ± ∆y) equals
I
~ ·N
L ~ ds = (a + d) · ∆ ∆y (4.13)
C
Exercise 4.3.13 Prove lemma (4.3.8), following the outline of the calculation in the proof of
lemma (4.3.2) and exercise (4.3.4).
Exercise 4.3.14 Carry out an analogous calculation for the case when the rectangle is replaced
by a triangle with two sides aligned with the coordinate axes. E.g. choose the corners to be
(x0 , y0 ), (x0 + ∆x, y0 ), and (x0 , y0 + ∆y).
Lemma 4.3.9 Suppose C is the curve consisting of the edges of the triangle with corners at
~ is the linear vector field L(x,
(x1 , y1 ), (x2 , y2 ), and (x3 , y3 ) (oriented counter clockwise). If L ~ y) =
(ax + by)~ı + (cx + dy)~, then
I
~ ·N
L ~ ds = (a + d) · ( area of the triangle ) (4.14)
C
Exercise 4.3.15 Prove the lemma (4.3.9) via direct calculation analogous to that for lemma
(4.3.8)
Exercise 4.3.16 In analogy to exercise (4.3.7) use linearity rewrite the vector field L ~ as L
~ =
~ 0 + ∆L
L ~ where L~ 0 (x, y) = L(x
~ 0 , y0 ) for all (x, y) and ∆L(x,
~ ~ − x0 , y − y0 ).
y) = L(x
~ ~ ~ ~ ~ ~ ds.
H H H
(i) Explain why L0 · N ds = 0 and hence
C L · N ds =
C ∆L · N
C
−→
(ii) Evaluate ∆L at the midpoints of the four edges of the rectangle. Use the midpoint rule to
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 39
~ ·N
~ ds.
H
evaluate C ∆L
−→
(iii) Alternatively, observe that ∆L (x0 + ∆x, y0 + ∆y) = L(∆x,~ ∆y), and hence the flux line
~ ~ ~ ~
H H
integral C ∆L· N ds equals C0 L· N ds where C0 is the curve C translated back to the origin (i.e.
each point (x, y) is moved to the point (x − x0 , y − y0 )). Thus it suffices to consider rectangles
centered at the origin! Repeat the calculations for the case of (x0 , y0 ) = (0, 0).
(iv) Compare these approaches with the calculations in the two preceding exercises. Which one
illustrates best what is going on?
(v) Go even a step further. Using linearity to combine some of the terms occurring in the
~
calculation. E.g., simplify L(∆x, ~
0) − L(−∆x, 0) = 2∆xL(1,~ 0), and eventually arrive at
I
~ ·N
L ~ ds = ~ı · L(1,
~ 0) + ~ · L(0,
~ 1) · ( area of R) . (4.15)
C
Exercise 4.3.17 Similar to exercise (4.3.17) consider a region R that lies between two polygonal
curves C1 and C2 , oriented as shown in exercise (4.3.17). The region always lies to the left as
one moves along the curve, and thus the normal vector always points outward from the region.
Note that for the inner curve this means that the outward normal points into the hole. By
H
carefully going through the steps of prior arguments, show that it is still true that C L ~ =
~ · dr
(c−b) ( area of R) if C now denotes the generalized curve consisting of the two connected pieces
C1 and C2 .
Exercise 4.3.18 Develop a formula for the area of a region that is bounded by a polygonal
curve. The formula should take as input the list (x1 , y1 ), . . . , (xn , yn ) of coordinates of the vertices
(corners). Advice: Pick a suitable linear vector field L ~ – there are many possibilities – so that
~ ~
H
C L · dr = 1 · ( area of R). Utilize the midpoint or trapezoidal rule, each of which is gives the
exact value for linear vector fields.
In words: For flux line integrals of linear vector fields over any region that is bounded by
piecewise smooth curves: The ratio of the line integral divided by the area of the enclosed
region is a constant. This constant is independent of the location, the shape, and the size of the
curve. The constant may be considered as an analogue of the slope of a straight line. As such
it will be the precursor for a geometric definition of the divergence, another derivative of vector
fields.
40
In order to see the derivative via zooming it is important that the zoom
rates in domain and range are equal. In the case of a graph of a function
f (x) in single-variable calculus this means that the horizontal and vertical
magnification rates must be equal. A typical field on a small scale
In the case of a vector field the analogues of horizontal and vertical need
clarification. The horizontal axis represents the domain, and is to be re- The drift, a constant field
placed by the xy-plane. The vertical axis represents the range – which
is now represented by arrows, instead of heights (y-coordinates of points Compare the field and drift
on the graph). We need to zoom at equal rates in the domain, here the
xy-plane in the background, and in the range, here represented by arrows. Subtract the drift from the field
One key observation is that when zooming for the derivative of f at a point
(a, f (a) it is impossible to keep the “x-axis” on the screen! The vector field Viewing only the local changes
analogue means that it should be impossible to keep the feet of the arrows
in view. We need to zoom in on the arrow heads.
A simple way to obtain the desired pictures is to consider differences, anal-
ogous to the numerator f (a + ∆x) − f (a) in the difference quotient in
single variable calculus. Fix a point p = (x0 , y0 ) where we want to zoom
for the derivative of the vector field. Then compare the values F~ (x, y) =
F~ (x0 + ∆x, y0 + ∆y) of the vector field at points (x, y) = (x0 + ∆x, y0 + ∆y)
near p to the value F~ (x0 , y0 at p. Define a constant vector field F0 by
F~ 0 (x, y) = F~ (x0 , y0 ), and compare its values with those of the original
field. Practically consider, and plot, the vector field
−→
∆F (x, y) = F~ (x, y) − F~ (x0 , y0 ) (4.17)
[[Aside ([Move to appendix, include the picture!]): In the case of curved spaces it
is not at all clear what one means by a constant vector field. Indeed, this question
immediately leads to challenging questions in differential geometry, which have
important consequences. For example according to Einstein’s theory of general
relativity the universe in which we live is curved according to the mass-density.
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 41
Exercise 4.4.1 Use the JAVA vector field analyzer, or any other graphing tool to plot the
−→
vector field ∆F (x, y) for (x, y) very close to (x0 , y0 ) for the vector fields vecF and points
p = (x0 , y0 ) listed below. In each case plot the field on smaller and smaller scales (i.e. for
(x, y) = (x0 + ∆x, y0 + ∆y) closer and closer to (x0 , y0 ). In each case write down the window
size, i.e. basically the magnification, when the plotted vector fields stabilize, do not change any
more upon further magnification.
Note that most any plotting program will automatically scale the arrows so that they are of
maximal length, but do not overlap. This basically automatically takes care of the analogue of
the scaling factor h1 in the first year calculus difference quotient h1 (f (a + h) − f (a)).
Due to the limited, very small, number of pixels available to draw each arrow, it takes surprisingly
little magnification for the zooming to stabilize: Pixel for pixel the magnified plots of the vector
−→
field ∆F show the limit . . . In each case you should have observed that the final plot shows a
−→
linear vector field, a candidate suitable to be considered the derivative DF of the vector field F~
at the point p.
Before formalizing this notion, it is worthwhile to take a brief look at some numerical tables.
−→
The following table lists the values for the vector field ∆F (x, y) = F~ (x, y) − F~ (x0 , y0 ) when
F~ (x, y) = sin x · cos y~ı + (sin x + cos y) ~ near the point (x0 , y0 ) = (2, 1).
−→
100 ∆F x = 1.98 x = 1.99 x = 2.00 x = 2.01 x = 2.02
y = 1.02 −1.114~ı − 0.8796~ −1.325~ı − 1.282~ −1.540~ı − 1.694~ −1.760~ı − 2.114~ −1.985~ı − 2.544~
y = 1.01 −0.335~ı − 0.030~ −0.549~ı − 0.433~ −0.768~ı − 0.844~ −0.991~ı − 1.265~ −1.220~ı − 1.695~
y = 1.00 0.440~ı + 0.814~ 0.222~ı + 0.412~ 0.000~ı + 0.000~ −0.227~ı − 0.421~ −0.459~ı − 0.850~
y = 0.99 1.209~ı + 1.653~ 0.989~ı + 1.250~ 0.763~ı + 0.839~ 0.532~ı + 0.418~ 0.296~ı − 0.012~
y = 0.98 1.974~ı + 2.486~ 1.750~ı + 2.084~ 1.520~ı + 1.672~ 1.286~ı + 1.251~ 1.047~ı + 0.822~
−→
The table shows that ∆F (x, y) while not linear is very close to linear. For example,
−→ −→
∆F (2.02, 1.00) = −0.459~ı − 0.850 ~ ≈ 2 · (−0.227~ı − 0.421 ~) = 2· ∆F (2.01, 1.00), and
−→
∆F (1.99, 1.02) = −1.325~ı − 1.282 ~ ≈ (0.222~ı + 0.412 ~) + (−1.540~ı − 1.694 ~)
−→ −→
= ∆F (1.99, 1.00) + ∆F (2.00, 1.02).
(4.18)
42
These differences may be considered as numerical approximations of the values of the partial
derivatives of the components ~ı · F~ (x, y) = F1 (x, y) = sin x · cos y and ~ · F~ (x, y) = F2 (x, y) sin x +
cos y at (x0 , y0 ). The following table shows the approximate values obtained from the table, and
the decimal approximations obtained by symbolically differentiating the components of F and
evaluating at (x0 , y0 )
~ 0) !
a = ~ı · L(1, = −0.227 ≈ 1
0.01 ~ı · (F~ (x0 + 0.01, y0 ) − F~ (x0 , y0 )) ≈ ∂ ~
∂x ~ı · F (x0 , y0 ) ≈ −0.2248
~ 0) !
b = ~ı · L(1, = −0.786 ≈ 1
0.01 ~ı · (F~ (x0 , y0 + 0.01) − F~ (x0 , y0 )) ≈ ∂ ~
∂y ~ı · F (x0 , y0 ) ≈ −0.4161
~ 0) =! −0.421 ≈
c = ~ · L(1, 1
· (F~ (x0 + 0.01, y0 ) − F~ (x0 , y0 )) ≈ ∂
· F~ (x0 , y0 ) ≈ −0.7652
0.01 ~ ∂x ~
~ 0) =! −0.844 ≈
d = ~ · L(1, 1
· (F~ (x0 , y0 + 0.01) − F~ (x0 , y0 )) ≈ ∂
· F~ (x0 , y0 ) ≈ −0.8415
0.01 ~ ∂y ~
(4.23)
Exercise 4.4.2 Prepare a similar table of values for smaller values of (∆x, ∆y), e.g. for x =
1.998, 1.999, 2.000, 2.001, 2.002 and y = 0.998, 0.999, 1.000, 1.001, 1.002. Again check at a few
values how close to linear the table is. Use the table to read off better numerical estimates for
the partial derivatives.
Exercise 4.4.3 Prepare a similar tables of values for the vector field M ~ (x, y) = (−y~ı+x~)/(x2 +
y 2 ). Again check at a few values how close to linear the table is. Use the table to read off better
numerical estimates for the partial derivatives. In particular, what are the values of (c − b) and
(a + d) in your approximation, and what is the significance of these values?
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 43
~
Exercise 4.4.4 Repeat the previous exercise for G(x, y) = (−x~ı − y~)/(x2 + y 2 ).
~
Exercise 4.4.5 Prepare a similar tables of values for the vector field L(x, y) = (2x − 3y),~ı +
(3x + y)~ı, p = (2, 1). Again check at a few values how close to linear the table is. Use the table
to read off numerical estimates for the partial derivatives. Explain how your findings relate to
the statement that the derivative of a linear function is the same linear function.
It is time to formally define differentiability and derivatives of a vector field. The intuitive, and
fundamental, idea is that a vector field (just like any other function) is differentiable at a point,
if after sufficient magnification it is practically indistinguishable from a linear field (function)
(and remains upon further magnification). In first year calculus the slope of the graph of that
linear function is called the derivative. More precisely, we shall call the linear function itself the
derivative.
It is helpful to return briefly to first year calculus. Basically, instead of calling the number
m = f 0 (x0 ) the derivative of f at x0 , we now consider the linear function f 0 = fx0 0 : dx 7→ m dx the
derivative. Other common, suggestive but less precise, notation for this function is ∆y = m ∆x.
In the end, the key is to remember that any derivative f 0 is a function that takes two inputs, the
point x0 , and the increment ∆x or dx. First year calculus commonly focuses on how f 0 changes
at different points x0 . Sometimes to the extent that the main idea is almost lost . . . : For fixed
x0 , as a function of the increment dx, the derivative is a linear function.
Recall that not every function has a derivative. For example, no matter how much one zooms
in on the graph of f (x) = |x| at x0 = 0, it never gets close to a straight line. Thus one says that
f (x) = |x| is not differentiable at x0 = 0.
Definition 4.4.1 A vector field F~ in the plane is called differentiable at a point p = (x0 , y0 ) if
~ such that
there exists a linear vector field L
The divergence ~ ~
of the vector field F at the point (x0 , y0 ) is defined to be the trace of L, i.e. the
~ 0) − ~ · L(0,
quantity ~ı · L(1, ~ 1) (i.e. (a + d) in our usual notation).
The scalar curl of the vectorfield F~ at the point (x0, y0 ) is defined to be twice the skew symmetric
~ i.e. the quantity L(1,
part of L, ~ 0) · ~ − L(0,
~ 1) ·~ı (i.e. (c − b) in our usual notation).
The divergence ~ ~
of the vector field F at the point (x0 , y0 , z0 ) is defined to be the trace of L, i.e.
~ ~ ~ ~
the quantity ~ı · L(1, 0, 0) + ~ · L(0, 1, 0) + k · L(0, 0, 1) .
The curl of the vector field F~ at the point (x0 , y0 , z0 ) is defined to be twice the skew symmetric
~ i.e. the vector
part of L,
~k · L(0,
~ 1, 0) − ~ · L(0, ~ 0, 0) − ~k · L(0,
~ 0, 1) ~ı + ~ı · L(1, ~ 0, 1) ~ + ~ · L(1, ~ 1, 0) ~k.
~ 0, 0) −~ı · L(0,
(4.26)
44
The numerical explorations suggest that the coefficients of the derivative(s) of the vector field
F~ can be easily obtained as partial derivatives (evaluated at (x0 , y0 )).
−→
For most practical purposes one obtains L ~ =DF via this simple partial differentiation. Nonethe-
less, there exist vector fields F , e.g. defined by F~F (x, y) = xy~ı/(x2 + y 2 ) if (x, y) 6= (0, 0) and
~
F~F (0, 0) = 0 for which the partial derivatives are defined everywhere, but the vector field is not
differentiable.
Exercise 4.4.6 Plot the vector field F~ of the preceding comment. Zoom in on it at the point
(0, 0) and verify that it does not converge to a linear field at this point. Then use the definition
of partial derivatives, e.g. ∂f 1
∂x (0, 0) = limh→0 h (f (h, 0) − f (0, 0)), to verify that the Jacobian
matrix of partial derivatives JF exists at (0, 0), and is equal to the zero matrix.
continuity
no proof or prop or of sufficiency
emphaisize linear approx as central idea
If the flux/trace case case was dealt with in previous section, include div in the same box/defn.
this might be the most effective way.)
Exercise: When zooming of the first kind at any point in a linear field what does one see? In
particular, are linear fields differentiable? What is their scalar curl? How does the scalar curl
change from point to point in a linear field. Is zooming required at all? 0mm
Several more exercises, mostly JAVAscope related. E.g. find a vector field that has positive
scalar curl in the right half plane, and negative scalar curl in the left half plane.
Since the scalar curl (c − b) of a linear field naturally appeared when analyzing line integrals, it
is natural to ask how the scalar curl of differentiable nonlinear fields is related to line integrals.
Class exercise: Pick a typical nonlinear vector field, say F~ (x, y) = (cos x + sin y))~ı +
(cos y sin y) ~. Split the class into three groups. Each group picks a different base point in
the plane, such as (x0 , y0 ) = (4, 7). Each student picks, or is assigned one or several different
contours (just as in the class exercise in the section on linear fields, e.g. rectangles, squares,
various triangles, circles, half circles, etc.) that is based at her/his base point.
Starting with a diameter (or side-length) a = 1 set up and evaluate the line integral of the field
over the contour. Tabulate the results - keep many digits, don’t truncate!
Classnotes for Introduction to Vector Calculus. Matthias Kawski. April 4, 2001 45
Just as the slopes of line segments connecting two points on the graph of a function of a single
variable usually are different depending on the location of the points, and their distance from
each other, here, too, we obtain ratios that change with the location and the size of the contours.
However, as the contours become increasingly smaller the ratios approach a limit. The limit is
the same for any shape at the same base point, but it varies from point to point. This is in
complete analogy to single variable calculus where the difference quotients approach well-defined
limits at each point, bit generally different limits at different points. Here, there is the added
feature of also being able to consider the shapes. The explanation that the shapes do not matter
is easy: On a sufficiently small scale, the differentiable field is practically linear, and we proved
in the preceding section that for linear fields the ratio of the line integral divided by the area is
independent of the shape of the curves considered!
It remains to obtain formulas for the scalar curl in the usual coordinate systems. In rectangular
coordinates we simply consider the Taylor approximations of the ~ı and ~ components of the
differentiable vector field at the point of interest: (refer back to prior section or appendix).
Partition the region as before. But this time don’t worry about shapes at all. On the other
hand mae the regions all sufficiently small.
approximate identities
Main issue: Where to put the error estimates, in the preceding section – we already have o(k∆rk)
or into this section. maybe here put the epsilons for the Riemann sums if any at all
on small scale linear, work with approximate equal
make this an extremely short and elegant section. all the work was done in the linear case, and
when zooming, maing the right definition. this should be a freebie
Conjecture: For.
control examples
back to the magnetic field and induced voltage in time varying magnetic field? or moving loop.