Lecture03 PDF
Lecture03 PDF
Lecture03 PDF
1 Algorithmic Techniques
One of the things that students in the previous semesters found most difficult in this class was coming
up with their work algorithms. One difficulty with this is that one does not know where to start.
Question 1.1. Have you designed algorithms before. If so what approaches have you found
helpful?
Given an algorithmic problem, where do you even start? It turns out that most of the algorithms
follow several well-known techniques. For example, when solving the shortest superstring (SS)
problem we already mentioned three techniques: brute force, reducing one problem to another,
and the greedy approach. In this lecture, we will go over these techniques, which are key to both
sequential and parallel algorithms, and focus on one of them, divide and conquer, which turns out to
be particularly useful for parallel algorithm design. We will also talk about asymptotic cost analysis
and how to analyze divide-and-conquer algorithms.
Brute Force: The brute force approach involves trying all possible solutions and picking the best or
one of the satisfying solution (typically the first encountered). In the SS problem, for example, we
argued that every solution has to correspond to a permutation of the inputs with overlaps removed.
The brute force approach therefore tried all permutations and picked the best. In some problems,
it suffices to pick a solution; in such cases, it is not necessary to consider all possible solutions, a
brute-force algorithm can return the first encountered. Since there are n! permutations, this solution
is not “tractable” for large problems. In many other problems there are only polynomially many
possibilities. For example in your current assignment there should be only O(n2 ) possibilities to try.
However, even O(n2 ) is not good, since as you will work out in the assignment there are solutions
that require only O(n) work.
†Lecture notes by Umut A. Acar, Guy E Blelloch, Margaret Reid-Miller, and Kanat Tangwongsan.
1 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
Question 1.3. Given that they are often inefficient, why do we care about brute-force algo-
rithms?
One place the brute force approach can be very useful is when writing a test routine to check
the correctness of more efficient algorithms. Even if inefficient for large n the brute force approach
could work well for testing small inputs. The brute force approach is usually the simplest solution to
a problem, but not always.
Question 1.4. When trying to apply the brute-force technique to solve a problem, where
would you start first?
Since brute-force requires enumerating all possible results and checking them, it is often helpful
to start by identifying the structure of the expected results (all strings, all natural numbers between 0
and n) and a way to assign values to them so that you can pick the best.
Reducing to another problem: Sometimes the easiest approach to solving a problem is just reduce
it to another problem for which known algorithms exist.
Question 1.5. You all know about the problem of finding the minimum number in a set. Can
you reduce the problem of finding the maximum to this problem?
Question 1.6. Can you think of an exercise for becoming better in using this technique?
When you get more experienced with algorithms you will start recognizing similarities between
problems that on the surface seem very different. But one exercise that you might find helpful would
be to formulate new algorithmic problems, as this can help in understanding how seemingly different
problems may relate.
Inductive techniques: The idea behind inductive techniques is to solve one or more smaller
instances of the same problem, typically referred to as subproblems, to solve the original problem.
The technique most often uses recursion to solve the subproblems. Common techniques that fall in
this category include:
• Divide and conquer. Divide the problem on size n into k > 1 independent subproblems on
sizes n1 , n2 , . . . nk , solve the problem recursively on each, and combine the solutions to get the
solution to the original problem. It is important than the subproblems are independent; this
2 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
f(n11 )
Divide Combine
f(n12 )
f(n13 )
Contract Combine
f(n11 ) f(n12 ) f(n13 )
makes the approach amenable to parallelism because independent problems can be solved in
parallel.
• Greedy. For a problem on size n use some greedy approach to pull out one element leaving a
problem of size n − 1. Solve the smaller problem.
• Dynamic Programming. Like divide and conquer, dynamic programming divides the problem
into smaller subproblems and then combines the solutions to the subproblems to find a solution
to the original problem. The difference, though, is that the solutions to subproblems are
reused multiple times. It is therefore important to store the solutions for reuse either using
memoization or by building up a table of solutions.
Randomization is also crucial in parallel algorithm design as it helps un “break” the symmetry in
a problem without communication.
3 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
Question 1.8. Have you heard of the term “symmetry breaking”? Can you think of a real-life
phenomena that is somewhat amusing and can even be uncomfortable where we engage in
randomized symmetry breaking?
We often perform randomized symmetry breaking when walking in a crowded sidewalk with lots
of people coming in our direction. With each person we encounter, we may pick a random direction
to turn to and other person responds (or sometimes we do). If we recognize each other late, we
may end up in a situation where the randomness becomes more apparent, as we attempt to get past
each other’s way but make the same (really opposite) decisions. Since we make essentially random
decisions, the symmetry is eventually broken.
We will cover a couple of examples of randomized algorithms in this course. Formal cost analysis
for many randomized algorithms, however, can (but not always) require knowledge of probability
theory beyond the level of this course.
What is a good solution? When you encounter an algorithmic problem, you can look into your
bag of techniques and, with practice, you will find a good solutions to the problem. When we say a
good solution we mean:
2. Low cost: Out of the correct solutions, we would prefer the one with the lowest cost.
Ideally you should prove the correctness and efficiency of your algorithm. For example, in exams,
we might ask you do this. Even in real life, we would highly recommend making a habit of this as
well. It is often too easy to convince yourself that a poor algorithm is correct and efficient.
Question 1.10. How can you prove correct an algorithm designed by using an inductive
technique? Can you think of a proof technique that might be useful?
Algorithms designed with inductive techniques can be proved correct using (strong) induction.
Question 1.11. How can you analyze an algorithm designed by using an inductive technique?
Can you think of a technique that might be useful?
We can often express the complexity of inductive algorithms with recursions and solve them to
obtain a closed-form solution.
4 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
Divide and conquer is a highly versatile technique that generally lends itself very well to parallel
algorithms design. You probably have seen the divide-and-conquer technique many times before
this course. But it is such an important technique that it is worth seeing it over and over again. It is
particularly suited for “thinking parallel” because it offers a natural way of creating parallel tasks.
The structure of divide and conquer. The structure of a divide-and-conquer algorithm follows the
structure of a proof by (strong) induction, which makes it easy to show correctness. Often it also
makes it easy to determine costs bounds since we can write recurrences that match the inductive
structure. The general form of divide-and-conquer looks as follows:
— Base Case: When the problem is sufficiently small, we return the trivial answer directly or
resort to a different, usually simpler, algorithm, which works great on small instances.
— Inductive Step:
1. First, the algorithm divides the current instance I into independent parts, commonly
referred to as subproblems, each smaller than the original problem.
2. It then recurses on each of the parts to obtain answers for the parts. In proofs, this is
where we assume inductively that the answers for these parts are correct.
3. It then combines the answers to produce an answer for the original instance I, and in the
reasoning about correctness and proof we have to show that this combination gives the
correct answer.
Question 2.1. Can you identify the base case and the inductive step for the quicksort algorithm
working on a set of numbers?
The base case is when the set being sorted contains only one element. The inductive case
partitions the set into two sets by picking a random pivot. This partition is the divide step. The
algorithm then recurses on the two subproblems and combines the solution by appending them. In
quicksort combine is easy and cheap. Most of the work is done in the divide step.
Question 2.2. Can you think of different base cases for quicksort that may lead to lower cost?
It turns out asymptotically inefficient algorithms such as insertion sort have lower constant factors
and may be more efficient in practice when solving small problem instances (typically less than 100).
We can thus improve the efficiency of quicksort by picking a larger base case and reverting to a
different algorithm in the base case.
Question 2.3. Can you identify the base case and the inductive step for the mergesort
algorithm working on a set of numbers? How does it differ from quicksort?
5 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
In the base case, we have two or fewer elements in the set. In the inductive case, we split the
set into two equal halves (divide step), solve them recursively, and merge the results (combine
step). Mergesort is different from quicksort because the divide step in trivial but combine is more
involved—that is where most of the work is done.
As we will see later, we can express divide and conquer algorithms with trivial divide steps with
reduce by using higher-order functions.
Exercise 1. Can you think of different base cases for mergesort that may lead to lower cost?
Will it be the same as that for quicksort?
Strengthening. In most divide-and-conquer algorithms you have encountered so far, the subprob-
lems are occurrences of the problem you are solving (e.g. recursive sorting in merge sort). This is
not always the case. Often, you’ll need more information from the subproblems to properly combine
results of the subproblems. In this case, you’ll need to strengthen the problem definition, much in the
same way that you strengthen an inductive hypothesis when doing an inductive proof. Strengthening
involves defining a problem that solves more than what you ultimately need, but makes it easier or
even possible to use solutions of subproblems to solve the larger problem.
Question 2.4. You have recently seen an instance of strengthening when solving a problem
with divide and conquer. Can you think of the problem and how you used strengthening?
In the recitation you looked at how to solve the Parenthesis Matching problem by defining a
version of the problem that returns the number of unmatched parentheses on the right and left. This
is a stronger problem than the original, since the original is the special case when both these values
are zero (no unmatched right or left parentheses). Not only does the modified version tells us more
than we, it was necessary to make divide-and-conquer work. In fact, if we do not strengthen the
problem, we will not be able to combine the results from the two recursive calls (which tell you
only whether two halves are mathed or not) to conclude that the full string is matched, because for
example there can be an unmatched open paranthesis on one side that matches a close parenthesis
on the other.
Work and Span. Since the subproblems can be solved independently (by assumption), the work
and span of such an algorithm can be described using simple recurrences. In particular for a problem
of size n is broken into k subproblems of size n1 , . . . , nk , then the work is
k
X
W (n) = Wdivide (n) + W (ni ) + Wcombine (n) + 1
i=1
k
S(n) = Sdivide (n) + max S(ni ) + Scombine (n) + 1
i=1
6 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
Note that the work recurrence is simply adding up the work across all components. More
interesting is the span recurrence. First, note that a divide and conquer algorithm has to split a
problem instance into subproblems before these subproblems are recursively solved. We therefore
have to add the span for the divide step. The algorithm can then execute all the subproblems in
parallel. We therefore take the maximum of the span for these subproblems. Finally after all the
problems complete we can combine the results. We therefore have to add in the span for the combine
step.
Applying this formula often results in familiar recurrences such as W (n) = 2W (n/2) + O(n). In
the rest of this lecture, we’ll get to see other recurrences.
3 Asymptotic Complexity
What is O( f (n))? Precisely, O( f (n)) is the set of all functions that asymptotically dominated by the
function f (n). Intuitively this means that these are functions that grow at the same or slower rate
than f (n) as n goes to infinity. We write g(n) ∈ O( f (n)) to refer to a function g(n) that is in the set
O( f (n)). We often use the abusive notation g(n) = O( f (n)) as well.
In an expression such as 4W (n/2) + O(n), the O(n) refers to some function g(n) ∈ O(n).
For a function g(n), we say that g(n) ∈ O( f (n)) if there exist positive constants n0 and c such
that for all n ≥ n0 , we have g(n) ≤ c · f (n).
If g(n) is a finite function (g(n) in finite for all n), then it follows that there exist constants k1 and
k2 such that for all n ≥ 1,
g(n) ≤ k1 · f (n) + k2 ,
Pn0
where, for example, we can take k1 = c and k2 = i=1 |g(i)|.
Remark 3.2. Make sure to become very comfortable with asymptotic analysis. Also its
different versions such as the Θ(·) and Ω(·).
Exercise 2. Can you illustrate graphically when g(n) ∈ O( f (n))? Show different examples,
to hone your understanding.
Using the recursion W (n) = 2W (n/2) + O(n), we will review the tree method, which you have seen
in 15-122 and 15-251. Our goal is to derive a closed form solution to this recursion.
By the definition of asymptotic complexity, we can establish that
W (n) ≤ 2W (n/2) + k1 · n + k2 ,
7 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
k1 n + k2 k1 n + k2
k1 (n/2) + k2 k1 (n/2) + k2 k1 n + 2 k2
To apply the tree method, there are some key questions we should ask ourselves to aid drawing
out the recursion tree and to understand the cost associated with the nodes:
Our answers to these questions lead to the following analysis: We know that level i (the root is
level i = 0) contains 2i nodes, each costing at most k1 (n/2i ) + k2 . Thus, the total cost in level i is at
most
n
2i · k1 i + k2 = k1 · n + 2i · k2 .
2
Since we keep halving the input size, the number of levels is bounded by 1 + log n. Hence, we
have
log
Xn
W (n) ≤ k1 · n + 2i · k2
i=0
n n
= k1 n(1 + log n) + k2 (n + 2
+ 4
+ · · · + 1)
≤ k1 n(1 + log n) + 2k2 n
∈ O(n log n),
where in the second to last step, we apply the fact that for a > 1,
a n+1 − 1
1 + a + · · · + an = ≤ a n+1 .
a−1
8 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
The tree method involves determining the depth of the tree, computing the cost at each level, and
summing the cost across the levels. Usually we can easily figure out the depth of the tree and the cost
of at each level relatively easily—but then, the hard part is taming the sum to get to the final answer.
It turns out that there is a special case in which the analysis becomes simpler. This is whet the
costs at each level grow geometrically, shrink geometrically, or stay approximately equal.
By recognizing whether the recurrence conforms with one of these cases, we can almost immedi-
ately determine the asymptotic complexity of that recurrence.
The vital piece of information is the ratio of the cost between adjacent levels. Let costi denote
the total cost at level i when we draw the recursion tree. This puts recurrences into three broad
categories:
You might have seen the “master method” for solving recurrences in previous classes. We do not
like to use it since it only works for special cases and does not give an intuition of what is going on.
However, we will note that the three cases of the master method correspond to special cases of leaves
dominated, balanced, and root dominated.
W (n) ≤ 2W (n/2) + k1 · n + k2 ,
9 Version 2.0
Parallel and Sequential Data Structures and Algorithms — Lecture 3 15-210 (Fall 2013)
to guess the answer first and check it. This is often called the “substitution method.” Since this
technique relies on guessing an answer, you can sometimes fool yourself by giving a false proof. The
following are some tips:
1. Spell out the constants. Do not use big-O—we need to be precise about constants, so big-O
makes it super easy to fool ourselves.
Let’s now redo the recurrences above using this method. Specifically, we’ll prove the following
theorem using (strong) induction on n.
Theorem 5.1. Let a constant k > 0 be given. If W (n) ≤ 2W (n/2) + k · n for n > 1 and W (1) ≤ k for
n ≤ 1, then we can find constants κ1 and κ2 such that
W (n) ≤ κ1 · n log n + κ2 .
Proof. Let κ1 = 2k and κ2 = k. For the base case (n = 1), we check that W (1) = k ≤ κ2 . For the
inductive step (n > 1), we assume that
W (n/2) ≤ κ1 · 2n log( 2n ) + κ2 ,
And we’ll show that W (n) ≤ κ1 · n log n + κ2 . To show this, we substitute an upper bound for W (n/2)
from our assumption into the recurrence, yielding
W (n) ≤ 2W (n/2) + k · n
≤ 2(κ1 · 2n log( 2n ) + κ2 ) + k · n
= κ1 n(log n − 1) + 2κ2 + k · n
= κ1 n log n + κ2 + (k · n + κ2 − κ1 · n)
≤ κ1 n log n + κ2 ,
10 Version 2.0