Algorithm 6

Advanced Core in Algorithm Design #6
算法設計要論第 6 回
Yasushi Kawase
河瀬康志
Nov. 8th, 2022

last update: 12:46pm, November 8, 2022
Yasushi Kawase Advanced Core in Algorithm Design 1 / 24

Schedule
Lec. # Date Topics

1 10/4 Introduction, Stable matching
2 10/11 Basics of Algorithm Analysis, Greedy Algorithms (1/2)
3 10/18 Greedy Algorithms (2/2)
4 10/25 Divide and Conquer (1/2)
5 11/1 Divide and Conquer (2/2)
6 11/8 Dynamic Programming (1/2)
7 11/15 Dynamic Programming (2/2)
— 11/22 Thursday Classes
8 11/29 Network Flow (1/2)
9 12/6 Network Flow (2/2)
10 12/13 NP and Computational Intractability
11 12/20 Approximation Algorithms (1/2)
12 12/27 Approximation Algorithms (2/2)
13 1/10 Randomized Algorithms

Algorithmic Paradigms
Divide-and-Conquer
• Divide up problem into several independent problems
• Solve each subproblems recursively
• Combine solutions to subproblems into overall solution
Dynamic Programming
• Divide up problem into several overlapping problems
• Solve each subproblems recursively (or bottom up)
• Combine solutions to subproblems into overall solution

Outline
1 Weighted interval scheduling
2 Subset sums and knapsacks
3 Segmented least squares
4 Sequence alignment

Weighted interval scheduling
Problem
• Input: jobs J = {1, 2, . . . , n},
job j starts at sj , finishes at fj , and has value vj > 0
• Goal: find maximum value subset of mutually compatible jobs
two jobs that don’t overlap
Example
v=2
4
4
7
2
1
time

Weighted interval scheduling
Problem
• Input: jobs J = {1, 2, . . . , n},
job j starts at sj , finishes at fj , and has value vj > 0
• Goal: find maximum value subset of mutually compatible jobs
two jobs that don’t overlap
Example
v=2
4
4
7
2
1
time

Recursive relation
• sort the jobs by finish time (f1 ≤ f2 ≤ · · · ≤ fn )
• OPT(j): optimal solution for subproblem consisting only of {1, 2, . . . , j}
Recursive formula
0 if j = 0,
(
OPT(j) =
max{vj + OPT(p(j)), OPT(j − 1)} if j > 0
maximum i such that f (i) ≤ s(j) (found in O(log n) via binary search)
v=2
4
p(j) 4
7
2
j 1
time
Time complexity: O(n log n)

Example
j
1 v=2
2 4
3 4
4 7
5 2
6 1
time
j 0 1 2 3 4 5 6
OPT(j)

Example
j
1 v=2
2 4
3 4
4 7
5 2
6 1
time
j 0 1 2 3 4 5 6
OPT(j) 0 2 4 6 7 8 8

Quiz
What is the optimal value of the following problem?

j
1 v=3
2 4
3 5
4 7
5 4
6 8
7 10
8 5
9 8
10 6
time

Outline

Subset sum
Problem
• Input: positive integers a1 , a2 , . . . , an and W
• Goal: find I ⊆ {1, 2, . . . , n} such that i∈I ai = W
P
Examples
• a = (1, 3, 7, 8, 11), W = 20 possible (1 + 8 + 11 = 20)
• a = (2, 4, 8, 10, 12), W = 19 impossible

Dynamic programming for subset sum problem
ψ(k, w): solution for a1 , a2 , . . . , ak and w (True or False)
Recursive formula
True if k = 0 and w = 0


if k = 0 and w > 0

False

ψ(k, w) =

 ψ(k − 1, w) if w < ak
ψ(k − 1, w) ∨ ψ(k − 1, w − ak ) otherwise


Compute for k = 0, 1, . . . , n and w = 0, 1, . . . , W O(nW ) time
Example: a1 = 2, a2 = 4, a3 = 3, W = 5
k \ w 0 1 2 3 4 5
0 — — — — — —
a1 = 2 1 — — — — — —
a2 = 4 2 — — — — — —
a3 = 3 3 — — — — — —
Recursive formula


if k = 0 and w > 0

False

ψ(k, w) =

 ψ(k − 1, w) if w < ak


Example: a1 = 2, a2 = 4, a3 = 3, W = 5
k \ w 0 1 2 3 4 5
0 True False False False False False
a1 = 2 1 — — — — — —
a2 = 4 2 — — — — — —
a3 = 3 3 — — — — — —
Recursive formula


if k = 0 and w > 0

False

ψ(k, w) =

 ψ(k − 1, w) if w < ak


Example: a1 = 2, a2 = 4, a3 = 3, W = 5
k \ w 0 1 2 3 4 5
a1 = 2 1 True False True False False False
a2 = 4 2 — — — — — —
a3 = 3 3 — — — — — —
Recursive formula


if k = 0 and w > 0

False

ψ(k, w) =

 ψ(k − 1, w) if w < ak


Example: a1 = 2, a2 = 4, a3 = 3, W = 5
k \ w 0 1 2 3 4 5
a2 = 4 2 True False True False True False
a3 = 3 3 — — — — — —
Recursive formula


if k = 0 and w > 0

False

ψ(k, w) =

 ψ(k − 1, w) if w < ak


Example: a1 = 2, a2 = 4, a3 = 3, W = 5
k \ w 0 1 2 3 4 5
a2 = 4 2 True False True False True False
a3 = 3 3 True False True True True True
Knpsack problem
Problem
• Input: items E = {1, 2, . . . , n} and a capacity W ∈ Z+
item i has value vi ∈ Z+ and size wi ∈ Z+
• Goal: maximize i∈I vi subject to I ⊆ {1, 2, . . . , n} i∈I wi ≤ W
P P
Examples
W = 16
i vi wi
1 55 4
2 61 2
3 82 9
4 38 1
5 63 3

Knpsack problem
Problem
• Input: items E = {1, 2, . . . , n} and a capacity W ∈ Z+
item i has value vi ∈ Z+ and size wi ∈ Z+
• Goal: maximize i∈I vi subject to I ⊆ {1, 2, . . . , n} i∈I wi ≤ W
P P
Examples
W = 16 optimal value is 61 + 82 + 38 + 63 = 244
i vi wi
1 55 4
2 61 2
3 82 9
4 38 1
5 63 3

Dynamic programming for knapsack problem
OPT(k, w) = max vi | X ⊆ {1, 2, . . . , k}, wi ≤ w
P P
i∈X i∈X
Recursive formula
0 if k = 0,


OPT(k, w) = OPT(k − 1, w) if wk > w
max{OPT(k − 1, w), OPT(k − 1, w − wk ) + vk } otherwise


Example (wi , vi ) = (4, 55), (2, 61), (9, 82), (1, 38), (3, 63), W = 16
k\w 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
1 0 0 0 0 55 55 55 55 55 55 55 55 55 55 55 55 55
2 0 0 61 61 61 61 116 116 116 116 116 116 116 116 116 116 116
3 0 0 61 61 61 61 116 116 116 116 116 143 143 143 143 198 198
4 0 38 61 99 99 99 116 154 154 154 154 154 181 181 181 198 236
5 0 38 61 99 101 124 162 162 162 179 217 217 217 217 217 244 244

Outline

Least squares
Problem (xi , yi )
• Input: n points p1 , p2 , . . . , pi , . . . , pn ∈ R2
• Goal: find a line y = ax + b that minimizes the sum of the squared error
Pn
i=1 (yi − axi − b)2
Example
y
x
Pn Pn Pn Pn Pn
n i=1 xi yi −( i=1 xi )( i=1 yi ) yi −a i=1 xi
Optimal solution: a = Pn n 2 ,b= i=1
n
n i=1 xi2 − i=1 xi
P
O(n) time

Least squares
Problem (xi , yi )
Pn
i=1 (yi − axi − b)2
Example
y
x
Pn Pn Pn Pn Pn
n
n i=1 xi2 − i=1 xi
P
O(n) time

Least squares
Problem (xi , yi )
Pn
i=1 (yi − axi − b)2
Example
y
x
Pn Pn Pn Pn Pn
n
n i=1 xi2 − i=1 xi
P
O(n) time

Segmented least squares
Problem (xi , yi )
• Input: n points p1 , p2 , . . . , pi , . . . , pn ∈ R2 and c > 0

Pn
• Goal: find segments y = f (x) that minimizes i=1 (yi − f (xi ))2 + cL
# of segements
Examples
y

Segmented least squares
Problem (xi , yi )
• Input: n points p1 , p2 , . . . , pi , . . . , pn ∈ R2 and c > 0

Pn
• Goal: find segments y = f (x) that minimizes i=1 (yi − f (xi ))2 + cL
# of segements
Examples
y
L=2

Dynamic programming
Notation
• OPT(j): opt. val. of segmented least squares for p1 , p2 , . . . , pj
• eij : opt. val. of least squares for pi , pi+1 , . . . , pj computable in O(n) time
Recursive formula
0 if j = 0
(
OPT(j) =
min1≤i≤j {eij + c + OPT(i − 1)} if j > 0
Compute for j = 0, 1, . . . , n, each of which takes O(n 2 ) O(n 3 ) time

y
x
Improve the computational time
Pj
Bottleneck: computing eij = k=i (yk − aij xk − bij )2 takes O(n) time
(j−i+1)n jk=i xk yk −( jk=i xk )( jk=i yk )
P P P
• aij = 2
(j−i+1) jk=i xk2 −
Pj
k=i xk
P
Pj
yk −aij jk=i xk
P
• bij = k=i
j−i+1
Approach
Pj Pj Pj 2
Pj
• precompute k=1 xk , k=1 yk , k=1 xk , k=1 xk yk (∀j) in O(n) time
• using cumulative sums, eij can be computed in O(1) time
segmented least squares is solvable in O(n 2 ) time

Outline

String similarity
Q: How similar are two strings X = x1 x2 . . . xm and Y = y1 y2 . . . yn
Alignment of X and Y
set of ordered pairs xi –yj such that each character appears in at most one
pair and no crossing
Example “ocurrance” and “occurrence”
o c u r r a n c e – o c – u r r a n c e o c – u r r – a n c e
o c c u r r e n c e o c c u r r e n c e o c c u r r e – n c e
6 mismatches, 1 gap 1 mismatch, 1 gap 0 mismatches, 3 gaps

Edit distance
Edit distance: minimum sum of gap and mismatch penalties
• Gap penalty δ ≥ 0
• Mismatch penalty αpq ≥ 0 (αpp = 0)
Example
o c – u r r a n c e o c – u r r – a n c e
o c c u r r e n c e o c c u r r e – n c e
cost = δ + αae cost = 3δ
Problem
• Input: two strings X = x1 x2 . . . xm and Y = y1 y2 . . . yn , penalty δ, αpq
• Goal: compute the edit distance

Dynamic programming for knapsack problem
OPT(i, j): edit distance for x1 x2 . . . xi and y1 y2 . . . yj
Recursive formula
jδ if i = 0



iδ  if j = 0




OPT(i, j) = αxi yj + OPT(i − 1, j − 1)

 
OPT(i 1, j) otherwise

min δ + −



δ + OPT(i, j − 1)

  
  
Compute for i = 0, 1, . . . , m and j = 0, 1, . . . , n O(mn) time
X: o c u r r a n c e
Y: o c c u r r e n c e

Example
δ = 2, αpq = 1 (for any distinct p, q)
o c u r r a n c e
o
c
c
u
r
r
e
n
c
e

Example
δ = 2, αpq = 1 (for any distinct p, q)
o c u r r a n c e
0 2 4 6 8 10 12 14 16 18
o 2 0 2 4 6 8 10 12 14 16
c 4 2 0 2 4 6 8 10 12 14
c 6 4 2 1 3 5 7 9 10 12
u 8 6 4 2 2 4 6 8 10 11
r 10 8 6 4 2 2 4 6 8 10
r 12 10 8 6 4 2 3 5 7 9
e 14 12 10 8 6 4 3 4 6 7
n 16 14 12 10 8 6 5 3 5 7
c 18 16 14 12 10 8 7 5 3 5
e 20 18 16 14 12 10 9 7 5 3

Quiz
What is the edit distance if δ = 3, αpq = 2 (for any distinct p, q)
o c u r r a n c e
o
c
c
u
r
r
e
n
c
e

Algorithm 6

Uploaded by

Copyright:

Available Formats

Algorithm 6

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Algorithm 6

Uploaded by

Copyright:

Available Formats

Advanced Core in Algorithm Design #6

Nov. 8th, 2022

Yasushi Kawase Advanced Core in Algorithm Design 1 / 24

Lec. # Date Topics

Yasushi Kawase Advanced Core in Algorithm Design 2 / 24

Yasushi Kawase Advanced Core in Algorithm Design 3 / 24

1 Weighted interval scheduling

2 Subset sums and knapsacks

3 Segmented least squares

Yasushi Kawase Advanced Core in Algorithm Design 4 / 24

Yasushi Kawase Advanced Core in Algorithm Design 5 / 24

Yasushi Kawase Advanced Core in Algorithm Design 5 / 24

Time complexity: O(n log n)

Yasushi Kawase Advanced Core in Algorithm Design 6 / 24

Yasushi Kawase Advanced Core in Algorithm Design 7 / 24

Yasushi Kawase Advanced Core in Algorithm Design 7 / 24

What is the optimal value of the following problem?

Yasushi Kawase Advanced Core in Algorithm Design 8 / 24

1 Weighted interval scheduling

2 Subset sums and knapsacks

3 Segmented least squares

Yasushi Kawase Advanced Core in Algorithm Design 9 / 24

• a = (2, 4, 8, 10, 12), W = 19 impossible

Yasushi Kawase Advanced Core in Algorithm Design 10 / 24

Compute for k = 0, 1, . . . , n and w = 0, 1, . . . , W O(nW ) time

Compute for k = 0, 1, . . . , n and w = 0, 1, . . . , W O(nW ) time

Compute for k = 0, 1, . . . , n and w = 0, 1, . . . , W O(nW ) time

Compute for k = 0, 1, . . . , n and w = 0, 1, . . . , W O(nW ) time

Compute for k = 0, 1, . . . , n and w = 0, 1, . . . , W O(nW ) time

Yasushi Kawase Advanced Core in Algorithm Design 12 / 24

Yasushi Kawase Advanced Core in Algorithm Design 12 / 24

Compute for k = 0, 1, . . . , n and w = 0, 1, . . . , W O(nW ) time

Yasushi Kawase Advanced Core in Algorithm Design 13 / 24

1 Weighted interval scheduling

2 Subset sums and knapsacks

3 Segmented least squares

Yasushi Kawase Advanced Core in Algorithm Design 14 / 24

Yasushi Kawase Advanced Core in Algorithm Design 15 / 24

Yasushi Kawase Advanced Core in Algorithm Design 15 / 24

Yasushi Kawase Advanced Core in Algorithm Design 15 / 24

• Input: n points p1 , p2 , . . . , pi , . . . , pn ∈ R2 and c > 0

Yasushi Kawase Advanced Core in Algorithm Design 16 / 24

• Input: n points p1 , p2 , . . . , pi , . . . , pn ∈ R2 and c > 0

Yasushi Kawase Advanced Core in Algorithm Design 16 / 24

Compute for j = 0, 1, . . . , n, each of which takes O(n 2 ) O(n 3 ) time

Yasushi Kawase Advanced Core in Algorithm Design 18 / 24

1 Weighted interval scheduling

2 Subset sums and knapsacks

3 Segmented least squares

Yasushi Kawase Advanced Core in Algorithm Design 19 / 24

Q: How similar are two strings X = x1 x2 . . . xm and Y = y1 y2 . . . yn

Example “ocurrance” and “occurrence”

Yasushi Kawase Advanced Core in Algorithm Design 20 / 24

Yasushi Kawase Advanced Core in Algorithm Design 21 / 24

Compute for i = 0, 1, . . . , m and j = 0, 1, . . . , n O(mn) time

Yasushi Kawase Advanced Core in Algorithm Design 22 / 24

Yasushi Kawase Advanced Core in Algorithm Design 23 / 24

Yasushi Kawase Advanced Core in Algorithm Design 23 / 24

Yasushi Kawase Advanced Core in Algorithm Design 24 / 24