Research Log 1
Research Log 1
Research Log 1
Weeks 1 & 2
Theory
1 Introduction
minimize f0 (x)
(1)
subjected to fi (x) ≤ bi , i = 1, ..., m
where:
x = (x1 , ..., xn ) → optimizationvariable
f : Rn → R
→ objectivef unction
0
.
fi : Rn → R, i = 1, ..., m → constraintf unctions
b , ..., b
1 m → limits
There are classes of optimization problems, characterized by particular forms of the objective and con-
straint functions. An important example of the optimization problem is called a linear program if the
objective and constraint functions f0 , ..., fm are linear, i.e., satisfy:
1
Adriana Silva Research Log September | 2018
2 Convex sets
where θ ∈ R, form the line passing through x1 and x2 . The parameter value θ = 0 corresponds to y = x2
and θ = 1 corresponds to y = x1 .
Values of the parameter θ between 0 and 1 correspond to the (closed) line segment between x1 and x2 .
From another perspective, y is the sum of the point x2 and the direction x1 − x2 (from x2 to x1 ) scaled
by the parameter θ. Thus, θ gives the fraction of the way from x2 to x1 where y lies. As θ increases from
0 to 1, the point y moves from x2 to x1; for θ ¿ 1, the point y lies on the line beyond x1.
A set C ⊆ Rn is affine if the line through any two distinct points in C lies in C. That is, if for any
x1 , x2 ∈ C and θ ∈ R: thetax1 + (1 − θ)x2 ∈ C.
The affine set C contains the linear combination of any two points in C, if the sum of the linear coefficients
is 1.
Generalizing, a point of the form θ1 x1 + ... + θk xk , where θ1 + ... + θk = 1, is an affine combination of the
points x1 , ..., xk .
The set of all affine combinations of points in a set C is the affine hull of C:
A set C is convex if the line segment between any two points in C lies in C. That is, if for any
x1 , x2 ∈ C and any θ with 0 < θ < 1: thetax1 + (1 − θ)x2 ∈ C.
2
Adriana Silva Research Log September | 2018
Every affine set is also convex, since it contains the entire line between any two distinct points in it,
therefore it also contains the line segment between the points.
A point of the form θ1 x1 +...+θk xk , where θ1 +...+θk = 1 and θi ≥ 0, i = 1, ..., k, is a convex combination
of the points x1 , ..., xk .
The set of all convex combinations of points in C is the convex hull :
2.1.4 Cones
A set C is called a cone, if for every x ∈ C and θ ≥ 0: θx ∈ C. A set C is a convex cone if it is convex
and a cone, which means that for any x1 , x2 ∈ C and θ1 , θ2 ≥ 0: θ1 x1 + θ2 x2 ∈ C. Geometrically, points
of this form can be described as forming a two-dimensional pie slice.
Figure 3: Cone: all the points of the form θ1 x1 + θ2 x2 , where θ1 , θ2 ≥ 0. θ1 = θ2 = 0 is at 0 and the edges
(which correspond to θ1 = 0 or θ2 = 0) pass through the points x1 and x2 .
A point of the form θ1 x1 + ... + θk xk with θ1 , ..., θk ≥ 0 is called a conic combination of x1 , ..., xk . If xi
are in a convex cone C, then every conic combination of xi is in C. A set C is a convex cone if and only
if it contains all conic combinations of its elements.
• The empty set ∅, any single point x0 and the whole space Rn are affine, therefore convex.
3
Adriana Silva Research Log September | 2018
2.2.1 Hyperplane
Figure 4: Hyperplane in R2 with normal vector a and a point x0 in the hyperplane. For any point x in
the hyperplane, (x − x0 ) is orthogonal to a.
2.2.2 Halfspace
The halfspace can also be expressed as {x | aT (x − x0 ) ≤ 0}, where x0 is any point on the associated
hyperplane and satisfies aT x0 = b. The halfspace consists of x0 plus any vector that makes an obtuse (or
right) angle with the outward normal vector a.
4
Adriana Silva Research Log September | 2018
2.2.3 Ball
where r > 0. r is the radius and xc is the center of the ball. ||.||2 denotes the Euclidean norm.
A ball is a convex set if: ||x1 − xc ||2 ≤ r, ||x2 − xc ||2 ≤ r and 0 < θ < 1.
2.2.4 Polyhedra
A polyhedron is defined as the solution set of a finite number of linear equalities and inequalities:
Figure 6: The polyhedron P is the intersection of five halfspaces, with outward normal vectors a1, ...., a5.
In compact notation:
P = {x | Ax ≺ b, Cx = d} (10)
The symbol ≺ means vector inequality.
• Intersection
• Affine functions
• Perspective function
• Linear-fractional functions
2.3.1 Intersection
5
Adriana Silva Research Log September | 2018
• K is convex.
• K is closed.
x K y ⇔ y − x ∈ K, (11)
x ≺K y ⇔ y − x ∈ intK. (12)
Many properties of K are similar to ≤ on R.
Figure 7: Left: the set S1 has a minimum element x1 . Right: x2 is a minimal point of S2 .
6
Adriana Silva Research Log September | 2018
Suppose C and D are nonempty disjoint convex sets, C ∩ D = ∅. Then there exists a 6= 0 and b such that
aT b ≤ b ∀x ∈ C and aT b ≥ b ∀x ∈ D.
Figure 8: The hyperplane x | aT x = b is called a separating hyperplane for the sets C and D.
{x | aT x = aT x0 } (15)
K ∗ = {y | xT y ≥ 0, ∀x ∈ K}. (16)
7
Adriana Silva Research Log September | 2018
• K ∗∗ is the closure of the convex hull of K. (Hence if K is convex and closed, K ∗∗ = K.)
2.1
y = θ 1 x1 + θ 2 x2
θ2 = 1 − θ1
⇒ y = x2 + θ1 (x1 − x2 )
By the definition of convex set: y = x2 + θ1 (x1 − x2 ) ∈ C.
Now, for k=3 suppose:
C ⊆ R3
x1 , x2 , x3 ∈ C
θ1 , θ2 , θ3 ∈ R
θ1 ≥ 0, θ2 ≥ 0, θ3 ≥ 0
θ1 + θ 2 + θ3 = 1
y = θ 1 x 1 + θ2 x 2 + θ3 x 3
Assuming that at least one θi is not equal to 1 ⇒ θ1 6= 1.
θ 2 + θ3 = 1 − θ1
(1 − θ1 )
⇒ y = θ1 x1 + (θ2 x2 + θ3 x3 ) = θ1 x1 + (u2 x2 + u3 x3 )(1 − θ1 )
(1 − θ1 )
θ2 θ3
where u2 = 1−θ1 , u3 = 1−θ1 and u2 , u3 ≥ 0.
θ2 + θ 3 1 − θ1
u2 + u3 = = =1
1 − θ1 1 − θ1
C is a convex set. x2 , x3 ∈ C ⇒ u2 x2 + u3 x3 . Since x1 ∈ C ⇒ y ∈ C.
8
Adriana Silva Research Log September | 2018
General case:
C ⊆ Rn
x1 , ..., xK ∈ C
θ1 , ..., θK ∈ R
θi ≥ 0, i = 1, ..., k
θ1 + ... + θK = 1
y = θ1 x1 + ... + θK xK
Assuming θ1 6= 1 ⇒ θ2 + ... + θK = 1 − θ1 .
(1 − θ1 )
⇒ y = θ1 x1 + (θ2 x2 + θ3 x3 + ... + θK xK ) = θ1 x1 + (u2 x2 + u3 x3 + ... + uK xK )(1 − θ1 )
(1 − θ1 )
θ2 θ3 θK
where u2 = 1−θ1 , u3 = 1−θ1 , uK = 1−θ1 and u2 , u3 , ..., uK ≥ 0.
C is convex and x2 , x3 , ..., xk ∈ C. Thus, u2 x2 + u3 x3 + ... + uK xK ∈ C. And since x1 ∈ C ⇒ y ∈ C.
2.2
A set is convex if and only if it combines every convex combination of its points. This means, it contains
the line between any two points in the set (definition of convex set).
Assuming C as a convex set. Suppose the intersection of C with any line is convex. Take any two points
x1 , x2 ∈ C. The intersection of C with the line that goes through x1 and x2 is convex. All the convex
combinations of x1 and x2 are in the intersection line and therefore they are in C.
A set is affine if the line that goes through any two distinct points of the set lies in the set. That is, it
contains every affine combination of two points in it.
Take any two distinct points of the affine set C, x1 , x2 ∈ C. The line that goes through x1 and x2 contains
all the affine combinations of x1 and x2 . The intersection of the line with set C also contains all the affine
combinations of x1 and x2 . And that applies for any two points of the set.
2.3
Midpoint convexity (definition): a set C is midpoint convex if whenever two points a, b ∈ C, the
midpoint (average) a+b
2 ∈ C. A convex set is midpoint convex.
x+y
Suppose that C is a closed midpoint convex set. x, y ∈ C. The midpoint 2 ∈ C.
Assume that z is a point on the line segment between x and y.
xi + yi x0 + y0 x+y
zi = where z0 = =
2 2 2
(
zi , zi ≤ z0
xi+1 =
xi , zi > z0
9
Adriana Silva Research Log September | 2018
(
zi , zi > z0
yi+1 =
yi , zi ≤ z0
By this construction, z is always on the line segment between xi and yi and zi converges to z0 .
Due to midpoint convexity of C, zi ∈ C ∀i. And because C is closed, z0 ∈ C. This applies to all z on the
line segment between x and y, so C is convex.
2.12
A halfspace is a convex set and a slab is the intersection between 2 halfspaces. The intersection of 2
convex sets is convex, therefore a slab is a convex set.
b) A rectangle: {x ∈ Rn | αi ≤ xi ≤ βi , i = 1, ..., n}.
Just like in a), a rectangle is a finite intersection of 2 halfspaces, therefore is a convex set.
c) A wedge (cunha?): {x ∈ Rn | aT1 x ≤ b1 , aT2 x ≤ b2 }.
Again, this is an intersection of 2 halfspaces, therefore is convex.
d) The set of points closer to a given point than a given set: {x | ||x − x0 ||2 ≤ ||x − y||2 , ∀y ∈ S}, S ⊆ Rn .
Applying the Euclidean norm:
⇔ xT x − 2xT0 x + xT0 x0 ≤ xT x − 2y T x + y T y
⇔ 2(y − x0 )T x ≤ y T y − xT0 x0
This defines a halfspace, therefore is convex.
e) The set of points closer to one set than another: {x | dist(x, S) ≤ dist(x, T )}, S, T ⊆ Rn and
dist(x, S) = inf {||x − z||2 | z ∈ S}.
Example of 2 sets: S = {0, 10} and T = {5}.
x + S2 ⊆ S1 if x + y ∈ S1 ∀y ∈ S2
\
⇔ {x | x + S2 ⊆ S1 } = {x | x + y ∈ S1 }
y∈S2
10
Adriana Silva Research Log September | 2018
g) The set of points whose distance to a does not exceed a fixed fraction θ of the distance to b: {x | ||x −
a||2 ≤ θ||x − b||2 }, a 6= b, 0 ≤ θ ≤ 1.
This set has the form of a ball, therefore is convex.
2.16
2.25
→ C is a closed convex set.
→ x1 , ..., xK are on the boundary of C.
→ C ⊆ {x | aTi (x − xi ) ≤ 0, i = 1, ..., k}: aTi (x − xi ) = 0 defines a supporting hyperplane for C at xi .
→ Polyhedra Pinner = conv{x1 , ..., xK }
→ Polyhedra Poutter = {x | aTi (x − xi ) ≤ 0, i = 1, ..., k}
Pinner is the set with all the convex combinations of the points x1 , ..., xK . The points xi are on the
boundary of C and C is closed and convex, therefore Pinner ⊆ C.
If x ⊆ C ⇒ x ⊆ Poutter because C ⊆ {x | aTi (x − xi ) ≤ 0, i = 1, ..., k}.
Thus, Pinner ⊆ C ⊆ Poutter .
2.31
11
Adriana Silva Research Log September | 2018
b) K1 ⊆ K2 ⇒ K2∗ ⊆ K1 ∗.
y ⊆ K2 ∗ means xT y ≥ 0 ∀x ∈ K2 and K2 includes K1 . Therefore xT y ≥ 0 ∀x ∈ K1 . [ver o desenho no
caderno]
c) K ∗ is closed.
K ∗ is an intersection of two halfspaces, that include the boundary points ((aT x) = 0). It also includes
the origin as a boundary point. Therefore, K ∗ is closed.
d) The interior of K ∗ is given by intK ∗ = {y | yT x > 0∀x ∈ clK}.
clK: closure of K and includes interior plus boundary points of K.
{y | y T x = 0} is the boundary of K ∗ (the edges). So, the interior of K ∗ is given by {y | y T x > 0}.
e) If K has nonempty interior then K ∗ is pointed.
Pointed: contains no line (x ∈ K, −x ∈ K ⇒ x = 0).
If y ∈ K ∗ and −y ∈ K ∗ , K ∗ is not pointed. This means, y T ≥ 0 and −y T ≥ 0, ∀x ∈ K. So,
y T x = 0∀x ∈ K. Thus, K has empty interior.
f ) K ∗∗ is the closure of K. (Hence if K is closed, K ∗∗ = K.)
y 6= 0 is the normal vector of a halfspace containing K if and only if y ∈ K ∗ . The intersection of all
halfspaces containing a convex cone K is the closure of K. Therefore the closure of K is K ∗∗ .
g) If the closure of K is pointed then K ∗ has nonempty interior.
If K has empty interior, there is a a 6= 0 such that aT y = 0, ∀y ∈ K ∗ . This means a and −a are in K ∗ ,
so K ∗ is not pointed.
Next Week
⇒ Chapter 3
References
[1] Boyd S., Vandenberghe L., Convex Optimization, Cambridge University Press
12