Optimization Algorithms On Matrix Manifolds
Optimization Algorithms On Matrix Manifolds
Optimization Algorithms On Matrix Manifolds
Follow links Class Use and other Permissions. For more information, send email to:
[email protected]
Chapter One
Introduction
in that direction have been dramatic over the last two decades and have led
to a solid conceptual framework. However, generalizing a given optimization
algorithm on an abstract manifold is only the first step towards the objective
of this book. Turning the algorithm into an efficient numerical procedure is a
second step that ultimately justifies or invalidates the first part of the effort.
At the time of publishing this book, the second step is more an art than a
theory.
Good algorithms result from the combination of insight from differential
geometry, optimization, and numerical analysis. A distinctive feature of this
book is that as much attention is paid to the practical implementation of
the algorithm as to its geometric formulation. In particular, the concrete
aspects of algorithm design are formalized with the help of the concepts of
retraction and vector transport, which are relaxations of the classical geomet
ric concepts of motion along geodesics and parallel transport. The proposed
approach provides a framework to optimize the efficiency of the numerical
algorithms while retaining the convergence properties of their abstract geo
metric counterparts.
The geometric material in the book is mostly confined to Chapters 3 and 5.
Chapter 3 presents an introduction to Riemannian manifolds and tangent
spaces that provides the necessary tools to tackle simple gradient-descent op
timization algorithms on matrix manifolds. Chapter 5 covers the advanced
material needed to define higher-order derivatives on manifolds and to build
the analog of first- and second-order local models required in most optimiza
tion algorithms. The development provided in these chapters ranges from
the foundations of differential geometry to advanced material relevant to
our applications. The selected material focuses on those geometric concepts
that are particular to the development of numerical algorithms on embed
ded and quotient manifolds. Not all aspects of classical differential geometry
are covered, and some emphasis is placed on material that is nonstandard
or difficult to find in the established literature. A newcomer to the field of
differential geometry may wish to supplement this material with a classical
text. Suggestions for excellent texts are provided in the references.
A fundamental, but deliberate, omission in the book is a treatment of the
geometric structure of Lie groups and homogeneous spaces. Lie theory is
derived from the concepts of symmetry and seems to be a natural part of
a treatise such as this. However, with the purpose of reaching a community
without an extensive background in geometry, we have omitted this material
in the present book. Occasionally the Lie-theoretic approach provides an
elegant shortcut or interpretation for the problems considered. An effort
is made throughout the book to refer the reader to the relevant literature
whenever appropriate.
The algorithmic material of the book is interlaced with the geometric ma
terial. Chapter 4 considers gradient-descent line-search algorithms. These
simple optimization algorithms provide an excellent framework within which
to study the important issues associated with the implementation of practi
cal algorithms. The concept of retraction is introduced in Chapter 4 as a key
INTRODUCTION 3
1 http://press.princeton.edu/titles/8586.html
4 CHAPTER 1
over the last decade. The Notes and References section at the end of each
chapter is an attempt to give proper credit to the many contributors, even
though this task becomes increasingly difficult for recent contributions. The
authors apologize for any omission or error in these notes. In addition, we
wish to conclude this introductory chapter with special acknowledgements
to people without whom this project would have been impossible. The 1994
monograph [HM94] by Uwe Helmke and John Moore is a milestone in the
formulation of computational problems as optimization algorithms on man
ifolds and has had a profound influence on the authors. On the numerical
side, the constant encouragement of Paul Van Dooren and Kyle Gallivan
has provided tremendous support to our efforts to reconcile the perspectives
of differential geometry and numerical linear algebra. We are also grateful
to all our colleagues and friends over the last ten years who have crossed
paths as coauthors, reviewers, and critics of our work. Special thanks to Ben
Andrews, Chris Baker, Alan Edelman, Michiel Hochstenbach, Knut Hüper,
Jonathan Manton, Robert Orsi, and Jochen Trumpf. Finally, we acknowl
edge the useful feedback of many students on preliminary versions of the
book, in particular, Mariya Ishteva, Michel Journée, and Alain Sarlette.