Temporal Shape Analysis via the Spectral Signature
Elena Bernardis1 , Ender Konukoglu2, Yangming Ou1 , Dimitris N. Metaxas3 ,
Benoit Desjardins1 , and Kilian M. Pohl1
1
Dept. of Radiology, University of Pennsylvania, Philadelphia, PA 19104, USA
2
Microsoft Research, Cambridge, CB3 0FB, UK
3
Dept. of Computer Science, Rutgers University, Piscataway, NJ 08854, USA
Abstract. In this paper, we adapt spectral signatures for capturing morphological
changes over time. Advanced techniques for capturing temporal shape changes
frequently rely on first registering the sequence of shapes and then analyzing the
corresponding set of high dimensional deformation maps. Instead, we propose a
simple encoding motivated by the observation that small shape deformations lead
to minor refinements in the spectral signature composed of the eigenvalues of
the Laplace operator. The proposed encoding does not require registration, since
spectral signatures are invariant to pose changes. We apply our representation to
the shapes of the ventricles extracted from 22 cine MR scans of healthy controls
and Tetralogy of Fallot patients. We then measure the accuracy score of our encoding by training a linear classifier, which outperforms the same classifier based
on volumetric measurements.
1 Introduction
Capturing the shape and function of anatomy through volumetric measurements extracted from 4D medical scans has become of central importance in diagnosing diseases. For example, cardiologists rely on ejection fraction extracted from ultrasound
or cine MR scans to assess patients. These volumetric measurements, however, are not
sensitive enough to aid the diagnosis of many focal or diffuse cardiac diseases. In this
paper, we introduce a new encoding of the shape and its temporal changes based on
the spectral signature and show that this encoding is more sensitive for comparing two
shapes and their temporal dynamics than volumetric measurements.
Advanced techniques for capturing the changes in shape over time frequently rely
on registering the sequence of images and then analyzing the corresponding set of deformation maps [1,2]. Examples of this type of analysis, specific to the heart, include
mappings motivated by biomechanical models [3] or mathematical properties [4], from
which statistical models of the cardiac ventricles can be extrapolated [5]. While promising, these approaches are susceptible to negatively biasing the analysis due to the underlying assumptions and parameter settings of the registration framework, as well as
the accuracy in reducing high dimensional deformation maps to a few features.
An alternative avenue is to characterize the shape of each structure with a lowdimensional set of parameters and use the latter for structure discrimination. Spectral
signatures, i.e. eigensystems of the Laplace and Laplace-Beltrami operators, have recently gained popularity as powerful shape descriptors [6,7]. The eigenvalues of the
N. Ayache et al. (Eds.): MICCAI 2012, Part II, LNCS 7511, pp. 49–56, 2012.
c Springer-Verlag Berlin Heidelberg 2012
50
E. Bernardis et al.
a: LV & RV segmentations
b: Spectrum of the Laplacian
RV
c: Learned Eigencurves
LV
Subject 1
...
basis 1
basis 2
basis 3
basis 4
...
T
1
T 1
T
Time
RV
LV
Subject K
1
...
T 1
T
Time
...
T
1
T 1
T
Time
Fig. 1. Our method. a: Right (RV) and left (LV) ventricle segmentations for each heart sequence
at three sample timepoints. b: Eigenvalue curves that encode temporal shape changes displayed
as a function of time. c: RV and LV reduced shape signatures learned by a low dimensional
embedding of these curves, while assuming no prior physical or statistical model of the data.
Laplace operator implicitly carry information on local shape invariants such as curvature, surface area and volume, and allow to encode shape information with few parameters without need of prior registration [8]. In this paper, we specifically explore the
adaptation of this technology from capturing individuals shapes to capturing temporal
morphologic changes.
Our work is motivated by the observation that small shape deformations lead to minor refinements in the spectral signature. Spectral signatures are thus well suited for
capturing temporal shape changes of anatomical structures that vary slowly between
measurements, such as cardiac ventricles from cine MR scans. Our representation captures temporal shape changes by first computing the spectral signature for each time
point yielding a family of eigenvalue curves (Fig. 1b). We then encode temporal shape
changes by a low dimensional embedding of the eigenvalue curves (Fig. 1c). By doing
so, our simplistic representation assumes no prior physical or statistical model of the
data and only depends on two parameters, namely, the number of eigenvalues and the
dimension chosen for the lower dimension embedding.
We apply our representation to the shapes of the ventricles extracted from 22 cine
MR scans of 11 healthy controls and 11 Tetralogy of Fallot (TOF) patients. We choose
this specific scenario as there is no uncertainty, unlike with other diseases, about the
diagnosis, so that the labeling of individual data sets can be viewed as ground truth. In
addition, TOF is suitable for temporal analysis as it affects both RV shape and cardiac
function. We then measure the accuracy score of our encoding by training a linear classifier and recording the leave-one-out cross-validation accuracy in distinguishing these
two populations. Our representation outperforms the linear classifier based on volumetric measurements of the ventricles. Before we describe our encodings and experiments
in further detail, we just note that we do not attempt to provide a representation tailored
Temporal Shape Analysis via the Spectral Signature
51
towards cardiac disease detection but rather derive a shape representation for implicitly
encoding temporal morphological changes that can be applied to the cardiac domain.
2 Temporal Shape Encoding
We now present in further detail the extension of the spectral signature of the Laplacian
operator to the temporal domain. Laplace operators and their spectra have been studied
in mathematics for a long time [9,10,11]. Their introduction in computational shape
analysis is, however, rather recent [12]. We start with an introduction to Laplace operators and then describe our spectral shape encoding for temporal shape deformations.
2.1 Spectrum of Laplace Operator
Our brief overview of the Laplace operators is meant to give the necessary background
to understand their role in temporal shape encoding. For a more thorough discussion,
we refer the reader to [11,12].
We denote an object as a closed bounded domain Ω ⊂ Rd with piecewise smooth
boundaries. With respect to medical imaging domain, Ω corresponds to the volume
extracted from the segmentation of an anatomical structure. The Laplace operator ∆Ω
d ∂ 2
on Ω is defined as ∆Ω f
i=1 ∂x2 f for a twice differentiable function f , where
i
x {x1 , . . . xd } are the spatial coordinates. The importance of this operator for shape
analysis arises from its eigenvalues and eigenfunctions, which are the solutions of the
Helmholtz equation with Dirichlet type boundary conditions, ∆Ω f + λf = 0, ∀x ∈
Ω and f (x) = 0, ∀x ∈ ∂Ω, where ∂Ω denotes the boundary of the object and λ ∈ R
is a scalar [11]. There are infinite pairs of {(λj , fj )}∞
j=1 that satisfy this equation and
the ordered set of eigenvalues form a positive diverging sequence 0 < λ1 ≤ λ2 ≤ . . . ,
called the Dirichlet spectrum of ∆Ω , which we simply refer to as the ‘spectrum’.
The spectrum has several advantageous properties for shape analysis in medical image analysis [11]. First, the spectrum encodes information regarding the intrinsic geometry of the object. This information content is
due to an identity
∞called heat-trace and
∞
its equivalent polynomial expansion Z(τ ) j=1 e−λj τ = m=0 am/2 τ −d/2+m/2 ,
with τ > 0. The coefficients am/2 are given as sums of volume and boundary integrals
of some local invariants of Ω [13,11], such as its volume or its surface mean curvature.
∞
The functional relation between {λj }∞
j=1 and {am/2 }m=0 links the geometry of an object to its spectrum, and is the ingredient that makes the Laplace operator interesting
for shape analysis. Second, the eigenvalues are invariant to isometric transformations.
This invariance can be even extended to scaling [12], thus eliminating the need to align
the shapes to a common coordinate system for further analysis. Finally, the eigenvalues
change continuously with the deformations applied to the object’s boundary, i.e. there
is a continuous link between the differences in eigenvalues and the difference in shape.
This continuous link is a critical component for our encoding of temporal shape deformation. For a deeper intuition, let us consider an object that changes its shape with time.
We represent the temporal dependence of the object’s shape with Ω(t) and the temporal
dependence of its spectra with Λ(t). Now, if we interpret the motion of an object as the
deformation between two time points, Λ(·) not only then captures the geometry of Ω(t)
52
E. Bernardis et al.
at a given t but it also provides information regarding the motion of Ω(·). In this paper,
we make use of both information when encoding 3D+t(ime) objects.
2.2 Learning Temporal Shape Changes
We now outline the framework for encoding 3D + t. Given a set of K 3D+t volumes
defined over T time points, we first compute for each volume i ∈ {1, . . . , K} and time
point t ∈ {1, . . . , T } the first N eigenvalues of the spectrum (see also Fig. 1b), which
we denote with {λi1 (t), λi2 (t), . . . , λiN (t)}. We then construct temporal
curves from the
eigenvalues across the N signatures of each volume, i.e. λij := λij (1), . . . , λij (T ) ,
for j ∈ {1, . . . , N }, is the j th temporal (eigenvalue) curve of volume i. The final step
encodes the temporal shape deformations by learning a low dimensional embedding of
those curves across all K volumes. By doing so, further analysis of our encoding always
has to consider the subject specific matrix of eigenvalues defining curves across time
instead of viewing λij individually.
For dimensionality reduction, we apply Non-negative Matrix Factorization (NMF)
[14] to the N · K temporal curves. In general, the dimension of the data matrix V is
M × O where M is the number of measurements and O the number of objects. In our
temporal shape encoding, each column of the data matrix V represents a temporal curve
λij . Thus, the dimension of V becomes the (number of timepoints T ) ×(K · N ). NMF
then factorizes matrix V into a basis matrix H and coefficient matrix W so that
V ≈ W H subject to minimizing F (W, H)
N
·K
T
[Viµ log(W H)iµ − (W H)iµ ],
i=1 µ=1
where V refers to the entries of the corresponding matrix. Setting the number of basis
vectors to B and b ∈ {1, . . . , B}, the optimal H and W are obtained by finding the
local minimum of F (·, ·) via the following iterative algorithm:
N
·K
W′
Viµ
Viµ
Hbµ 2: Wib ← N·Kib
3: Hbµ ← Hbµ
Wib
′
(W
H)
(W
H)iµ
iµ
W
j=1
µ=1
jb
i=1
1: Wib′ ← Wib
T
All temporal curves can now be represented by the basis matrix H and the corresponding B coefficients provided by W . These B coefficients across all the sequences of N
eigenvalues are then the temporal shape encodings of our 3D+t objects.
We note that one could have chosen any other dimensionality reduction method. We
simply choose NMF as it does not make any assumption of the underlying distribution,
unlike for example Principle Component Analysis as motivated by our experiments.
3 Experimental Setup
We evaluate our temporal shape encoding by applying it on ventricle segmentations
from short-axis cardiac MRI scans and comparing its accuracy score in a classification
problem to discriminate healthy individuals from patients who had Tetralogy of Fallot
(TOF) corrected surgery in infancy. Our dataset includes the cine MR scans of 11 TOF
cases and 11 healthy volunteers (K=22). TOF patients are post repair and age-matched
Temporal Shape Analysis via the Spectral Signature
53
to normal controls. The cine MR scans of each case are normalized to 23 timepoints. A
medical expert then semi-automatically segmented the blood pool of the right ventricle
and myocardium of the left ventricle at the end-diastole (ED) timepoint using ‘Segment’
[15] with manual corrections of the results. For simplicity, we refer to the blood pool
of the right ventricle just as right ventricle or RV, and to the myocardium of the left
ventricle as left ventricle or LV. We then propagate this segmentation to the other time
points via the non-rigid registration of [16]. We refine those segmentations via dilation
and erosion to remove possible holes, disconnections or topology changes caused by
registration errors. Sample segmentation sequences are shown in Fig. 1a.
For computing the Laplace spectrum, we define the shapes of LV and RV based
on the corresponding segmentations. We then compute the Dirichlet eigenvalues using
finite differences on the regular grid. While using a high order FEM method might provide more accurate eigenvalues, this is outside the scope of this article as our focus is to
apply the signature to the temporal domain, hence we use the simplest implementation.
Furthermore, one can also consider using Neumann eigenvalues however they would be
more sensitive to noisy segmentations e.g. isolated islands.
We analyze the accuracy of our new temporal shape encoding in describing the shape
changes of the RV and LV by first applying it as well as related representations to our
sequences of segmentations. We then measure the accuracy of a linear Support Vector
Machine (SVM) [17] in correctly labeling images based on those features via the leaveone-out principle. Table 1 records the outcome when applying the classifier to LV and
RV separately as well as when we combine the eigencurves of both ventricles.
We start by extracting the volumes and computing the first eigenvalue of the RV and
LV. We view each volume or first eigenvalue as a feature so that classification of every
RV and LV is based on 23 features (given from the original 23 timepoints). When concatenating the ventricles (which we will denote by RV-LV), classification is based on 46
features instead. Classification results based on volume and based on the first eigenvalue
are shown in Table 1 (a) and (b) respectively. Simply using volume for distinguishing
shape changes between TOF and normal subjects only leads to an accuracy of approximately 60% for LV and RV separately and of 68% when combined. As mentioned in the
∞
previous section, the functional relation between {λj }∞
j=1 and {am/2 }m=0 is the link
between the geometry of an object and its spectrum. In particular, the first eigenvalue
is directly proportional to the volume of object i since a0 = (4π)−3/2 Vi . As predicted,
the results obtained using the first eigenvalue alone are comparable to the volume ones.
As shown in the rest of the experiments, increasing the number of eigenvalues adds
shape information, improving the overall recall. We start by considering the eigenvalue sequences λij individually. We do so by computing the eigenvalues for each of
the hearts, setting the spectrum cutoff at N = 100. For comparison to our new encoding, the results of Table 1 (c) do not explicitly model temporal dependencies as we first
combine the 23 · N features for each ventricle (or 46 · N for RV-LV). We then reduce
the M = 23 · N eigenvalues (or 46 · N for RV-LV) for the O = K hearts via NMF
(see Sec. 2.2) to B = 22 features. Parameters N and B are chosen empirically to be
minimum while maximizing classification accuracy. We experimented for N from 1 to
200 and for B from 2 to 200. Furthermore, B is set to the same for the LV, RV, and
RV-LV scenarios. The resulting classification scores (c) are higher compared to (a,b).
54
E. Bernardis et al.
Table 1. TOF classification results. For each experimental setup, we record precision (prec.),
recall and accuracy for LV, RV and RV-LV, obtained from our leave-one-out cross validation.
The first eigenvalue (b) alone performs similarly to the volume feature (a). Increasing number
of eigenvalues (c) improves the performance, especially recall. d-e) Our 3D+t shape signatures
method yields increased scores. f) Choosing PCA instead of NMF on the same eigencurves used
for (d) results in lower scores.
LV
Prec.
a: volume
0.7727
b: λ1
0.8182
c: λ1:1:100
0.7143
d: λ1:1:100 (3D+t) 0.6970
e:λ1:10:100 (3D+t) 0.7879
f: PCA λ1:1:100 0.7273
Recall
0.5862
0.5625
0.5914
0.8214
0.8667
0.6957
Accuracy
0.6136
0.5909
0.6104
0.7727
0.8333
0.7045
RV
Prec.
0.9394
0.8636
0.7557
0.7576
0.8182
0.7273
Recall
0.5741
0.6441
0.6425
0.8065
0.8438
0.7500
Accuracy
0.6212
0.6932
0.6676
0.7879
0.8333
0.7424
RV-LV
Prec. Recall
0.9301 0.6274
0.8864 0.6290
0.8182 0.7674
0.8788 0.8286
0.9545 0.8400
0.7879 0.7647
Accuracy
0.6888
0.6818
0.7851
0.8485
0.8864
0.7727
We further improve the classification score by using our new 3D+t temporal shape
representation of Sec. 2.2 (Table 1 (d,e)). Here, we reduce the dimensionality of the
N · K eigencurves from T = 23 entries (respectively 46 for the RV-LV) to B = 4
after exploring the entire range from 2 to 23. Fig. 2a illustrates sample eigencurves. We
then feed B · C features to the classifier, where C is the number of curves selected for
the classification step. In line (d), we use all the eigencurves for classification, hence
setting C = N (= 100), while we subsample the eigencurves in (e) setting C = 10. The
3D+t encoding yields a higher precision and significantly improves recall, hence giving
an overall improved accuracy. As shown from the classification results in (e), selecting
a subset of the eigencurves, once all the eigencurves have already been used for the
learning step, allows to improve the outcome of the classifier, indicating that another
method for data compression or classification could improve even further the results of
our temporal shape descriptor as well as that of the other scores.
To motivate the use of NMF, we also applied PCA (f) to learn the temporal eigencurves using the same setup of (d). The accuracy scores drop to the level we measured
for the volumes scores indicating that the Gaussian assumption of PCA is violated by
these sets of temporal eigencurves. Our quantitative findings are also reflected in the
visual comparison of the basis vectors for B = 4 of the two dimensionality reduction
methods in Fig. 2b. While the bases obtained by PCA are very noisy, the NMF ones are
cleaner and better describe the smooth temporal changes observed in the eigencurves.
Summary. In this paper, we exploit the implicit local shape properties captured by spectral signatures, i.e. eigensystems of the Laplace and Laplace-Beltrami operators, and
adapt them to capture morphological changes over time. We propose a fairly simple encoding based on the observation that small shape deformations lead to minor refinements
in the spectral signature. The shape analysis is independent of the original segmentation
used and, given initial segmentations, computing the spectral signature is independent
of registration. The accuracy obtained in the classifications demonstrate that that our
temporal-shape representation can be successfully used to classify TOF cardiac disease
patients. Our results indicate that our new temporal shape representation better incor-
Temporal Shape Analysis via the Spectral Signature
RV
LV
λ
LV
λ
RV
55
1
23 1
23
1
23 1
Time
23
Time
a: Sample eigencurves for healthy (left) and TOF (right) cases
RV
LV
ciaociaoco NMF
basis 1
basis 2
basis 3
basis 4
1
23 1
Time
23 1
Time
ciaociaoco PCA
basis 1
basis 2
basis 3
basis 4
1
basis 1
basis 2
basis 3
basis 4
23 1
23
Time
basis 1
basis 2
basis 3
basis 4
23 1
Time
RV-LV
basis 1
basis 2
basis 3
basis 4
23 1
Time
basis 1
basis 2
basis 3
basis 4
23 1
23
Time
b: Reduced spectral signatures
Fig. 2. Shape signatures. a: Sample eigencurves before reduction. b: Learned spectral signatures
reduced via NMF and PCA for RV and LV (over 23 timepoints), and for RV-LV (over 46 timepoints). In the absence of the correct initial statistical model, PCA results in a high level of noise
and does not capture the smooth temporal changes of the eigencurves.
porates the temporal relation of the data. Thus, we are able to better capture RV and
LV deformations in this population than scores capturing shape changes over time by
separately measuring the 3D shape at each timepoint.
Acknowledgments. We would like to thank DongHye Ye for his help on generating
the cardiac dataset. This project was supported in part by Grant Number UL1RR024134
and by the Institute for Translational Medicine and Therapeutics’ (ITMAT) Transdisciplinary Program.
56
E. Bernardis et al.
References
1. Ardekani, S., Weiss, R.G., Lardo, A.C., George, R.T., Lima, J.A.C., Wu, K.C., Miller, M.I.,
Winslow, R.L., Younes, L.: Cardiac motion analysis in ischemic and non-ischemic cardiomyopathy using parallel transport. In: ISBI, pp. 899–902 (2009)
2. Beg, M.F., Helm, P.A., Mcveigh, E., Miller, M.I., Winslow, R.L.: Computational cardiac
anatomy using MRI. Magnetic Resonance in Medicine (2004)
3. Wang, X., Chen, T., Zhang, S., Metaxas, D., Axel, L.: LV Motion and Strain Computation
from tMRI Based on Meshless Deformable Models. In: Metaxas, D., Axel, L., Fichtinger, G.,
Székely, G. (eds.) MICCAI 2008, Part I. LNCS, vol. 5241, pp. 636–644. Springer, Heidelberg
(2008)
4. Helm, P., Beg, M.F., Miller, M.I., Winslow, R.L.: Measuring and mapping cardiac fiber and
laminar architecture using diffusion tensor mr imaging. Annals of the New York Academy
of Sciences 1047(1), 296–307 (2005)
5. Mansi, T., Durrleman, S., Bernhardt, B., Sermesant, M., Delingette, H., Voigt, I., Lurz, P.,
Taylor, A.M., Blanc, J., Boudjemline, Y., Pennec, X., Ayache, N.: A Statistical Model of
Right Ventricle in Tetralogy of Fallot for Prediction of Remodelling and Therapy Planning.
In: Yang, G.-Z., Hawkes, D., Rueckert, D., Noble, A., Taylor, C. (eds.) MICCAI 2009, Part
I. LNCS, vol. 5761, pp. 214–221. Springer, Heidelberg (2009)
6. Reuter, M., Niethammer, M., Wolter, F.-E., Bouix, S., Shenton, M.: Global medical shape
analysis using the volumetric laplace spectrum. In: Proceedings of the 2007 International
Conference on Cyberworlds, NASA-GEM Workshop, pp. 417–426. IEEE Computer Society,
Los Alamitos (2007)
7. Reuter, M.: Laplace Spectra for Shape Recognition. Books on Demand GmbH (2006)
8. Niethammer, M., Reuter, M., Wolter, F.-E., Bouix, S., Peinecke, N., Koo, M.-S., Shenton,
M.E.: Global Medical Shape Analysis Using the Laplace-Beltrami Spectrum. In: Ayache,
N., Ourselin, S., Maeder, A. (eds.) MICCAI 2007, Part I. LNCS, vol. 4791, pp. 850–857.
Springer, Heidelberg (2007)
9. Weyl, H.: Das asymptotische verteilungsgesetz der eigenwerte linearer partieller differentialgleichungen. Math Ann., 441–469 (1912)
10. Kac, M.: Can one hear the shape of a drum? The American Mathematical Monthly 73(7),
1–23 (1966)
11. Courant, R., Hilbert, D.: Method of Mathematical Physics, vol. I. Interscience Publishers
(1966)
12. Reuter, M., Wolter, F.E., Peinecke, N.: Laplace-Beltrami spectra as ’Shape-DNA’ of surfaces
and solids. Computer-Aided Design 38, 342–366 (2006)
13. Protter, M.: Can one hear the shape of a drum? Revisited. SIAM Review 29(2), 185–197
(1987)
14. Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: NIPS, pp. 556–
562 (2000)
15. Heiberg, E., Sjogren, J., Ugander, M., Carlsson, M., Engblom, H., Arheden, H.: Design and
validation of segment - freely available software for cardiovascular image analysis. BMC
Medical Imaging 10(1), 1 (2010)
16. Ou, Y., Sotiras, A., Paragios, N., Davatzikos, C.: Dramms: Deformable registration via attribute matching and mutual-saliency weighting. Medical Image Analysis 15(4), 622–639
(2011)
17. Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Mining
and Knowledge Discovery 2, 121–167 (1998)