1
Pattern Recognition Letters
journal homepage: www.elsevier.com
Hypergraph Isomorphism Using Association Hypergraphs
Giulia Sandia,∗∗, Sebastiano Vascona,b , Marcello Pelilloa,b
a Dept.
of Environmental Sciences, Informatics and Statistics, Ca’ Foscari University, Via Torino 155, 30172 Mestre, Venice, Italy
Centre for Living Technology, Ca’ Foscari University, Ca’ Bottacin Dorsoduro 3911, 30123 Venice, Italy
b European
ABSTRACT
Association graphs represent a classical tool to deal with the graph matching problem and recently the
idea has been generalized to the case of hypergraphs. In this article, the potential of this approach is
explored. The proposed framework uses a class of dynamical systems derived from the Baum-Eagon
inequality in order to find the maximum (maximal) clique in the association hypergraph, that corresponds to the maximum (maximal) isomorphism between the hypergraphs to be matched. The proposed approach has extensively been tested with experiments on a large synthetic dataset, including
hypergraphs of different cardinalities, order and connectivities. In particular the isomorphism version
of the problem has been analyzed. The results obtained are impressive in terms of correctness, thus
showing that, despite its simplicity, the Baum-Eagon dynamics has an outstanding capacity of finding
globally optimal solutions and solving the hypergraph isomorphism problem.
© 2020 Elsevier Ltd. All rights reserved.
1. Introduction
In recent years hypergraphs (as opposed to graph) have been
used to encode structural information in different fields such
as computer vision, pattern recognition and machine learning,
thanks to the benefits that derive when dealing with relationships among more than two elements, thus encoding a higher
pool of information. In Bai et al. (2016) the authors evaluate the complexity traces of different hypergraphs representing images, getting the idea from the work on graphs by Xiao
et al. (2009), in order to cluster the data; in Zhang and Hancock (2012) and Zhang et al. (2016) hypergraphs are used for
feature selection, relying on the high cardinality of graphs to
reflect the functional similarities between more than two features. Moreover, matching this type of structures is of particular
interest for solving problems such as, e.g., feature tracking, object recognition, scene registration and shape matching, where
high-order relations are commonly used. Various studies have
tried to solve this question maximizing the sum of the matching
scores, thus transforming it into an optimization problem (see
e.g. Duchenne et al. (2011); Lee et al. (2011); Yan et al. (2015)
and the references therein).
∗∗ Corresponding
author:
e-mail:
[email protected] (Giulia Sandi)
In Pelillo (1998, 1999) the author focuses on the isomorphism problem using the classical technique of computing
the association graph between the structures that have to be
matched. On this newly built graph, techniques from evolutionary game theory are then applied. Moreover in Pelillo (2003)
the use of this approach both to graph isomorphism and to subgraph isomorphism has been widely discussed, highlighting in
the obtained results a relation between the complexity class of
the analyzed problems and the quality of the obtained results.
Recently, Zhang and Ren (2017) and Hou and Pelillo (2018)
tried to apply a similar methodology on uniform hypergraphs,
working on the association hypergraph using dynamics inspired
from the Baum-Eagon inequality (see Baum and Eagon (1967);
Pelillo (1997); Rota Bulò and Pelillo (2013)). In the aforementioned papers, only hypergraphs of cardinality 3 have been
taken into consideration, obtaining good results, however with
the developed approaches even uniform hypergraphs of larger
cardinality can be treated.
Inspired by these recent papers, in this article, which extends
the work done in Sandi et al. (2018), we investigate the potential of this technique. In particular, we focus on the simplest
case of the matching problem, namely the isomorphism case.
Specifically, we carried out a set of experiments using replicator
dynamics on a large synthetic dataset, that include 11400 different uniform hypergraphs of varying cardinalities, order and
connectivities. The results obtained are impressive: 100% of
2
the isomorphisms have been successfully returned in all cases,
despite the simplicity of the dynamics used.
The article is structured as follows. In section 2 an overview
of the state of the art is given; in section 3 we present the definition of association hypergraph together with some fundamental results that are required for using this auxiliary structure in
solving the isomorphism problem; in section 4 the Baum-Eagon
inequality is introduced, and the related dynamics, both in their
linear and exponential form, are described; section 5 illustrates
the set-up of the experiments and presents the related results,
which proved out to be exceptional in terms of precision; finally, section 6 concludes the article.
2. Related Works
In Zass and Shashua (2008) a probabilistic point of view on
the matching problem is offered. The authors employed the
Sinkhorn projection to match nodes on graphs and hypergraphs
of same sizes, and applied this technique for solving non-rigid
image matching. This work has been used in Mirzaalian et al.
(2009) to match moles in patients’ skin across different scanning time. In Duchenne et al. (2011) the authors proposed a
matching algorithm based on high order power iteration, employing an affinity tensor to capture the high order relations
among feature tuples, while in Leordeanu et al. (2011) a semisupervised learning approach that learn the parameters for the
hypergraph matching is outlined and the solution is obtained
through an Integer Quadratic Program (IQP).
Lee et al. (2011) extended Cho et al. (2010) proposing a hypergraph matching algorithm based on random walk, which was
able to embed strong prior on one-to-one correspondences of
nodes. Albarelli et al. (2012) use high order matching to classify objects, while in Yan et al. (2015) the hypergraph matching
problem has been solved using a discrete formulation that is
optimized with a particular adaptive discrete gradient assignment method, that is theoretically guaranteed to converge to a
fixed discrete point. In Nguyen et al. (2015) a tensor block coordinate ascent methods for hypergraph matching is proposed,
while Jin Chang et al. (2016) used the reweighted random walk
approach proposed by Lee et al. (2011) to find correspondences
between two dynamic kinematic structures of objects in sequences.
More recently Zhang and Ren (2017) proposed an approach
based on game-theory to refining multi-source feature correspondences. Although the optimization methodology is the
same of our work (we both optimizes a polynomial with the
Baum-Eagon inequality), we differ from this paper in the spirit
of the analysis. We are in fact interested in a different problem: the unweighted hypergraph isomorphism. A similar approach has been used in Hou and Pelillo (2018) to perform feature matching across a set of images. Finally Zhang and Wang
(2018) use a Markov Chain Monte Carlo method to narrow the
search space and solve the hypergraph matching problem.
The vast majority of the works presented here use only small
hypergraphs of cardinality 3, due to computational reasons.
Even though the cost for solving the hypergraph matching problem increases exponentially with the cardinality and order of
the structures to be matched, our method overcome this limitation being able to work with hypergraphs of higher cardinality.
In fact we have successfully tested our approach with uniform
hypergraphs of cardinality 3, 4 and 5.
3. Hypergraph matching using association hypergraphs
An unweighted undirected hypergraph is formally defined by
the triplet H = (V, E, K(H)), where:
• V = {1, . . . , n} represents the finite set of vertices or nodes;
• E ⊆ 2V \ 0 represents the finite set of hyperedges or hyperarcs of various cardinalities (where 2V denotes the powerset of V);
• K(H) = {|F| : F ∈ E} represents the finite set of edge types
of H, that are all the possible cardinalities of the edges in
E.
In this article we focus only on uniform hypergraphs, or kgraphs, where all the edges have the same cardinality, and the
set of edge cardinalities K(H) is made of one single element
k ≥ 2, hence E ⊆ V × · · · × V for k times. If k = 2 we have the
classic notion of graph, where only pairwise relations are taken
into consideration. The order of H is the number of its vertices,
while its size determines the number of its edges. Given n nodes
i1 , ..., in ∈ V, they are said to be adjacent if there exists an e ∈ E
such that {i1 , ..., in } ⊆ e. The degree of a vertex i ∈ V, denoted
by deg(i), is the number of vertices adjacent to it.
Given a k-graph H = (V, E, k), a clique is defined as a subset of vertices C such that for all distinct i1 , ..., ik ∈ C, they
are mutually adjacent, that is {i1 , ..., ik } ∈ E. A clique that is
not contained in any larger clique is called a maximal clique,
while the largest clique in the hypergraph is called the maximum clique. The cardinality of the maximum clique is called
the clique number ω(H).
In order to ease the lecture of this article, from now on the
words hypergraph, k-graph, or graph will be used interchangeably for indicating uniform un-weighted hypergraphs with hyperarcs of cardinality k, when no ambiguity can arise from the
context. Similarly the words edge or arc will refer to hyperedge
or hyperarc.
Given two hypergraphs H ′ = (V ′ , E ′ , k) and H ′′ =
(V ′′ , E ′′ , k), an isomorphism between them is defined by any
bijection φ : V ′ → V ′′ for which {i1 , ..., ik } ∈ E ′ ⇔
{φ(i1 ), . . . , φ(ik )} ∈ E ′′ , ∀i1 , ..., ik ∈ V ′ . Two k-graphs are said to
be isomorphic if there exists an isomorphism between them.
Usually, when speaking about the graph matching problem
we indicate the maximum common sub-graph problem, that
aims at understanding if there is a subset of vertices, possibly
the largest one, in the two graphs, that are isomorphic. However in this article we are going to focus on the hypergraph isomorphism problem, that is a special case of the more general
matching question. We therefore want to understand if two hypergraphs are isomorphic and in case finding the isomorphism
itself.
The hypergraph isomorphism problem is easier than the general matching problem (see Garey and Johnson (1979)): the
first one is located in the low hierarchy of NP, thus meaning
that is not NP-complete nor it has been demonstrated to belong
3
Fig. 1. Two isomorphic and symmetric hypergraphs. There is not a unique
isomorphism between these structures. Two possible matches are: (1,A),
(2,B), (3,C), (4,D), (5,E), (6, F) and (1,A), (2,C), (3,B), (4,D), (5,F), (6,E),
but other matches are possible too. This will result in saddle points when
running the optimization process (see Section 5)
to P, while the second one has been demonstrated to be an NPcomplete problem.
We can generalize to k-graphs, without too much effort, the
notion of association graph. This is an handy auxiliary graph
structure conceived for solving general graph matching problems, that has been introduced in Barrow and Burstall (1976)
and also in Kozen (1978).
approaches that are known to work well in solving the latter
problem.
Let’s consider the arbitrary undirected hypergraph of order n
H = (V, E, k), and let S n indicates the standard simplex of IRn :
n
X
n
x ∈ IR : xi ≥ 0, for all i = 1, ..., n, and
Sn =
xi = 1
i=1
Given C a subset of vertices of H, xc indicates its characteristic vector, and it corresponds to the point in S n described by:
1/|C|, if i ∈ C
c
xi =
0,
otherwise
where |C| indicates the cardinality of C.
Let’s now consider the Lagrangian of H, which is the polynomial function defined as:
XY
f (x) =
xi
(1)
e∈E i∈e
Definition 1. The association hypergraph derived from unweighted uniform hypergraphs H ′ = (V ′ , E ′ , k) and H ′′ =
(V ′′ , E ′′ , k) is the undirected unweighted hypergraph H =
(V, E, k) defined as:
V = V ′ × V ′′
and
E = {{(i1 , j1 ), ..., (ik , jk )} ∈ V ′ × V ′′ :
i1 , ... , ik , j1 , ... , jk ,
{i1 , ..., ik } ∈ E ′ ⇔ { j1 , ..., jk } ∈ E ′′ }.
Generalizing to hypergraphs an equivalent result achieved by
Pelillo (1998, 1999) for graphs, we can demonstrate a one-toone correspondence between the maximum clique problem and
the graph isomorphism problem.
Theorem 1. Let H ′ = (V ′ , E ′ ) and H ′′ = (V ′′ , E ′′ ) be two hypergraphs of order n and edge cardinality k, and let H = (V, E)
be the related association k-graph. Then H ′ and H ′′ are isomorphic if and only if ω(H) = n. In this case, any maximum
clique of H induces an isomorphism between H ′ and H ′′ , and
vice versa. In general, maximum and maximal common subgraph isomorphisms between H ′ and H ′′ are in one-to-one correspondence with maximum and maximal cliques in H, respectively.
Sketch of proof. Suppose that the two k-graphs are isomorphic, and let φ be an isomorphism between them. Then the
subset of vertices of H defined as Cφ = {(i, φ(i)) : ∀i ∈ V ′ } is
clearly a maximum clique of cardinality n. In reverse, let C be
an n-vertex maximum clique of H, and for each (i, h) ∈ C define φ(i) = h. Then it is easy to see that φ is an isomorphism
between H ′ and H ′′ because of the way the association k-graph
is constructed. The proof for the general case is analogous.
With this result we can move our original problem of finding
a matching between graphs to the problem of finding a maximum clique in the related association graph. Obviously this
does not reduce the complexity of our problem since, again,
the clique decision problem is NP-complete (see Karp (1972)),
however it gives us the possibility to use one of the existing
∗
A point x ∈ S n is said to be a global maximizer of f in S n if
f (x∗ ) ≥ f (x), ∀ x ∈ S n . On the other hand it is said to be a local
maximizer if ∃ an ǫ > 0 such that f (x∗ ) ≥ f (x) ∀ x ∈ S n whose
distance from x∗ is smaller than ǫ. Moreover, x∗ is said to be a
strict local maximizer if f (x∗ ) = f (x) implies x∗ = x.
When dealing with graphs (namely, k = 2), the MotzkinStraus theorem (see Motzkin and Straus (1965)) provides a notable relation between local (global) maximizers of function (1)
in S n and maximal (maximum) cliques of the graph itself. In
particular, it claims that a subset C of vertices of a graph G,
is a maximum clique if and only if a global maximizer of the
Lagrangian of the graph G in the standard simplex S n is in the
form of the characteristic vector xC . This result interestingly
gives the opportunity to move from a discrete to a continuous
formulation of our problem.
The formulation of f (x) as the Lagrangian of the hypergraph
is the same used in Zhang and Ren (2017), Hou and Pelillo
(2018) and Sandi et al. (2018), and is derived by directly extending the Motzkin-Straus theorem on graphs. Given the good
results on the aforementioned papers, in our experiments we are
going to adopt this function. However different formulations
are possible, for example the ones that generalize the MotzkinStraus theorem, together with Bomze’s formulation, to both
uniform and non-uniform hypergraphs, presented in Rota Bulò
and Pelillo (2009) and Peng et al. (2016) respectively.
4. Finding isomorphisms using the Baum-Eagon dynamics
Given the definitions and results just presented, the problem of finding an isomorphisms can be reduced to solving the
following (linearly constrained) polynomial optimization problem:
XY
maximize f (x) =
xi
e∈E i∈e
(2)
subject to x ∈ S n
Applying a result achieved by Baum and Eagon (1967) we
obtain an easy but powerful tool for optimizing this function.
4
Their work introduces a class of non-linear transformations in
the standard simplex, and proves a key result, generalizing a
previous one introduced by Blakley (1964) on related characteristics for specific homogeneous quadratic transformations. The
Baum-Eagon inequality is presented in the following theorem.
Theorem 2. (Baum and Eagon (1967)) Let Q(x) be a homogeneous polynomial in the variables x j with non-negative coefficients, and let x ∈ ∆. Define the mapping z = M(x) from ∆ to
itself as follow:
n
∂Q(x) X ∂Q(x)
, j = 1, ...n.
(3)
xl
zj = xj
∂x j l=1
∂xl
Then Q(M(x)) >Q(x) unless M(x) = x.
This theorem defines a continuous mapping that is generally
known as a growth transformation. It is worth noticing that,
when defining the mapping M, only first-order derivatives are
involved. Nevertheless M is able, with a finite number of steps,
to increase the polynomial Q, thus strongly differentiating itself
from classical gradient methods, that need instead to estimate
high-order derivatives for determining the size of the infinitesimal steps to be taken. Moreover the use of gradient descend
methods induce some problems for the points on the boundary,
since some projection operator are performed. By contrast, a
simple normalization on rows has to be performed when using
results of theorem 2.
Therefore the Baum-Eagon inequality provides a powerful
tool and has been widely used for maximizing polynomials
functions in S n . In particular they are a main component in
various statistical estimation methodologies developed within
the theory of probabilistic functions of Markov Chains (see
Baum et al. (1970)), and they are used to analyze the dynamical
properties of relaxation labelling processes (see Pelillo (1997)).
Theorem 2 can be employed to optimize our problem in equation (2), since we have to maximize the function f that is indeed
an uniform polynomial with non-negative coefficients, with the
constraint of having x laying within the standard simplex.
The following discrete time dynamic can be defined from the
Baum-Eagon inequality:
δ j (x)
, j = 1...n,
(4)
x j (t + 1) = x j (t) Pn
i=1 xi δi (x)
where for readability reasons we have defined δ j (x) =
∂ f (x)/∂x j .
At time 0, the dynamic in equation 4 starts from a random
point within the standard simplex S n , then, one step after the
other, the state vector is updated until convergence. At the end
of the iterations the resulting state vector will have the form of
a characteristic vector, that can be thresholded against a small
amount close to zero thus returning only the subset of nodes
in the association graph that are part of the (maximum) clique.
Even though we have no theoretical guarantee that the discrete
time dynamic returns a global maximum of the function, and
not a local maximum, we will see in section 5 that the basins of
attraction of the global optimum are quite large, and the use of
this dynamic derived from the Baum-Eagon inequality always
returned the maximum clique in the association graph, in all the
performed experiments, and never incurred in local solutions.
An exponential version of the proposed dynamic can be defined, in order to obtain a faster convergence:
eκδ j (x)
,
κδi (x)
i=1 xi (t)e
x j (t + 1) = x j (t) Pn
j = 1...n.
(5)
Using this exponential dynamic might decrease the time needed
in the optimization step, however it comes with a price: we have
in fact to tune the newly introduced parameter κ. In setting this
parameter we have to be careful to speed-up the process without
loosing the correctness of the results. In section 5 some remarks
are made on how the value of κ influences the search for the
global optimum.
5. Experiments
In this section we are first going to provide a description of
how we produced the dataset and how we carried out the experiments. Then the obtained results are analyzed.
5.1. Experimental set-up
The proposed approach has been systematically tested on
random hypergraphs of different cardinalities, sizes and connectivities, in order to evaluate its efficacy and to understand if
the results heavily differs, depending of the various combination of parameters used in constructing the graphs.
The hypergraph isomorphism problem has been inspected
with a specifically designed dataset considering hypergraphs
of cardinality 3, 4 and 5. For each cardinality the dataset is
made of graphs of different orders, in particular: for k = 3 we
have n = {25, 50, 75, 100}, for k = 4 we have n = {25, 50, 75},
while for for k = 5 we have n = {25, 50}. For each possible
cardinality-order pair, 15 different connectivity rates are tested
in the interval [0.01%, 0.99%] (see Table 1). Moreover, all the
different densities have been evaluated on 100 different hypergraphs, except for the combination of parameters k = 4, n = 75
and k = 5, n = 50 that for complexity reasons have been tested
on 30 different hypergraphs. The dataset is therefore made of a
total of 11400 experiments.
Table 1. Specification of the different parameters used to create the dataset.
Arcs Cardinality k
Order
Connectivity Rate
25
3
4
5
50
75 100 25
50 75 25
50
0.01, 0.03, 0.05, 0.1, 0.2, 0.3, 0.4, 0.5,
0.6, 0.7, 0.8, 0.9, 0.95, 0.97, 0.99
The disparity in the considered orders and number of experiments among the different cardinalities is due only to complexity reasons. In fact the proposed framework can be used for
larger experiments both in terms of cardinality and order, as
long as there is enough time and memory available.
The dataset has been built in the following way: one hypergraph for each experiment (from now on H1 ) has been randomly generated adopting the random graph model proposed
by Gilbert (1959). Given the following parameters: cardinality
k, order n, connectivity rate p, each k-graph is created with the
algorithm:
1. start from n isolated nodes;
2. select a k-tuple of nodes, and generate a random number
between 0 and 1. If the random number is smaller than
p then an edge connecting the selected nodes is added,
otherwise the nodes are left disconnected;
5
k
0.25
edges.
Once H1 has been built, the second graph H2 is generated starting from H1 and randomly perturbing the IDs of its nodes. The
k-graphs here created are ready to be matched. Using random
graphs to test the framework has two main advantages. First,
a wide variety of parameters combination can be extensively
tested, since we are not bound to any specific application. In
this way even structures that might be uncommon for some
application, but still of interest for others, are explored. Second, we can create an experimental system that is easily reproducible, and can therefore be used to compare our approach
with other algorithms.
When running the experiments, an immediate problem arises
during the creation of the association hypergraph. Since the
set of its vertices contains all the possible association of nodes,
the order and size increases exponentially with the order of the
structures to be matched. For this reason some pruning to the
possible couples of nodes, as far as it is possible without removing any information, has been developed. In fact we can easily
understand that not all the pair of nodes have sense. In particular, since we are trying to understand if the two hypergraphs are
identical, there is no use in pairing nodes with different degree.
Therefore, the vertex set of the association graph is constructed
as follow:
V = (i, j) ∈ V ′ × V ′′ : deg(i) = deg( j)
and the edge set has been defined as in definition 1. When
the two graphs are isomorphic, theorem 1 continues to hold,
since the isomorphisms preserves the degree property of vertices. However, using this simple heuristic, we greatly decrease
the number of nodes in the association graph, and the same goes
for the number of arcs, greatly simplifying the optimization
task. In fact, after pruning, in some parameters combination
we keep less than the 5% of the possible coupled nodes.
Once the association graph is ready, the optimization process
can start. Each experiment has been performed with the linear Baum-Eagon dynamic (Eq. (4)) and with its exponential
version (Eq. (5)) with the κ parameter ranging in {10, 25, 50}.
The algorithm was started in the barycentre of the simplex and
stopped when either the distance of two subsequent points was
smaller than a given threshold, set to 10−5 , or when a maximum
number of time-steps, equal to 600, has been processed. When
the algorithm stops, we check if a clique has been found: in
the negative case, the final point is randomly perturbed and the
algorithm is started again, using as initial point the randomly
perturbed x.
Perturbing is needed in order to avoid saddle points, that are
solutions in which the algorithm is not able to return a clique
because there are some symmetries in the graphs to be matched.
This results in more than one possible isomorphism, and the
algorithm does not have any way of choosing one solution with
respect to the other. This problem arises because the algorithm
starts the search for the barycentre of the simplex.
Figure 2 shows all the steps done by the algorithm to solve
the symmetric isomorphism problem of figure 1. A perturbation
is clearly visible at step 26. After the perturbation the symmetry is broken and the algorithm is able to ”choose” the correct
Components of state vector
3. repeat step 2 for each of the possible
n
0.2
0.15
Perturbation
0.1
0.05
0
0
10
20
30
40
50
60
70
Iterations
Fig. 2. Evolution through time of the components of the state vector x(t)
from the hypergraphs in figure 1 using the linear dynamic in Eq. 4. A
perturbation can be seen at iteration 26, after which the algorithm is able
to choose which associations to keep which to discard.
pairings, thus reaching the global optimum and finding the maximum clique. The perturbation is repeated for at most 4 times,
then the algorithm is stopped independently from the fact that
it has returned a clique or not.
Once the optimization is finished we have to understand if the
returned state vector represents a correct isomorphism. In order to automatically check the results, the vector is thresholded
against a small value close to 0 (the value 10−5 has been used),
thus selecting only some couples. Then we verify if the returned
couples represent a clique in the association hypergraph, and, in
case they do, we verify if the clique number is equal to the number of nodes of the original structures. In fact, from theorem 1,
there is the guarantee of finding the isomorphism only if the returned couples represent a maximum clique. Since H1 and H2
are of the same order by construction (otherwise we wouldn’t
have any isomorphism), there can not be an isomorphism concerning a number of nodes that is larger than the size of the
vertex set of these two graphs. All the experiments have been
run on a workstation equipped with an Intel Core i7-6800K at
3.40GHz with 128GB of RAM, and the code for the framework
has been implemented in C.
5.2. Experimental results
In terms of correctness, the proposed framework, has returned impressive results: as we can see in figure 3, all the
isomorphisms have been properly found, with the algorithm returning 100% of the nodes exactly coupled, for all the 11400
experiments, independently of the dynamic that has been used
and independently of the cardinality k, the number of nodes n
and the connectivity rate p.
The mean CPU time needed to run the optimization, considering the best performing dynamic for each set of parameters,
is shown in figure 4. As we can see all the curves have the same
behaviour, showing that the algorithm is extremely slow when
dealing with very sparse or very dense graphs, while it is way
faster when solving matching between graphs with mean density. This is due to the fact that the association hypergraphs in
the extreme connectivity rates are way bigger and denser than
the ones in the median connectivity rates, thus giving birth to
more complex polynomials to be optimized.
With no surprise, we see that dealing with smaller hypergraphs results in shorter execution times, nevertheless the behaviour of the curves according to all the other parameters is
exactly the same independently on the cardinality or on the order of the original hypergraphs. We can also notice than, as
it could be expected, keeping the cardinality k fixed, execution
6
Fraction of Correct Experiments K=3, N=50
Fraction of Correct Experiments K=3, N=25
Fraction of Correct Experiments K=3, N=75
1
1
1
0.5
0.5
0.5
0
0
0
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
Fraction of Correct Experiments K=3, N=100
Fraction of Correct Experiments K=4, N=25
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
Fraction of Correct Experiments K=4, N=50
1
1
1
0.5
0.5
0.5
0
0
0
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
Fraction of Correct Experiments K=4, N=75
Fraction of Correct Experiments K=5, N=25
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
Fraction of Correct Experiments K=5, N=50
1
1
1
0.5
0.5
0.5
0
0
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
Linear dynamic
Exponential dynamic KAPPA=10
0
0.010.030.05 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 0.950.970.99
Exponential dynamic KAPPA=25
Exponential dynamic KAPPA=50
Fig. 3. Image has been changed Fraction of correct results for all the different experiments tested. 100% of all the performed experiments, independently
of the different combination of parameters, have returned the correct isomorphism in all the dynamic used. On the x-axis we have the different connectivity
rates, on y-axis we have the fraction of correct results.
takes longer for hypergraphs of bigger order; vice versa, keeping the order n fixed, the higher the cardinality the longer the
running time. In general, the exponential dynamic always out-
Evolution of vector state, K=100
0.4
0.3
0.2
Average Computation Time Optimization Step
10
k=3, n=25
k=3, n=50
k=3, n=75
2
k=3, n=100
k=4, n=25
k=4, n=50
k=4, n=75
k=5, n=25
k=5, n=50
0.1
0
Average CPU time (in seconds)
0
100
200
300
400
500
600
Fig. 5. Evolution of the state vector when matching two hypergraphs with
k = 3, n = 75 and p = 0.7 using the exponential dynamic with κ = 100. The
value of κ is too big and the dynamic is not able to converge.
10 1
10 0
10 -1
10 -2
0.01
0.03
0.05
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
0.95
0.97
0.99
Expected Connectivity
Fig. 4. Mean CPU timeneeded to run the optimization algorithm for finding
isomorphism on hypergraphs with different cardinalities and orders. Only
the best performing dynamic is drawn for each combination of parameters.
Y-axes are in logarithmic scale.
performs the linear one, thus resulting to be really attractive,
from a computational point of view. However it introduces a
parameter to be tuned, the κ in equation 5. The value for κ has
been searched initially in a logarithmic scale {1, 10, 100} and
further refined in the proximity of a good value. With κ = 100
we obtained a strongly oscillatory trend in the elements of the
state vector that prevented the dynamic to converge (see figure
5). On the other hand, with κ = 1 the dynamic was nearly as
slow as the linear one, thus not giving any incentive in using
the exponential form. With κ = 10 we achieved a faster and
convergent behaviour, hence we refined the space for this parameter between 10 and 100.
Three different values of κ (10, 25, 50) have been used on all
experiments. In nearly every parameters configuration, the best
performing dynamic that has been inserted in figure 4 is always
the exponential with κ = 50.
The only exception is when we have k = 3 and n = 25. In
this case the graphs to be matched, and therefore the related
association hypergraph, are really small, and the exponential
dynamic with κ = 50 oscillates too much. The best performing
dynamic becomes therefore the exponential one with κ = 25.
Figure 6 shows the evolution through time of the state vector in one of the experiments with k = 3 and n = 25, with
all the different dynamics.The first thing that is clearly visible
is that all the dynamics have the same behaviour, perturbing
exactly one time the state vector before finding the correct isomorphism. As we could have expected, the linear dynamic is
the slowest one, while all the exponential dynamics are faster.
However we can see that the value κ = 50 is too high: even
though the correct isomorphism is always found, this dynamic
oscillates too much and it needs more iterations in order to converge to a final solution, thus requiring more time than all the
other dynamics. This shows that the value of κ is strictly dependent on the problem under examination, and a single value
cannot be used on all the experiments. Thus, in a real setting
with a specific problem, the parameters of that problem have to
be analyzed and evaluated, and the value of κ has to be tuned
according to those specific values.
6. Conclusions
In this paper, we tackled the hypergraph isomorphism problem by generalizing the notion of association graph to the
higher-order case of uniform hypergraphs. Given two uniform
k-graphs, we proved that maximal cliques in their association
hypergraph, are in one-to-one correspondence with their possible isomorphisms. Furthermore, in our extensive experimentation, we successfully employed the simple Baum-Eagon inequality and its exponential variant to find maximum cliques.
7
Fig. 6. Comparison of the evolution of the state vector for one experiment
with cardinality k = 3, order n = 25 and connectivity p = 0.99. Even
though all the dynamics converge to the correct isomorphism after exactly
one perturbation, the number of iteration needed to stop the algorithm
is different. It is worth noticing that the value κ = 50 is too large for the
exponential dynamic, since it oscillates a lot, thus needing a lot of iterations
to converge.
We reported impressive results: in 11400 experiments on randomly generated hypergraphs of different cardinalities, orders
and connectivities we have always obtained 100% of correct
isomorphisms, emphasizing the ability of the proposed dynamical system to escape from local optima. Alongside the experimentation, we have shown that for the extreme connectivity
rates (sparse and dense graphs) very large association hypergraph are generated needing more resources to run the optimization. In such cases, the use of a sparsification technique
and the exponential dynamics might lower the time and space
needed to perform the experiments, but at the price of setting the parameter κ. In our future work we plan to explore
the more general and complex task of sub-hypergraph isomorphism, and to investigate the regularized formulation proposed
in Rota Bulò and Pelillo (2009).
References
Albarelli, A., Bergamasco, F., Rossi, L., Vascon, S., Torsello, A., 2012. A
stable graph-based representation for object recognition through high-order
matching, in: Proc. 21th International Conference on Pattern Recognition
(ICPR), pp. 3341–3344.
Bai, L., Escolano, F., Hancock, E., 2016. Depth-based hypergraph complexity
traces from directed line graphs. Pattern Recognition 54.
Barrow, H.G., Burstall, R.M., 1976. Subgraph isomorphism, matching relational structures and maximal cliques. Inf. Process. Lett. 4, 83–84.
Baum, L.E., Eagon, J.A., 1967. An inequality with applications to statistical
estimation for probabilistic functions of markov processes and to a model
for ecology. Bulletin of the American Mathematical Society 73, 360–363.
Baum, L.E., Petrie, T., Soules, G., Weiss, N., 1970. A maximization technique
occurring in the statistical analysis of probabilistic functions of markov
chains. The annals of mathematical statistics 41, 164–171.
Blakley, G.R., 1964. Homogeneous nonnegative symmetric quadratic transformations. Bulletin of the American Mathematical Society 70, 712–715.
Cho, M., Lee, J., Lee, K.M., 2010. Reweighted random walks for graph matching, in: European Conference on Computer Vision (ECCV), Springer. pp.
492–505.
Duchenne, O., Bach, F., Kweon, I., Ponce, J., 2011. A tensor-based algorithm
for high-order graph matching. IEEE Transactions on Pattern Analysis and
Machine Intelligence 33, 2383–2395.
Garey, M.R., Johnson, D.S., 1979. Computers and Intractability: A Guide to
the Theory of NP-Completeness. W. H. Freeman & Co.
Gilbert, E.N., 1959. Random graphs. The Annals of Mathematical Statistics
30, 1141–1144.
Hou, J., Pelillo, M., 2018. A game-theoretic hyper-graph matching algorithm,
in: Proc. 24th International Conference on Pattern Recognition (ICPR), pp.
1012–1017.
Jin Chang, H., Fischer, T., Petit, M., Zambelli, M., Demiris, Y., 2016. Kinematic structure correspondences via hypergraph matching, in: Conference
on Computer Vision and Pattern Recognition (CVPR), pp. 4216–4225.
Karp, R.M., 1972. Reducibility among combinatorial problems, in: Complexity
of computer computations. Springer, pp. 85–103.
Kozen, D., 1978. A clique problem equivalent to graph isomorphism. ACM
SIGACT News 10, 50–52.
Lee, J., Cho, M., Lee, K.M., 2011. Hyper-graph matching via reweighted random walks, in: Conference on Computer Vision and Pattern Recognition
(CVPR), pp. 1633–1640.
Leordeanu, M., Zanfir, A., Sminchisescu, C., 2011. Semi-supervised learning
and optimization for hypergraph matching, in: International Conference on
Computer Vision (ICCV), pp. 2274–2281.
Mirzaalian, H., Hamarneh, G., Lee, T.K., 2009. A graph-based approach to skin
mole matching incorporating template-normalized coordinates, in: Conference on Computer Vision and Pattern Recognition. (CVPR), pp. 2152–2159.
Motzkin, T.S., Straus, E.G., 1965. Maxima for graphs and a new proof of a
theorem of Turán. Canad. J. Math. 17, 533–540.
Nguyen, Q., Gautier, A., Hein, M., 2015. A flexible tensor block coordinate ascent scheme for hypergraph matching, in: Conference on Computer Vision
and Pattern Recognition (CVPR), pp. 5270–5278.
Pelillo, M., 1997. The dynamics of nonlinear relaxation labeling processes.
Journal of Mathematical Imaging and Vision 7, 309–323.
Pelillo, M., 1998. A unifying framework for relational structure matching,
in: Proc. 14th International Conference on Pattern Recognition, (ICPR), pp.
1316–1319.
Pelillo, M., 1999. Replicator equations, maximal cliques, and graph isomorphism. Neural computation 11, 1933–1955.
Pelillo, M., 2003. Computational complexity and the elusiveness of global
optima. Limitations and future trends in neural computation. NATO science
series 186, 71–94.
Peng, Y., Peng, H., Tang, Q., Zhao, C., 2016. An extension of the motzkinstraus theorem to non-uniform hypergraphs and its applications. Discrete
Applied Mathematics 200, 170–175.
Rota Bulò, S., Pelillo, M., 2009. A generalization of the motzkin–straus theorem to hypergraphs. Optimization Letters 3, 287–295.
Rota Bulò, S., Pelillo, M., 2013. A game-theoretic approach to hypergraph
clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence
35, 1312–1327.
Sandi, G., Vascon, S., Pelillo, M., 2018. On association graph techniques
for hypergraph matching, in: Structural, Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshop, (S+SSPR), Beijing,
China, pp. 481–490.
Xiao, B., Hancock, E.R., Wilson, R.C., 2009. Graph characteristics from the
heat kernel trace. Pattern Recognition 42, 2589–2606.
Yan, J., Zhang, C., Zha, H., Liu, W., Yang, X., Chu, S.M., 2015. Discrete hypergraph matching, in: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1520–1528.
Zass, R., Shashua, A., 2008. Probabilistic graph and hypergraph matching, in:
Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8.
Zhang, H., Ren, P., 2017. Game theoretic hypergraph matching for multi-source
image correspondences. Pattern Recognition Letters 87, 87–95.
Zhang, R., Wang, W., 2018. Second- and high-order graph matching for correspondence problems. IEEE Transactions on Circuits and Systems for Video
Technology 28, 2978–2992.
Zhang, Z., Bai, L., Liang, Y., Hancock, E., 2016. Joint hypergraph learning and
sparse regression for feature selection. Pattern Recognition 63.
Zhang, Z., Hancock, E., 2012. Hypergraph based information-theoretic feature
selection. Pattern Recognition Letters 33.