Generating Random Regular Graphs: J. H. Kim V. H. Vu August 9, 2006
Generating Random Regular Graphs: J. H. Kim V. H. Vu August 9, 2006
Generating Random Regular Graphs: J. H. Kim V. H. Vu August 9, 2006
J. H. Kim
V. H. Vu
August 9, 2006
Abstract
Random regular graphs play a central role in combinatorics and theoretical computer
science. In this paper, we analyze a simple algorithm introduced by Steger and Wormald
[10] and prove that it produces an asymptotically uniform random regular graph in a
polynomial time. Precisely, for xed d and n with d = O(n
1/3
), it is shown that the
algorithm generates an asymptotically uniform random d-regular graph on n vertices in
time O(nd
2
). This conrms a conjecture of Wormald. The key ingredient in the proof
is a recently developed concentration inequality by the second author.
The algorithm works for relatively large d in practical (quadratic) time and can be
used to derive many properties of uniform random regular graphs.
1 Introduction
One of the most important and interesting models for random graphs is the model of random
regular graphs. The study of this model has been pursued by a large number of researchers
for decades. Random regular graphs have been playing a crucial role in theoretical computer
science, especially in the theory of expanders. For instance, it is now known that a uniform
random d-regular graph has asymptotically the best possible expansion rate, namely, the
second eigenvalue of it is almost surely (2+o(1))
Department of Mathematics, UCSD, La Jolla, CA 92093. Research supported in part by grant RB091G-
VU from UCSD, by NSF grant DMS-0200357 and by an A. Sloan fellowship. Email:[email protected]; website:
www.math.ucsd/vanvu.
1
Question. How do we generate a uniform random regular graph?
Besides its obvious practical importance, the above question is also critical from the theo-
retical point of view, as it is dicult to study properties of random regular graphs (or any
random objects) without knowing how to generate them.
By denition, the most straightforward method is making a list of all (simple) regular
graphs and then choosing one uniformly at random from the list. This, unfortunately, could
never be done in practice, as the number of regular graphs is huge (see Section 3 for an
asymptotic estimate).
A better approach is to follow the conguration model proposed by Bollobas [1], and
Bender and Caneld [2]. In order to generate a uniform random d-regular graph on n
vertices, we consider a family of n sets of size d. Each set may be regarded as a set of
d copies of each vertex. All together there are nd points, or nd copies of the vertices.
Draw a uniform random perfect matching on these copies and connect two vertices i and
j if the matching contains an edge consisting of copies of i and j. It is easy to see that
the resulting multigraph is d regular and the distribution, conditioned on the graph being
simple, is uniform. It is known that if d is a constant then the probability of being simple
is bounded by a positive constant, uniformly in n. This, however, is no longer true if d is
large, for instance logn or n
a
, so that most of the time one gets a non-simple graph. More
precisely, it has been shown [12] that the probability is about exp(
d
2
4
) , which tends to 0
extremely fast in d. Even for a modest parameter such as d = 14, one, in expectation, has
to try e
49
> 10
20
times until he or she obtains a simple graph. This clearly rules out the
possibility of using this approach in practice unless d is very small. From the theoretical
point of view, the conguration model is not very useful if d logn, as one has to beat the
failure probability exp(
d
2
4
). For more discussion about this point, we refer to Wormalds
survey [12]. In [7], McKay and Wormald gave an algorithm with polynomial running time
which generates uniform random regular graph with degree d = O(n
1/3
). However, Steger
and Wormald [10] pointed out that this algorithm is rather complicated to implement.
For many practical and theoretical purposes, it is usually sucient to generate random
d-regular graphs which are asymptotically uniform. After all, most statements we want to
prove about a random model are of an asymptotic nature. It has turned out that allowing
asymptotically uniform distributions makes the problem more feasible.
A natural way to generate an asymptotically uniform random regular graph is to use the
Markov chain technique. This was done by Jerrum and Sinclair [4], using their algorithm
for generating random perfect matchings in dense graphs. On the other hand, although
the algorithm is proved to be in P, its running time is (n
10
), which is only theoretically
polynomial. Moreover, it seems to us that it is very hard to prove properties of random
regular graphs based on this algorithm.
More recently, a faster and very practical algorithm was analyzed by Steger and Wormald
[10], following an earlier work of Ruci nski and Wormald [9]. This algorithm is a natural
renement of the conguration model algorithm of Bollobas, and Bender and Caneld.
Again let us consider a set V of nd points which is the union of n disjoint sets of size d. We
want to construct a perfect matching which does not give rise to loops and parallel edges.
We achieve this by building the matching one edge at a time. First of all, we never pick
2
an edge which creates a loop. Namely, we never pick an edge with both ends in the same
set of size d. Assume that a bunch of edges are picked. In the next step, we only pick an
edge which does not create a parallel edge, namely, we ignore all the edges connecting two
sets I and J after some i I and j J are connected for the rst time. An edge is called
suitable if it creates neither a loop nor a parallel edge. When we pick an edge, we pick it
with respect to the uniform distribution over the set of all suitable edges. At the end, we
obtain a random perfect matching, which provides a simple, d-regular random graph. The
implementation of the algorithm is straightforward and its running time is only O(nd
2
) (see
[10]), which is sub-quadratic for d n
1/2
. The only matter here is the distribution of this
random graph.
Steger and Wormald [10] proved that if d = o(n
1/28
), then the distribution in question
is asymptotically uniform. Wormald [12] conjectured that the distribution is still asymp-
totically uniform for any d signicantly less than n
1/3
.
The rst purpose of this paper is to prove this conjecture. This way, we obtain a very
fast method to generate asymptotically uniform random regular graphs with degree up to
n
1/3
, for any positive constant . Our result also has a considerable theoretical value,
as we can rely on the algorithm to prove many highly non-trivial properties of uniform
random regular graphs. In fact, it was shown, using a weaker version of this result, that
if logn d n
1/3
, then the random regular graph G
n,d
behaves very much like the
(non-regular) Erdos-Renyis random graph G(n, p) of the same density [6]. Among others,
it follows immediately that in this range of d, G
n,d
almost surely contains a hamiltonian
cycle, since the Erdos-Renyi graph with density signicantly larger than logn/n has this
property. The interested reader is referred to [6] for a precise statement.
The other purpose of the paper is to introduce a further application, via the proof,
of a recently developed concentration inequality by the second author [11] , which can be
applied for functions with large Lipchitz coecients, a situation where standard tools such
as Azumas and Talagrands are ineective. This concentration result plays a critical role
in the proof. We will partly follow the somewhat natural approaches in [10] and then use
the concentration result together with more sophisticated analysis.
In the whole paper, Pr denotes probability and E denotes expectation. 1
X
is the
characteristic function of the event X: 1
X
= 1 if X occurs and 1
X
= 0 otherwise. The
asymptotic notation is used under the assumption that n tends to innity. All logarithms
have the natural base.
2 The algorithm and the result
Let N(d, n) be the number of d-regular graphs on n vertices. McKay and Wormald [8] have
shown that for d = o(n
1/2
)
N(d, n) =
(nd)!
(
1
2
nd)!2
nd/2
(d!)
n
exp
_
1d
2
4
d
3
12n
+O(
d
2
n
)
_
Consider the set S(n, d) of all d-regular graphs on n points equipped with the uniform
distribution. The (uniform) probability of each graph in S(n, d) is p
u
= 1/N(d, n).
3
Let us now describe the algorithm:
Algorithm A.
(I) Start with a set U of nd points (nd even) partitioned into n groups of size d.
(II) Repeat the following until no suitable pair can be found. Choose two random points i
and j in U and if ij is suitable, pair i with j and delete them from U.
(III) Create a graph G with an edge from r to s if and only if there is a pair containing
points in the r
th
and s
th
groups. If G is regular, output it, otherwise return to step (I).
Throughout the paper, Pr
A
(G) denotes the probability that the output of A is a par-
ticular graph G. In [10], Steger and Wormald analyzed the above algorithm and proved the
following theorem.
Theorem 2.1 If d = o(n
1/28
), then for every simple d-regular graph G on n vertices
Pr
A
(G) = (1+o(1))p
u
.
Our goal is to extend the above result for larger d. We shall prove
Theorem 2.2 For any positive constant < 1/3 the following holds. For any d n
1/3
and any simple d-regular graph G on n vertices
Pr
A
(G) = (1+o(1))p
u
.
The rest of the paper is devoted to the proof of Theorem 2.2. The next two sections
describe the main ideas of the proof. In particular, we shall state a series of properties
which imply Theorem 2.2 (see Section 4). The essential technical ingredient, the new large
deviation inequality, will be presented in Section 5. The remaining sections of the paper
are devoted to the proofs of the properties stated in Section 4.
3 Preliminaries
It is well-known (and easy to check) that for each d-regular graph G, there are exactly (d!)
n
dierent simple perfect matchings on U which give rise to G. (A perfect matching is simple
if it gives rise to a simple regular graph.) So to prove Theorem 2.2, it suces to show that
if d satises the assumption of the theorem, then for any simple perfect matching M
Pr
A
(M) = (1+o(1))
(
1
2
nd)!2
nd/2
(nd)!
exp
_
d
2
1
4
_
. (1)
Consider an ordering M = x
1
, . . . , x
nd/2
where x
m
are the edges of M. Assume that the
rst m edges of M are obtained and let G
m
(M) be the graph formed by the projection of
these edges. Thus G
m
(M) is a subgraph of G. To count the number of suitable edges, notice
that there are
_
nd2m
2
_
ways to form an edge. However, an edge is suitable if and only if it
does not join two vertices that come from the same group or two vertices that come from two
already adjacent groups. The number of edges of the rst type is
[1]
m
(M) =
u
_
dd
u
2
_
and
the number of unsuitable edges of the second type is
[2]
m
(M) =
u
G
m
(M)
v
(dd
u
)(dd
v
),
4
where d
u
is the degree of u in G
m
(M). Set
m
(M) =
[1]
m
(M)+
[2]
m
(M). Then the number
of suitable edges is
_
nd2m
2
_
m
(M). It follows that
Pr
A
(M) =
nd/21
m=0
1
_
nd2m
2
_
m
(M)
. (2)
Set
[1]
m
=
1
2
(nd2m)
2
(d1)
nd
and
[2]
m
= (nd2m)
2
m(d1)
2
n
2
d
2
. It will be useful to think of
[i]
m
as a sort of expectation of
[i]
m
with respect to a random choice of M. Dene
m
=
[1]
m
+
[2]
m
.
A routine, but somewhat tedious, calculation shows that for d = o(n
1/3
) (one may refer
Lemma 3.6 of [10] for the details)
nd/21
m=0
_
nd2m
2
_
_
nd2m
2
_
m
= exp
_
d
2
1
4
+o(1)
_
. (3)
Also notice that
nd/21
m=0
_
nd2m
2
_
=
(nd)!
2
nd/2
. (4)
Dene
f(M) =
nd/21
m=0
_
nd2m
2
_
m
_
nd2m
2
_
m
(M)
.
Using (2, 3, 4), it is clear that to prove (1) it suces to show
MS(M)
f(M) = (1+o(1))(nd/2)!, (5)
where S(M) denotes the set of all orderings of the edges of M. Notice that S(M) has
exactly (nd/2)! elements, so (5) is equivalent to saying that the expectation of f(M) with
respect to the uniform distribution on S(M) is 1+o(1), that is,
E
_
f(M)
_
= 1+o(1). (6)
In [10], this is shown for d = O(n
1/28
). To prove (6) for d = o(n
1/3
), we shall show the
upper bound and the lower bound separately;
E
_
f(M)
_
1+o(1) (7)
and
E
_
f(M)
_
1o(1). (8)
The proof of (7) is rather dicult and occupies most of the rest of the paper, including
Sections 3,4 and 5. The proof of (8) follows easily from a lemma developed for the proof of
(7) and is presented in Section 7.
The general idea for proving (7) is the following: we rst partition S(M) into many
classes according to the order of magnitude of f(M). Next, we upper bound the measures
5
of these classes with respect to the uniform distribution on S(M). There will be one main
class which contributes 1o(1); all other classes together contribute o(1).
The partition is non-trivial and bounding the measures is not at all easy. We need
very strong tools and the key one is a recent concentration result, proved in [11]. This
result provides a very tight control on the deviation tail
m
(M)
m
, where M is sampled
randomly from (M). Consequently, we obtain a suciently good bound on the measures
of the classes in the partition.
4 Partition
Let = log
0.01
n. Set
0
= logn and
i
= 2
i1
for all i = 1, 2, . . . , L, where L is the
smallest integer such that
L
cd
2
logn, where c is a suciently large constant. For each
0 m nd/2, we will dene a function T
m
so that the sequence {T
m
(
0
) < T
m
(
1
) . . .
T
m
(
L
)} satises certain properties. We rst present the properties and then dene T
m
.
Let 1/3 be an arbitrarily small positive constant and let S
(M)
_
= 1o(1). (9)
and
E
_
f(M)1
S(M)\S
(M)
_
= o(1), (10)
where the expectations are taken over the uniform random ordering of M.
Here and later 1
S
(M)
is the characteristic function of the event M S
(M): 1
S
(M)
= 1
if M S
(M) and 1
S
(M)
= 0 otherwise. Similarly, 1
S(M)\S
(M)
is the the characteristic
function of the event M S(M)\S
(M).
To bound the contribution of S
(M), we partition S
m
(M)
m
< T
m
(
i
) for all m,
and A
m
(M) T
m
(
0
)+
m
0
logn+
m
,
6
for M A
0
and m (nd
0
)/2. Since A
0
=
_
K
j=1
B
j
_
C, we have
S
(M) = A
L
i=1
A
i
\A
i1
_
K
j=1
B
j
\B
j1
_
C. (11)
For (9), let us consider
E
_
f(M)1
S
(M)
_
= E
_
f(M)1
A
_
+
L
i=1
E
_
f(M)1
A
i
\A
j1
_
+
K
j=1
E
_
f(M)1
B
j
\B
j1
_
+E
_
f(M)1
C
_
. (12)
We will show that E
_
f(M)1
C
_
1+o(1) and the other terms are o(1). These estimates
are direct consequences of the following properties for the uniform random ordering of M:
Property 4.1 For all M C, f(M) 1+o(1).
Property 4.2 (a) Pr(A
i
\A
i1
) exp((
i
)), for all 1 i L
(b) f(M) exp(o(
i
)) for M A
i
\A
i1
.
Property 4.3 (a) Pr(B
j
\B
j1
) exp((2
j/2
logn)) for all 1 j K
(b) f(M) exp(O(2
3j/4
)) for M B
j
\B
j1
.
Property 4.4 (a) Pr(A
) exp(5d
2
logn),
(b) f(M) exp(4d
2
logn) for all M A
.
Now we dene the critical parameters T
m
(a). The heart of the proof is to show that T
m
(a)s
satisfy certain properties that easily imply Properties 4.1-4.4.
Denition 4.5 First set q
m
= (nd2m)/nd and p
m
= 1q
m
for all m = 0, 1, . . . , nd/21.
Next, let
m
(a) = c
_
a(nd
2
q
2
m
+a
2
)(dq
m
+a)
m
(a) = c
_
a(nd
3
q
2
m
+a
2
)(d
2
q
m
+a)
m
(a) = c
_
a(nd
3
q
3
m
+a
3
)(d
2
q
2
m
+a
2
)
m
= 8nd
3
q
3
m
,
where c is a suciently large constant. Dene
T
m
(a) =
_
min
_
m
(a)+
m
(a)+
m
(a),
m
(a)+
m
(a)+
m
_
if nd2m a
a
2
/ otherwise
7
Property 4.0 follows from the denition. The function T
m
arises naturally from the large
deviation consideration in the next section. Technically speaking, T
m
has been set so that
we can conveniently derive the large deviation parts, or parts (a), of Properties 4.2-4.4 from
a general concentration result. On the other hand, it turns out that this denition of T
m
does satisfy the second parts of the properties (see Section 5).
The proofs of the properties will be presented in the next two sections. These properties
together with (12) imply that E(f(M)1
S
(M)
) 1+o(1). In Section 8, we prove (10) and
(7). The proof of (8) is presented in Section 7. This proof follows relatively easily from
certain estimates in Section 6.
5 Large deviation
In this section, we prove the large deviation part of Properties 4.2-4.4. The most dicult
proof is that of Property 4.2 and the vital tool for this proof is the following concentration
result, proved by the second author in [11]. This result renes an earlier result by the
authors in [5].
Consider independent random variables t
1
, . . . , t
n
with arbitrary distributions on the
interval [0, 1]. For a polynomial Y = Y (t
1
, . . . , t
n
) of degree k and a multi-set A of size
at most k,
A
Y denotes the partial derivative of Y with respect to A. For instance, if
Y = t
2
1
t
2
2
t
3
+t
5
4
and A
1
= {1, 2}, A
2
= {1, 1, 3}, then
A
1
(F) = 4t
1
t
2
t
3
and
A
2
(F) = 2t
2
2
,
respectively. If the set A is empty then
A
Y = Y . For all 0 j k, let
E
j
(Y ) = max
|A|j
E(
A
(Y ))
be the maximum expectation of a partial derivative of order at least j of Y .
Theorem 5.1 Let Y be a polynomial of degree k with positive coecients at most 1. For
any collection of positive numbers E
0
> E
1
> . . . > E
k
= 1 and satisfying
E
j
E
j
(Y )
E
j
/E
j+1
+4j logn, 0 j k1,
the following holds
Pr
_
|Y E(Y )|
_
E
0
E
1
_
2e
()
,
where the constant in () depends on k.
In our present application, t
i
s are i.i.d binary random variables and each monomial
in the polynomials of interest is a product of distinct t
i
s. While partial derivatives are
convenient to use symbolically, the reader could also interpret the quantities E
j
(Y ) in a
combinatorial way as follows. For j 1, E
j
(Y ) is the maximum average eect of a group
of at least j atom variables. In other words, changing the value of any group of at least j
variables would change Y , in expectation with respect to the random variables outside the
group, by at most E
j
(Y ).
The crucial advantage of Theorem 5.1 is that the average eects are usually much less
than the maximum eect (or maximum martingale dierence) one needs to consider when
apply an Azuma type inequality. This advantage thus enables us to often derive a tighter
concentration result, compared to what one has from Azumas. (It would be useful for
8
the reader to try to use Azumas inequality instead of Theorem 5.1 and see what he or
she obtains.) Theorem 5.1 and many variants are discussed in details in [11], which also
contains a variety of applications.
Since a sum of independent random variables can be seen as a polynomial of degree one,
one can also view Theorem 5.1 as an extension of Chernos bound.
5.1 Proof of Property 4.2, part (a)
We need to show
Pr(A
i
\A
i1
) exp((
i
)).
Since
i
= 2
i1
, T
m
(
i
) 8T
m
(
i1
) by denition. Given the denition of the set A
i
and the fact that
i
logn, it suces to prove the following two lemmas
Lemma 5.2 For any m such that nd2m
i
Pr
_
m
(M)
m
1
8
min
_
m
(
i
)+
m
(
i
)+
m
(
i
),
m
(
i
)+
m
(
i
)+
m
__
exp((
i
)).
Lemma 5.3 For any m such that nd2m <
i
Pr
_
m
(M)
2
i
/4
_
exp((
i
)). (13)
Part (a) of Property 4.4 also follows instantly from Lemma 5.2 by adjusting the constant
c in the denition of
L
.
Proof of Lemma 5.2 It is more convenient to consider the following model: G
p
m
is the
random subgraph of G = G
nd/2
(M) obtained by keeping each edge of G with probability
p
m
= 2m/nd, independently. A standard calculation shows that with probability at least
1/nd, G
p
m
has exactly m edges. Since
i
lognd, to prove the lemma, it suces to show
that for each m
Pr
G
p
m
_
m
(M)
m
1
8
min
_
m
(
i
)+
m
(
i
)+
m
(
i
),
m
(
i
)+
m
(
i
)+
m
__
exp((
i
)).
(14)
Till the end of this proof, we x m and i and ignore the subscripts i and m in all other
relevant quantities; Pr means Pr
G
p
m
, and means
i
.
Notice that has been dened (on purpose) to be exactly the expectation of (G
p
) (and
[1]
,
[2]
the expectations of
[1]
(G
p
),
[2]
(G
p
), respectively). Therefore, (14) is a corollary
of the following fact
Lemma 5.4 We have
Pr
_
[1]
(G
p
)E(
[1]
(G
p
)) /8
_
exp(()) (15)
Pr
_
[2]
(G
p
)E(
[2]
(G
p
)) min(
+
8
,
+
8
)
_
exp(()).
9
For each edge e of G, dene a random variable t
e
as follows: t
e
= 1 if e is not chosen
in G
p
and 0 otherwise. Obviously, the t
e
s are i.i.d. binary random variables with mean
q = q
m
= (nd2m)/nd. Clearly,
[1]
=
1
2
eu,fu
e=f
t
e
t
f
.
First, we have E(
[1]
)
1
2
nd
2
q
2
; there are n choices for u, each u provides at most
_
d
2
_
< d
2
/2 pairs e, f and each product t
e
t
f
has expectation q
2
. Next, for each t
e
,
t
e
[1]
=
f:fe=,f=e
t
f
; there are 2(d1) < 2d terms in the sum, so the expectation of a partial
derivative of order 1 is less than 2dq. Finally, any partial derivative of order 2 of
[1]
is 0
or 1. So
E
0
(
[1]
) max(
1
2
nd
2
q
2
, 2dq, 1), E
1
(
1
) max(2dq, 1)
and E
2
(
[1]
) = 1. Now set E
0
= nd
2
q
2
+2
2
, E
1
= 2dq+ and E
2
= 1. Since logn, it is
easy to show that the conditions of Theorem 5.1 are satised. On the other hand, setting the
constant c in Denition 4.5 large enough, one can guarantee that
E
0
E
1
/8. Theorem
5.1 yields
Pr
_
|
[1]
(G
p
)E(
[1]
(G
p
))| /8) Pr
_
|
[1]
(G
p
)E(
[1]
(G
p
))|
_
E
0
E
1
_
exp(()),
completing the proof of (15).
For the proof of the second statement, we need to verify the following two inequalities
Pr
_
[2]
(G
p
)E(
[2]
(G
p
))
+
8
_
exp(()). (16)
and
Pr
_
[2]
(G
p
)E(
[2]
(G
p
))
+
8
_
exp(()). (17)
Consider the set X of all triples (e, g, f) where e, g, f are edges of G and they (in this order)
form a path of length 3 in G. A short consideration shows that
[2]
=
u
G
p
v
(dd
u
)(dd
v
)
can be expressed as
(e,g,f)X
t
e
t
f
(1t
g
) = Y
1
Y
2
,
where Y
1
=
(e,g,f)X
t
e
t
f
and Y
2
=
(e,g,f)X
t
e
t
f
t
g
. (Cf. Lemma 3.5 of [10]). A routine
argument (similar to the one presented for
[1]
) shows that E
0
(Y
1
) max (nd
3
q
2
, 2d
2
q, 1)
and E
1
(Y
1
) max(2d
2
q, 1). Set E
0
= nd
3
q
2
+2
2
, E
1
= 2d
2
q+ and verify that they satisfy
the conditions of Theorem 5.1. By adjusting the constant c in Denition 4.5, we can assume
that
E
0
E
1
/8 (where is the constant in Theorem 5.1). Theorem 5.1 yields
Pr
_
|Y
1
E(Y
1
)| /8
_
Pr
_
|Y
1
E(Y
1
)|
_
E
0
E
1
_
exp(()). (18)
For Y
2
, one can show that E
0
(Y
2
) max(nd
3
q
3
, 2d
2
q
2
, dq, 1), E
1
(Y
2
) max(2d
2
q
2
, dq, 1)
and E
2
(Y
2
) max(dq, 1). We dene E
0
= nd
3
q
3
+3
3
, E
1
= 2d
2
q
2
+2
2
and E
2
= dq +
10
and verify the conditions in Theorem 5.1. Again by adjusting the constant c in Denition
4.5, we can assume that
E
0
E
1
/8 (where is the constant in Theorem 5.1). Similarly
to (18), we have
Pr
_
|Y
2
E(Y
2
)| /8
_
Pr
_
|Y
2
E(Y
2
)|
_
E
0
E
1
_
exp(()). (19)
(18) and (19) imply (16); (17) follows from (18) and the following simple observation
[2]
E(
[2]
) |Y
1
E(Y
1
)| +E(Y
2
)
|Y
1
E(Y
1
)| +nd
3
q
3
= |Y
1
E(Y
1
)| +/8
Our proof of Lemma 5.3 is thus completed.
Proof of Lemma 5.3. In this lemma, nd2m is small so that G
p
is very dense. Consider
its complement G
q
. Let N
0
(u) = N
G
p
(u){u}. Then
(G
p
)
u
d
G
q
(u)
vN
0
(u)
d
G
q
(v).
If (G
p
)
2
/4 then one of the following should hold
G
q
has more than
1
4
2
edges (20)
For some u,
vN
0
(u)
d
G
q
(v) /
3
. (21)
The probability that (20) holds is at most
_
nd/2
1
4
_
q
1
4
_
2end
q
_1
4
. (22)
Recall that q /nd; it follows that the right hand side in (22) is at most
exp(
1
4
2
) = exp(()) (23)
For (21), notice that there are at most d
2
edges in G that contain at least one vertex in
N
0
(u), and each edge contributes at most two in the sum
vN
0
(u)
d
G
q
(v). In particular,
Pr(
vN
0
(u)
d
G
q
(v) /
3
)
_
d
2
/2
3
_
q
/2
3
n
_
ed
2
/2
3
q
_
/2
3
n
_
2ed
4
n
_
/2
3
exp((
3
logn))
= exp(()), (24)
as we have chosen log
1/3
n.
11
5.2 Proof of Properties 4.3 and 4.4, parts (a)
This proof is more or less identical to the proof of Lemma 5.3. Again we omit the sub-index
m, but let us remark that we are interested in only those m in the range nd2m
0
.
Since
0
=
2
logn log
2
n, there are only o(log
2
n) such indices. Therefore, it is enough
to prove the bound for each individual m.
Similarly to (20) and (21), if (G
p
) 2
j1
then one of the following should hold
G
q
has more than 2
j/2
/2 edges
For some u,
vN
0
(u)
d
G
q
(v) 2
j/2
.
The rest of the proof can be easily worked out along the lines of (22, 23, 24).
By repeating the proof of Property 4.2 for the special case i = L, we have that
Pr(A
) exp((
L
)) = exp(( cd
2
logn)).
where c is the constant in the denition of
L
. The hidden constant in does not depend
on c, so by setting c suciently large, we can have (cd
2
logn)) 5d
2
logn and
Pr(A
) exp(5d
2
logn),
as claimed.
Remark. In fact, by increasing c we can have that
Pr(A
) exp(ad
2
logn),
for any constant a. This fact will be important in the proof of (10).
6 Bounds
In this section, we prove the second half of Properties 4.1, 4.2, 4.3 and 4.4. All the proofs
here are quite elementary and are presented here for the sake of completeness.
Recall
f(M) =
nd/21
m=0
_
1+
m
(M)
m
_
nd2m
2
_
m
(M)
_
.
Since
_
nd2m
2
_
(1/2)
1
m
(M) for M S
(M), we have
f(M)
nd/21
m=0
_
1+
(3/)max(
m
(M)
m
, 0)
(nd2m)
2
_
. (25)
12
6.1 Property 4.2, part (b)
It is sucient to show that for any M A
i
nd/21
m=0
max(
m
(M)
m
, 0)
(nd2m)
2
= o(
i
)
To this end, we omit the sub-index i and use the short hand g(m) for
max(
m
(M)
m
,0)
(nd2m)
2
.
Since g(m) = O(1) for any m,
/
1/2
nd2m=2
g(m) = o(), so we only have to consider the
sub-sum starting from nd2m = /
1/2
. Notice that the numerator in g(m) is at most
T
m
(), so due to the denition of T
m
() we have
nd
nd2m=/
1/2
g(m)
nd2m=/
1/2
2
(nd2m)
2
+
nd2m=
m
()+
m
()+
m
(nd2m)
2
+
nd/2
nd2m=
2
m
()+
m
()+
m
()
(nd2m)
2
. (26)
To bound the three sums on the right hand side, we require a series of elementary estimates,
presented below. The need for these estimates will be justied in the rest of the proof. In
the following we use the fact that q
m
= (nd2m)/nd
nd
nd2m=2
(nd
3
q
3
m
)
1/2
(nd2m)
2
=
1/2
n
nd
nd2m=2
1
nd2m
= O
_
1/2
n
_
nd/2
x=2
1
x
x
_
= O
_
1/2
n
nd
_
. (27)
nd
nd2m=2
(nd
2
q
2
m
)
1/2
(nd2m)
2
= O
_
n
1/2
nd
nd2m=2
1
nd2m
_
= O
_
n
1/2
lognd
_
(28)
nd
nd2m=2
(
3
dq
m
)
1/2
(nd2m)
2
= O
_
3/2
n
1/2
nd
nd2m=2
1
(nd2m)
3/2
_
= O
_
3/2
n
1/2
_
(29)
nd
nd2m=
2
(nd2m)
2
2
_
nd
x=
x
2
= o(). (30)
(27, 28, 29) imply the next three estimates, respectively.
13
nd
nd2m=2
(nd
5
q
3
m
)
1/2
(nd2m)
2
= O
_
1/2
_
d
3
n
_
(31)
nd
nd2m=2
(nd
3
q
2
m
)
1/2
(nd2m)
2
= O
_
d
1/2
lognd
n
1/2
_
(32)
nd
nd2m=2
(
3
d
2
q
m
)
1/2
(nd2m)
2
= O
_
3/2
d
1/2
n
1/2
_
(33)
Furthermore,
nd
nd2m=2
(
3
nd
3
q
3
m
)
1/2
(nd2m)
2
O
_
3/2
_
d
n
_
(34)
by (27), and
nd
nd2m=2
2
dq
m
(nd2m)
2
= O
_
2
n
1
nd2m
_
= O
_
2
logn
n
_
(35)
nd
nd2m=w
2
3
(nd2m)
2
= O
_
3
_
w
2
x
2
x
_
= O
_
3
2
_
= o() (36)
w
2
nd2m=2
nd
3
q
3
m
(nd2m)
2
=
w
2
nd2m=2
nd2m
n
2
= O
_
4
n
2
_
. (37)
Important remark. Notice
L
= O(d
2
logn), so for d = o(n
1/3
/
1/3
log
1/2
n) the
right most formula in (27)-(37) is o(). In fact, only in (37) we need this bound on d. The
stronger assumption d n
1/3
is required only for an estimate in Section 8.
Due to the basic inequality
A+B
A+
m
() = O
_
(nd
3
q
3
m
)
1/2
+(nd
2
q
2
m
)
1/2
+(
3
dq
m
)
1/2
+
2
_
m
() = O
_
(nd
5
q
3
m
)
1/2
+(nd
3
q
2
m
)
1/2
+(d
2
q
m
)
1/2
+
2
_
m
() = O
_
((nd
5
q
5
m
)
1/2
+(
3
nd
3
q
3
m
)
1/2
+(
4
d
2
q
2
m
)
1/2
+
3
_
.
14
It follows from (27, 28, 29, 30) that
nd/2
nd2m=
m
(nd2m)
2
= o(). (38)
Similarly, it follows from (30, 31, 32, 33) that
nd/2
nd2m=
m
(nd2m)
2
= o(). (39)
Notice that in (38, 39), the sum start from . We need the bound nd2m w
2
only
for (see (36)). Using (34, 35, 36), we have that
nd/2
nd2m=
2
m
(nd2m)
2
= o(). (40)
(38, 39, 40) shows that last sum of (26) is o(). To handle the second sum, given (38) and
(39), we need only show
nd2m=
m
(nd2m)
2
= o().
This follows trivially from (37) and the fact that we can set so that
2
3
= o(n
2
).
Clearly, the rst sum of (26) is O(
2
w
1/2
0
nd2m=2
g(m) = 0. It remains to show that if
m
(M)
m
T
m
(
0
) for all m such that nd2m
0
then
nd/2
nd2m=
0
g(m) = o(1).
Clearly it is sucient to prove that
nd/2
nd2m=
0
T
m
(
0
)
(nd2m)
2
= o(1). (41)
The proof of this is similar to the proof in the previous subsection, with a slight modi-
cation. Instead of (36) and (37), we shall use
nd
nd2m=w
3
0
3
0
(nd2m)
2
= O
_
3
0
_
w
3
x
2
x
_
= O
_
3
0
3
0
_
= o(1)
15
w
3
0
nd2m=2
nd
3
q
3
(nd2m)
2
=
(nd2m)
n
2
= O
_
6
0
n
2
_
= o(1),
respectively. In all other estimates, the right most formula is o(1) for =
0
. We invite the
reader to work out the rest of the proof.
6.3 Properties 4.3 and 4.4 , parts (b)
By the denition of the set B
j
,
nd2m=2
g(m)
nd2m=2
2
j
(nd2m)
2
2
j
,
completing the proof.
Part (b) of Property 4.4 was proved in [10].
7 Proof for the lower bound
In this section we prove (8). Due to (13, 18, 19)
Pr
_
|
m
m
|
m
(
0
)+
m
(
0
)+
m
(
0
)
_
= o(1) (42)
(recall
m
=
1
2
(nd2m)
2
(d1)
nd
+(nd2m)
2
m(d1)
2
n
2
d
2
), for all m such that nd2m
0
.
Consider an ordering M where |
m
m
|
m
(
0
) +
m
(
0
) +
m
(
0
) for these m. We
have
f(M)
nd
nd2m=
3
0
_
1(3/)
m
(
0
)+
m
(
0
)+
m
(
0
)
(nd2m)
2
_
nd2m=
3
0
nd2m=2
_
1(3/)
m
(nd2m)
2
_
(43)
Part of the proof of (41) shows that
nd
nd2m=
3
0
m
(
0
)+
m
(
0
)+
m
(
0
)
(nd2m)
2
= o(1). This
implies that the rst product is 1o(1). The second product is also 1o(1) since
m
=
O
_
(nd2m)
2
(
1
n
+
m
n
2
)
_
and
0
= o(log
2
n). These together with (42) yield that
E(f(M)) 1o(1), (44)
completing the proof.
Remark. The proofs in this and the previous sections yield the following corollary, which
will be useful in the next section.
16
Corollary 7.1 By setting the constant c in the denition of
L
suciently large, we have
E
_
exp
_
10
2
nd/21
m=0
max(
m
m
(M), 0)
(nd2m)
2
__
= 1+o(1). (45)
In fact,
10
2
can be replaced by any constant.
To prove (45), it suces to notice the following:
In the proofs of the (b) part of the Property 4.2, we actually bound
exp(
nd/21
m=0
max(
m
m
(M), 0)
(nd2m)
2
)
instead of f(M). We rst upper bound f(M) by
nd/21
m=0
_
1+
(3/)max(
m
(M)
m
,0)
(nd2m)
2
_
(see (25)). Next, we upper bound this product by
exp
_
(3/)
nd/21
m=0
max(
m
m
(M), 0)
(nd2m)
2
_
.
Then we actually prove that the sum in the exponent is o(
i
). This, together with
the (a) part of Property 4.2, implies that the contribution of the sets A
i
\A
i1
in
the expectation in (45) is o(1). The extra factor 10/
2
does not really matter since
10/
2
o(
i
) is still o(
i
).
Similarly, one can show that the contribution from the sets B
i
\B
i1
is also o(1) and
the contribution from C is 1+o(1). Here the factor 10/
2
is swallowed by the extra
logn term in the (a) part of Property 4.3.
To bound the contribution from A
2
d
2
logn). Following the
proof of the (b) part of this property we have that
exp
_
10
2
nd/21
m=0
max(
m
m
(M), 0)
(nd2m)
2
_
exp(
40
2
d
2
logn).
Thus, the contribution of A
is at most exp(
50
2
d
2
logn+
40
2
d
2
logn) = o(1).
8 Proof of (10)
Let us recall the denition of S
(M): S
(M) which
contains those M where the condition
m
(M) (1/2)
_
nd2m
2
_
is violated for some m.
17
It is known ([10] p.383) and not hard to prove that
m
(M) d
2
(nd2m)/2.
In particular, if nd2m (1/2)
1
d
2
+1, then
m
(M) (1/2)
_
nd2m
2
_
. Thus, the
condition
m
(M) (1/2)
_
nd2m
2
_
could be violated only when m is relatively close to
nd/2, namely, m nd/2(1/2)
1
d
2
. Let S
i
(M), i = 1, ..., (1/2)
1
d
2
, be the set of
all orderings M of M such that
m
(M) (1/2)
_
nd2m
2
_
m < nd/2i
and
m
(M) > (1/2)
_
nd2m
2
_
for m = nd/2i.
Since
i=1
n
i
= o(1), in order to prove (10), it suces to show that
E(f(M)1
S
i
) (1+o(1))n
i
for all i = 1, ..., (1/2)
1
d
2
.
Notice that
_
nd2m
2
_
m
(M) is at least nd/2m, for nd/2m edges in M remain to
be selected. This yields
_
nd2m
2
_
_
nd2m
2
_
m
(M)
nd2m
and hence
nd/21
m=nd/2i
_
nd2m
2
_
_
nd2m
2
_
m
(M)
2
i
i! 2i
i
,
where the last inequality 2
i
i! 2i
i
can be veried by induction. Moreover, the fact that
m
(M) (1/2)
_
nd2m
2
_
for all m < nd/2i yields
nd/2i1
m=0
_
nd2m
2
_
m
_
nd2m
2
_
m
(M)
=
nd/2i1
m=0
_
1+
m
(M)
m
_
nd2m
2
_
m
(M)
_
exp
_
5
nd/2i1
m=0
max(
m
(M)
m
, 0)
(nd2m)
2
_
.
Therefore,
f(M)1
S
i
= 1
S
i
nd/21
m=0
_
nd2m
2
_
m
_
nd2m
2
_
m
(M)
2i
i
1
S
i
exp
_
5
nd/2i1
m=0
max(
m
(M)
m
, 0)
(nd2m)
2
_
,
and Holders inequality gives
E(f(M)1
S
i
) 2i
i
E
_
1
S
i
exp
_
5
nd/2i1
m=0
max(
m
(M)
m
, 0)
(nd2m)
2
__
2i
i
E(1
S
i
)
1/2
E
_
exp
_
10
2
nd/2i1
m=0
max(
m
(M)
m
, 0)
(nd2m)
2
__
/2
.
18
Since (45) gives
E
_
exp
_
10
2
nd/2i1
m=0
max(
m
(M)
m
, 0)
(nd2m)
2
__
/2
= 1+o(1),
we only need to show that
2i
i
Pr(S
i
)
1/2
(1+o(1))n
i
.
Let m = nd/2i and (u) =
M,m
(u) be the set of all neighbors of u (excluding u) in the
graph generated by rst m edges of M. Since
m
(M) =
1
2
u
(dd
u
)
v(u){u}
(dd
v
1
{u=v}
)
and
_
nd/2m
2
_
=
1
2
u
(dd
u
)
v
(dd
v
1
{u=v}
),
m
(M) > (1/2)
_
nd/2m
2
_
implies that there is a vertex u with dd
u
> 0 such that
v(u){u}
(dd
v
1
{u=v}
) (1/2)
v
(dd
v
1
{u=v}
),
or equivalently
v(u){u}
(dd
v
)
2
v
(dd
v
1
{u=v}
)
=
2
(nd2m1) i. (46)
Notice that d
u
= |(u)| and dd
u
> 0 means |(u)| < d. Moreover, any of the last i edges
of M that is not entirley in (u) contributes at least 1 in the rst sum of (46). Hence
the number of such edges is at most i, or all but at most i of the last i edges of M are
entirely in (u). Let j = d|(u)| and l be the number of edges entirely in (u). Then it
is required that j 1 and l (1)i. The probability that dd
u
> 0 and (46) without
the two middle steps occur for a (xed) vertex u is upper bounded by
j1
l(1)i
_
d
j
__
(
dj
2
)
l
__
nd/2d(
dj
2
)
ijl
_
_
nd/2
i
_ ,
and hence
Pr(S
i
) Pr
_
such a u
_
n
j1
l(1)i
_
d
j
__
(
dj
2
)
l
__
nd/2d(
dj
2
)
ijl
_
_
nd/2
i
_ .
Using
_
d
j
_
d
j
j!
,
__
dj
2
_
l
_
(d
2
/2)
l
l!
,
19
_
nd/2d
_
dj
2
_
i j l
_
(nd/2)
ijl
(i j l)!
and, as i = O(d
2
) = o((nd)
1/2
),
_
nd/2
i
_
= (1+o(1))
(nd/2)
i
i!
,
we have that
Pr(S
i
) (1+o(1))n
j1
l(1)i
_
i
j l
_
_
2d
nd
_
j
_
d
2
nd
_
l
= (1+o(1))n
j1
l(1)i
_
i
j l
_
_
2
n
_
j
_
d
n
_
l
,
where
_
i
j l
_
=
i!
j!l!(ijl)!
. Since the summand decreases at least geometrically in the ratio
O(id/n) = o(1) as j or l increases, the last double sum asymptotically equals the (rst)
term with j = 1 and l = (1)i. So
Pr(S
i
) (2+o(1))i
_
i 1
(1)i
_
_
d
n
_
(1)i
.
Since 1/3, we have
_
i 1
(1)i
_
_
d
n
_
(1)i
2
i1
_
d
n
_
(1)i
_
2
3/2
d
n
_
(1)i
,
which together with i (1/2)
1
d
2
and d n
1/3
implies that
2i
i
Pr(S
i
)
1/2
(4+o(1))i
_
d
2
1/2
_
2
3/2
d
n
_
(13/2)
_
i
(4+o(1))i
_
2
3/2
1/2
n
13(13/2)
_
i
(1+o(1))n
i
,
completing the proof.
9 Remarks and Open Questions
To start this section, we would like to make some remark concerning the sharpness of our
analysis. The assumption d n
1/3
was used at two places. The rst is in the proof of (37)
(see the remark in Section 6), where we need to assume that d = o(n
1/3
/log
1/2
n). The
second place is the end of the proof of (10) (see the last paragraph of Section 8), where
we need to assume that d n
1/3
. Our feeling is that the arguments in Section 8 are
somewhat more exible, so there might be a chance to improve the calculation here. On the
other hand, improving the analysis in Section 6 seems like a bigger challenge. Thus, while
20
we think that our current analysis might be tightened to yield a slightly better bound, say
d n
1/3
/polylogn, we do not see how one can obtain d n
1/3+
.
The next, and natural issue is whether one could really expect the bound d n
1/3+
from by the proposed algorithm (Algorithm A). Wormald [12] conjectures that n
1/3
may
be the right threshold for this algorithm. Except Jerrum-Sinclair result (mentioned in the
introduction) there is no other result, as far as we know, about generating random regular
graphs with degree larger than n
1/3
.
In many cases (in both theory and practice), one would be satised with a random sample
whose probability is comparable to the uniform probability. In other words, the distribution
is not uniform, but has a constant torsion factor. Technically, we want Pr
A
(G)/p
u
= (1)
for any d-regular graph G. In fact, it is already useful to have Pr
A
(G)/p
u
= (1) for most
d-regular graphs G. The formal question we would like to pose is:
Question. For which d does Pr
A
(G)/p
u
= (1) hold for all d-regular graph G on n vertices,
with a possible exception of a set of measure o(1).
Finally, the random graphs created by Algorithm A, regardless their distribution, form
a quite reasonable model for random regular graphs. The study of this model has been
suggested by several researchers [9, 12]. In a recent paper [6], the present authors proved
that random graphs created by Algorithm A behave very much like Erdos-Renyi graphs
G(n, p) of the same density, for all logn d n
1/3
/log
2
n. By comparing with Erdos-
Renyi graphs, one can easily compute several parameters (such as the chromatic number)
of the random graphs in the new model.
References
[1] B. Bollobas, A probabilistic proof of an asymptotic formula for the number of labelled
regular graphs. European J. Combin. 1 (1980), no. 4, 311316.
[2] E. Bender and R. Caneld, The asymptotic number of labeled graphs with given degree
sequences, J. Combinatorial Theory Ser. A 24 (1978), no. 3, 296307.
[3] J. Friedman, A proof of Alons second eigenvalue conjecture, preprint.
[4] M. Jerrum and A. Sinclair, Fast uniform generation of regular graphs, Theoret. Comput.
Sci. 73 (1990), no. 1, 91100.
[5] J. Kim and V. Vu, Concentration of multivariate polynomials and its applications,
Combinatorica 20 (2000), no. 3, 417434.
[6] J. Kim and V. Vu, Sandwiching random graphs, Adv. Math. 188 (2004), no. 2, 444469.
[7] B. McKay and N. Wormald, Uniform generation of random regular graphs of moderate
degrees, Journal of Algorithms 11 (1990) 5267.
[8] B. McKay and N. Wormald, Asymptotic enumeration by degree sequence of graphs
with degrees o(n
1/2
), Combinatorica 11 (1991), no. 4, 369382.
21
[9] A. Ruci nski and N. Wormald, Random graph processes with degree restrictions, Com-
bin. Probab. Comput. 1 (1992), no. 2, 169180.
[10] A. Steger and N. Wormald, Generating random regular graphs quickly. (English. Eng-
lish summary) Random graphs and combinatorial structures (Oberwolfach, 1997),
Combin. Probab. Comput. 8 (1999), no. 4, 377396.
[11] V. Vu, Concentration of non-Lipschitz functions and applications. Probabilistic meth-
ods in combinatorial optimization, Random Structures Algorithms 20 (2002), no. 3,
262316.
[12] N. Wormald, Models of random regular graphs. Surveys in combinatorics, 1999 (Can-
terbury), 239298, London Math. Soc. Lecture Note Ser., 267, Cambridge Univ. Press,
Cambridge, 1999.
22