1 (Log Q) 1.9828m
1 (Log Q) 1.9828m
1 (Log Q) 1.9828m
THOMAS WRIGHT
arXiv:2111.14054v5 [math.NT] 19 Oct 2023
1. Introduction
One of the most exciting recent breakthroughs in number theory has been the
much-celebrated progress toward the twin prime and Polignac conjectures. The
current wave of excitement was sparked by Zhang [Zh], who proved in 2013 that
where Hm is the smallest number such that there are infinitely many intervals of
length Hm containing at least m + 1 primes. Maynard and Tao [Ma] were later
able to establish a slightly different method that would eventually (with the help
of Polymath 8b [Po]) reduce this bound to
H1 ≤ 246.
Moreover, Maynard and Tao (as well as Baker and Irving [BI], Stadlmann [St], and
the Polymath project) were able to bound Hm for larger values of m as well; they
gave bounds for m = 2 to 5, as well as a general upper bound for Hm . Those results
have been iteratively improved from the initial work of Maynard and Tao - we list
here the best current results, as found in [St]:
H2 ≤ 396, 516,
H3 ≤ 24, 407, 016,
H4 ≤ 1, 391, 051, 532,
H5 ≤ 77, 510, 685, 234,
Hm ≪ e3.8075m .
There has also been a great deal of work done on these results under the assump-
tion of certain conjectures. In particular, if one assumes the Elliott-Halberstam
conjecture, one can find even better results (the bounds below are listed in [Po],
1
2 T. WRIGHT
Before we do that, however, we recall the two theorems that give us the best
known upper bounds for Siegel zeroes. The first is the best effective bound; the
second is the best ineffective bound:
Theorem. (Siegel 1935) For any ϵ there exists a constant C(ϵ) such that
β < 1 − C(ϵ)q −ϵ .
In particular, Heath-Brown found that for any such q, the number of twin primes
(p, p + 2) with q 250 ≤ p ≤ q 500 is as conjectured in the Hardy-Littlewood conjec-
tures2.
Unfortunately, Heath-Brown’s methods do not appear to generalize to tuples
beyond pairs. However, with the new results on prime tuples, it seems natural to
ask whether Siegel zeroes might be of help in the more general prime tuple case.
This is the motivating question of our current paper.
1This quote appears in Terence Tao’s blog article “Heath-Brown’s theorem on prime twins and
Siegel zeroes,” published 26 August 2015.
1
2These bounds on q have since been widened by Tao and Teräväinen to q 20.5+ϵ ≤ p ≤ q η 2 ,
log q
where η = 1−β
[TT].
4 T. WRIGHT
3. Main Result
We assume a somewhat stronger condition on the exceptionality of the Siegel
zeroes to prove the following:
Main Theorem 1. Fix an A > 2, and let r = 554, 401. Assume that there
are infinitely many D and χD such that for each D, there exists a real sD with
L(sD , χD ) = 0 and
1
sD > 1 − .
(log D)rr +A
Then for any m ≥ 1,
Hm ≪ e1.9828m .
For smaller values of m, we also find the following:
Main Theorem 2. With the same assumptions about Siegel zeroes as given in the
previous theorem, we have
H2 ≤ 264,
H3 ≤ 49, 192,
H4 ≤ 439, 812,
H5 ≤ 3, 775, 860.
We note that all of these are better results than the assumption of Elliott-
Halberstam, although our H2 result is still a lesser bound than the one given by
generalized Elliott-Halberstam. In the case of m = 1, our methods give H1 ≤ 12,
which, while on par with Elliott-Halberstam, is obviously majorized by Heath-
Brown’s result and hence is not further discussed here.
4. Methods
The key result we will use will be one of Friedlander and Iwaniec [FI03], who
proved the following:
Theorem 4.1. (Friedlander-Iwaniec 1992) Let χD be a real character mod D. Let
233
x > Dr with r = 554, 401, let q < x 462 , and let (a, q) = 1. Then
π(x) aD r
(1) π(x, q, a) = 1 − χD + O L(1, χD )(log x)r ,
ϕ(q) (q, D)
where the constant is absolute and computable.
233
We note here that while Friedlander and Iwaniec claim this theorem for q < x 462 ,
their work actually proves this theorem for
58(1− 1 )
r
q < x 115 = x.5043469... ,
which is a slightly better bound (since 233/462 = .504329...). Friedlander and
Iwaniec simply chose the above to simplify the statements of their main theorems;
however, this additional leeway will slightly improve our results3.
Now, Theorem 4.1 is true for any choice of χD or D, but it is only nontrivial
when L(1, χD ) is small, which means that the character must be exceptional. Note
3See [FI03], Page 2049 for more information; Friedlander’s and Iwaniec’s simplification of
results occurs between (5.11) and Proposition 5.2. Friedlander and Iwaniec themselves note that
they have shrunk the bounds slightly to make the result easier to state.
PRIME TUPLES AND SIEGEL ZEROS 5
that if s is a Siegel zero then L(1, χD ) ≪ (1 − Re(s)) log2 D by mean value theorem
(since L′ (s, χD ) ≪ log2 D on the interval between the Siegel zero and 1), so the
hypothesis on Siegel zeroes in the Main Theorem is sufficient to prove that the error
term here is less than the main term.
In their paper, Friedlander and Iwaniec highlight the fact that if q and D are
relatively prime, one can find equidistribution of primes in arithmetic progressions
233 1
mod q as long as q < x 462 , which is a better result than the x 2 −ϵ that is achievable
under GRH. In our case, however, the more interesting result is what happens when
D|q and χD (a) = −1:
Corollary 4.2. Fix A and r as in Main Theorem 1, and assume that the conditions
on Siegel zeroes in that theorem are satisfied as well. Let D|q and χD (a) = −1. If
q < x 115 (1− r ) then
58 1
2π(x) π(x)
π(x, q, a) = +O
ϕ(q) ϕ(q)(log x)A−2
In other words, the number of primes is double what one would normally expect
in a congruence class. This is the key insight that will allow our main theorems to
be proven.
In order to explain this insight, let us define θ′ to be the level of distribution of
the primes, which is to say the supremum of the exponents t for which
X π(x) x
max π(x, q, a) − ≪
q≤xt
(a,q)=1 ϕ(q) logB x
for every B > 0.
In [Ma] and [Po], the authors define a quantity Mk as the supremum of the
quotient of two integrals (see (3)-(5) below for a more specific definition). For
practical purposes, however, this quantity gives information about the ratio between
S2 , the sieved sum over the primes, and S1 , the sum of the sieve weights themselves.
The work of [Ma] establishes that for us to find tuples of m + 1 primes in intervals
of length k, we require
2m
Mk > ′ .
θ
In our case, let θ denote the largest value for which (1) holds for every q ≤ xθ . If we
choose all of our primes from a congruence class a modulo D for which χ(a) = −1
then by the corollary above, we will have double the expected number of primes;
thus, the size of S2 is also doubled, and hence we only require
m
Mk > .
θ
While our θ isn’t as large as θ′ would be under Elliott-Halberstam (which allows
any θ′ < 1), the excess of primes has the same effect as would doubling the value of
θ′ . Since our θ is also greater than 1/2 according to Theorem 4.1, this means that
our results actually surpass those assuming Elliott-Halberstam.
5. Remarks
Obviously, no one has made conjectures as to what the largest possible value for θ
might be for (1) or Corollary 4.2 to hold, since most mathematicians do not believe
there are Siegel zeroes at all. In the paper of Friedlander and Iwaniec, the authors
58
encounter an obstruction at θ = 115 (1 − 1r ) from the error term and then another
6 T. WRIGHT
one at θ = 2/3 from the main term. The authors remark that while overcoming
the first bound might be tractable, the second one seems a bit more difficult to
improve. The reason for this is that the first bound is derived from a paper on
ternary divisor sums [FI85] where the main result has since been improved; the
improved version is not yet known to be amenable to the methods of [FI03], but
it is not hard to imagine that one could find a way to apply the improved result.
The second bound, however, comes from the famous Weil bound for Kloosterman
sums; as such, any improvement over 2/3 would likely require either a method of
estimating the main term that avoids using this bound or else an improvement on
the Weil bound itself.
On the other hand, conjectures about primes in arithmetic progressions (such
as Montgomery’s conjecture on arithmetic progressions [Mo1], [Mo2] or Elliott-
Halberstam) say that the equidistribution of primes in classes should apply modulo
′
q < xθ for every θ′ < 1. Let’s imagine for a moment that one could make a similar
conjecture about non-equidistribution of primes mod q in the presence of a Siegel
zero; in other words, assume that (1) and Corollary 4.2 could be proven for q < xθ
for every θ < 1, and assume that there are infinitely many Siegel zeroes as required
by the Main Theorems. Then one would find the following4 bounds for Hm :
H1 = 2
H2 ≤ 12,
H4 ≤ 270,
H6 ≤ 52, 116,
′
Hm ≪ e(1+ϵ )m ,
where the last line holds for any ϵ′ > 0.
The result for H1 is notable here, since it would be a resolution of the twin prime
conjecture. Note that this result for H1 does not require θ to go all the way up to
1; if the distribution held for θ = .722, we could recover the twin primes result of
Heath-Brown [HB].
6. Outline
The proof, for the most part, will follow the framework of Maynard and Tao.
The differences are as follows:
1.) Obviously, instead of invoking Bombieri-Vinogradov or Elliott-Halberstam,
we invoke the result of Friedlander and Iwaniec. Since we have an explicit error
term mod q for each q, we are not required to do any fancy Cauchy-Scwhartz
machinations; we can simply plug in an estimate for π(x, q, a) and then sum this
term up over all allowable choices of q.
2.) If we wish to ensure that the number of primes is double what Dirichlet’s
theorem would predict, we must shift the prime tuple so as to put all of the terms
in classes a mod q for which χD (a) = −1 and D|q. Since D is small relative to x,
this causes little issue in our analysis. This shifting is covered in section 7.
From here, the proof mostly follows the ideas described in [Ma], albeit with
W a bit bigger than in the original paper. We note that [Ka] and [BFM] outline
4The numbers on this list come from [Po], Table 3, where the numbers chosen are simply the
lowest k for which Mk > m + 1.
PRIME TUPLES AND SIEGEL ZEROS 7
the process for dealing with a larger W , and the effect on the final calculations is
minimal.
Note that since g is prime and (n′ + hi , gD′ ) = 1 for all i, χD (D′ y + n′ + hi ) can
only equal 0 if g|D′ y + n′ + hi ; for each choice of i, this will occur exactly once
8 T. WRIGHT
So if H > k2 k−1
then there must exist a tuple where all of the χD (D′ y + n′ + hi ) =
−1, which is as required.
Now, we can rewrite H as
g Y
X k
H= (1 − χD′ (D′ y + n′ + hi )χg (D′ y + n′ + hi ))
y=1 i=1
g Y
X k
= (1 − χD′ (n′ + hi )χg (D′ y + n′ + hi ))
y=1 i=1
For each i, let us write ϵi = χD′ (n′ + hi ). Again, these ϵi are all ±1 since (D′ , n′ +
hi ) = 1. So
g Y
X k
H= (ϵi − χg (D′ y + n′ + hi ))
y=1 i=1
where the Qj (y) are polynomials of degree ≤ k that are square-free when viewed
over Fg , and the Sj are distinct subsets of {1, 2, · · · , k}. By triangle inequality,
k
X g Y k 2X −1 Y Xg
H≥ ϵi − ϵi χg (Qj (y))
y=1 i=1 j=1 i∈Sj y=1
k
g
X −1
2X g
X
≥ 1 − χg (Qj (y))
y=1 j=1 y=1
The first absolute value is clearly equal to g; for the second, we can use the Weil
bound for character sums over Fg , finding that
g
X √
χg (Qj (y)) ≤ (k − 1) g.
y=1
So
√
H ≥ g − k2k−1 g.
This is clearly larger than k2k−1 for large values of g. So there must exist a y ′ ≤ g
such that χD (D′ y ′ + n′ + hi ) = −1 for all i. We then set
l ≡ D ′ y ′ + n′ (mod D)
to complete the theorem. □
PRIME TUPLES AND SIEGEL ZEROS 9
and let
W = DV.
Because our tuple is admissible, we know that there exists a congruence class
v mod V such that all of the terms in the tuple are coprime to V . So for l as in
Lemma 7.1, define v0 such that
v0 ≡ v (mod V )
v0 ≡ l (mod D)
As is standard, for a well-chosen function λ, we let
2
X X
S1 = λd1 ,··· ,dk ,
n≡v0 (mod W ) di |n+hi
X≤n≤2X
and
2
X k
X X
S2 = 1P (n + hi ) λd1 ,··· ,dk .
n≡v0 (mod W ) i=1 di |n+hi
X≤n≤2X
This is essentially the sum that was considered in [Ma]. The only difference
here is that W is of size X ϵ for some ϵ < 1r . However, such alterations have been
addressed in [Ka], [BFM], and elsewhere; the only difference is a slight change in
the allowable level of distribution.
Obviously, it will be helpful if we have a definition for λ. To this end, let Sk be
the set of all piecewise differentiable functions F : [0, ∞)k → R where the support
of F lies on the simplex
k
X
Rk = {(t1 , ..., tk ) : ti ≤ 1}
i=1
and
X λd1 ,··· ,dk−1 ,1 λe1 ,··· ,ek−1 ,1
= (1 + o(1))B 1−k Jk (F ).
ϕ([dj , ej ])
d1 ,···dk−1 ,e1 ,··· ,ek−1
[d1 ,e1 ],··· ,[dk ,ek ],W coprime
Proof. This is essentially the evaluation of (5.2) and (5.18) of [Ma]. That paper
only deals with very small W ; however, the method applies easily to larger W , and
others have done exactly this - see e.g. (5.4) and (5.20) of [Ka]. □
9. The Sum S
From here, we evaluate the two sums. Define Sk to be the set of all functions F
that satisfy the criteria listed above, and define
Let
Pk (m)
m=1 Jk (F )
(5) Mk = sup .
F ∈Sk Ik (F )
Then we have the following:
Theorem 9.1. For a given k, choose F ∈ Sk . Let S1 and S2 be as above, and let
D, X, and R be as defined at the beginning of Section 8. Then
X −k
S1 = (1 + o(1)) B Ik (F )
W
and
k
X X (m)
S2 = (2 + o(1)) B 1−k Jk (F ),
ϕ(W ) log X m=1
where the o(1)-term goes to zero as D → ∞.
log R
Since log X = θ2 , S2 can be rewritten as
k
X −k X (m)
S2 = (θ + o(1)) B Jk (F ).
W m=1
PRIME TUPLES AND SIEGEL ZEROS 11
The proof here is similar to the proofs in [Ma] or in any number of papers that
are based on [Ma]. We split the proof into two parts, one for each sum, below.
The usual trick is to expand out the square and reverse the order of summation,
giving
X′ X
S1 = λd1 ,··· ,dk λe1 ,··· ,ek 1.
d1 ,··· ,dk n≡v0 (mod W ),X≤n≤2X
e1 ,··· ,ek [di ,ei ]|n+hi
where the prime indicates that the sum requires the [d1 , e1 ], · · · , [dk , ek ], W to be
pairwise coprime. The di and ei must all be coprime to W (by choice of v0 );
moreover, ([di , ei ], [dj , ej ]) = 1 since Q a prime p|hi − hj implies p|W . We can then
replace the inside
Q sum with X/(W [di , ei ]) + O(1). Since λ ≪ 1 and is only
supported on di ≤ R, we have
X′ X
λd1 ,··· ,dk λe1 ,··· ,ek Q + O(1)
W [di , ei ]
d1 ,··· ,dk
e1 ,··· ,ek
X′ X
= λd1 ,··· ,dk λe1 ,··· ,ek + O(R).
Q
W [di , ei ]
d1 ,··· ,dk
e1 ,··· ,ek
Again, we expand out the square and reverse the order of summation, giving
X′ X
= λd1 ,··· ,dk λe1 ,··· ,ek 1P (n + hm ).
d1 ,··· ,dk−1 ,1 n≡v0 (mod W ),n∼X
e1 ,··· ,ek−1 ,1 [di ,ei ]|n+hi
k ≫ e(1.98276)ρ .
From equation (149) in [Po], the minimum diameter H(k) of a k-tuple can be
bounded by
H(k) ≤ k log k + k log log k − k + o(k),
So
Hm ≪ e1.9828m .
which is Main Theorem 1.
Here, the authors set α = T . In some sense, this theorem can be seen to build
on Zhang’s original idea that one can often derive better results by restricting the
analysis to smooth moduli.
[T ]
More specifically, the authors of [Po] seek to maximize Mk by treating it prob-
abilistically, eventually finding (see (114) of [Po])that for any arbitrary r > 1,
Z r−X1 −···−Xk−1 ! [T ]
2 m 2 Mk r
(6) g(t) dt ≤ P (X1 + · · · + Xk ≥ r)
0 k
where the Xi are independent random variables taking values in [0, T ] with probabil-
ity distribution m12 g(t)2 . The values of µ and σ for each variable in this distribution
are calculated as in the statement of the theorem, and one can see that the goal is
then to choose our c, T , and τ that maximizes Mk [T ] in (6) above.
[Po] defines the parameters c, T, τ in terms of new parameters β and η (and our
previously defined k), where
η
c :=
log k
β
T :=
log k
τ = 1 − kµ
[Po] then sets about finding the best choices of β and η to push Mk over the various
magical marks that will guarantee a certain number of primes.
β and η can be chosen arbitrarily. To find optimal choices for β and η, we
performed a Matlab computation where we began with the β and η chosen in [Po]
for the nearest tuple and then let the variables “walk” until we found apparent
maxima. The results are below:
PRIME TUPLES AND SIEGEL ZEROS 15
Theorem 13.1. Given the following values for β, η, and k, we can find the fol-
lowing for Mk :
Noting that
3
5.9484 > = 5.94828...
.504346916
4
7.9310494 > = 7.9310487...
.504346916
5
9.9138119 > = 9.9138109...
.504346916
we see that these choices of k are the desired ones.
As in [Po], we will let H(k) denote the minimal diameter hk −h1 of an admissible
k-tuple. Andrew Sutherland was kind enough to compute the minimal tuples5 for
the three numbers above:
M53 ≥ 3.986213
2
and 3.986213 is greater than .504346916 = 3.96552.... So assuming the existence of
infinitely many strong Siegel zeroes, we have
H2 ≤ 264,
which falls between the H2 found if one assumes EH (270) and that found if one
assumes GEH (252).
15. Acknowledgements
I would like to thank Andrew Sutherland for his help in computing the tuples
and for alerting me to the work of [St], as well as catching a couple of important
discrepancies between [Po] and the Polymath website. I would also like to thank
Beau Christ and Jamie Wright for their technological help. Finally, I would like to
thank the referee for numerous helpful comments and suggestions.
References
[BI] R.C. Baker, A.J. Irving. Bounded intervals containing many primes, Math.
Z. 286 (2017), 821—841.
[BFM] W.D. Banks, T. Freiberg, J. Maynard. On limit points of the sequence of
normalized prime gaps, Proc. Lond. Math. Soc., (3) 113 (2016), 515–539.
[Da] H. Davenport. On character sums in a finite field, Acta Mathematica 71
(1939), 99–121.
[FI03] J. Friedlander and H. Iwaniec, Exceptional characters and prime numbers
in arithmetic progressions, Int. Math. Res. Not. 37 (2003), 2033—2050.
[FI85] J. Friedlander and H. Iwaniec, Incomplete Kloosterman Sums and a Divisor
Problem, Ann. of Math., Second Series, Vol. 121 (2) (1985), 319–344.
[Fo] K. Ford, Large prime gaps and progressions with few primes, Riv. Math.
Univ. Parma , vol. 12 (1) (2021), 41—47.
[HB] D. R. Heath-Brown, Prime twins and Siegel zeros, Proc. London Math. Soc.
(3) 47 (1983), no. 2, 193–224.
[Ka] D.A. Kaptan, A note on small gaps between primes in arithmetic progres-
sions, Acta Arith., 172 (2016), 351—375.
[La] E. Landau, Über die Klassenzahl imaginär-quadratischer Zahlkörper, Nachr.
Ges. Wiss. Göttingen (1918), 285–295.
[Ma] J. Maynard, Small gaps between primes, Ann. of Math. 181 (2015), 383—413.
[Mo1] H. L. Montgomery, Primes in arithmetic progressions, Michigan Math. J. 17
(1970), 33–39.
[Mo2] H. L. Montgomery, Topics in Multipicative Number Theory, Lecture Notes
in Mathematics 227, Springer-Verlag, Berlin (1971).
[Po] D. H. J. Polymath, Variants of the Selberg sieve and bounded gaps between
primes, Research in the Mathematical Sciences 1:12 (2014), 1–83.
[St] Stadlmann, J. On primes in arithmetic progressions and bounded gaps be-
tween many primes, arXiv:2309.00425.
[TT] T. Tao and J. Teräväinen, The Hardy-Littlewood-Chowla conjecture in the
presence of a Siegel zero, preprint, https://arxiv.org/pdf/2109.06291.pdf.
[Zh] Y. Zhang, Bounded gaps between primes, Annals Math 179 (2014), 1121—
1174.