Area, lattice points, and exponential sums

LONDON MATHEMATICAL SOCIETY MONOGRAPHS NEW SERIES Series Editors H. G. Dales Peter M. Neumann LONDON MATHEMATICAL S...

Author: Martin Neil Huxley

91 downloads 1767 Views 5MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

LONDON MATHEMATICAL SOCIETY MONOGRAPHS NEW SERIES

Series Editors

H. G. Dales Peter M. Neumann

LONDON MATHEMATICAL SOCIETY MONOGRAPHS NEW SERIES Previous volumes of the LMS Monographs were published by Academic Press, to whom all enquiries should be addressed. Volumes in the New Series will be published by Oxford University Press throughout the world.

NEW SERIES 1. Diophantine Inequalities R. C. Baker 2. The Schur Multiplier Gregory Karpilovsky 3. Existentially Closed Groups Graham Highman and Elizabeth Scott 4. The Asymptotic Solution of Linear Differential Systems M. S. P. Eastham 5. The Restricted Bumside Problem Michael Vaughan-Lee 6. Pluripotential Theory Maciej Klimek 7. Free Lie Algebras Christopher Reutenauer 8. The Restricted Burnside Problem 2nd edition Michael Vaughan-Lee 9. The Geometry of Topological Stability Andrew du Plessis and Terry Wall

10. Spectral Decompositions and Analytic Sheaves J. Eschmeier and M. Putinar 11. An Atlas of Brauer Characters C. Jansen, K. Lux, R. Parker, and R. Wilson 12. Fundamentals of Semigroup Theory John M. Howie 13. Area, Lattice Points, and Exponential Sums M. N. Huxley 14. Super-Real Fields H. G. Dales and W. H. Woodin

15. Integrability, Self-Duality, and Twistor Theory L. Mason and N. M. J. Woodhouse 16. Categories of Symmetries and Infinite-Dimensional Groups Yu. A. Neretin

Area, Lattice Points, and Exponential Sums M. N. Huxley College of Cardiff University of Wales

CLARENDON PRESS 1996

OXFORD

Oxford University Press, Walton Street, Oxford OX2 6DP Oxford New York

Athens Auckland Bangkok Bombay Calcutta Cape Town Dares Salaam Delhi Florence Hong Kong Istanbul Karachi Kuala Lumpur Madras Madrid Melbourne Mexico City Nairobi Paris Singapore Taipei Tokyo Toronto and associated companies in Berlin Ibadan Oxford is a trade mark of Oxford University Press

Published in the United States by Oxford University Press Inc., New York © M. N. Huxley, 1996

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press. Within the UIX exceptions are allowed in respect of any fair dealing for the purpose of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act, 1988, or in the case of reprographic reproduction in accordance with the terns of licences issued by the Copyright Licensing Agency. Enquiries concerning reproduction outside those terms and in other countries should be sent to the Rights Department, Oxford University Press, at the address above. This book is sold subject to the condition that it shall not, by way of trade or otherwise, be lent, re-sold, hired out, or otherwise circulated without the publisher's prior consent in any form of binding or cover other than that in which it is published and without a similar condition including this condition being imposed on the subsequent purchaser. A catalogue record for this book is available from the British Library

Library of Congress Cataloging in Publication Data Huxley, M. N. (Martin Neil) Area, lattice points, and exponential sums/M. N. Huxley. (London Mathematical Society monographs; new ser., 13) Includes bibliographical references and index. 1. Exponential sums. 2. Lattice theory. 3. Functions, Zeta. I. Title. II. Series: London Mathematical Society monographs; new ser., no. 13. 1996 512'.73--dc20 QA246.7.H89 95-38370

ISBN 0 19 853466 3 Typeset by Technical Typesetting Ireland Printed in Great Britain by Biddies Ltd., Guildford, Surrey

To Trish, Nicholas, and Clare

Contents

Notation

xi

Introduction

1

Part I Elementary methods The rational line 1.1 Height 1.2 The Farey sequence 1.3 Lattices and the modular group 1.4 Uniform distribution 1.5 Approximating a given real number 1.6 Uniform Diophantine approximation

1.

Polygons and area 2.1 Counting squares 2.2 Jarnik's polygon 2.3 The discrepancy of a polygon 2.4 Fitting a polygon to a smooth curve 2.

The integer points close to a curve 3.1 Introduction 3.2 The reduction step 3.3 The iteration 3.4 Swinnerton-Dyer's method: the convex polygon 3.5 Swinnerton-Dyer's method: counting quadruplets 3.6 Expanding the result 3.7 Points on the curve 3.

The rational points close to a curve 4.1 Major and minor sides of the polygon 4.2 Duality 4.3 A linear form often near an integer 4.

Part II The Bombieri-Iwaniec method Analytic lemmas 5.1 Bounds for exponential integrals 5.

87 87

Contents

viii

5.2 Partial summation for exponential sums 5.3 Rounding error sums 5.4 Poisson summation 5.5 Evaluating exponential integrals 5.6 Bilinear and mean square bounds

89 93 98 104 114

Mean value results 6.1 The simple exponential sum 6.2 The lattice point discrepancy 6.3 The mean square discrepancy

126 126 128

The simple exponential sum 7.1 Motivation 7.2 Major and minor arcs 7.3 Poisson summation 7.4 The large sieve on the minor arcs 7.5 A preliminary calculation 7.6 Long major arcs

142

8.

The exponential sum for the lattice point problem 8.1 Major and minor arcs 8.2 The major arcs estimate 8.3 Poisson summation on the minor arcs 8.4 The large sieve on the minor arcs

167 167 170 172 178

Exponential sums with a difference 9.1 Major and minor arcs 9.2 The major arcs estimate 9.3 Poisson summation on the minor arcs 9.4 The large sieve on the minor arcs

185 185 187 188 193

6.

7.

9.

Exponential sums with modular form coefficients 10.1 Modular forms 10.2 The Wilton summation formula 10.3 Farey arcs 10.4 Wilton summation on the Farey arcs 10.5 A large sieve on the Farey arcs 10.6 Towards a mean square result 10.7 Jutila's third method 10.

135

142 144 149 152 158 160

197 197

200 208 210 213 220 225

Part III The First Spacing Problem: ìnteger' vectors The ruled surface method 11.1 Preparation: divisor functions 11.2 Families of solutions 11.3 A comparison argument 11.

235 235 241 249

Contents

ix

The Hardy-Littlewood method 12.1 Integrals that count 12.2 The minor arcs 12.3 The major arcs 12.4 Extrapolation

255

The First Spacing Problem for the double sum 13.1 Families of solutions 13.2 Good and bad families 13.3 The problem with a perturbing term

272

12

13.

255 259 262 270

272 278 283

Part IV The Second Spacing Problem: `rational' vectors The First and Second Conditions 14.1 The Coincidence Conditions 14.2 Magic matrices 14.3 The Second Condition 14.4 A family of sums

287

Consecutive minor arcs 15.1 Parametrizing rational points 15.2 Coincidence over a short interval 15.3 Linearizing the Fourth Condition 15.4 Coincidence over a long interval 15.5 Extending the Taylor series

307

The Third and Fourth Conditions 16.1 Counting coincident pairs of minor arcs 16.2 Sums with congruence conditions 16.3 Eliminating the centres of the arcs

329

14.

15.

16.

287 289 295 300

307 311 315 320 323

329 336 339

Part V Results and applications Exponential sum theorems 17.1 The simple exponential sum 17.2 Exponential sums with a parameter 17.3 A congruence family of sums 17.4 Sums with T large

349

Lattice points and area 18.1 Exponential sums 18.2 Integer points and rounding error 18.3 Lattice points inside a closed curve 18.4 A family of lattice point problems 18.5 Rounding error and integration

372

17.

18.

349 357 361 363

372 377 384 393 397

Contents

x

Further results 19.1 Exponential sums with a difference 19.2 A major arc estimate 19.3 Exponential sums with a large second derivative

406

Sums with modular form coefficients 20.1 Exponential sums 20.2 Mean value theorems 20.3 The modular form L-function

423

21. Applications to the Riemann zeta function

438

21.1 Introduction 21.2 The order of magnitude in the critical strip 21.3 The mean square 21.4 Gaps between zeros 21.5 The twelfth-power moment

438 441 443 447 451

22. An application to number theory: prime integer points 22.1 Preparation 22.2 Type I double sums 22.3 Type II double sums 22.4 Prime numbers in a smooth sequence

452

19.

20.

406 413 415

423 429 435

452 456 459 463

Part VI Related work and further ideas 23. Related work

467

23.1 Integer points close to a curve 23.2 The Hardy-Littlewood method 23.3 Other Farey arc arguments 23.4 Higher dimensions 23.5 Exponential sums with monomials

467 470 471 473 475

Further ideas 24.1 Comments on the method 24.2 Subdivision without absolute values

478

References

484

Index

491

24.

478 479

Notation GENERAL The lower case letters a to w denote integers unless stated otherwise. The letter p always denotes a prime number, and q always denotes a positive integer, usually the denominator of a rational number. The exceptions are as follows:

dy/dx etc., are derivatives. e and i are used for the usual mathematical constants when these occur. r, s, and t are used for variables where the notation is traditional: r for the

polar coordinate; s for arc length and for an exponent, and in Fourier transform formulae as in Dirichlet series; t for a real variable, usually `time', sometimes the imaginary part of a complex exponent s = o, + it; u and v are sometimes used for variables. x and y are always real variables, and z is a complex variable. Upper case letters are used for sets, matrices, and for real numbers (which may be integers). Coefficients (functions of one or more integer variables) are

usually denoted a(... ), b(... ), or c(... ). Functions of one or more real or complex variables are e(... ), f (... ), F(... ), g(... ), G(... ), h(... ), H(... ), k(... ), or K(...) and sometimes u(... ), v(... ), and w(... ). Greek letters are used for real and complex numbers and variables.

Theorems and lemmas are numbered uniquely. Theorem 18.3.2 is the second result in Chapter 18, Section 18.3. The equations in its proof are numbered (18.3.7) onwards, indicating that they occur in Section 18.3 of Chapter 18.

NUMBER-THEORETIC NOTATION The relation m I n (`m divides n (without remainder)') between m and n

means that m is positive and that n is a multiple of m. The relation a = /3(mod m) (à is congruent to /3 modulo m') between real numbers, usually integers, means that a - /3 is an integer n and that m I n. For integers m and n, (m, n) denotes the highest common factor (defined for m and n not both zero) and [m, n] denotes the lowest common multiple (defined for m and n both non-zero).

The usual functions dr(n), µ(n), cp(n), and o,(n) of number theory are defined as follows:

Notation

xii

The number of ways of writing n as a product of r positive integers is called dr(n); we write d(n) for d2(n), the number of factors of n. The Mobius function µ(n) is zero if n has a repeated prime factor, 1 if n = 1 or if n has an even number of prime factors, all distinct, and is - 1 if n has an odd number of prime factors, all distinct. The Euler function cp(n) counts the number of expressions 1/n, 2/n,..., n/n which are in their lowest terms. The sum of the factors of n is called v(n). For a real number x, [x] is the integer part of x, the unique integer n in x - 1 < n <x, [[x]] is the nearest integer to x, the unique integer in x - z < n5 x + 2(W) is x - [[x]], depending only on x modulo one, and 114 is the

distance of x from the nearest integer, the absolute value of ((x)). The rounding error or row-of-teeth function is

p(x) = [x] -x+ i = - ((x - 2)). ANALYTIC NOTATION We write

e(x) = exp27rix = exp2iri((x)), a complex exponential that maps the reals modulo one to the unit circle. We use the following order-of-magnitude notation. The Vinogradov symbol

F 0, is complicated, partly because the argument itself is complicated, and partly because the two main parameters H and N are each chosen to equalize two or more terms in the upper bound. There is even some moonshine: the Fourier integral in the last Poisson

summation is an incomplete Bessel function integral. For modular form coefficients it has integral order, for the lattice point problem it has order one-half, and for the simple exponential sum, order one-third. Bessel functions of integral order occur in the symmetric analytic method, but only for the special curves, the ellipse and hyperbola. Bessel functions arise in the representations of matrix groups, but order one-third is unexpected. This book is accessible to graduate students beginning research. The reader is assumed to know some real and complex analysis, and Fourier theory up to the function (sin 7rx)/Trx. Some ideas from plane geometry are used, such as the radius of curvature, and the duality of points and lines. For the applications to modular forms, prime numbers, and the Riemann zeta function, standard theorems are quoted. No special knowledge of number theory is required. All technical lemmas on exponential integrals or on the average number of factors of integers that are needed are proved. This book can be read as an introduction to research in analytic number theory. There

are no set exercises. The researcher should keep asking: `What is this about?'; `How is it done?'; `Why is it done this way?'; `Can't you use instead ... ?'; Ìsn't this the same as ... T.

Introduction

4

The expert will find the components of the new method presented in one

place for the first time, with suggestions as to how it relates to other approaches, and what direction it might take in the future. The results for sums in one variable in Chapters 17, 18 and 19 have been summarized for ease of reference as tables of exponents. Chapter 5 collects the most useful lemmas on exponential sums and integrals of one variable. A quick impression of the new method may be taken by reading Chapter 7 on the simple exponential sum, and Chapters 11 and 14 which begin the discussion of the two spacing problems. The list of references includes all the recent work (to 1993), and all the sources consulted in preparing this book, whether cited in the text or not. Thanks to Voronoi, van der Corput, and I. M. Vinogradov, without whom twentieth-century number theory would not be the same; to Davenport, a better teacher than I will ever be; to Bombieri, Hooley, Rankin, Selberg, and Swinnerton-Dyer for help and inspiration; to K. Ramachandra and the Tata Institute of Fundamental Research for the opportunity to work on this book; and to S. W. Graham, D. R. Heath-Brown, A. Ivic, H. Iwaniec, M. Jutila, G. Kolesnik, E. Kratzel, H. L. Montgomery, Y. Motohashi, C. J. Mozzochi, W. G. Nowak, 0. Trifonov, N. Watt, and everyone else who supplied ideas and comments.

Part I Elementary methods,

1

The rational line 1.1

HEIGHT

The rational numbers form a countable set, dense in the real numbers. They are set in the real line like stars in the sky `to illuminate the mystery of the

continuum' (a remark attributed to Borel). In any region, if magnified sufficiently, faint stars appear. To explain this bright and faint analogy, we need the concept of height.

A rational number a is uniquely represented as a fraction a/q in its lowest terms, that is, by a pair of integers a and q, with q positive and the highest common factor (a, q) equal to one. The height is defined by

H(a) = max(IaJ, q) There are three rational numbers of height one: 0, 1 and - 1 (unless infinity is counted as 1/0), and for each H z 2, the number of fractions of height H is 4cp(H), where cp(H) is Euler's function. The rational numbers are partially

ordered by height, and totally ordered when we put an ordering on the rationale of fixed height, for instance their ordering as real numbers. Each rational number of height H appears in the ordering in a position H

53+4F, (h-1)=2H2-2H+3. h=1

Thus the rational numbers are countable. We use the expressions `small rational number' and `large rational number' to mean rational numbers whose height is small or large. This usage agrees with the common expression à small integer'. The small rationale are the

bright stars in the analogy, the large rationale are the faint stars, and the irrational numbers form the background darkness of the continuum. The height gives a metric on the positive rational numbers with distance function defined by

d(a, /3) = log H(a//3 ), so that the translation identity

d(ay,lay)=d(a,a), and the triangle inequality d(a, y) < d(a, /3) + d(13, y)

The rational line

8

hold. Any two distinct rationals are a distance at least log 2 apart. It is fairly easy to see that any set of n distinct positive rationals contains a pair which is at least log p apart, where p is the largest prime up to n. Zaharescu (1987), answering a question of R. L. Graham, has shown that for large n there is a

pair which is at least log n apart. This result is probably true for all n (to verify it for small n means checking a finite list of possible exceptional sets). This chapter is about the interaction of the height and the metric I a - '61

on the real line. In later chapters we shall use the structure of sets of rationals of bounded height close to a given real number. To a first approximation the height distance and the real distance are independent, but there is an analogue of gravitational lensing, in which the images of faint stars are displaced away from a nearby bright star.

1.2 THE FAREY SEQUENCE The Farey sequence of order Q (in the strict sense) consists of the rational numbers of height at most Q between 0 and 1, arranged in their order as real numbers. We usually use the Farey sequence in contexts where only the fractional part of a real number matters. The extended Farey sequence of order Q consists of all the rational numbers with denominator less than or

equal to Q, the lifting of the strict Farey sequence from R/71, the reals modulo integers, to the real numbers R. We write .9(Q) for the extended Farey sequence.

Although .9(Q) is a subsequence of .9(Q + 1), the numbering of terms differs, and consecutive terms of .9(Q) may be separated in .9(Q + 1). Farey

was one of the first to notice the two properties in the next lemma, and brought them to the attention of the great Cauchy. Lemma 1.2.1 (Basic properties of the Farey sequence)

(a) If e/r and f/s are consecutive terms in .9(Q), then

f/s - e/r = 1/rs,

ft - es = 1.

(b) If e/r and f/s are consecutive terms in .9(Q), then they remain consecutive in .A(R) for Q ::g R < r + s, but are separated in .9(r + s) by the Farey

mediant (e + f)/(r + s). Proof First we show that property (a) implies property (b). If fr - es = 1, and

a/q lies between e/r and f/s, then

a/q - e/r = (ar - eq)/qr > 1/qr, and

f/s - a/q = (fq - as)/qs Z 1/qs.

The Farey sequence

9

Adding, we have

1/rs = f/s - e/r>- 1/qr+ 1/qs, so that

q>-r+s. The Farey mediant (e + f)/(r + s) does lie between e/r and f/s. We find that

f/s - (e + f)/(r+s) =1/s(r+s), (e +f )I(r + s) - e/r = 1/r(r + s), and thus no other rational number with denominator r + s can lie within the open interval (e/r, f/s). This calculation verifies property (a) for the pairs e/r, (e + f)/(r + s) and (e + f)/(r + s), f/s, which are consecutive in .3(r + s), and implies that (e + f)/(r + s) is in its lowest terms.

By induction we see that we can construct the whole Farey sequence between 0 and 1 from the consecutive pair 0/1, 1/1 in .9(1) by repeatedly taking Farey mediants, so property (a) holds throughout. If 1/0 counts as a D rational, then 1/1 is itself the Farey mediant of 0/1 and 1/0. Definition We say that e/r and f/s are consecutive Farey fractions if they are consecutive fractions of some Farey sequence .9(Q), for instance with

Q=r+s- 1.

Our next lemma describes the set of rationale between two consecutive Farey fractions. Lemma 1.2.2 (Self-similarity) If e/r, f/s are consecutive Farey fractions, then for Q r + s the members of .9(Q) lying between e/r and f/s are the fractions (eu + ft)/(ru + st) with t z 1, u z 1, (t, u) =1 and ru + st s Q.

Proof By the inductive construction of the Farey sequence in the proof of Lemma 1.2.1, all the members of .9(Q) between e/r and f/s can be written as (eu + ft)/(ru + st). Conversely for t > 1, u z 1

f/s - (eu +ft)/(ru + st) =u/s(ru + st), (eu + ft)/(ru + st) - e/r = t/r(ru + st). If (t, u) = 1, then there are integers v and w with Iw

ul-1'

so that

ew+fv rw+sv

eu+ftl ru+st -

f l

s

elly r 11w

t1=1 u

implying that the fraction (eu + ft)/(ru + st) is in its lowest terms.

The rational line

10

We often need an upper bound for the number of members of .g(Q) in a short interval. Lemma 1.2.3 (Short interval upper bounds) Let I be an interval of length A. Then

E

150Q2+1,

a/q e.9(Q)n 1 1

- 2 and w is also an integer vector. Each rational number a/q corresponds to a pair of primitive integer vectors (q, a) and (-q, -a), and each primitive integer vector (x, y), except (0, ± 1), corresponds to a rational number y/x. An infinite gradient can be regarded as the rational number 1/0. We consider the parallelogram in Figure 1.2 with vertices 0 (0, 0), A (r, e), B (r + s, e + f) and C (s, f ). The area of the parallelogram is fr - es. Lemma 1.2.1 corresponds to the fact that when OABC has area one, then there is no other lattice point within the closed parallelogram OABC, and we can tile the plane with copies of OABC translated by every integer vector (m, n). The

lattice with basis vectors OA, OC has the same lattice points, but its lattice lines are in different directions. Changing the basis without changing the lattice points corresponds to an automorphism of the group of vectors given

by the matrix I sf

er).). The full automorphism group consists of all integer

matrices with determinant ±1. We want areas to be preserved, not reflected, so the order of the basis vectors matters, and we use matrices of determinant +1, the group SL(2, Also, ,since rational numbers form the gradients of

vectors, the matrix I -o (

_° I acts trivially. The group which affects the

FIG. 1.2

The rational line

12

gradients of vectors is PSL(2, Z), called the modular group, the quotient of SL(2, Z) by its centre ( ±I), represented by 2 X 2 integer matrices of determinant one, but with I _d) and I a d )) both representing the same group element. ` ` The matrices of the modular group are classified by the trace a + d, which is invariant under conjugacy. Since we can change signs, we take the trace to be non-negative. Matrices with trace less than two are called elliptic, those with trace two parabolic, elations, shears or transuections, those with trace

greater than two hyperbolic. The modular group acts as rigid motions on two-dimensional hyperbolic space. In this action a hyperbolic matrix is a rigid motion along an axis, an elliptic matrix is a rotation about a finite point, and a transvection is a rotation about a point at infinity. Conjugacy corresponds to changing the origin of coordinates. This geometric action is parametrized by the action

ay

as+b ca+d

on the complex numbers. The real axis parametrizes the directions at infinity. We interpret Lemma 1.2.2 as mapping the positive rationals t/u one-to-one

onto the rationals between e/r and f/s by the matrix I Sf er ). In Lemma 1.2.1 we have

r+s) = (s r)(0 1)' e+f e f e 1 0). (r +s r) (s r)(1 1 (s

The semigroup of matrices in SL(2, 71) with non-negative entries is generated

by the transvections T =

(10

1) and U = (1

o

). These matrices T and U

generate the full modular group. The usual generators are S and T, where

S= (0

-10) =T_1UT-i

1.4 UNIFORM DISTRIBUTION The uniform distribution of a sequence of numbers is a basic concept in analytic number theory. Dirichlet proved his prime number theorem by showing that the sequence of prime numbers was uniformly distributed among the cp(q) possible arithmetic progressions modulo q. In this book we

usually consider the sequence of real numbers x =f(n), where f(x) is a smooth function. We expect x,, to be uniformly distributed modulo one.

Uniform distribution

13

Definition A sequence of real numbers xn is uniformly distributed modulo one if, for every e > 0, however small, there is an N(e) such that, for every M>_ N(e), and every subinterval I of the unit interval [0, 1], then the number M(1) of integers n in 1, ... , M, whose fractional parts x,, - [ xn ] lie in I, differs from M times the length of the interval I by at most Me. We call the difference M(I) - M I I I the discrepancy of the interval I. Uniformity conditions are required when one approximate process or limit operation occurs inside another one. If g(x) is a continuous function satisfying the periodic condition g(x + 1) = g(x), then the uniform distribution of x,, modulo implies that

M M n=1

1

g(x)

(1.4.1) 0

as M tends to infinity. Periodic functions belong to Fourier theory, and Hermann Weyl showed that a sequence x,, is uniformly distributed if and only if (1.4.1) holds for the trigonometric functions sin 21rhx and cos 2.7rhx for

each integer h. We use the notation e(x) = exp 21r i x

for complex exponentials (when x is a rational a/q, then e(a/q) is a qth root of unity). In this notation the Weyl criterion becomes M

F, e(hx,,)->0 n=1

as M tends to infinity for each non-zero integer h. Weyl's criterion for uniform distribution will be proved in Lemma 5.3.3. In practice Weyl bequeathed us the principle that one may average away a variable, provided that certain exponential sums are small enough; what

bound is actually required for the exponential sum, that depends on the context. In this section we test the Weyl criterion for the rational numbers in (0, 1], enumerated as in Section 1.1.

Lemma 1.4.1 (Ramanujan's sum)

F e Aqq

a=1 (a,q)=1

clh clq

cµ(

where µ(n) is the Mobius function. Proof Since

E µ(d)=0 din

The rational line

14

if n is greater than one, and the factors of the highest common factor (a, q) are just the common factors of a and q, the sum required is q

E

q1

a=1 din

q/d

b=1

dlq

where we have written a = bd. 1 he sum over b is zero unless h is a multiple of q/d so, writing c = q/d, we get /

F, µ(d) d = E F, cµ(d) _ > cp q clh

dIq

q/dlh

clh clq

d

cd=q

1 ).

`

Lemma 1.4.2 (Weyl's sum for the rational numbers) Let Q be a positive integer, and let x1, ... , x v be the rational numbers of height at most Q in (0,11. Then

N= 2Q2) +O(QlogQ)=

3Q2

+O(QlogQ)

and for each integer h N

dM(Q/d),

E e(hxa) dlh

1

where M(x) = Ea

x

µ(n) is the sum function of the Mobius function.

Proof For the first assertion, by Lemma 1.4.1 with h = 0, Q

q=1

Q

q

N= >

1= E F, F, cµ(d)

F, a=1 (a,q)=1

q=1

c

d

cd=q

Q

_ E E d µ(d) _ E

µd d)

d5Q

q=1 dIq

Q

E

q

q=1 q = 0 (mod d)

p(d) d

d5Q Q2

°r°

E

2 d=1

2 1

Q2 + O(Q) d

u(d) + d

2

O

I µ(d)1

Q2

d=Q+1

d

2

+ Q X-

dsQ

I µ(d)I d

The error terms are O(Q log Q) in order of magnitude. The infinite series is

1/i(s) at s = 2, and (2) takes the special value ire/6.

Approximating a given real number

15

For the second assertion, the sum can be rearranged to give

E

q=1

F, a=1 (a,q)=1

F, EdjA

elah q

d

q=1 dlh

dlq

Ed dlh

µ(r). r5Q/d

Lemma 1.4.2 gives the Weyl sum in the case when the enumeration x1, ..., XN corresponds to the complete set of rationals of height at most Q. If N is such that xN has height Q, but so does xN+1, then we must add O(Q) to the results of Lemma 1.4.2 to correct for the terms omitted. Since I M(x)I <x, we have, for non-zero h, N

Q Q + 1 since the mediant is not in 9(Q). Similarly if x > (e + f)/(r + s), then we take a/q = f/s. 0 The first proof is a variation of Dirichlet's pigeon-hole principle, that if Q + 1 letters are delivered to Q persons, then somebody has at least two letters in their pigeon-hole. The pigeon-hole principle is used like compactness in analysis. The second proof can be made constructive by the continued fraction rule, which, given a real number x, constructs a sequence of rational numbers, alternately below and above x. If x is rational, then this algorithm finds the

Approximating a given real number

17

lowest-terms form of x, and then terminates. In particular, if x is given as a ratio m/n, then the continued fraction rule is simply Euclid's algorithm to find the highest common factor d = (m, n) and integers u, v with n m

ul =d.

In our notation r = m/d, q = n/d, and r/q, v/u are consecutive Farey fractions.

The continued fraction rule is an iteration. Let

ro=ao,

ao = [x},

qO = 1,

and, if x :k a0, then let

x1 = 1/(x - ao). Then 1

ro

0

qO

and

x=

rox1+l gox1 + 0

For the second step, let

a1= [xl],

(ql)

0)(

f qO

1

and, if x1 0 al, then let

x2 = 1/(x1 -a1). Then r1

ro

ql

qO

= 1.

and

x=

r1x2 + ro g1x2 + qO

For the general step, let

(rn) =rrn qn

I11 qn

1

rn-2

-1

qn-2

and, if xn # an, then let

xn+1 = l/(xn -a,,).

1 ( an)

The rational line

18

Lemma 1.5.2 (Continued fractions)

At each step in the iteration given above

we have

r_1

rn

qn- 1 qn

_

(-1)n,

(1.5.1)

and

rnxn+l

(1.5.2)

gnxn+l +qn-1 lies between rn/qn and rn_ /qn-1, which are consecutive fractions of A q,,). The fraction with the even suffix is the smaller of the two. If x is irrational, then the sequence r,,/qn tends to x with 1

1

gnqn+l

2n

Proof We see that I

rn-1

rn

rn-1

rn-2

qn-1

qn

qn-1

qn-2 I'

and the first assertion (1.5.1) follows by induction. Substituting

xn+1 -an+1 = 1/xn+2 in (1.5.2) gives

x=

(rnan+l +rn-1)xn+2+rn (gnan+l +qn-1)xn+2 +qn

proving (1.5.2) for all n >- 2 by induction. We find that rn-1

-x=

qn-1

(-1)nxn+1 qn-1(gnxn+l +qn-1)

and rn

(-1)nxn+1

qn

gn(gnxn+l + qn-1)

Both these differences have the same sign, which is that of (-1)", since xn + 1, q,, and qn _ 1 are positive. The fractions rn/qn and rn - 1/qn are consecutive (in the appropriate order) because of (1.5.1) and Lemma 1.2.1. For the third assertion we note that qn+1 ?qn +qn-1z2gn-1, gnqn+l ? 2gngn-1,

and by induction gnqn+l ? 2"gogl

? "2

Uniform Diophantine approximation

19

Since x lies between r,,/qn and

we have 1

The matrices rn-1 qn-1

rn

qnJ

which occur in the construction have determinant ± 1, and rn-1

r,,

qn-1

qn

=

rn-1 qn-1

1an)

1

q,, -2 I\0

rn-2 1Ta,,

_

l qn-1 (r 21

qn-2J

in the notation of Section 1.4. Similarly we have rn+1 qn+1

rn qn

_

rn - I

rn

qn-1

qn

1

(a,

0 1

_

n-1

- (qn-1

)s_1Ts. r

We note that, although S-1 = -S as matrices, in the action of the modular group PSL (2, Z), S-1 = S.

1.6 UNIFORM DIOPHANTINE APPROXIMATION Although the average gap between the fractions of .9'(Q) is in order of magnitude 1/Q2, there is a gap of length 1/Q between 0 and 1/Q, and there are long gaps about other small rational numbers. In Borel's analogy, a bright star is surrounded by an area in which the stars are fainter than usual, as if the images of nearby stars have been displaced away from the bright star. If criterion (A), accuracy of approximation, is the most important, then, given an accuracy S (with 0 < S < 1), we need a set of reference fractions such

that every real number is distant at most S from one of the reference fractions. One way to do this is to divide the real line into intervals of length at most 3, and to choose the smallest rational number (smallest in height) in each interval to be a reference fraction. If the extended Farey sequence .9'(Q) has SQ2 < 1, then no two fractions of 9(Q) fall into the same short interval, so that all the fractions of .9'(Q) are reference fractions. Some intervals will

give reference fractions with denominator greater than Q. Choosing the smallest rational in each interval is what is called a `greedy algorithm' in combinatorics. The following construction gives an explicit set of reference fractions. We

start with the Farey sequence .9'(Q) for Q = [2/1/S ] and refine it to a sequence of rationals at most S apart. If e/r and f/s are consecutive Farey fractions with

If/s-e/rI<S,

The rational line

20

then we leave the interval [e/r, f/s] alone, but if S < I f/s - e/rl = 1/rs, then we fill the gap. Suppose that r > s. We insert fractions e,

e+ft,

f

1

ri

r+st;

s

s(r+st;)

which are adjacent to f/s in their respective Farey sequences. We choose t; by to = 0, and for i z 1 r + st; = 2/isS + O(1), the sequence stopping at t; when 1

s(r+st;)

5 S.

We note that, since 2r - 1 z r + s > 2/15, we have

r+st;>rz1/,/S. We have enlarged the Farey sequence AQ) to a sequence of reference fractions. The gaps between consecutive fractions have lengths between 8/4 and S. Our next lemma, on the greedy construction and a variant of it, is taken from Huxley and Watt (1988). Lemma 1.6.1 (Uniform Diophantine approximation). Suppose that we are given a real interval J of length 0, divided into subintervals Jk of length at least S,

where 0 < 6 < 1. In each interval Jk we pick the rational number ak/qk with least denominator. Let Q be any positive integer. Then the inequality qk 5 Q holds for at most OQ2+1 values of k, and qk Z Q holds for at most 80

8

62Q2 + SQ values of k.

If we pick instead the rational number ak/qk in Jk with qk least, subject to qk z 1/3, then qk z Q holds for at most 240

12

S2Q2 + SQ

values of k.

Proof The first assertion is part of Lemma 1.2.3. Intervals for which qk > Q contain no fraction of .1(Q - 1), so they lie between some pair of consecutive

Uniform Diophantine approximation

21

fractions e/r and f/s of 9(Q - 1). Suppose that s is the smaller donominator; then s < r, r >- Q/2. If there are n values of k for which the interval Jk lies strictly betwen e/r and f/s, then

n6- Q that belong to a fraction f/s outside J. All other intervals Jk with qk >- Q belong to some fraction f/s within J. Thus by Lemma 1.2.3 we find that qk >- Q occurs for at most 2 2-+2 E 6Q ss2/sQ

E f

2 SsQ

8i

< S2 Q 2

+8

SQ

values of k. In the case when choices of ak/qk with qk < 1/6 are not allowed, we may suppose that Q2 z 4/6, or the result follows from the trivial bound L/S + 1

for the number of subintervals. We must add the number of intervals Jk containing a fraction a/q in 9(1/%6) (which must be unique) and no other member of 9(Q - 1). Then q 5 Q/2. Suppose that a/q lies between e/r and f/s in .9(Q - 1). Since the mediant (a + e)/(q + r) is not in F(Q - 1), we have q + r >- Q, so that r >- Q/2. Similarly we have s z Q/2. We deduce that

SSf/s - e/r= 1/qr+ 1/qs M1, then all points inside M1C are inside M2C. Thus K(M) and K(M) + L(M) are increasing functions of M. We show that the limit of K(M)/M2 exists as follows. Let n be a positive integer. Then each unit square inside the curve MC corresponds to n2 unit squares inside nMC, and each unit square cut by MC corresponds to n2 unit squares, some of which are cut by nMC. Thus we have K(nM) >_ n2K(M),

L(nM) 5 n2L(M).

If L(nM)/n2M2 tends to zero as n tends to infinity, then K(nM)/n2M2 tends to a limit, the area inside the curve. We show that the area exists when C is a convex curve, of length A say. The curve nMC has length nMA, and the x-coordinate changes by at most nMA/2 on the curve nMC. Hence at most nMA/2 + 1 lattice lines parallel to the y-axis are cut by the curve nMC (Fig. 2.3), since the curve has to go there and back again. Similarly there are at most nMA/2 + 1 lattice lines parallel to the x-axis which are cut by the curve.

As the curve nMC is described in the sense of t increasing, it passes

Counting squares

27

N

nM.U2 FIG. 2.3

through L(nM) lattice squares. If the curve never cuts a lattice line, then L(nM) = 1. If L(nM) > 1, then the curve nMC enters each new square by cutting a lattice line. Since nMC is a convex curve, it meets lattice lines in two distinct points, or possibly in a closed interval. In the case of a closed interval, we take the endpoints of the interval to be the two points in which the lattice line is cut by the curve; they may coincide. Then L(nM), the number of squares cut, is at most the number of times that the curve nMC cuts a lattice line in this sense, and L(nM) < 4(nMA/2 + 1) = 2nMA + 4.

We see that L(nM)/n2M2 tends to zero, and the common limit, A, of K(nM)/n2M2 and (K(nM)+L(nM))/n2M2 does exist. To show that the limit is independent of M, we use the fact that if n and r are integers with rM M1"d+E and a certain number of derivatives not vanishing. Bombieri and Pila (1989) took these ideas further, getting the bounds

R(M) > A2/3 >> A1/3.

Proof Let the vertices of the Jarnik curve be PI,..., PR. Let Cr be the circumcircle of the triangle Pr-1PrPr+1 (Pa is identified with PR, PR+1 with P1). The triangle Pr_1PrF.+I has area i by Lemma 1.2.1, and sides of length

between 1 and HV. Either Pi_1Pr or PrPi+1 is a vector with one entry at least H/2. Let Or be the angle 1 _1PrPr+1 (Fig. 2.6), K, be the length of Pr Pr+1, la'r be the length of Pr_1'r+1, and let pr be the radius of the circle Cr. By elementary geometry 1

2

1A'r

= Kr- 1 K,sln 6r,

sin Or e = 2Pr'

so that Pr = Kr-1 KrPA'r s Kr-1Kr(Kr-I + Kr) 5 4H3V.

The circles Cr and Cr+1 meet at P. and at Pr+1. We now take circles Dr of radius 2 pr, such that Cr and Dr touch internally at P. (Fig. 2.7). The circles

Dr and Dr+ 1 meet outside the circles Cr and Cr, 1, so that there is an intersection between P. and Pr+ I and outside the Jarnik polygon. We can

Dr+l

FIG. 2.7

The discrepancy of a polygon

33

join P and P,+ 1 by a smooth curve E, which has the same tangent and radius of curvature at the point P, as the circle D, and has the same tangent and radius of curvature at the point Pr+ 1 as the circle D,+ 1, and has radius of curvature at most 4HVV. When we subtract the equation of Dr from the equation of D,+ and change coordinates, then this corresponds to finding a 1

function g(x) with g(0), g'(0) and g"(0) all zero, g(x), g'(x) and g"(x) non-negative and bounded on 0 < x 51, and taking prescribed values at x = 1. The length of Er is less than the length of the polygonal path from P, to P,+ 1 formed by the tangents to the circles Cr and Cr, 1 at Pr and P,+ 1,

respectively. The angle between the tangent at Pr and the line P,P,+1 is equal to the angle P,Pr_1Pi+1, which is less than IT- Or. If H is large, then Ors 31x/4, the tangents cut at an angle greater than 7r/2, and the length of Er is Ar 5 (sec 0, + sec 0,.+1)K,:!,r+ 1) K, S 2K,'!1.

The Jarnik curve is formed from the curves Er, having continuous radius of curvature, and length

A= EA, 1, or if the equation has the form x = y, then we assume that

we treat y as the independent variable. Let E be the region between the lines x = t + 2, x = u + 2, y = v + i and y = ax + /3, a quadrilateral whose R, and Pn_1Pn, extended if necessary. The quadrilateral may be self-intersecting. We take area and sides lie along the lines Rn_1Qn_1, RnQn, Q,,

lattice points above the line y = v + i as counting positively, those below as counting negatively. Then the signed number of lattice points in D differs from the signed number of lattice points in E by at most two, and the area within D differs from the area within E by at most four, the difference being

contained within lattice squares containing Q,,-, and Q, and adjacent squares above or below these two. The area of E is 2 a-v-)dx=(u-t)

(a(t+ u+1) 2

1

t+

1

+/3-v-2Ji

and the number of lattice points is u

u

m=t+1

1+1

F, ([am+/3]-v)= L(p(am+/3)+am+/3-v-Z) U

_ L p(am+/3) t+1

+(u -t)

(a(t+u+1) 2

+/3) -v-Z

Polygons and area

38

Hence the signed number of lattice points in the pentagon D differs from the area of D by u

E p(am+/3)+O(1). r+1

We put together the bounds given by Lemma 2.3.2 for each such pentagon to obtain the error estimate in counting squares for a convex polygon. Theorem 2.3.3 Let D be the region inside a convex plane polygon P1P2 ... P. as follows. 1P,, (with P0 = PN) define the bound 1, For each edge

Let L be the length of the edge P1P,, and a be its gradient, with continued fraction a = ao + 1/(a1 + ), and convergents r,/q;. Let k be the largest integer with qk 5 L + 1. Then

B(Pi-1,

=ao+a1 + ... +ak + (L + 1)/qk

In this notation the discrepancy 0 of D, the difference between the number of integer points in D and the area of D, satisfies N

0
>-1

E B(u) - 0. Then the number of solutions of (3.1.1) with F(x) = G(x) R 0,

-1-1/r

if -K 28 (u1 + U2).

Since u1 and u2 are positive integers, u1u2 > (u1 + u2)/2, and

d21>(S(u1+u2)-1)2-1.

(3.4.4)

Swinnerton-Dyer's method: the convex polygon

55

We see that d21 (and similarly d;+1,i) cannot be negative. The integer points Pi, (mi, ni), close to the curve form a convex polygonal line. The gradients vi/ui form a decreasing sequence. No four vertices form a

parallelogram, so the differences m, - mi, n1 - ni together determine the points Pi and P,, we use this in the next section. We call PiPi+1 a minor side

of the polygon if neither P;_1 nor P;+2 lies on the straight line PiPi+1. A major side of the polygon consists of a maximal sequence Pi, Pi+ 1,

, Pi+, of

collinear consecutive vertices, along some line y = (ax + b)/q with a, b, q integers, and a/q in its lowest terms. By (3.4.2) the gradient a/q lies in an interval I with length j

AL+

AL

There are at most two intervals of x on which TFlMax+blS6.

(3.4.5)

q

The integer points Pi,..., Pi+, fall into one or other of these intervals. A proper major side is a maximal sequence Pi, Pi+ 1, ... , Pi +, of at least three collinear consecutive vertices on a line y = (ax + b)/q, lying within an interval of x on which (3.4.5) holds. The integers mi, ... , mi+r are in arithmetic progression with common difference q. We see from (3.4.3) that a proper major side has length at most K, with qt- 0 for which 2` > k1 > 2`-1

(3.5.10)

Here r takes at most log N/ log2 + 1 values. We treat the trailing triples similarly, according to the size of k3. Suppose that a1, a2, a3 and k1, k2, k3 are leading triples in the same class (3.5.10). From (3.5.7) and the analogous determinant formed with the k, we have c31(a2k1 - a1k2) = c21(a3k1- a1k3), c31(a2k3 - a3k2) = c32(a1k3 - a3k1). We see that the integer

e = a1k3 - a3k1

(3.5.11)

is divisible by

_

C31

C31

(C211 C31) ' (C31, C32)

_

C31

C31

(c21, C311 C32)

ac

Eliminating c21 from (3.5.8) and its analogue for the ki gives C32(C31+C32)(a2k2-a2k;)

1B

(4.1.7)

2 8'

and at most 2Q(2C0)1"3 M

(4.1.8)

values of r with ar+1

qr+ 1

ar+2

qr+2

ar+2

qr+2

ar

qr

ar+1

qr+l

ar

qr

1

>_

2grgr+lqr+2

(4.1.9)

2

for some e between 0 and M. Corollary

The number of rational points on the curve which can be written as

(a/q,b/q) with QSq 1/2grgr+lqk ? 1/8Q2gk Hence the determinant D.Jk has size I D;JkI

= 2g1gjqk(area PiPi Pk) > d/4.

In Lemma 4.1.2 we have either I80iikl z d/8,

so that (4.1.16) holds, or, for some e a, ai

q,

ak

qi

ai

qk

ak

qk

AF"(/M)

qi

a,

q,

2gIgjqk

d

8)

giving

a,

a;

ak

a1

ak

a1

qi

q,

qk

qt

qk

q,

d 32CiiQ3 '

which implies the remaining inequalities.

Lemma 4.1.7 (Separation of major sides)

j-i=dz2,

Let P,P1 be a major side, with aj a;

- - = A,

q1

q1

and equation lx + my + n = 0. Then a,+1 q1+ 1

-a

>_min

q1

F 2m

d

d

i

6 4S Q2' 1 6 AQ 2

(4.1.18)

and also aj+1

ai

q1+1

q1

->min

d

d

192 SQ2 ' 48C Q2

m ( 2S )

(4.1.19)

Proof Since the line contains d + 1 rational points with x values at least m/4Q2 apart, we have the inequalities A Z dm/4Q2,

d :!!g 4Q2A/m.

The rational points close to a curve

70

If the second possibility (4.1.17) of Lemma 4.1.6 holds, then

- a,12

(aj+i qt+i

q,

d

d2m

J Z 32C Q3A

128COQ5A2'

which gives (4.1.18). For the bound (4.1.19) we modify the proof of Lemma 4.1.6. Let e = [d/2] > d/3. By (4.1.15) of Lemma 4.1.5 we have as/q3 - ar/qr 5 AA'

either with r = i, s = i + e or with r = i + e, s = i + d. We argue as in Lemma 4.1.6 with i, j, k replaced by r, s, i + d + 1. Again we have

e 5 4Q2µ/m.

µ >_ em/4Q2,

(4.1.20)

Corresponding to (4.1.17) we have ar

ai+i ( qj+i

e

2

-, Z 32C Q3µ qr

elm

>

elm 512C2Q' '

128C OQSµ2

giving the second case of (4.1.19).

Lemma 4.1.8 (Vertical major sides) most 366MQ2 rational points.

O

The major sides with m = 0 contribute at

Proof In the case m = 0 we have (1, n) = 1, so x = -n/l, y = b/q with 1 1q .

Let q = kl. Then ly lies in an interval of length 281/Q, with denominator k ::g 2Q/l. By Lemma 1.2.3, there are between 3 and 281 4Q2 Q

+1 12

88Q + 1 5 128Q 1

1

choices for b/q for each fixed n/l. Also, n/l lies in an interval of length M, and 15 2Q, so by Lemma 1.2.3 again, vertical major sides contribute at most

Fl128Q 1

:!g 128Q(2QM+ 1) 5 366MQ2.

0

n/[

Lemma 4.1.9 (Good major sides) Each individual major side contributes at most 72

(2 S CQ3

(4.1.21)

m

rational points. For A z 1, the major sides with m0 0 and A 5 4A a

(CQ)

(4.1.22)

Major and minor sides of the polygon

71

contribute in total at most 144

2SCQ3

0

0

) + 192ASMQ2

(4 . 1 . 23)

rational points, and for B S 1/S the major sides with m >- 1/B8

(4.1.24)

588B1/2CSMQ2

(4.1.25)

contribute in total at most rational points.

Corollary The number of rational points on major sides is at most 606B112CSMQ2.

Proof We use the notation of Lemma 4.1.7. Since d >- 2, the number of rational points on a major side satisfies d + 1 5 3d/2, and by (4.1.20) in the proof of Lemma 4.1.7 we have

d < 3e S 12Q2µ/m,

(4.1.26)

which gives the first assertion (4.1.21). For the other bounds we adapt the argument of Lemma 4.1.3. We number the major sides satisfying (4.1.22) from

1 to K, and call their endpoints P,(k), Pj(k), with dk =j(k) - i(k). We put Xk = ai(k)/qi(k) Since A + 1) >- j(k) and i(k + 2) > j(k) + 1, the bound (4.1.18) of Lemma 4.1.7 gives xk+ 2 - xk ?

as(k)+1

a'(k)

dk

- qi(k) - 64A S Q2 qj(k)+1 2

Thus K-2

K-2

E dks64ASQ2 k=1

(xk+2-xk)5128ASMQ2. 1

The bound (4.1.23) follows when we use (4.1.26) for the final values k = K - 1

and K, and the inequality dk + 1 < 3dk/2. Similarly, we number the major sides satisfying (4.1.24), and define Xk likewise. By (4.1.19) of Lemma 4.1.7

xk+2 -xk z dk/96B112C8Q2. and by (4.1.26) 2 SCCQ3

d k 5 48BS

48S

2BCQ3 :!9 - 96B112CSMQ2

for each k, since OM2 Z 1. This gives the bound (4.1.25). The corollary


72

follows when we give B its maximum value 1/8 in (4.1.25), and add the bound of Lemma 4.1.8 for the number of rational points on vertical major

0

sides.

4.2 DUALITY We need the notion of the function g(y) complementary to f(x). It

is

convenient to have f(x) defined for all x, with f"(x)> 0, although we only

use values of x in the range 0 to M. The function F(x) is defined for 0 < x< 1, where it satisfies (4.1.4). Suppose that (4.1.4) holds on a range a - /3. This gives a function F(x) defined for all x, twice continuously differentiable, satisfying the inequality (4.1.4), and agreeing with the original definition for

0<x< 1. We continue to put f(x) = TF(x/M). The inverse function h(y) of f'(x) is defined for all y. We put

g(y) =yh(y) -f(h(y)), so that

g'(y) =h(y) +yh'(y) -f'(h(y))h'(y) =h(y), and

g"(y) =h'(y) = 1/f"(h(y)). We can easily check that f(x) is the function complementary to g(y). Figure 4.1 shows the relation between the complementary functions f(x)

and g(y) in the special case when f(x) vanishes at the origin. They are represented as areas bounded by the curve y =f'(x).

g(y)

fl FIG. 4.1

Duality

73

Lemma 4.2.1 (The dual curve) On a straight line lx + my + n = 0, there are at most two disjoint intervals with the property that every point on them has

l y -f(x)l5 S/Q.

(4.2.1)

If I is such an interval, containing two rational points Pi, Pj satisfying (4.1.1), with

A=ai ---aiqi q1

then min XE!

l

28

m

AQ

- + f'(x)

,

(4.2.2)

and

g(

l

n

m

m

2C3S2

S

Q+AA2Q2

(4.2.3)

Proof Since f"(x) > 0, the points x with

f(x)5 -(lx+n)/m+S/Q form an interval, which may be empty. The points with

f(x) S -(lx+n)/m-S/Q form a subinterval, which again may be empty. The difference of these sets is one or two intervals on the real line.

Let U, (xo, yo), be the point on the curve y = f(x) where the gradient is

-I/m. Then xo=h(-l/m), so that Yo + lxo/m + n/m = n/m -g(-l/m). In the one-interval case, the value xo must lie within the interval, so lxo

Yo+m-

+

n

m

S

S Q.

In the two-interval case, the value xo does not lie in either interval. We use the rational points Pi and P. By subtraction

f

so, for some

aj

ai

-f(gi

ll

+

l

a,

ai

m q1 qi between ai/qi and at/qj, we have, by the mean value qt

theorem, 1

MO +In This completes the proof in the one-interval case.


74

In the two-interval case, the point xo does not lie in either interval. A second application of the mean value theorem gives 28

A CIA-xols

Q.

lies between a./qi and a,/q1, whilst xo does not, then either ai/qi or a!/q, lies between l; and xo. Thus for P, (a/q, b/q), equal to either P, or Pj, Since

we have

la

-x

IS

2CS

AAQ'

V

and Taylor's theorem about xo gives

b qS6=

x +(q -x)

(4

2

for some q. Since f'(xo) _ -1/m, and la + mb + nq = 0, we see that lxo

I

YO +

nI

2C3S2

S

m + m 5Q+-z-

which completes the proof of the lemma.

Lemma 4.2.2 (Points close to the dual curve) The number of rational points on major sides is O(C2SMQ2 + C7/651/2A1/6MQ11/6 )

Proof For technical reasons we want an upper bound for the length A of a

major side (measured in the x-direction). If A exceeds the bound µ of Lemma 4.1.5, then we argue as in Lemma 4.1.7. Let the points on the major

side be Pi, ..., Pi+d. Then for e = [d/2] either Pi,..., Pite or Pi+el

) Pi+d

span a distance at most µ in the x-direction. Instead of considering the whole

major side, we take the shorter of the two halves, which contains at least (d + 1)/2 rational points, including the endpoints. Hence we can suppose that A 5 p., provided that we double the final estimate for the number of rational points. We use Lemma 4.1.8 for major sides with m = 0, and Lemma 4.1.9 with A = C for major sides with length A satisfying A 5 4S

(---) K

.

(4.2.4)

We use Lemma 4.1.9, with a value of B to be chosen, for major sides with m > 1/BS. Let N be the length of the interval taken by f'(x) for MS x5 2M, so that N< CAM. By (4.2.2) of Lemma 4.2.1, major sides for which (4.2.4) is false have 1/m within a distance AQ 1 ( 28 A

52

I\

J« 1i cM

2Cm AQ)

Duality

75

of a value of f'(x). Thus l/m lies in an interval of length M' 0, the major sides contribute

O I C19/9SMQ2I CQT I E+ C11/661/2A1/6MQ3/2log 1 J (QT)

= OI C10/36MQ2

E

+ (CA)1/3MQ)

(4.2.8)

rational points. The implied constants depend on e.

Proof We can assume that (4.2.9)

Q >_ 2C/e2,

since the result follows from Lemma 4.2.2 if (4.2.8) is false. As in Lemma 4.2.2, we use Lemmas 4.1.8 and 4.1.9 with A = C for major sides with m = 0, m >_ 1/BS or with (4.2.4) false. These cases give the terms of (4.2.7). Let N be the length of the interval taken by f'(x) for M 5 x 5 2M, as in Lemma 4.2.2. By (4.2.2) of Lemma 4.2.1, major sides for which (4.2.4) is false have 1/m within a distance 2S

eAM

1

AQ2

(2CmQ) < 4C 11

eN 4

of a value of f'(x), since N>_ OM/C> 1/C, and (4.2.9) holds. Thus 1/m lies in an interval of length

M'M. We write

0=T/M2. Let f(x) = TF(x/M), and let R be the number of points (x, y) which can be written as (a/q, b/q) with a, b, q integers, 0:5 x 5 M, Q:5 q < 2Q, and with

Iy-f(x)I=

b

-f(a

S

)I < Q 9 Then, as either M, Q, or T tends to infinity, we have the bounds R 0 on (a, 13). Let g(x) be real, and let V be the total variation of g(x) on the closed interval [ a, /3 ] plus the maximum modulus of g(x) on [ a, /3 ]. Then

fpg(x)e(f(x))dxl5

4V

a

Proof As in Lemma 5.1.2 we need only consider the pure exponential integral. If f'(x) changes sign at a point y in the interval (a, )3), then f'(x) z 8A for x Z y + 8, and I f'(x)I z SA for x < y - S. We use fY+se(f(x))dxl y-s f." < 1/ iTSA + 28 + 1/irSA,

f pe(f(x)) dx 5

8

e(f(x)) dx

+l f p e(f(x))dx y+s

where we have used Lemma 5.1.2 on the first and third integrals. We take

Partial summation for exponential sums

89

S = 1/ (ara) . If y does not exist, or y 5 a + S, or y>- 6 - S, then there is a similar subdivision of the range of integration which leads to a stronger inequality.

Lemma 5.1.4 (rth derivative test) Let f(x) be real and r times differentiable on the open interval (a, f3) with f (')(x) >- µ > 0 on (a, G). Let g(x) be real, and let V be the total variation of g(x) on the closed interval [ a,13 ] plus the maximum modulus of g(x) on [a, R ]. Then

f Rg(x)e(f(x))dxl
- T/C2M2,

so that f'(x) is monotone with A = f'(a) < f'(x) < B = f'(/3). Let g(x) be a real function of bounded variation V on the closed interval [ a, l3 ]. Then g(n)e(f(n)) = a_-5n_-5P

ASrSB

g(xr)e(f(xr) - rxr+ ) g fd(xr)

+O(V+Ig(a)D

77

+log(B-A+2)1

Evaluating exponential integrals

109

where xr is the unique value in a <x 5 /3 with

f'(Xr) = r. Proof First we consider the unweighted sum. We apply Lemma 5.5.2 to the integrals J

Re(f(x) -rx)dx

in the truncated Poisson summation formula (Lemma 5.4.3) with y = Xr. First we consider the exceptional cases when Lemma 5.5.2 gives less information than the Second Derivative Test (Lemma 5.1.3). This occurs when Xr is within a distance O(M/ FT) of the endpoints a or (3. Now -xr)f"()

1 =f'(xr+]) -f'(xr) _ (xr+l for some

between xr and xr+ 1. Hence

(5.5.9) xr+1 -xrXM2/T. The number of values of r with xr within O(M/ FT) of either endpoint is

MT

1, the Riemann zeta function is given by 1

W

(s)= F S. n=1 n

By Lemma 5.4.1 with a = T, b - oo,

(s)

T-1

1

ns

1

T-

1 1

1

+

1

1

+f

2TS

dx+ f

T xs

Tl-S

1

ns + 2TS +

s-1

00 s(s + 1)u(x) X

T

S+z

( s(s + 1)o (x) dx. xs+z + JT

dx (5.5.14)

The integral on the right converges absolutely for Re s > -1. Hence (5.5.14) gives a meromorphic continuation of C(s), and we may use it as the definition of C(s). From the proof of Lemma 5.4.2 we see that

u As + 1)v(x) IT

xs+2

s(s + 1)(1 - cos2.7rrx)

u

dx = fT

2a2r2xs+2

r=1 s

-

1

12

1

US+1 )

( TS+1

E s(s + 1) r=1

1((

21rzrz I TU

/

dx

t

1

xv+zI\eI`rx-

t

+eI -rx- Z-logx T2

1

« Tv + r=1

dx

1 «Tv, r2 Tv+z r 1

1

by the First Derivative Test (Lemma 5.1.2). We let U tend to infinity.

Step two We use the truncated Poisson summation formula of Lemma 5.4.3,

Analytic lemmas

112

with a weight function g(x) =x-° put in as in Lemma 5.5.3. For ease of writing, we consider the case of t positive. We have T

T

1

Fns=E fNe

1

1-

x°dx+O

log T

N°

where the sum is over t

t

1

1 2arT-45r52i7N+4.

The term r = 0 gives T'-g

N'-$

1-s'

1-s

the first term cancels in (5.5.14). The integral with r = [t/27rN + 1/4] is

divided into blocks with x running from Nk to Nk+l, chosen so that

t

Nk+ I < 2Nk for each k. By the Second Derivative Test (Lemma 5.1.3)

fNk"e (

Nk

rx-

2?r

Nk

logx )I

x dx
- T/C2M2. Suppose also that f'(x) changes sign from negative to positive at a point x = y with a < y < P. If T is sufficiently large in terms of the constants Cr, then we have

I Rg(x)e(f(x)) dx -g(a)e(f(y)+8) +g(/3)e(f(/3)) f'"(y) 21rif'(9) +O

T4U11

+-

g(a)e(f(a)) 21rif'(a)

1

N/2(

+O

3) The implied constants are constructed from the constants Cr. 5.6

1

U (1

(T

+

N/2)

BILINEAR AND MEAN SQUARE BOUNDS

An analytic number theorist, it is said, is someone who is very good at using

Cauchy's inequality. Often in analytic number theory we come across a double sum MZ

N2(m)

F,

F,

m=M1 n=Ni(m)

g(m)h(n)e(f(m, n)).

(5.6.1)

Bilinear and mean square bounds

115

Following Montgomery and Vaughan, we classify double sums as type I if either sequence of coefficients is smooth. If h(n) has bounded variation, then we can estimate the inner sum in terms of pure sums. If both coefficient sequences are irregular, then we use Cauchy's inequality. We state the first lemma for a sum slightly more general than (5.6.1). Lemma 5.6.1 (Type II sums) M2

N2(m)

F,

F,

2

g(m)h(m, n)e(f(m, n))

m=Mi n=Nl(m) M

Ig(m)I2) M2 n1

h(m,nl)h(m,n2)e(f(m,n1)-f(m,n2))

E

x

m=M1

n2

N(m)S n U112

Zm)zn1,n2

Proof We use Cauchy's inequality on the sum over m; the second factor is M2

2

N2(m)

E h(m, n)e(f(m, n))

E

m=M1 n=NI(m) rearranged.

In our next lemma a sum in one variable is treated as a double sum. The method is `divide and conquer'; it is equivalent to splitting the range into short intervals, using Cauchy, and then averaging over the possible points of dissection.

Lemma 5.6.2 (Differencing step) Let f(x) be a real function, defined for M< x:5 N, and let w(M), w(M + 1), ... , w(N) be any coefficients. Let D 5 (N - M + 1)/2 be a positive integer. Then N

E w(m)e(f(m))I

(N-M+2D-1)

2 A

N(S) >_

=2

3/2

j

i

f

S

)

f

2 2

2

e(sx - six)sinc2 j

Sx

dx

2 2

S

I.F'(x)I sinc 2 dx Z 2

Sx

C3

IF(X)12

7r/2

dx,

and in the other direction /

z

N(S)5Ei F(2) since1sS 21) ` / / 2

4

26 f E E e(sjx-six)A(2Sx)dx

'72S

0

= 2 C28 11

2S

j

i

f

S

2

IF(x)I A(26x)dx

SIF(x)IZ dx.

p

The k-dimensional case goes similarly.

Lemma 5.6.6 (Double large sieve)

Suppose xWlW,..., x(m) and yU),

are real vectors in k dimensions with

-Xi/2 5 x;"') S Xi12,

-Y/2 < y, ") 5 Y,/2

, y(N)


121

fori=l,...,k; m=1,...,M; n=1,...,N. Let Ei=1/Xi, Si=X,/(X;Y+1), and put

A(x) = max(1 - Ixl,O). Then 2

E ambne(x(m) Y(")) m

k

4k

17.

(2) A(S)B(e) [1 (X;Y + 1),

(5.6.3)

n

where k

x(m) - x(r) k

A(s) _ E E IamlIarl [l A

r

r

m

L

k

B(E)= '+

bn b

r

n

1

si

1

t

(

AI

y(rn)

E'

I.

y,(r)

Proof As in the last lemma we drop the suffices and treat the onedimensional case. Since E/2

ify- E/2

e(sx) ds = e(xy)

e(Ex/2) - e(- Ex/2)

= Ee(xy)sinc Ex,

21rix

we have

E E ambne(xmyn) E E m

n

m

n

am bn

fYn + E/2

e slnc Exm y,- E/2

= 1(Y+ 0/2

e(sxm) ds

F,

-EAme(sxm)

(Y+ 0/2 m

IYn-sIs /2

where am

Am -

E sine Exm

I AmI S

2E

laml.

By Cauchy's inequality we have ambne(xmyn)

11: m

SI1I2,

n

where I1 and I2 are the integrals

I

z

(Y+E)/2

2

b

I (Y+E)/2I IYn-SISE/2 "

_

bnbrmax(E - IYn n

I1 = f

ds

yrI,O) = EB(E);

r (Y+ E)/2

2

I F, Ame(Sxm)I ds

-(Y+E)/2 M

b,, dS,

Analytic lemmas

122

with 00

7T2

I1 < f 4 since

2

6sl>Ame(sXm)I ds m

00

7r2

rXk -Xm

AmAk4SAI m

k 7T2

IAmAkI m

ir 4 16

k 1

SEZA(S)

46

S

)

maxi`1-

IXk -Xm)

7r4 (XY+ 1) 16

S

,0

A(S).

E

For the k-dimensional case we use Cauchy's inequality inside a k-fold

0

integral with respect to s1, ... , sk.

To show why Lemma 5.6.6 is called the `large sieve', we deduce from it the simplest inequality of sieve theory, the `Brun-Titchmarsh Theorem'. Hardy and Littlewood (1923) found the first result of this type using Brun's selective version of Eratosthenes' method, rejecting multiples of small primes. Later Selberg changed the treatment from linear to quadratic, finding a signed weight function which is large at primes, and summing its square over all the integers in the interval. Linnik meanwhile found sieve upper bounds using his

dispersion method. The basic idea of the dispersion method is to prove uniform distribution by a mean-to-max method, relating the absolute discrepancy to the mean square of the discrepancy on subintervals. Halberstam and Richert (1974) give a detailed account of sieve methods. We give two proofs of the Brun-Titchmarsh theorem from Lemma 5.6.6, the first one of Linnik's type, the second one of Selberg's type. The two proofs are dual in the sense of bilinear forms (see Lemma 10.6.3).

Lemma 5.6.7 (Primes in an interval) There is a constant C such that, out of any M(Z 4) consecutive integers, at most CM/log M of them have no prime factor less than YrM-.

Corollary If the endpoints of the interval are greater than VM_, then at most CM/log M of the integers in the interval are prime.

First Proof Suppose that the vectors y(°) are E-well spaced, in the sense that, if n r, then for some i we have IyI "> -y(r)l > E;.

Then all terms in the sum B(e) with no r are zero, and B(E) _ , Ib"I2. n

(5.6.4)


123

We choose bn by

k= E ae(x(m) . y(n)). m

The left-hand side of (5.6.3) is B(E)2, so we have k

4k

B(E) _

ame(x(n') .y(n))I2 S (

I

2)

m

n

A(S) F1 (X;Y + 1).

5.6.5)

Now we specialize k to be one (we drop the suffices i), and the points y(n)

to be the fractions of the Farey sequence AQ) with - i < a/q 5 m - (M + 1)/2, so that

X=M-1,

$=1-1/M.

e=1/(M-1),

Y=1,

x(n') _

Since S < 1, we have 12.

A(S) = E la

The minimum distance between members of AQ) for Q >- 2 is 1/Q(Q - 1), so the spacing condition (5.6.4) holds with Q = [vrM-] (the lemma is trivial for

M 4).

Suppose that the interval goes from K + 1 to K + M. We put am =1 if K + m has no prime factor < Q, zero otherwise. When yn = a/q, then

y )= Eame - m-

ame(x

2

=ea(K+M+1))Ea e(a(K+m) q

m

2

m

q

Let

(a)

Eame

q

a(K+m) q

m

which depends only on a/q modulo one. We can now write (5.6.5) as a

Ir 4

2

E E IS(Iq 5-MEIamI2, q=1 amodq 16

(5.6.6)

m

(a,q)= 1

a form of the large sieve familiar to number theorists.

Let R = E am be the number of non-zero terms in S(a/q). Then using Ramanujan's sum (Lemma 1.4.1) we have

S(a)=Eam a mod q

(a,q)=1

q

m

E e\ a mod q

a(K+ m) q

(a,q)=1

= mLam clqE c/ cl(k+m)

(q)= Fla. µ(q)=p p(q)R, m

Analytic lemmas

124

since the only integer c which is a factor both of q and of K + m is one. By Cauchy's inequality (this corresponds to Linnik's dispersion step) we have

F,

p.2(q)R2 5 cp(q)

a mod q

(a,q)=1

Dividing by cp(q) and substituting in (5.6.6), we have 2

R

µ2(q)

ir4MR

cP(q)

16

F,

q--,Q

15 .6.7)

The last step is to estimate the sum in (5.6.7) from below. The Mobius function µ(q) is 0 unless q is a product of distinct primes, Pi, ... , p, say, and then

µ2(q)

1

1

r

cp(q)

cp(q)

Pi...P,

1

1

(1

Pu

1

)

r( II 1+-+ zR +... _ -,n Pi PI Pr 1 1

1

1

1

taken over all integers n whose prime factors are all taken from the set (Pi,. .. , p,). Hence

µ2(q) q ,Q

1

97(q) -

L

n

taken over all integers n for which the product of the distinct prime factors of

n is an integer < Q. Thus µ2(q) q

Q (q)

Q

1

nz

Q+1

dx x

= log(Q + 1)

iz log M.

(5.6.8)

We deduce the result with C = ir4/8. Second proof Similarly, if the vectors x(m) are 6-well spaced, then 4k

2

bne(x('n) y("))1 "I

n

5

k

(2) B(e) F1 (X;Y + 1).

We take k = 1, x(m) and y(") as in the first proof, and choose b by

µ(q) a M+ 1 9P(q)e(q(K+ 2 ))

(5.6.9)


125

where y(") is the fraction a/q. The points y(") are e-well spaced, so that

B(e) _

µ2(q) _

':

Q µ2(q) P(q)

q=1 amodq PT(q) (a,q)=1

Thus (5.6.9) gives 2

µ(q)

E F, m q=1 (P(q) amodq

e

7r 2M Q µ2(q)

r a(K+m)) q

5

1

(a,q)=1

16

E 1

cp(q)

(5.6.10)

which is Selberg's sieve inequality (with a worse constant). The sum over a is the Ramanujan sum again. In particular, if K + m has no prime factor 5 Q, then the sum over a is µ(q). The left-hand side of (5.6.10) is µ2(q)

> R 1

2

q'(q)

and we complete the argument using (5.6.8) as in the first proof.

The form of the large sieve in Lemma 5.6.6 is due to Bombieri and Iwaniec (1986a). The application in Lemma 5.6.7 follows Mathews (1973). Lemma 20.2.2, which bounds the number of times that an exponential sum is large, is also a sieve result in the widest sense. The other lemmas were either known

to van der Corput, or are later developments of his ideas. For example, Lemma 5.5.2, from Huxley (1994c), corresponds to the leading terms only of van der Corput's asymptotic formula (1935, 1936), without taking f(x) to be infinitely differentiable. There are many forms of Lemma 5.5.2, with different hypotheses about the function f(x), for example, Lemma 10 of Heath-Brown

(1983), Lemma 2 of Karatsuba (1987), and Lemma 3.4 of Graham and Kolesnik (1991). The more specialized Lemmas 10.6.1, 10.7.1 and 21.4.3 are also of van der Corput type.

6 Mean value results 6.1 THE SIMPLE EXPONENTIAL SUM The idea in this chapter is to investigate a sum S by introducing a real parameter x, and then evaluating E2 = f 1IS(x)I2 dx. 0

Apart from its own interest, the integral has three uses. 1. Limitation results. Since IS(x)I >_ E for some value of x, any uniform upper

bound for IS(x)I must be greater than E. These results are called omega

theorems from the notation F = O(G) for `F does not have order of magnitude strictly smaller than G'. 2. Upper bounds. Sometimes there is a mean-to-max argument, using special properties of S(x), as in the Tauberian theorems of convergence theory. 3. Large values results. If x1, ... , xR is a well-spaced set of points at which IS(x,)I is large, then R itself cannot be too large. The large sieve (Lemma 5.6.6) is often used in this way. Theorem 6.1.1 Let f(x) be a real function with R continuous derivatives for M 5 x 5 2M, for which there is a size parameter T and a constant C >_ 1 such that If (')(x)I 5 CrT/M' (6.1.1) and

if W(x)I z T/CSMS

(6.1.2)

hold for certain values of r and s. Then f2 I

ZM1

2

dt=M+O(CM2

L e(tf(m))

Tg M

M

provided that R Z 1 and (6.1.2) holds with s = 1, and 1 f1

2M-1

2

E e(f(m +x)) dx=M+O

C2M2 log M C9M4 T

+

T2

,

M

provided that R >_ 3 and (6.1.1), (6.1.2) hold with r = 2,3 and s = 2, and f (3)(X)

The simple exponential sum

127

does not change sign. The term O(M4/T 2) in the second result may be improved if (6.1.1) holds for larger values of r.

Proof We have

f

2

12Mf 1

2

f2

dt =

e(t f (m)) M

e(t(f (m) - f (n))) dt m

M+

n 1

m

O n

f(m)f(n)I

m#n

by the First Derivative Test (Lemma 5.1.2). Since

f (m) - f (n) = (m - n)f'(e ) for some

between m and n, the error term must be
IkI, then we treat y as the independent variable, and argue similarly. We split the range at Mat and M)31, where u =Mat corresponds to the tangent angle +yo it/3, and u = M/31 corresponds to the tangent angle 1 = 7r/3. On the range Mf31 to M/3 we use various forms of the first derivative test. Since we have F" = 1/p cos3 41,

F' = tan 41,

3 sin 4-

F(3)

pcos54

dp

1

p2cos341

dcr'

integrating by parts introduces improper integrals as 41 tends to it/2. We split the range of integration at u = M/32, corresponding to 41 = 4/2 with

cos +lr2 =1/ (kM) . Then, by the First Derivative Test (Lemma 5.1.2),

fMp e(kMF(t/M) + ht) Mpg

21rik

dt
M,

f1DM(u, v) du dv >

provided that R z 4. The implied constants depend on the curve C.

Proof We drop the suffix M. By Parseval's formula for Fourier series we deduce that

f111 0

0

(N(u, v) -AM2)2 du dv =

F, F, Ia(h,k)12 h

k

(h,k)#(0,0)

(R -1)2NT.

(7.2.6)

As we shall see, R gives the usual order of magnitude of the denominator q when we approximate the real number f" (x)/2 by a rational number a/q. From (7.2.4) and (7.2.6) we have R2 >- N. (7.2.7) The Bombieri-Iwaniec method improves the classical bounds for exponential sums when R N. (7.2.8) We assume that M and T are sufficiently large (in terms of the constants Cr)

for R and N to exist and be large themselves. It is convenient to express orders of magnitude in terms of M, N, and R instead of M, N, and T. For

(1-SW Sx5(2+8)M we have N4f (4)(x) - 1 be a real number. In the notation of Lemma 7.4.2, we have Y)1)2

E(1) e(-x(h) 1

a/q

ABH5 NR2V(1
> 1 for i = 1, 2, 3, whilst X4Y4 = 2r (2 H)

QN

VQ27

We ought to treat the range Q >> N, the flanks of the major arcs, separately, taking out the term in h1/2 in (7.4.1) by partial summation (Lemma 5.2.1),

and applying the large sieve to three-dimensional vectors. However, the largest contribution comes from ranges with Q x R. We have 4

F1 (X;Y+1)

«l1+

Q1

f 4

1


158

Lemma 5.6.6 allows us two coefficient sequences. We take the coefficients on the vectors x(h) to be 1 in case 1, and 1 or 0 in cases 0 and 2, according to whether the vector x(h) occurs in the sum E('). We take the coefficients on the y vectors to have unit modulus, and argument opposite to that of the sum

r(l) e(-x(h) .y), h

so that we get the sum of moduli. We have

5i 2Zr and

M'

N3 Zr

< < 17T

1

1

X

(7.6.9)

.v . I

I µgN3I

3g µg3zrl

We have

r- K

-h(N3)=(qzr-N3)h"(6)^µgN3zr-

q

N

q

for some 6 between N3 and qzr. Hence (7.6.9) implies

r - K - qh'(N3) e(hg(n)) H N,

H3 N3

where

H)I

HN

HN

C«I1+HII+HN)+HN«1. II

13

llllllway

Taking moduli in this than

means that we cannot expect any estimate better

M

N

HM (HN)
) form a coincident pair if their corresponding entries differ by numerically less than 1

1

1

FK

8KLV' 4L (2K) ' 4L ' 8L The result is still true for a family of sums Si of the form (8.1.1), taken over subintervals (depending on i) of H< h < 2H, and of M < m < 2M, formed with functions fi(x) satisfying (8.1.3) uniformly in i, when we replace b(j) by Ni, j), y(J) by y('°>), and the sum over j by a sum over i and j. The sequence of rational numbers aj/qj corresponding to the minor arcs may also depend on i.

proof This is Lemma 5.6.6 with

X= (8KLV, 4L (2K) , 8L,16L/V), and 1

Y= 1,2max

(µq3)

1

,1,max

.

(l-+-q3)

we can take Y = Y3 = 1 since the rational numbers a/q and ab/q are only defined modulo one. By (8.1.3) and (8.1.4) we have

2) 3)
-1/C, IF(')(x)I 5 Cr, (9.1.2) x 5 2, the upper bound holding for r = 2, 3, 4, 5, and the lower bound

Exponential sums with a difference

186

for r = 2, 3 always, and for r = 1 if M>> FT. We write d1 for d/dx and f1 for d1 f, etc., so that (9.1.2) gives Idi-1f(x,Y)I< C Mr

Id,-1f(x,Y)I < 2MT

r

for M+ 2H - 1 <x < 2M - 2H and the corresponding values of r. We suppose that M is sufficiently large, and that integers N and R can be chosen with 15 R :!g H:!' N < M to satisfy 2R2NT >-

C3M3

> 2(R -1)2NT,

(9.1.3)

2C3{H + 1 2K - 1, and has a pole of order 2 or 3 at s = 2K - 1. The pole has order 2 for forms of even weight and congruence level one.

Corollary

When the pole has order 2, then there is a constant B2 with L

1=1

I b(l)4

lzk- 1 5 BZ (log2L)z

for any positive integer L.

Either Lemma 10.3.2 or 10.3.3 can be used to improve the smoothing error in Lemma 10.3.1.

10.4 WILTON SUMMATION ON THE FAREY ARCS We apply the truncated Wilton summation formula (Lemma 10.2.6), and evaluate the resulting exponential integrals.

Wilton summation on the Farey arcs

211

Lemma 10.4.1 (Explicit Wilton summation) We have (when f" (x) > 0) M2

E wj(m)b(m)e(f(m)) j=1 m=M

_ E 1F, F, b(l)el - 8111 wj(xj + O BMk/2

I MZRZ

N3

)hil

M

(x, )e(gj± (xj ))

11/(2s-2)

+ RZ

)(logM)12),

)

(10.4.1)

where xji and x,-.I are the stationary phase points satisfying

f'(x)=a± 1 (I), qj qj V x

(10.4.2)

the functions g j W, g,-., ( x) are given by

ax

2

qj

qj

(IX)

g (x)=f(x)- '--+-

+8+8,

the weight functions hit (x), h7 (x) are given by

(x/l

)ck-1)/2

(f" (x) ± (l/x3)1/2/2qj)

(4lxgf )1/4

and the length of the reflected sum K1 is the smallest integer with

k1+1 _ 18M/R2. Proof First we fix j, and write a/q for aj/qj. We apply the truncated Wilton summation formula with a(m), b(l) replaced by

b(m)e(am/q), q-kb(l)e(-al/q) as in Lemma 10.2.3. The size of the cut-off K depends on the derivatives ax

dxr

wj(x)e f(x) - q

which are sums of terms involving r,

q

with

ro+r1 +2r2+

+tr1=r.

We have w,(ro)(x) , and

uj =

f'(u12)=ai/q1,

x+-o,

and A0,..., A4 are constants constructed from the constants C1.

Proof Using Lemma 5.6.1, we bound the square on the left of (10.5.10) by 2L-1 lb(l)12

lkh,(l)ht(1)e(gi(l) -ge(l))

1k

1

t

The first factor is O(B2 log L) by Lemma 10.2.1. We estimate the sum over 1 in the second factor using Lemmas 5.1.1 and 5.4.3. The weight function

H(l) = lkhi(l)h1(l) is allowed to depend on i and j. We need estimates for the size and total

variation of H(l) that are uniform in i and j. The bound for dxj,/dl in Lemma 10.5.1 shows that the weight function H(1) is f times a slowly varying function of 1, with

H(1)^

Mk 1NR2 Q

l M1 ,

and the total variation of H(l) has the same order of magnitude. After Lemma 5.1.1, we consider sums e(g,(1) - g,(1))

(10.5.15)

A large sieve on the Farey arcs

217

over subintervals of the range L < I < 2L. We have d ge(l)

dl

_ - gjV(x1l) -

-

and 1

xj

2qj

(13)

d2

dl2

ge(l)

qj

)

dx

lxj, 1

dl

1

1

2qj

,

By Lemma 10.5.1 we have kdl

a, 1 uj gi(1>+- -+-+

d

qj f

dl

32C2 NR

+q

71

2

(10.5.16)

15MQ2

A5N2R4

1

2ql

q1

aj

uj

1

ge(l) + q,

a uif, (ui)

2

MaQs

0; O(NE) solutions;

x log N solutions.

In practice, the nature of the problem seems to change as soon as we add an inequality. The algebraic geometry becomes less relevant. It may be that most solutions of the inequality satisfy not just the given inequality, but a

corresponding equation, and they lie on an algebraic variety of smaller dimension. Many divide-and-conquer arguments go from an inequality to an

equation in this way. For the equations (11.1.1) - (11.1.4), if a and 0 are small enough, then we expect the diagonal solutions to predominate; these are the solutions with g1,. .. , g, equal to hl,..., h, in some order. We start with two variables. Let d(m) denote the divisor function as in Section 2.1 of Chapter 2, the number of ways of writing m as a product of two positive integers. Lemma 11.1.1 (Binary quadratic forms) Let n be a positive integer. Then the number of pairs of integers x, y (of either sign) with x2 - y2 = n is 2d(n) if n is odd, 2d(n/4) if n is a multiple of four, and zero if n is twice an odd integer. The number of pairs of integers x, y with x2 - xy +y2 =n is

6 E X(d), din

where X(d) is the Dirichlet character mod 3 taking the values X(d) = 0, ± 1 according to X(d) = d (mod 3). Corollary If a and b are integers with a2 < 3b, then the number of triples of integers hl, h2, h3 (of either sign) with

hl+h2+h3=a, h; +h2 +h3 =b is

(d

(6-3X(a2))

XI deven,

(11.1.6)

dl(3b-a2)

Proof If x2 - y2 = n, then x + y = e, x - y = d for some integers d and e of the same sign, and either d or e both even, or d and e both odd. Conversely, each such pair of d and e gives a solution. When n is odd, then any pair of integers with de = n gives a solution. When n is even, then if de = n, either d or e must be even. If n is twice an odd number, then the other factor must be

Preparation: divisor functions

237

odd, and we cannot construct a solution. If n is a multiple of four, then any pair of even integers d and e with de = n gives a solution. The equation x2 - xy + y2 = n is treated in the same way in principle, but the factorization is x + py = 6, x + p2y = e, where p is a complex cube root

of one, with p2 =1- p, and 6 and a are algebraic integers of the form a + bp, where a and b are integers (see Hardy and Wright 1960, Chapter 12).

The lemma contains the theorem that every prime number of the form 6m + 1 can be written as x2 - xy +y2. For the corollary we eliminate h3 to get

2h; + 2h2 + 2h1h2 - 2ah1- 2ah2 = b - a2, r and we note that the substitution

x=h1+2h2-a,

y=h2-h1

(11.1.7)

makes

x2 - xy +y2 = 3hi + 3h2 + 3h1h2 - 3ah1 - 3ah2 + a2 = (3b - a2)/2. (11.1.8)

Conversely, if x and y satisfy (11.1.8), and if a is a multiple of three, then (x +y)2 is a multiple of three, and so is x +y, and we get integer values for h1 and h2 from (11.1.7). If a is not a multiple of three, then

2(x +y)2 - -a2 = 2a2(mod3). Half the solutions have x + y = a, and the other half have x + y = -a. Only the solutions with x + y = - a give integer values for h and h2 in (11.1.7). 1

We deduce the formula (11.1.6).

In the next few lemmas we see that the sum over d in Lemma 11.1.1 is not too large, especially if we can average over n. Thus when h4, .... h 91, ... , gr have been chosen in (11.1.1) and (11.1.3), then the last three variables take

care of themselves. We need the more general divisor function d,(n), the number of ways of writing the positive integer n as a product of r positive integers. We give the standard bounds. The averages in Lemma 11.1.3 may be replaced by asymptotic formulae. The leading terms in the asymptotic formu-

lae may be obtained by elementary arguments as in Lemma 11.1.3, or by counting lattice points as in Chapter 2, or by using the Dirichlet series generating function

d,(n) ns

Lemma 11.1.2 (Bounds for divisor functions) We have

d,(mn) 5 d,(m)d,(n),

(11.1.9)

with equality if the highest common factor (m, n) is one, and

d,(n)ds(n) S d,s(n).

(11.1.10)

The ruled surface method

238

For any S > 0, there is a constant B(r, 8) with

dr(n) SB(r, 6)NS.

(11.1.11)

Proof If m = k1k2 ... kr,

and n = 1112 ... lr, then mn = g1g2 ... qr, with q1= k,!1. If (m, n) = 1, then q, determines k. _ (q,, m) and 1, = (q,, n). If (m, n) > 1, then there are several sets of factors k1,.. . , kr and 11, ... , Ir which give the same products q1, ... , qr. This proves (11.1.9). If n = k1k2 ... kr = 1112 ... ls, then we construct rs numbers q1j whose product is n as follows. For any factor e of n, we put ml(e) = (k1, e), and, for i > 2, m1(e) = (k1k2 ... k1, e)/(k1k2 ... k;-1, e). Then q,1 = m,(11), and, for j > 2, q,1= m1(1112 ...11)/m,(1112 ... 11_ 1).

The fraction q,1 has the form

(kk',Ii')(k,1) (k, ll')(kk',1)

with k=klk2

k1_1, k' =k1, 1=1112

1i_1, 1' =1,. Let d=(k,l), k=du,

1= dv. Then (u, v) =1 and

(uk', vl') (uk', vl') q11= (u, vl')(uk', v) (u, l')(k', v) ' which is an integer. It is easy to check that the rs numbers q,, determine

kl,...,kr and When n is expressed as a product of powers of distinct primes

n=pSI...pkk, then

dr(n) = fl dr(p,'). For p a prime, dr(ps) is the same as the number of ways of typing a sentence

of s words on paper with r lines. This is the number of ways of typing r + s - 1 things, each either a word or a carriage return, with s words and r - 1 carriage returns. Thus d"(ps) _

(r+s - 1! Y1H--, we have r )=1

r

($t - h))=

rj (x2 - y?) = O(?1Hr). )=1

Interchanging the roles of x; and y) gives the corresponding inequality with C] g, and h) interchanged.

We can now deal with the case r = 4. This account is based on Watt (1989a). We give the simpler argument which loses a factor H. The proof has a divide-and-conquer structure. We divide the integer solutions into families, and count the number of families which have many solutions. This brings in another important idea, the Riesz interchange, which is the discrete analogue of the transformation

f13If(x)Idx= f/3 a

suplfxI flf(x)Idydx= fy=o

x=a y=o

f

dx dy,

where S(y) is the set of values of x for which I f (x)I ? y. This idea is useful when we can describe the set S(y). Theorem 11.2.5 The number of solutions of (11.1.1), (11.1.2), and (11.1.3) with r = 4 in the range (11.1.5) is

O(H4 + 6Hs+E) for any e > 0. The implied constant depends on e.

Proof By Lemma 11.2.3, the solutions fall into families. There are O(H4) solutions belonging to the families that contain at most 768 solutions.

Families of solutions

247

Families with more than 768 solutions are divided into blocks according into which range L to 2L - 1 the number of solutions falls, where L = 21 is some power of 2. We note that there are at most 24H4 trivial solutions in which 91, ... 194 form a permutation of h1, ... , h4. If (after renumbering) we have g4 = h4 and g3 = h3, then the solution must be trivial, since

gi

+g2 hi + h2, =

g1 + 92 = h1 + h2,

and so g1g2 = h1h2, and

(h1 -g1)(h1 -g2) = h; - (h, +h2)h1 0. Then either h1= g1 (whence h2 =g2), or h1=g2 (whence h2 =g1). If one solution in a family is trivial, then all solutions are trivial. Hence we can refer to trivial or non-trivial families. We estimate the number of non-trivial families with at least L solutions of

(11.1.3). We can normalize the family (renumbering if necessary) so that 0,g,z0,h4=0. By Lemma 11.2.4 with r = 4, 91929394

(H-r)r ways, and then have g1, ... , gr a permutation of h1, ... , hr in r! ways; these are diagonal solutions. For the second bound, we divide four-dimensional space into boxes B,,, and we let n; be the number of vectors x(h) (in the notation of Lemma 7.4.2) in

each box. Two vectors (distinct or not) falling into the same box give a solution of (11.1.1)-(11.1.5). Let I be the number of boxes. Then Cauchy's inequality gives (Ln,)z

0, with constant depending on e. Proof Equation (11.1.1) implies (11.3.1) with r= 1, and (11.1.1) and (11.1.3) together imply (11.3.1) with r = 2. Since 5

(x - a1)...(a - a5) =x5 + E (-1)rxs-r(a1,..., a5), r= 1

then we have

I(x-h1)...(x-h5) - (x-g1)...(x-g5)I S7SH5 for HS x < 2H. In particular 1(g; -hI)...(g; - hs)I-_ 1,

We put

f3=(b+K)/q,

We suppose that U >> H, since the case U > 1/H4 or Ig(4)(x)I >> 1/H5 at each point of J. Suppose that J has length L. In both cases we have Ig(3)(x)I >>L/H5, I g" (x)I >>L2/H5 somewhere on J. The length of J is O( (o-K1Y5) ),

which tends to zero as promised. We estimate trivially on all intervals whose length is O(H1/2U1/6). This leaves O(log H) long intervals to consider. We apply Poisson summation (Lemma 5.4.6) to get ah2 + bh e

+g(h)

q

G(a, b - r; q) qA-rSgB+-',

f e(g(x) - rx) dx !

q

log(gB-qA+2)), with

qB+qA 1,

(12.3.26)

then N10(6, A) 0 is arbitrary, but the implied constants depend on e.

Proof We fit Lemmas 12.1.2, 12.2.2, and 12.3.2 together to estimate the integral in (12.1.2). We have

0 H'

1 N10(2U«

U(Ql/4 +

U4/3

Hs+e(1+

0

N

For

H

k

+

k

V`-

1

2 k; +

J

k; + kj

1

2k

(k;kj)

2

so that d

2dt

1

logo

1

(klk4) + (

(k1k3)

k2 - k1)( k3 + (k1k2k3k4)

1

1

(k2k4)

(k2k3)

k4 ) >> IM1-M21 z

K

Hence within a fixed family, the values of t for which (13.1.3) holds form an interval of length ,K4 O I m1(m1 - m2)(m1- m3)I

which gives the bound (13.1.12). Similarly considering the values of 14/12 leads to the bound (13.1.11).

0

The First Spacing Problem for the double sum

276

Lemma 13.1.4 (Solutions of all four equations) Suppose that k1,. .. , k4, ll, ... ,14 satisfy (13.1.1)-(13.1.5). Define 0 and .p by

kl +12 k2 -13 k3 -14 k4 = OLFK,

ll

11/ kl + 12/ k2 - l3/ k3 -14/ k4 = cpLFK (so that 0 and cp are 0(17)). Then

max rjIki-kjl P, I ql >- Q.

O

14.2 MAGIC MATRICES One of Bombieri and Iwaniec's deepest insights was that the first condition (14.1.1) was linear in the action of the modular group on primitive integer vectors, described in Section 1.3 of the first chapter. To simplify the notation, we write r/q = l a/ql. Lemma 14.2.1 (The magic matrix) in their lowest teens with r

Suppose that r/q and r'/q' are rationals r' 0, then since

neither k/m nor (l - k)/(n - m) lies in J, but I/n, which is in J, lies between these two rationals, then we have

1-k

k

2SA<m -- n-m

1

m(n-m)'

so that SAmn < 1;

this inequality holds trivially if m = 0. We rewrite (14.2.6) as r

r'

1

9 , q, _ - + O(6A). n

(14.2.12)

The First and Second Conditions

294

Then (14.2.2) and (14.2.3) give

an - c1= n9+ O(SAnK) = O(SAnK), do + c1= n/O + O(SAnK) = O(SAnK). Hence by (14.2.11)

m

ce

ce

n

n

n

am - ck= -(an-cl)- - =m6- - +O(SAKm), and similarly ce

ck+dm =m/9+ - +O(SAKm). n

The magic matrix is conjugate to

d')

(c'

n)-1(c

=(k

l n1

(m

d)

akn + bmn - ckl - dim - akm - dm2 + ck2 + dkm

aln + bn2 - c12 - din

-aim - bmn + ckl + dkn

1

(an - cl)(ck + dm) - mn

c

-(am-ck)(ck+dm)+m2

(an-cl)(cl+dn)-n2 -(am-ck)(cl+dm)+mn

Here mn

a'

c

SAnK

+O

Icl
Q H1H12-H11H2

10_3

(F111)

A family of sums

305

In the construction of Chapter 10 for P5 Q M F112 H H12 H11H2 4T2 x3(F11)" 1

and in the case P > Q 1

4M

H1H12 -H11H2

D3 x3(F11)3

Lemma 14.4.3 (Coincidences from a family of sums) Suppose that the family of sums is formed with functions F(x, y,) which satisfy (14.4.1) and the conditions of Lemmas 14.3.1, 14.3.5, and 14.4.1. Then the number of coincidences in the First and Second Conditions among minor arc vectors from the whole family

of sums withP> 1 and A2 PQ >> 1 by Lemma 14.1.1.

In Lemma 10.6.5 we need the maximum number of coincidences involving a particular minor arc. Lemma 14.4.4 (Coincidences with a fixed minor arc) Under the hypotheses of Lemma 14.4.3, the number of coincidences involving a fixed minor arc with

P> N/M (15.2.8) by the condition N3 rQ'

t

We can rewrite (15.3.11) as

a'u

6'

v

t + R, - t 5 Q' for some integer v, where 6' «D4. We have set up the conditions of Theorem 4.3.3 with parameters a', (3',

6', M', N', and R'. Either R' ::g max(86'Q',1285'M'Q'2) + 1, (15.3.13) which is impossible if G(u/t) is less than the first term in the minimum in (15.3.1) of the lemma with B3 sufficiently large, or else all the solutions lie on a rational line ey = cx + d, with c, d, and e integers with no common factor, and l

ea

86'Q' rG « N2, - c sR'-1

(15 . 3 . 14)

l

and

1e13-dl
> 1 is (16.2.5)

SPQ >> k.

Lemma 16.2.1 (Type 1 matrices acting on congruence families) For a congruence family of sums of the form (16.2.3) or (16.2.4), satisfying (16.2.5), the number of coincidences between pairs of minor arc vectors indexed by rational

numbers a/gwith PS al > 1. Choosing Q = 256r + 1 gives r >> N, contradicting the assumption r < NIB, if the constant B1 has been chosen sufficiently large. Hence the points do lie

on a straight line, which we can write as Ix + my + n = 0 with integer coefficients satisfying m - 5 different minor arcs, then we replace (R2 + rs)2 by R4/L2.

Proof For an upper triangular matrix (

I

B), the Second Condition in the

form (16.3.12) gives (aq)

h "(a)

=3

µ

-

1

µ

2 3

(f l3)(g(a/q +B)) 4B

=-3

f (3)(g(a/q))

f (4)(g(g)) (f(3)(g(, )))3

for some q. Upper triangular matrices occur only for M < FT, when we have assumed that IF(4)(x)I is non-zero, so that /

h"(allxA=IBIM2R 4

(16.3.14) q

The Third and Fourth Conditions

346

Lower triangular matrices

('

° ) only occur for M >- FT, when we have

assumed that 3(F(3))2 - F" F(4) is non-zero, and a similar calculation gives

xA=ICIM.

(16.3.15)

We have set up the conditions of Lemma 4.2.1, with h(x) in place of f(x),O(A4) in place of 3, and the distance A between the rational points a;/q; satisfying

A >> min(1/R2,1/rs). We have Ik(1) +nl > L/R2, and in (4.1.14) of Lemma 4.1.5 we can take both factors to be of size >> L/R2, so that

L2/R4 «04/i Q, and we have

A4/Q «A4R4/OLZQ2, and we still omit the term 04/Q. As in Chapter 15, we shall apply these lemmas with either e/r or f/s as the rational number of smallest denominator that is a value of f" (x)/2 on the minor arc. In the second case, e/r is the rational number of second

smallest denominator, and (e - f)/(r - s) is outside the minor arc, but adjacent to f/s in some Farey sequence. In both cases we find that rs 5 3R2.

The construction of Chapter 8 gives corresponding results, but with different derivatives and factorials in (16.3.1) and (16.3.3), and, for example, O5 _ 1 for M>> Ts/1z The conditions (17.1.11) ensure that (17.1.18), (17.1.19), and the conditions of Chapter 7 can be satisfied by a suitable choice of N and R in the ranges (17.1.10). For M > T'/ 12, we take V = 1, and choose N and R so that 1 x O1P2 x M2/N4, which puts N and R into the ranges (17.1.16). The conditions (17.1.17) ensure that (17.1.18), (17.1.19) and the conditions of Chapter 7 can be satisfied by a suitable choice of N and R in the ranges (17.1.16).

We can now substitute the estimates for A and B into Lemma 7.4.3, then the result into Lemma 7.4.2, and add up the cases. A complication is that the bound for A5 changes when Q x N1- E, and that we must change the bound for B from Lemma 16.1.3, valid for Q Q,, then a valid upper bound is given by the order of magnitude for Q x R, multiplied by R'".

Proof In Lemma 7.2.2, the largest power of Q which occurs in W(Q) is Q-1. Hence in Lemma 7.4.2 there is a factor q-3r/2-1+s outside the expression from the large sieve. When Modification 2 operates, then we can insert a

factor R2/Q2, since the sum over a/q contains >> Q2/R2 approximating polynomials corresponding to each minor arc. In the expression ABH5NR2V 3

Q

(17.1.24)


353

of Lemma 7.4.3, we can ignore the factor 1 + Q/N for small Q. Since H NQ/R2, and the largest exponent of H in the bound for A is 2r - 3 + E, then the largest exponent of Q in (17.1.24) is 2r - 1 +a+ e for Q > N. We can now work out the largest exponent of Q in Lemma 7.4.2 as a + E

2

-$-

r + 7

2

for Q > N. Hence a positive choice of 6 is possible under the conditions given. If the upper bound for ABV has a discontinuity at Q0Yby a factor Qo, then we define an exponent 0 by (Qo/R)B ^ Q'7,

and insert an extra factor (Q/R)B into (17.1.24). We can still choose a if (17.1.23) holds: otherwise we use the larger estimate with the factor Q'I over the whole range Q >> R.

We can now give bounds for the simple exponential sum S of Chapter 7. Further complications arise when we consider a family of sums Si. Theorem 17.1.4 Let F(x) be a function four times continuously differentiable on the interval 1 <x :!g 2, whose derivatives satisfy the following conditions:

I F(')(x)I 5 Cl

(17.1.25)

I F(')(x)I >_ 1/C1

(17.1.26)

for r = 3, 4,

for r = 3, where Cl is a positive constant. Let 6:5 1 be a positive real number, and let M2

e(TF(mM)),

S= ME

where M and M2 are positive integers, T is a large real number, and M <M2 < (1 + 5)M < 2M < T. Suppose that for some C2 >- 1 either case 1 or case 2 holds:

Case 1 M 5

and (17.1.26) holds for r = 4 also;

Case 2 M >- C21 FT and (17.1.25) and (17.1.26) holds for r = 2 also and IF" (x)F(4)(x) for some positive constant C3.

-

3(F(')(X))21

>- C3

(17.1.27)

Exponential sum theorems

354

If M is sufficiently large in terms of C1, then we have S _ 2, C5 (which may depend on S), a and /3 such that, for a class C of functions closed under the addition of linear terms and under linear changes of variable, we have in the notation of Theorem 17.1.4 ISI C4T ", we have

ISI52C5MTa-a.

Exponential sums with a parameter

357

Proof For M5 T" we pick an integer q > 1 so that

gMST"

J

1/57

(T8

>> f 133

M2 T + (Mz + 7 Jlog M,

(17.2.5)

1

We also have, for M > M112T 1/6 >> M2/3.

Hence DR Z M113, and the estimate for the original sum using R differencing steps must be >> Ml- 1/3X28+'

For T large, we take R large, so we are far from the root mean square size

V.

Lemma 17.4.1 (Smoothing the differences) Let D, L, Q, and R be positive integers. Let A be a set of integers lying between L and 2L - 1. Suppose that for each 1 in A we are given R positive integers d1(l),... , dR(l) with

d1(l)d2(l) ... dR(l) =1, d1(l) + +dR(l) 0, then the conditions of Theorems 17.1.4 and 17.4.2 hold for each R. We write M = T a. The bounds take the form S _ 1). The classical notation states bounds as S _ 1 when (18.1.6) holds. When (18.1.6) is false, then we choose V= 1, which puts N and R into the ranges (18.1.9), satisfying (18.1.21), and also satisfying (18.1.18) in the range (18.1.11). With V chosen so that A1P2 x 1, the last term in (16.1.10) of Lemma 16.1.3

dominates the first term if 1 « L)2 24 3Q/P X Q213R4/H513M.

(18.1.22)

The extremal choice of N and R is (18.1.13), and V>_ I when (18.1.12) holds.

Exponential sums

375

When (18.1.12) is false, then we choose V= 1, which puts N and R into ranges (18.1.15), satisfying (18.1.22), and also (18.1.18) in the range (18.1.17). We have to satisfy (8.1.4)-(8.1.7), so that

R :s- H5 N/64C,

(18.1.23)

R >- 2C H _+1

(18.1.24)

,

N<M/10.

(18.1.25)

The requirement (18.1.23) gives the conditions (18.1.4), (18.1.8), (18.1.10), (18.1.14), and (18.1.16). We find that, up to a constant factor, (18.1.24) follows from (18.1.23) and the condition A1Q2 >> 1 which our construction always satisfies. The constant B2 is chosen large enough for (18.1.24) to hold with

the right constant of proportionality. Also (18.1.25) follows easily from (18.1.18) if M and T are sufficiently large. The most difficult verification is (18.1.18) for the choice (18.1.7), where we use both lower bounds (18.1.6) and (18.1.10) in different ranges. Similarly, both (18.1.12) and (18.1.14) are used to verify (18.1.18) for the range (18.1.13). 0 Theorem 18.1.2 Let F(x) be a real function three times continuously differen-

tiable for 1 <x5 2, and let g(x), G(x) be bounded functions of bounded variation on 1 5 x S 2. Let C0,. .. , C6 be real numbers z 1. Let H, M, and T be large parameters. Suppose that

IF(')(x)I < C,

(18.1.26)

forr= 1,2,3, IF(')(x)I

1/C,

(18.1.27)

for r = 1, 2, and that either Case 1 or Case 2 holds:

Case l M5 C0T 1I2 and (18.1.27) holds for r = 3 also, Case 2 M >- Co 'T 1/2 and IF'F(3) - 3F" 21 Z 1/C4.

Let S denote the sum

S= E g( H

H)

m=M

G(M)e(M

F(M)).

Then there are positive constants B1 and B2, depending on C0,. .. , C6, and on the functions g(x) and G(x), such that for M in the range CS 1T49/114 SM5 C5T651114

and H satisfying

H C-1 T 1/2. Let N(S) denote the number of solutions of T

(18.2.1)

m IIM F(M )IIS8

with M:5 m 5 2M - 1. Then in both cases/ (18.2.2)

N(S) 2, followed by a rigid motion. Then, for any embedding of E in the Euclidean plane, the number of integer points in E is M)3151146),

AM2 + O(IM46/73(log where I is a number depending on the curve C, but not on Moron the position or orientation of E.

Theorem 18.3.3 In Theorem 18.3.2, suppose that the pieces C, are four times differentiable, in the sense that p is twice continuously differentiable with respect

to the tangent angle 41. Suppose also that p is either constant or has a finite number of maxima and minima on each piece Ci, and that d log p/d 4' has a bounded number of maxima and minima between any two consecutive extreme values of p on Ci. Then we may take

I= E r' pk/73' i

k

Lattice points inside a closed curve

389

where the inner sum is over local maxima Pik (including endpoints, and counting constant values only once) of the radius of curvature on Ci, provided that M is so large that the bounds M > 1 / p and M40

63 Z 1+

(log M)"

1

P

2

dP

2 103

d/,)

1 1

hold on each curve C1.

Proof of Theorems 18.3.2 and 18.3.3 We have to choose local coordinates on each piece Ci so that the conditions of Theorems 18.2.1 and 18.2.2 apply. The new coordinates are u and v in Lemma 18.3.1, with

u = r (x cos /3 +y sin f3 ),

v=W(ycos/3-xsin so, for (x, y) on the curve MCi, we have dv

- = tan(4,- /3),

(18.3.7)

du

d2v

due d3v

du3

1

r1/epcos3(f-/3) 3sin(4r-/3) 1 dp rp2cos5(ap-p) rp3cos4(q-/3) dq, 3sin(4i- f3-A) rp2

ip - /3 )cos A '

(18.3.8)

(18.3.9)

where the angle A is defined by

dp 3p d+/ 1

tan A =

We also have

dv d3v - 3 (d2v 2 du du3 due

3cos(4,-/3 -A) rp2 cos5(r/l - /3 )cos A '

(18.3.10)

We prove Theorem 18.3.2 by a convexity argument. The values of 4 on a

particular piece C. of the curve form a closed interval K. For any 4 in K there is an open interval of values of f3 for which the expressions in (18.3.8) and (18.3.9) are bounded away from zero and infinity. We pick a(41) to be the midpoint of this interval. For fixed 4 = 4i0, we take f3 = a(410) and

consider (18.3.8) and (18.3.9) for this P. Since p and A are continuous functions of +/, there is an interval of values of 41 about 1//0, open relative to the interval K, on which neither expression (18.3.8) nor (18.3.9) is zero or infinite. We call this interval I( /3 ). Each 41 inK lies in its own open interval

Lattice points and area

390

I(a(t/i)). There is a finite subcollection I1,...,IR of these intervals which covers K; we suppose that the subcollection is minimal, and numbered in order, so that I, nIr+2 is empty, but I, fl Ir+ is non-empty. We form closed intervals Jr by taking the midpoint of I, fl I,+ as the common endpoint of J, and Jr+ 1. Then J1,..., JR are closed intervals, disjoint except for endpoints,

that cover K, each corresponding to some angle /3 = ar. Let Sr be the smallest distance from an endpoint of Jr to the corresponding endpoint of Ir. The derivatives in (18.3.8) and (18.3.9) are finite and non-zero for 0rEJr+[-6r/4,6r/41. 1 /3-arl 0, uniformly for MN)")1"(K+1)

N< T < MN(MANK(log

(18.5.7)

Then M2

R=

.

M

m

p(NF( M))
_ N >_ N4, and for N8 >_ N >_ Nlo if either of these ranges is non-empty, M(62Q-llq-33)/(62Q-2)N11/(62Q-2)(log N)6(5g2+6)/(31Q-1)

R
0, C > 0, and so t > 0. We see from Lemma 14.3.2 that t is bounded. We use Lemma 16.3.5 with rs «R2. If a coincidence extends over L z 1 consecutive minor arcs, then Lemma 13.4.2 allows us to replace A2 by O(A2/L2) in Lemma 14.3.2, so that t > FT and V chosen so that 1 > FT V.

Corollary A bound of the form (19.3.2) valid for a certain class of functions and a certain size range for (log N')/(log M') implies two other bounds of the same type, valid for related classes of curves and certain size ranges. In the first bound, the exponents (Ki, A,) are (1/13, 9/13), (1/5, 2/5), and two terms

2+3K+2A 14+23K+24A (22+33K+32A' 22+33K+32A)'

12-2K (23+3K+A' 22+3K+A 3+3K+A

corresponding to each term M'AN" in (19.3.2). In the second bound, the exponents (K;, A.) are (1/5, 7/15), (0, 3/4), and two teens

4+8K+9A 12+18K+17A 24+38K+39A'24+38K+39A)' corresponding to each term in (19.3.2).

1-K 13+3K+A

(11-K' 22-2K

Further results

420

Lemma 19.3.3 (Iteration step for the double sum) The choices of N in Lemma 19.3.1 give bounds for the sum S of Chapter 8:

S > fT .

Corollary A bound of the form (19.3.2) valid for a certain class of functions and a certain size range for (log N')/(log M') implies another bound of the same type, valid for related classes of curves and certain size ranges. The exponents (Ki, A.) are (1/6, 1/2), (1/2, 0), and two terms

1+2K+A 1+5K+6A (4+9K+9A' 4+9K+9A)'

3+4K+A

3-3K

(12-K-A' 12-K- A)

corresponding to each term M'AN" in (19.3.2).

The proofs of Lemmas 19.3.2 and 19.3.3 follow those of Theorems 17.1.4

and 18.1.2. The corollary to Lemma 19.3.2 uses Lemma 18.5.2, and the corollary to Lemma 19.3.3 follows the proof of Theorem 18.2.1. The two cases

of Lemma 19.3.3 give the same exponents in (19.3.2) with M' and N' interchanged.

The new exponents from the iteration are not very sensitive to the values of K and A, and are useful only for a certain range of (log N')/(log M'). In practice this range is a short interval of the longer range in which the bounds are valid. We start the iteration from the bounds of Table 18.1, except that we use Theorem 18.2.1 in place of Theorem 18.2.2 and Lemma 18.5.2 for the

range 83/635 a5 643/466. Moreover, we can use the analogue of Theorem 18.2.1 for a family of exponential sums, using t as the parameter. This is the only range in which we can use t as a parameter. The bounds lower down

Exponential sums with a large second derivative

421

Table 19.1 Bounds for integer points close to a curve Iterated

Steps

D DD DDDC

23

23

41

73

73

32 =

17

35

7

3

41

396348

79

79

17

17

32

309023

DCC

DC

69787

3118

6577

396348

49115058

146195

146195

14735

14735

309023

38213083 -

15603

40525

3143

8083

49115058

1590221

84191

84191

16837

16837

38213083

1229946 =

1251

3306

115

342

1590221

3799577

6829

6829

679

679

1229946

2859937 =

3118

6577

598

333

3799577

8366056

14735

14735

1589

1589

2859337

6185661 =

3143

8083

675

1571

8366056

269897

16837

16837

3363

3363

6185661

197842

342

4

43

269897

27

679

679

64

64

197842

19 =

598

333

29

62

27

1375774

1589

1589

140

140

19

939695 =

675

1571

4

43

1375774

44151

3365

3365

64

64

939695

29740 =

4

43

44151

19

64

64

29740

12

29

62

19

374

140

140

12

227

89

356

374

872

641

641

227

423 =

C

1.2812... 1 . 2826

...

1 .2853

...

1 .2929

...

1.32 88...

1

.3525

...

=1.3573.. .

115

CC

C

=

27548

DDCC

DDC

from

Exponents K, A

Range for (log N)/(log M) from to

1

.

4211 ...

1

.

4641 ...

1 . 4846

...

=1.5833...

=

1

.

6476 ...

2 0615 .

...

Table 18.1 already use Theorem 17.2.2, with a parameter from the differencing step or from the Fourier series for the row-of-teeth function. Tables 19.1 and 19.2 give some further bounds for integer points close to a curve and for exponential sums. In the first column D means the double sum, C the simple sum, and A the differencing step, so that DCC means Lemma 19.3.3 applied to the result of Lemma 19.3.2 applied to one of the bounds that uses Theorem 17.2.2, labelled C(1) in Table 18.1. As usual, rows bracketed together correspond to an upper bound with two or more terms. Estimates for exponential sums in other ranges can be obtained by the A (differencing)

Further results

422

Bounds for exponential sums

Table 19.2

Iterated

Steps

Exponent (3

C

from

Range for a = (log M)/(log T) from to

89 + 285a

106822

139817

570

246639

246639

= 0.5669...

2387 + 17972 a

115

342

1033325

106822

27290

679

679

2642746

246639 =

2819 + 19177«

598

CDC

0.4331...

333

699371

1033325

2642746 =

29855

1589

1589

1647930

11897 + 88442 a

675

1571

156527

699371

134680

3363

3363

370694

1647930

CCC

= 0.4244...

113 + 897a

4

43

263

156527

1345

64

64

638

370694

CC

CC

0.4288...

= 0.4222...

491 + 3624«

29

62

143

263

5530

140

140

349

638 -

569 + 1053a

29

62

307

143

2800

140

140

641

349

1273 + 2484«

89

356

68682

307

6410

641

641

171139

761

AC 1

0.4122...

= 0.4097.. . = 0.4034.. .

4 + 103a

12

68682

128

31

171139 =

29 + 173a

227

12

280

601

31 =

0.4011...

0.3871.. .

or B (reflection) steps as in Table 17.1. We get some improvement near the values of a for which the steps in Table 17.1 change from A'C to A'+ 1C. For most values of a, the saving obtained from the iteration is less than the saving from using Theorem 17.2.2 instead of Theorem 17.1.4 after a sequence

of differencing steps. The iteration works best for exponential sums with a = (log M)/(log T) near 5/12, where the third derivative is still small, but the second derivative has order of magnitude larger than one. This range of a provides the title for this section.

20 Sums with modular form coefficients 20.1

EXPONENTIAL SUMS

We obtain the main results of Jutila (1987a et seq.) for the sum M2

S = E b(m)e(f (m))

(20.1.1)

M

of Chapter 10, where E b(m)e(mz) is a cusp form of even weight k for the full modular group. Lemma 20.1.1 (Choosing parameter sizes) Suppose that the conditions of Lemma 14.3.1 hold. Then, in the notation of Lemma 10.5.1, we have R 13/2

B(01(V ), 02(V ), 03(V ), 04(V )) > MR2, and z

P

04(V) X VT

M LQ2

+ PQ

( M) >> i,3(V).

To avoid double suffices, we write yi for the value of the parameter on the Farey arc with index i. We have U.

-=1+O(03), ui and

f11(u3,Yi)

= 1 +O(D4).

f1i(uj ,Yy)

Since F12 and F112 are supposed bounded and non-zero, f11(u?,Yi)

f11(ui3, Yj)

1 XIYi -y,IX

f1(ui,Yi) z fl(u; ,Y;)

-1

Sums with modular form coefficients

428

Now

f1(ul,Y1) =fl(u),Yj) +b, so that bQ

i,Yi) P - fl(u),yj) .f1(uz

-1 «lyi-Yjl+L3

_ 1/C1

for r = 1, 2, where C1 is a positive constant. Let G(x) be four times continuously

differentiable for all x, and zero outside the range 1 5x 5 2. Let b(m) be the coefficients of a cusp form of even weight k for the modular group, and let 2M-1

/

M=M

`

S(t) = m b(m)GI M)el tF( )), `

/

where M is a positive integer. Let C2 be a positive constant, and let T be a real number with M5 C2T. Then f2TIS(t)I6

dt V. For each power of 2, V, we have

I(V)
N/M or N'/M). Then S(t) is larger by a factor VM_ than the sum we want, so when (19.1.9) of Theorem 19.1.3 holds, then the squared integral for each block is O(ff). Blocks with M small are estimated as in Theorem 18.2.3. There are O(log T) blocks, so Cauchy's inequality gives O(A log2 T) for the whole integral.

Much deeper arguments give the corresponding mean value theorem T

Jo

+(2y-1)T+E(T),

with an explicit estimate for the error term E(T). We use the machinery of Heath-Brown (1979), following Heath-Brown and Huxley (1990).

Lemma 21.3.2 (Truncated mean square expansion) Suppose that 15 A S T1/3. Let t - 21rm2

E

g(t) = 2

ms (t/2a)

(m/n)"

G(t) = 2 m

n

m 1

(mn) ilogm/n

exp

02 log2 m/n 4

the conditions of summation in G(t) being

0 < I log m/ni 5 (log T)/0, mn -K>0 on[-/3,/3], and for 25k5r+2 If (k)(x)I < /31-kIf'(x)I. Then

1

e(f(x))cosr

R R

-x

dx
1, p prime, A(m) = 0 if m is not a prime or a prime power. The basic property of these weights is E A(d) = log m. (22.1.4) dim

We use the prime number theorem in a weak form (see Davenport 1967 and Hardy and Wright 1960 for the two standard proofs). Lemma 22.1.1 (Prime number theorem) For P > 2 we have

E A(m) =P+O msP

P log P

Let I be a subinterval of 1 <x < 2, chosen so that no combination of derivatives of F(x) occurring in the proof vanishes on I. We work with the integers in the interval MI, the set of points of I multiplied by M; let the endpoints of MI be M1 and M2. Let E(2) denote a sum over integers m for which (22.1.2) has a solution. Then (2)

E A(m) _ MI

A(m)(f(m + 1) -f(m)) MI

+ L A(m)(p(-f(m + 1)) - p(-f(m))).

(22.1.5)

MI

By the prime number theorem and partial summation, the first term in (22.1.5) becomes

f(M2)-f(M1)+0 logM

(22.1.6)

.

On the left of (22.1.5) (2)

(2)

E A(m)

F, log p + OI

MI

r

A(m)(22.1.7) /

m52M,mnotprime

P,-Ml

in this chapter p is a variable of summation restricted to prime numbers. The second sum in (22.1.7) is

E E log P< p

rz2

log 2 M

log p

log P

p'S 2M

>

e(-hf(m)) - e(-hf(m + 1))

f(m+1)

2,irih

f(m)

e(-ht) dt

=e(-hf(m))

ff(m+1)-f(m)e(-ht)dt. 0

There is some constant c with

f(m + 1) -f(m) > Q, so that p >> M112. This can be achieved by interchanging q and r within each block of the sum if necessary. With the combinatorics of Section 22.1 we take z close to M112, so that w is bounded, to achieve the same effect. For p >> M 1/2, the necessary condition (22.3.10) holds for M
. Then (23.5.5) implies

a-2 a; ( A

1

6

m

H44

111-

za

+0 0+ M J

(23.5.7)

)

Fouvry and Iwaniec study (23.5.7) by exponential sums. We can also use Farey sequence methods. For fixed A, the fraction m2/m1 lies in a short

interval. We take e/r to be the rational number of least height in this interval, and let f/s, f'/s' be its neighbours in the Farey sequence .9(r). Now m2/m1 lies in one of two reference intervals. For m2/m1 between e/r

and f/s, we have m1= ru + st, m2 = eu + ft for some integers t, u as in Lemma 1.2.2, but the common factor (t, u) = (m1, m2)'may be greater than one. We can also write

A = (e +f9)/(r+s9), where 0 may be negative. Substituting in (23.5.7), we find that (t - Ouxru + st) lies in an interval whose length is proportional to A. This is an integer point

close to a curve problem. The trivial estimate O((6 + 1XL + 1)) (in the notation of Chapter 3) gives

O(OM2 +H4/M2 +M/r), and by Lemma 1.2.3, we have r _ 2 of Lemma 24.2.2 are elementary, but the case c = 1 depends on counting the number of points on an algebraic curve over finite fields.

The terms of the Kloosterman sum correspond to matrices

of the modular group, and Kloosterman sums occur in the Fourier coefficients of modular forms. Kuznietsov (1980) followed ideas of Selberg to express sums of Kloosterman sums explicitly in terms of the Fourier coefficients of modular forms.

Further ideas

482

Lemma 24.2.3 (Kuznietsov's trace formula) For a, b positive integers, g(x) a continuous piecewise differentiable function of compact support q 2

(ab))

g(2

(ab)

a )kk-1/2

1)k

°°

+k=1 E

21T

b (-)

K(a, b; q) + Sab f J.

x

g(x) dx x

2

(cb(G2k a) - Sab)(2k -1) f J2k-1

4 (ab) bt(a)bt(b)g(K,)

c

1

+

cosh in.1

j=1

(2)

x)

g(x) dx

T;t(a)Tj,(b)g(t)

?r J- (1 + 2it)( 1 - 2it)

dt,

where G2k,a is a modular form of weight 2k, the ath Poincare series, and c is its nth Fourier coefficient (suitably normalized), b,(n) is the nth Fourier coefficient of the jth non-holomorphic Maass wave form (suitably normalized), and a + Kj is the corresponding eigenvalue, and Ta(n) is a normalized divisor function

F, E

T.(n)

d

()a,

e

de=n

Sab is 1 for a = b, 0 otherwise, and (x2

1T i

g(t)= 2sinhirt f (J2it(2)

where JS(y) is the Bessel function. In the same notation q 2 vV-Wab)

-

g

2

K(-a, b; q)

(ab)

4 (ab) b,(a)b,(b)g(Kt) j=1

cosh?rK,

T,t(a)T;t(b)g(t)

1

+ 1rL

dt,

with

g(t)= 2 cosh1rt f Kh2it(x)g(x)dx, where

Khf(y) _

2

W

fo exp(-y cosh t)cosh st dt

IT

1

=

1

I

exp(-2(' +u))u-s

du

is a less familiar Bessel function, the Hankel function of pure imaginary argument (Jeffreys and Jeffreys 1962, cap. 21).

Subdivision without absolute values

483

The two transforms g(t) and g(t) are analogous, since we can express Khs(y) in terms of 13(y) - I_S(y). The Kloosterman sum is a discrete analogue of the definition of Bessel functions of integral order as integrals round the unit circle. We call Lemma 24.2.3 a trace formula, as the right-hand side can be regarded as the normalized trace of an intertwining operator in representation theory. The lemma is proved using the Fourier theory of functions on the hyperbolic plane invariant under the action of the modular group PSL(2, 7L)

as rigid motions, and the Poincare series coefficients enter as residues at trivial poles.

Bombieri and Iwaniec (1986a) suggest that a full solution of the Second Spacing Problem will involve Lemma 24.2.3. We have calculated directly with

the matrices rather than with their Fourier theory. There are three integers a, b, and q in the Coincidence Conditions of Part IV. Perhaps 3 X 3 matrices will be needed. The representation theory of SL(3, 71) is still being investigated.

Are we on the right road to showing that the discrepancy of a smooth closed curve of linear dimensions M is O(MI/z+E)? Subdividing the curve according to rational gradients must be right. Perhaps the number of Farey

arcs should be ME, and we take about 1/e terms of the Taylor series. Alternatively, the y vectors may be so uniformly distributed that the contri-

butions of different minor arcs cancel. The Kuznietsov trace formula is appropriate here because SL(2, 7L) is the automorphism group of the lattice

points, a natural part of the problem. Should the representation theory of SL(n,7L) for n >_ 3 appear in a two-dimensional problem? In all events, we have reached the final formula: Ìf you want any more, you must sing it yourself.

References The numbers flush right following a reference are the chapter or section numbers in which the reference is cited, or to which it is related. Atkin, A. O. L. and Lehner, J. (1970). Hecke operators on 170(m). Math. Annalen, 185, 134-60. 10,20 Atkinson, F. V. (1949). The mean value of the Riemann zeta function. Acta Math., 81, 10, 21.5 353-76. Baker, R. C. (1986). Diophantine inequalities. Oxford University Press. 5

Baker, R. C. and Harman, G. (1991). On the distribution of apk modulo one. Mathematica, 38, 170-84.

23.3

Baker, R. C., Harman, G., and Rivat, J. (1994). The Piatecki-Shapiro Theorem. J. Number Theory, 50, 261-77. 22 Balasubramian, R. (1978). An improvement of a theorem of Titchmarsh on the mean square of I C(1/2 + it)I Proc. London Math. Soc., 36, 540-76. 21.4 Bombieri, E. (1987). Le grand crible dans la theorie analytique des nombres, 2nd edn. Asterisque, Paris. 5.6

Bombieri, E. and Iwaniec, H. (1986a). On the order of C(1/2 + it). Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4),13,449-72. 5,7,8,14,16,17,21,24.2 Bombieri, E. and Iwaniec, H. (1986b). Some mean value theorems for exponential sums. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 13, 473-86.

7.5,12

Bombieri, E. and Pila, J. (1989). The number of integral points on arcs and ovals. Duke Math. J., 59, 337-57.

23.1

Branton, M. and Sargos, P. (1994). Points entiers au viosinage d'une courbe plane a tres faible courbure. Bull. des Sciences Maths. (2) 118, 15-28. 3, 23.1 Cassels, J. W. S. (1953). A new inequality with application on the theory of Diophantine approximation. Math. Annalen, 126, 108-18. 5,6 Cassels, J. W. S. (1959). An introduction to the geometry of numbers. Springer, Berlin. 11.2

Chaix, H. (1972). Demonstration elementaire d'un theoreme de van der Corput. Comptes RenduesAcad. Sci. Paris, A 275, 883-5.

2

Chandrasekharan, K. and Narasimhan, R. (1967). On lattice-points on a random sphere. Bull. Amer. Math. Soc., 73, 68-71. 6 Colin de Verdiere, Y. (1977). Nombre de points entiers dans une famille homothetique de domaines de R. Ann. Sci. Ec. Norm. Sup. (4)10, 559-76. 6 Corput, J. G. van der (1920). Uber Gitterpunkte in der Ebene. Math. Annalen, 81, 1-20. 2,5 Corput, J. G. van der (1921). Zahlentheoretische Abschatzungen. Math. Annalen, 84, 53-79. 5 Corput, J. G. van der (1922). Verscharfung der Abschatzung beim Teilerproblem. Math. Annalen, 87, 39-65. 3,5 Corput, J. G. van der (1923). Neue zahlentheoretische Abschatzungen. Math. Annalen, 89, 215-54.

5

References

485

Corput, J. G. van der (1935). Zur Methode der stationaren Phase I. Compositio Math., 1, 15-38. 5.5 Corput, J. G. van der (1936). Zur Methode der stationaren Phase II. Compositio Math., 3, 328-72. 5.5 Corput, J. G. van der and Pisot, C. (1939). Sur la discrepance modulo un I. Proc. Kon. Ned. Akad. Wetensch., 42, 476-85. 5.3, 5.6 Davenport, H. (1967). Multiplicative number theory. Markham, Chicago. 5.4,21,22.1 Deligne, P. (1974). La conjecture de Well. Inst. Hautes Etudes Sci. Publ. Math., 53, 273-307. 10.3, 20.1 Deshouillers, J. M. (1976). Nombres premiers de la forme [n`]. Comptes RenduesAcad. Sci. Paris, A 282, 131-3. 22.1 Deshouillers, J. M. (1985). Geometric aspect of Weyl sums. Elementary and Analytic Theory of Numbers. Banach Centre Pub. 17 (Polish Sci. Pub; Warsaw) 75-82. 2 Deschouillers, J. M. and Iwaniec, H. (1982). Kloosterman sums and Fourier coefficients of cusp forms. Inventiones Math., 70, 219-88. 24.2 Everest, G. R. (1989). A `Hardy-Littlewood' approach to the S-unit equation. Compositio Math., 70, 101-18. 23.3

Faltings, G. (1983). Endlichkeitssatze filr abelsche Varietaten fiber Zahlkorpern. Inventiones Math., 73, 349-66.

11.1

Filaseta, M. (1988). An elementary approach to short interval results for k-free numbers. J. Number Theory, 30, 208-25. 3.2 Filaseta, M. and Trifonov, O. (1996). The distribution of fractional parts with applications to gap results in number theory. Proc. London Math. Soc. To appear. 23.1 Ford, L. R. (1938). Fractions. Amer. Math. Monthly, 45, 586-601. 1.2 Fort, C. (1973). New lands. Ace, New York. 15.2

Fouvry, E. and Iwaniec, H. (1989). Exponential sums for monomials. J. Number Theory, 33, 311-33.

23.5

Gel'fond, A. O. and Linnik, Yu. V. (1965). Elementary methods in analytic number theory. Rand McNally, USA.

6.3

Good, A. (1982). The square mean of Dirichlet series associated with cusp forms. Mathematika, 29, 278-95. 20.3 Good, A. (1983). Local analysis of Selbeig's trace formula. Lecture Notes in Mathematics, 1040. Springer, Berlin. 21.1 Graham, S. W. and Kolesnik, G. (1991). Van der Corput's method for exponential sums. London Math. Soc. Lecture Notes, 126. Cambridge University Press. 3,5,7,14,17.4,18.2,19.2,23.4

Hall, R. R. and Tenenbaum, G. (1986). The average orders of Hooley's 0r functions, II. Compositio Math., 60, 163-86. Hall, R. R. (1988). Divisors. Cambridge University Press.

11.1,18.5 11.1

Halberstam, H. and Richert, H.-E. (1974). Sieve methods. Academic Press, London. 5.6

Hardy, G. H. (1915). On the expression of a number as the sum of two squares. QuarterlyJ. Math. (Oxford), 46, 263-83.

10.2

Hardy, G. H. and Landau, E. (1924). The lattice points of a circle. Proc. Royal Soc., A 105, 244-58.

10.2

Hardy, G. H. and Littlewood, J. E. (1923). Some problems of `partitio numerorum'; III: on the expression of a number as a sum of primes. Acta Math., 44, 1-70. 5.6,12.1

Hardy, G. H. and Ramanujan, S. (1918). Asymptotic formulae in combinatory analysis. Proc. London Math. Soc. (2) 17, 75-115. 7.1,12.1

486

References

Hardy, G. H. and Wright, E. M. (1960). An introduction to the theory of numbers, 4th 1,11, 22.1 edn. Oxford University Press. Heath-Brown, D. R. (1978). The twelfth power moment of the Riemann zeta function. 21.5

Quarterly J. Math. (Oxford), 29, 443-62.

Heath-Brown, D. R. (1979). The fourth power moment of the Riemann zeta function. 21.3 Proc. London Math. Soc., (3) 38, 385-422. Heath-Brown, D. R. (1983). The Pjateckii-Sapiro prime number theorem. J. Number Theory, 16, 242-66.

5.5, 22

Heath-Brown, D. R. and Huxley, M. N. (1990). Exponential sums with a difference. 9,19.1, 21.3 Proc. London Math. Soc. (3) 61, 227-50. Herz, C. S. (1962a). Fourier transforms related to convex sets. Annals of Maths., 75, 6.2, 23.4 81-92. Herz, C. S. (1962b). On the number of lattice points in a convex set. Amer. J. Math., 6.2, 23.4 84,126-32. Hlawka, E. (1950). Integrale auf konvexen Korpern. Monatshefte Math., 54, 1-36, 6.2,18.5, 23.4 81-99. Hlawka, E., Schoissengeier, J., and Taschner, R. J. (1991). Geometric and analytic number theory. Springer, Berlin. Hobson, E. W. (1957). Theory of functions of a real variable. Dover, New York.

23 6.3

Huxley, M. N. (1972). On the difference between consecutive primes. Inventiones 20.2 Math., 15, 164-70. Huxley, M. N. (1975). Large values of Dirichlet polynomials III. Acta Arithmetica, 26, 435-44. 20.2 Huxley, M. N. (1985). Introduction to Kloostermania, Elementary and Analytic Theory of Numbers. Banach Centre Pub. 17, (Polish Sci. Pub., Warsaw), 217-306. 24.2 Huxley, M. N. (1987). The area within a curve. Proc. Indian Acad. Sci. (Math. Sci.), 97, 2 111-16. Huxley, M. N. (1988). The fractional parts of a smooth sequence. Mathematika, 35, 292-6. 3 Huxley, M. N. (1989a). Exponential sums and the Rieman zeta function, in Theorie des Nombres, C.R. C.I. TN. Laval 1987. de Gruyter, Berlin, 417-23. 3,17 Huxley, M. N. (1989b). The integer points close to a curve. Mathematika, 36, 198-215. 3

Huxley, M. N. (1990). Exponential sums and lattice points. Proc. London. Math. Soc. (3), 60, 471-502. Corrigenda, ibid. (3) 66, 70. 8,18 Huxley, M. N. (1991). Exponential sums and rounding error. J. London. Math. Soc. (2), 43, 367-84. 17.4,18

Huxley, M. N. (1992a). A note on short exponential sums, in Proc. Amalfi Conf. Analytic Number Theory. University Press, Salerno, 217-29. 14,17 Huxley, M. N. (1992b). Integer points in a domain with smooth boundary. in Seminaire de theorie des nombres Paris 1989-90. Birkauser, Basel, 93-111. 18.3

Huxley, M. N. (1993a). Exponential sums and the Riemann zeta function IV. Proc. London Math. Soc. (3), 66, 1-40.

4,15,17, 21.2

Huxley, M. N. (1993b). Exponential sums and lattice points II. Proc. London Math. Soc., (3), 66, 279-301. Corrigenda, ibid. (3), 68, 264.

18

Huxley, M. N. (1994a). The rational points close to a curve. Ann. Scuola Norm. Sup. Pisa Cl. Sci, (4) 21, 357-75. 4 Huxley, M. N. (1994b). A note on exponential sums with a difference. Bull. London Math. Soc., 26, 325-7. 13.3,19.1, 21.3

References

487

Huxley, M. N. (1994c). On stationary phase integrals. Glasgow Math. J., 35, 354-62. 5.5

Huxley, M. N. (1995a). A mean value theorem for exponential sums. J. Austral. Math. Soc., A, 59, 304-7. 6.1 Huxley, M. N. (1995b). The mean lattice point discrepancy. Proc. Edinburgh Math. Soc., 38, 523-31. 6

Huxley, M. N. and Kolesnik, G. (1991). Exponential sums and the Riemann zeta function III. Proc. London Math. Soc. (3), 62, 449-68. Corrigenda, ibid. (3), 66, 302. 11,12

Huxley, M. N. and Kolesnik, G. (1996). Exponential sums with a large second derivative. To appear. 16.3,19.3 Huxley, M. N. and Sargos, P. (1995). Points entiers au voisinage d'une courbe plane de class C". ActaArithmetica, 59, 359-66. 23.1 Huxley, M. N. and Trifonov, O. (1996). The square-full numbers in an interval. Proc. Cam. Philos. Soc., 119,201-8. 3, 23.1 Huxley, M. N. and Watt, N. (1988). Exponential sums and the Riemann zeta function. Proc. London Math. Soc. (3), 57, 1-24. 1.6, 7,14,17 Huxley, M. N. and Watt, N. (1989a). The Hardy-Littlewood method for exponential sums. Coll. Math. Soc. Jdnos Bolyai, 51, Number Theory. Budapest 1987. North Holland, Amsterdam, 173-91. 7,14,17

Huxley, M. N. and Watt, N. (1989b). Exponential sums with a parameter. Proc. London Math. Soc. (3),59,233-52. 14.4,17 Huxley, M. N. and Watt, N. (1994). The number of ideals in a quadratic field. Proc. 18, 24.1 Indian Acad. Sci. (Math. Sci.), 104, 157-65. Ireland, K. and Rosen, M. (1982). A classical introduction to modem number theory. 10.3 Springer, Berlin.

Ivi6, A. (1985). The Riemann zeta function. Wiley, New York.

21

Iwaniec, H. (1980). Fourier coefficients of cusp forms and the Riemann zeta function. 21.5 Seminaire de Theorie des Nombres (Bordeaux) 1979/80. Iwaniec, H. and Mozzochi, C. J. (1988). On the divisor and circle problems. J. Number Theory, 29, 60-93. 5, 8,13,18 Jarnik, V. (1925). Uber die Gitterpunkte auf konvexen Kurven. Math. Zeitschrift, 24, 500-18. 2,3

Jeffreys, H. and Jeffreys B. S. (1962). Methods of mathematical physics, 3rd edn. Cambridge University Press. 7.3,8.3,10.2,21.2,24.2 Jutila, M. (1985). On exponential sums involving the divisor function. J. reine angew. 10, 20.3 Math., 355, 173-90.

Jutila, M. (1986). On the approximate functional equation for C '(s) and other Dirichlet series. Quarterly J. Math. (Oxford) (2), 37, 193-209.

5.5,10, 20.3

Jutila, M. (1987a). Lectures on a Method in the Theory of Exponential Sums. Tata Institute Lectures in Maths. and Physics 80. Springer, Bombay.

10,20

Jutila, M. (1987b). On exponential sums involving the Ramanujan function. Proc. Indian Acad. Sci. (Math. Sci.), 97, 157-66.

10,20

Jutila, M. (1989). Mean value estimates for exponential sums, in C. R Joumees Arithmetiques Ulm 1987. Lecture Notes in Mathematics 1380. Springer, Berlin, 120-36.

10,17.4, 20

Jutila, M. (1990a). The fourth power moment of the Riemann zeta function over a short interval. Coll. Math. Soc. Jdnos Bolyai, 51, Number Theory. Budapest 1987. 10,13.3,14.4, 21 North Holland, Amsterdam, 221-44.

488

References

Jutila, M. (1990b). Mean value estimates for exponential sums II. Arch. Math., 55, 267-74. 10,20 Jutila, M. (1990c). Exponential sums related to quadratic forms, in Proc. Canadian Number Theory Conference Banff 1988. de Gruyter, Berlin.

10,20

Jutila, M. (1991). Mean value estimates for exponential sums with applications to 10,20,21 L-functions. Acta Arith., 57, 93-114. Jutila, M. (1992). Transformations of exponential sums, in Proc. Amalfi Conf. Analytic 10, 24.2 Number Theory. University Press, Salerno, 263-70. Karatsuba, A. A. (1981). On the distance between adjacent zeros of the Riemann zeta function lying on the critical line. Proc. Steklov Inst. Math., 3, 51-66, from Trudy 7.6,21.4 Mat. Inst. Steklov, 157. Karatsuba, A. A. (1987). Approximation of exponential sums by shorter ones. Proc. 5.5 Indian Acad. Sci. (Math.-Sci.), 97, 167-78. Kendall, D. G. (1948). On the number of lattice points inside a random oval. Quarterly 6 J. Math. (Oxford), 19,1-26. Kendall, D. G. and Rankin, R. A. (1953). On the number of points of a given lattice in 6.2,23.4 a random hypersphere. Quarterly J. Math. (Oxford) (2), 4, 178-89. Korobov, N. M. (1992). Exponential sums and their applications. Kluwer, Dordrecht. 5,23 3,5,23.4 Kratzel, E. (1988). Lattice points. Deutscher Verlag Wiss., Berlin. Kratzel, E. and Nowak, W. G. (1991). Lattice points in large convex bodies. Monatsh. 23.4 Math., 112, 61-72. Kratzel, E. and Nowak, W. G. (1992). Lattice points in large convex bodies H. Acta 23.4 Arith., 62, 285-95.

Kuznietsov, N. V. (1980). Peterson's hypothesis for cusp forms of weight zero and Linnik's hypothesis; sums of Kloosterman sums. Mat. Sbomik, 111, 334-83. 14.2,21.1,24.2 Landau, E. (1915). Uber die Gitterpunke. in einem Kreise (II). Gottinger Nachrichten 10.2 (1915), 161-71.

Lax, P. D. and Phillips, R. S. (1967). Scattering theory for automorphic functions. Academic Press, London. Linnik, Yu. V. (1941). The large sieve. DokladyA.N.S.S.S.R, 30, 292-4.

21.1 5.6,22.1

Littlewood, J. E. (1912). Quelques consequences de l'hypothese que la fonction i(s) de Riemann n'a pas des zeros dans le demi-plan R(s) > 1/2. Comptes RenduesAcad. 21.1 Sci. Paris, 154, 263-6. Littlewood, J. E. (1986) Littlewood's miscellany (ed. B. Bollobas). Cambridge Univer7.4 sity Press.

Liu, H.-Q. (1993). On the number of Abelian groups of given order (supplement). Acta Arith., 64, 287-96.

23.5

Liu, H.-Q. and Rivat, J. (1992). On the Pjateckii-Sapiro prime number theorem. Bull. 22, 23.5 London Math. Soc., 24,143-17. Logan, I. and O'Hara, F. (1983). The complete Spectrum ROM disassembly. Melbourne 18.5 House, Richmond, Surrey. Matthews, K. R. (1973). Bilinear forms and the large and small sieves. J. Number 5.6 Theory, 5, 16-23. Meurman, T. (1988). On exponential sums involving the Fourier coefficients of Maass 10,20 wave forms. J. reine angew. Math., 384, 192-207.

Meurman, T. (1990). On the order of the Maass L-functions on the critical line. Coll Math. Soc Janos Bolyai, 51, Number Theory. Budapest 1987. North Holland, 10,20 Amsterdam. 325-54.

References

489

Meurman, T. (1992). A simple proof of Voronoi's identity. Asterisque, 209, 265-74. 10 Montgomery, H. L. (1971). Topics in multiplicative number theory. Lecture Notes in Mathematics. 227. Springer, Berlin. 5,21 Montgomery, H. L. and Vaughan, R. C. (1974). Hilbert's inequality. J. London Math. Soc. (2), 8, 73-82. 5

Moreno, C. J. and Shahidi, F. (1983). The fourth moment of the Ramanujan Tfunction. Math. Annalen, 266, 233-9. 10.3, 20.1 Muller, W. and Nowak, W. G. (1991). Lattice points in planar domains: applications of Huxley's discrete Hardy-Littlewood method, in Number-Theoretic Analysis II. Springer Lecture Notes in Mathematics, 1452, pp. 139-64. Springer, Berlin. 23

Nowak, W. G. (1985a). On the average order of the lattice rest of a convex planar domain. Math. Proc. Cam. Philos. Soc., 98, 1-4. 6.3 Nowak, W. G. (1985b). An 11-estimate for the lattice rest of a convex planar domain. Proc. Royal Soc. Edinburgh, 100A, 295-99. 6.3 Ogg, A. (1969). Modular forms and Dirichlet series. Benjamin, New York.

10

Piatecki-Shapiro, I. I. (1953). On the distribution of prime numbers in sequences of the form [f(n)]. Mat. Sbomik, 33, 559-66.

22

Phillips, E. (1933). The zeta function of Riemann: further developments of van der Corput's method. Quarterly J. Math. Oxford, 4, 209-25.

5

Rankin, R. A. (1939). Contributions to the theory of Ramanujan's function T(n) and

similar arithmetical functions II: the order of Fourier coefficients of integral modular forms. Proc. Cam. Phil. Soc., 35, 357-72. 10.2, 20.1 Rankin, R. A. (1977). Modular forms and functions. Cambridge University Press. 1.3,10 Sargos, P. (1995). Points entiers an voisinage d'une courbe, sommes trigonometriques courtes et pairs d'exposants. Proc. London Math. Soc., (3) 70, 285-312. 3,19.3 Selberg, A. (1942). On the zeros of Riemann's zeta function on the critical line. Arch. for Math. og Naturvid., B 45, 101-14 (Collected Works 1, 142-55). 21.4 Selberg, A. (1956). Harmonic analysis and discontinuous groups in weakly symmetric

Riemannian spaces with applications to Dirichlet series. J. Indian Math. Soc., 20, 47-87 (Collected Works I, 423-63). 21.1,24.2 Sierpinski, W. (1906). Sur un probleme du calcul des fonctions asymptotiques. Prace Mat.-Fiz., 17,77-118. 2 Swinnerton-Dyer, H. P. F. (1974). The number of lattice points on a convex curve. J. Number Theory, 6, 128-35. 2,3 Titchmarsh, E. C. (1951). The theory of the Riemann zeta function. Oxford University Press.

5,21

Trifonov, O. (1988). On the number of the lattice points in some two-dimensional domains. Doklady Bulgar. Akad. Nauk, 41, 11, 25-7. 5,18.3 Uchiyama, S. (1972). The maximal large sieve. Hokkaido Maths. J. 1, 117-26. 8.4 Vaughan, R. C. (1977). Sommes trigonometriges sur les nombres premiers. Comptes RenduesAcad. Sci. Paris, A 285, 981-3. 22 Vaughan, R. C. (1981) The Hardy-Littlewood method. Cambridge University Press. 8,12, 23.2 Vinogradov, I. M. (no date; circa 1954). The method of trigonometric sums in the theory of numbers (translated K. F. Roth and A. Davenport). Interscience, London. 5,22 Voronoi, G. (1903). Sur un probli me du calcul des fonctions asymptotiques. J. reine angew. Math., 126, 241-82. 2 Voronoi, G. (1904). Sur une fonction transcendente et ses applications a la sommation de quelques series. Ann. Ecole Norm. Sup. (3) 21, 207-67, 459-533. 6,10, 21.5

490

References

Watt, N. (1988). An elementary treatment of a general Diophantine problem. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 15, 603-14 Watt, N. (1989a). A problem on semicubical powers. Acta Arith., 52, 119-40.

11 11

Watt, N. (1989b). Exponential sums and the Riemann zeta function II. J. London 11,12,17 Math. Soc., 39, 385-404. Watt, N. (1990). A problem on square roots of integers. Periodica Math. Hung., 21, 13 55-64. Watt, N. (1992). A hybrid bound for Dirichlet L-functions on the critical line, in Proc. Amalfi Conf. Analytic Number Theory. University Press, Salerno, 387-92.

16.2,17.3

Weil, A. (1967). Uber die Bestimmung Dirichletscher Reihen durch Funktional10.1 gleichungen. Math. Annalen, 168,149-56. Weil, A. (1971). Dirichlet series and modular forms. Lecture notes in mathematics, 189, 10.1 Springer, Berlin.

Wilton, J. R. (1929). A note on Ramanujan's arithmetic function T(n). Proc. Cam. 10.2 Phil. Soc., 25, 121-9. Wilton, J. R. (1932). Voronoi's summation formula. Quart. J. Math. Oxford (3), 26-32.

10.2

Zaharescu, A. (1987). On a conjecture of Graham. J. Number Theory, 31, 80-7.

1.1

Index

analytic coincidence conditions 315-20, 327, 339-46 approximate functional equation 111-13, 432,

436,441-3,447-8 arc of coincidence 313, 321, 327-8, 341-2 area in intrinsic coordinates 393 Atkin-Lehner newforms 210, 435 Atkinson's summation formula 451

degenerate cases 81 Deligne, P. 210, 425 Deshouillers, J. M. `452 diagonal solutions 236 terms 363 differencing 45-6, 65 step 115-19, 135, 185, 257, 259-60, 363-5, 383, 406, 416, 421-2, 461, 473, 474 Dirichlet approximation theorem 16, 83, 229, 259, 473

Baker, R. C. 473 Balasubramanian, R. 447 Bombieri, E. 2, 3, 4, 125, 158, 160, 289, 467-9, 478, 483

Bombieri-Iwaniec method 142-196, 235-361, 372-383, 479 Borel, E. 7 Brun-Titchmarsh theorem 122-5 Burgess D. A. 135

boundary condition 405 characters 100-101, 135, 236, 362, 432 divisor problem 2, 28, 239-40, 382-83 pigeonhole principle 16, 42-3, 48, 56, 335 series 2, 237, 432-3, 435, 438-441, 447 discrepancy of lattice points 33-41,128-141, 372, 384-8, 398, 474, 479, 483 of sequences 97-8

divide-and-conquer 2, 42-3, 48, 56, 81, 83, 115, 143, 158, 236, 240, 255, 263-4, 307, 311, 447,

Cassels, J. W. S. 116, 242 Cauchy, A. 8 inequality discussed 114-16, 121-2, 125 centre of Farey arc 145, 169, 186, 339 Chaix, H. 41 circle method 143, 152, 255-71, 470-2 see also Hardy, G. H. problem 2, 31, 203 coincidence conditions 156-7, 216, 231, 235, 272, 283-4, 287-346, 349, 416, 426-9, 478,

479

divided differences 46, 65 divisor concentration 240-1, 367 function 29, 203, 237-41, 367 problem 2, 28, 239-40, 382-83 duality of points and lines 73-4, 137-8 of bilinear forms 122, 224

483

detection 119-20, 255 commensurable subgroups 198

Erdos-Turan theorem 96-7 Euler constant 239-40

complementary function 72, 110, 321, 341, 345, 383, 415

product 113, 435, 438-41 Everest, G. 472 exponent pairs 151-2, 369-70 exponential integrals defined 87 evaluated 104-14 exponential sums bounded 99,353-61,365-71,415-22 defined 87 non-standard 475

congruence family 336-9,349,357,361-3,392, 478

continued fraction 16-19, 398 Corput, J. G. van der 4, 41, 43, 93, 116, 125 iteration 43, 62, 116-17, 135, 152, 383, 406, 474

lemmas 87-120, 221-2, 225, 448 counting squares 1-2, 25-31, 41, 131, 167 cusp forms 200 Davenport, H. 4, 101, 453

Faltings, G. 235

Index

492

families of solutions 241-48, 249-51, 272-83 of sums 179, 194, 219-20, 300-306, 333-4, 353, 357-61, 393-7, 410-13, 420, 424-9, 458, 478

Farey arcs 2, 143, 167-70, 185-7, 208-9, 255, 336-7, 372, 425-30, 445, 470, 473, 479-80, 483 sequence 8-10, 19-23, 208, 346, 439, 470, 477

see also major arcs, minor arcs

Fejer kernel 93-4 Filaseta, M. 45, 470 First Derivative Test stated 88, 104, 113-14, 448

First Spacing Problem 3, 56, 158, 235-85, 307,

372,406,471,476-7 flanks of major arcs 151 Ford circles 10 Fortean function 312, 324, 327, 342, 343 Fouvey, E. 475, 477

gardening 152,166,183-4,195-6,225,228-30, 412

Gauss circle problem 2, 31, 203 sums 100-4, 149, 162 Gel'fond, A. 0. 41, 135, 452 generating function 198, 438, 470 Gershgorin's eigenvalue bound 224 Good, A. 436, 439 Graham, R. L. 8 Graham, S. W. 4, 432 with Kolesnik, G. 2, 43, 116, 125, 152, 297, 383, 474

Halasz, G. 433 Halberstam, H. 122 Hall, R. R. 240, 399 Hardy, G. H. 2, 31, 122, 143, 203, 256, 439, 453

Hardy-Littlewood circle method, see circle method Harman, G. 473 Heath-Brown, D. R. 2, 4, 125, 185, 406, 444, 451, 464

height 7 Herz, C. S. 128, 474 Hlawka, E. 128, 405, 474 Hobson, E. W. 130 Hooley, C. 4 divisor concentration 241-2, 367

intrinsic equation of curve 24-5, 384, 388-393 inversion step 108-10, 116-17, 149, 176, 191-2, 193, 383, 422, 474-5 Ireland, K. 210

iteration between rational points and lines 76-80 by major arc method 166, 413-15 of points close to curves 43, 48-53 of resonance curves 370, 415-22 van der Corput's 43, 62, 116-17, 137, 154, 383, 406, 474 Ivic, A. 4, 440 Iwaniec, H. 2, 3, 4, 125, 167, 372, 451, 471, 475, 477; see also

Bombieri-Iwaniec

Jarnik, W. 30-33, 57 Jeffreys, H. and B. S. 150, 177, 203, 441, 482 Jordan curve theorem 25 Jutila, M. 2, 4, 208, 225, 284, 423, 435, 436, 437, 451

Karatsuba, A. A. 125, 160, 451 Kendall, D. G. 135, 474 Kloosterman refinement 471, 479

sums 481-3 Koebe's function 440 Kolesnik, G. 2, 4, 255, 297, 339, 416, 474, 478, see also Graham S. W. Kratzel, E. 4, 43, 474, 475 Kuznietsov, N. V. 290, 439, 481-83 L-function 435-7, 440-1, 451 Landau, E. 31 large sieve 3, 120-125, 126, 166, 214-20, 224-5,372,470 applied 152-8, 178-84, 193-6, 231, 329, 475-6

lemmas 120-25, 224-5 large values results 126, 431-2 Lax, P. D. 440 limitation results 126, 248-9, 334-5, 474 Lindelof hypothesis 441 Linnik, Yu. V. 41, 122, 124, 135, 454 Liu, H.-Q. 452, 475 Logan, I. 398 magic matrix 289-308, 320-1, 329-335, 415-18, 423-8, 430 major arcs in Bombieri-Iwaniec method 148-51, 160-66, 168-72, 186-8, 377, 409, 480 flanks of 151 in Hardy-Littlewood method 143, 255, 256-7, 262-68, 270 long 160-66, 356, 413-15, 450 for S-suits 473 Sargos splines 469 as sides of polygon 55, 67-72 Mangoldt, H. von 438, 453, 464 Matthews, K. R. 125 mean-to-max 117-18, 122, 135, 136, 185, 410

493

Index

Mellin transform 91, 117-18, 181-2, 195, 438-40 Minkowski, H. 242 minor arcs in Bombieri-Iwaniec method 143-9, 151, 168-70, 172-8, 186-7, 188-93, 336, 377, 410, 414, 480

in Hardy-Littlewood method 143, 255, 256-62, 270 Sargos splines 469

as sides of polygon 55, 67 MSbius function 13-14, 63-4, 123-5, 439, 454 transformation 199 Modification One 147, 149, 168, 172, 315 Two 147, 149, 156, 169, 181, 186, 195, 351, 352, 374, 376 Three 169

reflection step, see inversion step resonance curves 320-21, 345, 415-19 Riemann hypothesis 15, 210, 435-6, 439-40, 447

zeta function 2, 3, 64, 111-13, 160, 237, 432, 435, 438-451

Riesz interchange 218, 246, 267, 330-2, 405, 457

Rivat, J. 452, 475 Rosen, M. 210 row-of-glasses function 95 row-of-teeth function 2, 33, 93, 131, 167, 379-81 rounding error function 2, 33, 93, 131, 167, 379-81 sums 93-8, 131, 167, 363, 372, 379-83, 385-8,397-405

modular group 11-12, 197, 289, 439-40, 481, 483 forms 2, 3, 197-9, 256, 290, 435, 451, 481-83

Sargos splines 469

monomial function 50-2, 117, 165, 369, 414, 416,473,475-7 Montgomery, H. L. 115, 213, 455 Mordell, L. J. 440 Moreno, C. J. 210, 425, 437 Motohashi, Y. 4 Mozzochi, C. J. 2, 4, 167, 372

Second Spacing Problem 3, 158, 287-346, 363, 372, 476, 483 Selberg, A. 4, 481 sieve 122, 125, 439

Nowak, W. G. 4, 135, 474

Sierpinski, W. 41, 167, 372 sieve

trace formula 439-40 zeta function 439-40 Shahidi, F. 210, 425, 437 short interval means 118-19, 221-4, 226-7,

408-10,435,443-7 general 454

O'Hara, F. 398 partial summation lemmas 89-93, 181-2, 225, 408

Phillips, R. S. 440 Piatecki-Shapiro theorem 452, 463, 475 Pila, J. 467-9 Pisot, Ch. 116 Poisson summation formulae 93-104 used 2,103,109,110-12,149-50,173-8,

large, see large sieve Selberg's 122, 125, 439 simple asymptotic 63 simple asymptotic sieve 63 Simpson's rule 397 stakanchik 95 stationary phase integrals 105-8, 133-4, 150, 177, 212, 263

Swinnerton-Dyer, H. P. F. 4, 30, 43, 45-6, 49, 50, 54-62, 197, 467

189-93, 257, 260, 263-4, 320, 363, 381,

383,409,419,432,438,474-5 polynomial approximation 2, 142-6, 161, 173,

188-9,384,469 prime numbers 1, 15, 122-5, 438-9, 447, 452-64, 473

Tenenbaum, G. 240, 399 trapezium rule 31, 98, 128-9, 397 Trifonov, O. 4, 470 Type I double sums 115, 455-60 Type II double sums 115, 213, 455, 459-62, 475

Rademacher's binary decomposition 180 Ramachandra, K. 446 Ramanujan's hypothesis: bounds 210 hypothesis: multiplicativity 440 sum 13-14, 123, 125 work on partitions 143, 256 Rankin R. A. 4, 200, 210, 425, 433, 436, 451

reference fractions 19-20, 308, 312, 329-30, 332

Uchiyama, S. 180 uniform Diphantine approximation 19-23, 221 uniform distribution 12-15

van der Corput, see Corput, J. G. van der Vaughan, R. C. 115, 143, 152, 256, 454, 455, 464, 471

Index

494

Vinogradov, I. M. 4, 41, 93, 143, 152, 256, 454,

Wright, E. M. 453

471

Voronoi, G. 4, 41, 128 polygon 2, 40-1, 167, 372 summation formula 128, 203, 451

x-vectors 154, 178, 193, 231 y-vectors 154, 178, 194, 231

Waring's problem 3, 152, 471 Watt, N. 2, 4, 20, 241, 246, 272, 297 Weil, A. 199, 481 Weyl's criterion 13, 96-7 Wilton summation 199, 200-10, 451 translated 440 used 210-13

Zaharescu, A. 8 zeta functions 440-1 Minakshisundaram-Pleijel 440 Riemann 2, 3, 64, 111-13, 160, 237, 432, 435, 438-451 Selberg 439-40

Area, lattice points, and exponential sums

Area, lattice points, and exponential sums

Area, Lattice Points and Exponential Sums (London Mathematical Society Monographs)