|
|
||||||||||||||||||||||||||
| I have been getting a lot of feedback from readers, including of course many questions and requests for elucidation. To avoid having to repeat myself in individual e-mail responses, I am going to use this page as a link I can refer people to. Here I shall list some FAQs (i.e. frequently-asked questions), with my responses. General questions about the book are up front. Then questions that refer to specific pages are in order by page number. Other questions and suggestions are welcome (though if you think you have spotted an error in the text, please check first with my "Errata" page to see if I have already logged it). E-mail me via the link on my home page. Q. I am a total mathophobe. I hate equations and formulas and stuff. Can I get anything out of your book? A. Ah, "the fear of all sums"! Well, I hope mathophobes (I have a feeling that should actually be "mathephobes," but Merriam-Webster's Third is no help here**) will be able to get something out of my book, if not to the value of the full purchase price. I suggest you try the following plan.
** Though one of my readers, Evan McElravy, was: Mr. Derbyshire: I am myself a fair mathephobe. I do, however, know something of Greek and your hunch is correct. If you’ll indulge me a moment: There are two main
noun forms in Greek for the word, (1) hê mathê, and (2) to mathêma,
whose stem is mathêmat- Both are derived from the aorist stem of the
verb manthanein (math-). While this stem has no terminal vowel
("they learned" = emathon), and the final eta in the first form is
just an inflectional ending, in all the various other forms, like the second
from above, and the few derivatives (e.g., hê mathêteia =
"instruction from a teacher;" ho mathêtês = "student") there is an
intermediate eta. The big Liddell and Scott indicates no authentic Greek
compounds where math is the first element, but it seems certain that
there would be an eta there. Q. Will there be a paperback edition of Prime Obsession? A. Yes. Penguin Books (USA) have bought the paperback rights and started work on a paperback. Current (October 2003) schedule is for the paperback to come out in June 2004. Q. Can you send me, or post somewhere, the Mathematica notebooks you used for your calculations and diagrams? A. Mostly no. The book was written in a tearing hurry 12 to 15 months ago. Most of the math was done on the fly, and most of the notebooks have not survived in their original form. (I mean, I either cannibalized earlier ones to make later ones, or just lost them.) This is not actually a great loss to humanity. I am not a deep Mathematica user, and the code I developed was not particularly brilliant or imaginative. In most cases, you could re-create it yourself in a few minutes by just looking at the result diagram or calculation. For an illustration from the Mathematica notebooks that survive on my hard drive, here is the code behind the statement on page 284 that: "I asked my math software package, Mathematica 4, to generate a random 269x269 Hermitian matrix and compute its eigenvalues." << Statistics`DescriptiveStatistics` There was nothing much deeper than that. Q (page 4). In your "card trick," are you sure about that third card going forward one-sixth of a card length? Seems to me it should be one-eighth. A. I am baffled by the number of people who told me this, since I included a diagram at the top of page 5 showing, as clearly as it can possibly be shown, why the correct number is one-sixth. But OK, let's try your way. We push that third card forward one-eighth of a card length. What have we got in front of the "tipping point"? (I.e. the leading edge of the main part of the deck.) Well, we've got
That's a total of 11/8 card lengths (7 + 3 + 1 = 11) in front of the "tipping point." Now, 3 minus 11/8 equals 13/8. So out of those three top cards, 11/8 card lengths is in front of the "tipping point," while 13/8 card lengths are behind it. Which means that it is not a tipping point at all; we can still push a little further! One-sixth is the correct answer. If you are still not convinced, I can only suggest you take out a deck of cards and try it. You can easily distinguish between one-sixth and one-eighth of an ordinary card length. When it comes to the fourth card, the math changes slightly. After you push the top four cards out a distance one-eighth, the topmost card is now entirely beyond the tipping point. Its back edge is in fact one-twenty-fourth ahead of the tipping point. To confirm that this is, in fact, the tipping point, you therefore need to take moments into consideration. Obviously the "tipping effect" of that topmost card's mass is greater the further ahead of the tipping point it is. With the top four cards pushed out 1/2 plus 1/4 plus 1/6 plus 1/8, the arithmetic to check that the tipping point is correct now goes as follows.
To the right of the tipping point we have — Q (page 16). In Chapter 1.vii you say: "Consider the following sequence of numbers: 1/1, 3/2, 7/5, 17/12, 41/29, 99/70, 239/169, 577/408, 1393/985, 3363/2378,... Each fraction is built from the one before by a simple rule: add top and bottom to get new bottom, add top and twice bottom to get new top. That sequence converges to the square root of 2." Why? Can you prove this? A. Of course I can! That simple rule turns a/b into (a+2b)/(a+b). Or, by elementary algebraic manipulation, into 1 + b/(a+b). Or, by a wee bit more manipulation, to 1 + 1/(1 + a/b). Fix that in your mind: starting from a/b, one application of the "simple rule" gets you to 1 + 1/(1 + a/b). If I call a/b "R" (for "ratio") then one application of the rule gets you from R to 1 + 1/(1 + R). Where would two applications of the rule get you? Well, the first application gets you to 1 + 1/(1 + R), then the second of course gets you to 1 + 1/(1 + [1 + 1/(1 + R)]), or in other words, to 1 + 1/(2 + 1/(1 + R)). A third application of the "simple rule would then give you 1 + 1/(1 + [1 + 1/(2 + 1/(1 + R))]), or in other words: 1 + 1/(2 + 1/(2 + 1/(1 + R))). A fourth application of the rule gives 1 + 1/(2 + 1/(2 + 1/(2 + 1/(1 + R)))). A fifth give 1 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(1 + R))))). You can see where this is heading. A very large number of applications of that simple rule gets you from R to a number that looks like this: 1 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + .... + 1/(1 + R)))))...))))) (Mathematicians call this "a continued fraction.") Note that the size of R matters less and less as you keep iterating. If R was totally humongous, that final 2 + 1/(1 + R) would be just a teeny bit bigger than 2; if R was incredibly tiny, it would be an eentsy bit less than 3. As you proceed leftward, calculating as you go, the difference dwindles away to nothing. But what is that thing? What, to take it off to infinity, is 1 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + .... ....))))) ? Well, call it x, and rewrite it just slightly as 1 + 1/(1 + 1 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + .... ....))))) even putting in an extra parenthesis, if you like, to make things clearer: 1 + 1/(1 + [1 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + 1/(2 + .... ....))))]) Then if you look closely at it, you will see that the following thing is true: x = 1 + 1/(1 + x), which, if you rearrange it, says that x2 = 2. I repeat: this is true no matter what a/b you start with. In Prime Obsession I started with a = 1, b = 1, just for simplicity. (The other sequences in this section are a bit harder!) Q (page 33). You say there are only eight primes in the last hundred-block before a million, that is, from 999,901 to 1,000,000; and that there are only four in the last hundred-block below a trillion, from 999,999,999,901 to 1,000,000,000,000. Seems to me there must be hundred-blocks between a million and a trillion that have more than eight and/or less than four (or exactly eight or four), i.e., there are lots more than four hundred-blocks between a million and a trillion; the eight can't decrease only to four (unless, say, lots have seven, then lots have six, then lots have five). A. One thing I regret is not having put more emphasis on the randomness of the distribution of primes the way they sometimes clump together, sometimes spread apart, apparently unpredictably. That's what comes into play here. Consider that first hundred-block I mentioned, for instance, the one from 999,901 to 1,000,000. There are, as I said, 8 primes in that block. In the previous hundred-block, however — the one from 999,801 to 999,900 — there are only 4 primes. Stepping back through the hundred-blocks prior to that, the numbers of primes in each block are 6, 8, 6, 6, 7, 6, 5, 8, 10,... Stepping forward (that is, starting with the hundred-block from 1,000,001 to 1,000,100), the number of primes in each hundred-block is 6, 10, 8, 8, 7, 7, 10, 5, 6, 8,... You can see that the number of primes in a block of 100 numbers in the neighborhood of a million hovers around the "expected" value of 100 / log(1,000,000), which is 7.23824, but irregularly. Here we are in the presence of statistical truth. Same thing in the neighborhood of a trillion, where the "expected" number of primes in a hundred-block would be 100 / log(1,000,000,000,000), which is 3.61912. Starting ten hundred-blocks back from the one I quoted, and going a further ten blocks forward (so the first hundred-block here is from 999,999,998,901 to 999,999,999,000, while the last one is from 1,000,000,000,901 to 1,000,000,001,000), the tallies of primes in each hundred-block are: 2, 1, 3, 6, 3, 2, 6, 4, 2, 5, 4, 4, 6, 2, 4, 2, 4, 3, 5, 1, 6. Again, the tally is hovering around the "expected" value; the average is actually 3.57143. But the hovering is irregular. To fill this out even more comprehensively, here is a table of the number of primes in the five hundred-blocks before N and the five hundred-blocks after, for N equal to a thousand, a million, a billion, and a trillion. Here you see both the "thinning out" in the distribution of primes, and also the irregularity.
(Incidentally, looking at the lists of numbers in the second column of that table, I am inspired to wonder how big N would have to be before you had a reasonable chance — fifty-fifty, say — of getting a list of ten zeros. A quick back of the envelope calculation, making some primitive assumptions about the statistics involved, suggests that N would need to be around 1046... but that comes with no guarantees at all!) Q (page 105). I have seen it stated that the Euler Product is an "analytic version" of the Fundamental Theorem of Arithmetic, which guarantees a unique factorization of an integer into primes. It's not clear to me how this is so. Or how the zeta function would then be related to the Fundamental Theorem of Arithmetic. [Which states: "Any natural number greater than 1 can be represented in exactly one way as a product of primes." For example, the natural number 3,294 is 2 x 33 x 61, and cannot be factorized down to primes in any other way.] A. Well, the Euler Product is the infinite product: (1-1/2s)-1 times (1-1/3s)-1 times (1-1/5s)-1 times... for all the primes 2, 3, 5, 7, 11, 13,... Let's consider just one of those parentheses, the one for prime p. The parenthesis is (1-1/ps)-1. Is there some other way to write that? Sure there is. Look at my Expression 9-2 on page 140. My parenthesis is just the left-hand side of Expression 9-2, with 1/ps in place of x. I can therefore replace it by the right-hand side of Expression 9-2, which, with x replaced by 1/ps, works out to 1 + 1/ps + 1/p2s + 1/p3s + 1/p4s +... So now I know that my infinite product, the Euler Product, is equal to (1 + 1/2s + 1/22s + 1/23s + 1/24s +...) times (1 + 1/3s + 1/32s + 1/33s + 1/34s +...) times (1 + 1/5s + 1/52s + 1/53s + 1/54s +...) times... for all the primes. An infinite product of parentheses, each one containing an infinite sum! However, if you survived my Chapter 15, this won't scare you at all. Look at the logic of Chapter 15.iv. To multiply out an infinite number of parentheses, you just
As in Chapter 15.iv, the only plucks you need bother with are those with an infinity of 1's and a finite number of not-1's. This one, for instance: 1 x 1/24s x 1 x 1 x 1/7s x 1/112s x 1 x 1 x 1 x 1 x... What would that be? It would be 1/(24s x 7s x 112s). The denominator works out to 13,552s. Now apply the Fundamental Theorem of Arithmetic. It follows that my plucking process will turn up every possible integer exactly once. No other pluck, for example, will give me 13,552. The only pluck that will give me 13,552 is the one I did, because there is no other way to factorize 13,552 into powers of primes! That's the Fundamental Theorem! So when I come to that last bullet point, "add up all the results," my sum will contain terms like this: 1/ns, with every positive integer n represented exactly once. So I just proved the Golden Key by plain arithmetic, though I had to know the Fundamental Theorem of Arithmetic to do so. If you want to say that this makes the Golden Key an "analytic version" of the Fundamental Theorem of Arithmetic, I won't argue with you. Q (page 114). "Unfortunately, there is no ordinary household function that can be used to express the integral of 1 / log t." Hey, if the integral of 1 / x is log x, why isn't the integral of 1 / log t equal to 1 / log(log t)) ? A. It is! Flip back to page 112, second paragraph, where I say: "...read as 'the integral of x to the minus fourth power, with respect to x, from 2 to 3.' (Don't worry too much about that 'with respect to x.' Its purpose is to declare x as the main variable we are working with...)" Well, here you bump up against the importance of that "with respect to x." The integral of 1 / x with respect to x is log x. The integral of 1 / log t with respect to log t is, indeed, 1 / log(log t)). That's why I said: "It is!" (I should more properly have said: "It might be!"... but allow me a little license for propaganda purposes.) The integral of 1 / log t with respect to t, however, is Li(t). It matters what you are integrating with respect to. Q (page 125). The Selberg-Erdős controversy. A. I intended, and thought I made it clear in my book, that I was going to keep this whole business firmly at arm's length, and not get involved. This is a topic that still excites passions. My only contact with it when writing the book, other than having read (and reviewed) the two biographies of Erdős, was a conversation with Atle Selberg in 2002. Selberg was still, 53 years after the event, plainly upset about it. Since Prime Obsession came out, I have had a number of letters and e-mails about this section. One correspondent took my account of it to be "scathing sarcasm," a description that baffles me. Anyone who thinks that the middle paragraph on page 125 is "scathing sarcasm" just doesn't get out much. I never intended it to be sarcasm of any kind, and remain resolutely neutral on the dispute. My own neutrality, however, need not hinder me in reporting the following simple fact: that most of my correspondents on this matter side with Selberg, notwithstanding the fact that Erdős was plainly liked and admired by just about everybody. The following, by way of example, is from Professor Emeritus Erik Hemmingsen of Syracuse University, and is reproduced with his permission. Prof. Hemmingsen first points out that while Selberg was indeed at the Institute for Advanced Study when the paper came out, he was actually at Syracuse while doing the work. Prof. Hemmingsen continues: Selberg was a visitor at the Institute during the academic year 1947-1948, when he came to the attention of one of my colleagues who was also a visitor that year. Syracuse University offered Selberg his first academic job in America, and he and his wife arrived in Syracuse just before the beginning of the Fall 1948 semester. They returned to Princeton in the summer of 1950. When I arrived at Syracuse in September 1947, Erdős was already there. He was an old friend from the Univ. of Pennsylvania who had a position there when I came as a graduate assistant in 1941. We were both there for several years, and he was very kind to me. Selberg was naturally pleased when he found his way to a proof of the Prime Number Theorem. The reasonable person to talk to about this work was a kindly older colleague who had a serious interest in number theory. Unfortunately this was a great mistake, so bad [sic] that there are people who credit Erdős with the proof. After Erdős died, there was an article in the Notices of the AMS [i.e. the American Mathematical Society] in which the author went so far as to call the Prime Number Theorem Erdős's finest work. I was totally disgusted and sat myself to write my own impression of the facts. That account is now on deposit with the Syracuse University Mathematics Library. Q (page 147). Several readers wondered the following thing about the "functional equation" near the top of page 147. Why doesn't it give Zeta(s) = 0 for positive even whole numbers, as well as negative ones? Suppose s is –5, for example. Then the left-hand side of the functional equation is Zeta(6), while on the right-hand side I have a term, the sine term, that is zero. So why isn't Zeta(6) equal to zero? A. Because of the factorial term on the right-hand side, that's why. As Figure 9-11 shows, the factorial function is "infinite" (that is, it has no defined value) for negative whole numbers. Well, if s is –5, you are multiplying the sine term, which is indeed zero, by (–6)!, which is "infinite." That's why I said, a bit further down the page, "the negative integers create problems..." Those problems can be resolved, but at the level I'm working with here, it is sufficient to assume that when multiplying zero by "infinity," you''' get an actual number, one that is not zero. Q (page 149). "The integral of 1/(1-x) is -log(1-x)." Not according to Mathematica, which gives -log(-1+x). A. Ah, the perils of Mathematica! Actually, I and Mathematica are both right, though my expression makes much more sense in the context, which has x in the range from -1 to +1. Mathematica, of course, cannot know that context. The thing to remember here is that the derivative of any constant number is zero. So the derivative of x3 is 3x2; but so is the derivative of x3+1, x3+2, x3+3,...x3+anything! This makes sense if you think graphically; all those functions have the same graph, but the graph is just shifted up or down vertically... which does not affect the slope of the graph at any argument x. Contrariwise, when we say that the integral of f(x) is g(x), we should really say (but we hardly ever bother) that the integral of f(x) is g(x) +C, for any constant number C. OK, now rewrite my statement: "the integral of 1/(1-x) is -log(1-x)" like this: "the integral of 1/(1-x) is -log(1-x)+C." Since C can be any arbitrary constant at all, put it equal to -log(-1). By the basic rule of logs, i.e. that log(A) + log(B) = log(AB), you now get the Mathematica form (because -1 times 1-x equals -1+x). I repeat, though, the Mathematica form is totally unsuitable for the context, in which -log(-1+x) is a complex number! (As, of course, is -log(-1), whose value is -pi*i. This "cancels out" the complex numbers Mathematica gives you, so you end up with the actual real numbers I'm actually working with here.) Q (page 172). "The number 0.12345678910111213141516171819202... has a clear pattern, and I could tell you in advance what the hundredth digit is, or the millionth, or the trillionth. (Wanna bet? They are 5, 1, and 1, respectively.)" How did you do that? A. By brute force. Let's take the trillionth digit as an example. It's not to hard to figure out the following scheme:
So the trillionth position is some digit of an 11-digit number. Which number? Which digit? Well, the first 11-digit number, which is 10000000000, begins at position 98,888,888,890. Subsequent to that, the 11-digit number 10000000000 + k will begin at position 98,888,888,890 + (11 times k). Now a trillion works out to 98,888,888,890 + (11 times 81,919,191,919) + 1. Therefore, positions 999,999,999,999 to 1,000,000,000,009 of the decimal contain "91919191919" (10000000000 + 81919191919, see?) It follows that the trillionth digit is "1." (This is what an amateur-mathematician friend of mine refers to as "breaking rocks." If there is some neat way to get this result, I'd like to know about it.) Q (page 173). "...any infinite sequence of numbers in R has a limit in R..." Say what? A. That "infinite" should of course read "convergent." Q (pages 213 and 218). How did you create Figures 13.6 and 13.7? A. The only way that came to mind (if you can think of a better one, please let me know!) was to generate a contour map of the following real-valued function of x and y: Re[Zeta[x+y*I]]*Im[Zeta[x+y*I]]. If you do a Mathematica ContourPlot, cramming down the vertical PlotRange to show only values very close to zero (I used PlotRange->{-0.000001,0.000001}), you get a diagram that, with a little hand-trimming, looks like Figure 13.6 or Figure 13.7. Most of my diagrams, by the way, needed some hand-finishing, which was done either by myself or by the patient graphics people at Joseph Henry Press. Q (page 217). In Chapter 13.vii you say that you have three copies of Harold Edwards's book Riemann's Zeta Function, but add: "that's a long story." So... what's the story? A. When I was working up the proposal for Prime Obsession, I started reading anything I could find about the Hypothesis, and asking questions of anyone I thought might know something. Time and again I read, or was told: "You simply must get a copy of Harold Edwards's book." So I decided to get a copy. It was long out of print; there had just been the one edition, in 1974. I went on the secondhand book websites. None of them had a copy, but I posted requests to be notified if one turned up. Then I just asked anyone I could think of if they had a copy they could sell me. I asked Harold himself; he was sorry, but he had no spare copies left. I asked his publisher; I asked everybody. No-one had a copy. In frustration, I went to one of the local universities (Hofstra, I think it was) and persuaded them to let me take out a library copy. Using my desktop scanner at home, I copied the whole book. It took me all day, but at last I had a copy of Riemann's Zeta Function, if only in loose-leaf. I returned the university copy to the library. No sooner had I got through doing that, of course, than one of those second hand book websites came through with a copy. I bought it; so now I had a copy in loose leaf, and an actual hardcover book. At the Courant conference in 2002, I met Harold Edwards. He told me that all the enquiries I made about getting a copy of his book had caught the interest of his publisher somehow, and they had decided to bring out a new edition. They had duly done so in 2001, a very nice paperback edition. Would I like one? So I got a copy of the 2001 paperback edition, inscribed by Harold.
Q
(page 266). You introduce the concept of "field" and
tell us that a field is formed "if the elements can be added,
subtracted, multiplied and divided, according to the ordinary rules of
arithmetic... The results of
all these operations must stay within the field."
So far, so good. Then
comes "the simplest of all fields, with only two elements, 0 and
1." In order for the results to "stay within the field"
1+1=0 and 0-1=1. How is this
consistent with the "ordinary rules of arithmetic"? A. Perhaps I should have spelled out the actual "ordinary
rules of arithmetic" I was referring to.
They are as follows.
a
+ 0 = a for all numbers a
a
x 1 = a for all numbers a
a
+ -a = 0
a
x 1/a = 1
a
+ b = b + a and a
x b = b x a
a
+ (b + c) = (a + b) + c and a
x (b x c) = (a x b) x c and a
x (b + c) = (a x b) + (a x c) All of those are true for the 0, 1 field. For example: 1 x (0 + 1) = (1 x 0) + (1 x 1). Both are equal to 1. The oddity in this field is that "-1" is actually equal to 1. But nothing in the above rules forbids that! Q (page 319). You lost me with p-adic numbers. Can you give a fuller explanation? A. Well... it's an advanced topic, and you should really read it up or google it. Here is the briefest, simplest possible explanation I can think of. First, pick a prime number, to make things clearer. I'll take p = 5. Now remember from page 97 that there is a "clock arithmetic" for any number bigger than 1. On a 12-hour clock, for example, the hours are numbered from 0 to 11. If you add 9 to 7 on this clock (i.e. if you ask the question: "What time will it be 9 hours on from 7 o'clock?") you get 4. Now write out the powers of 5: 51, 52, 53, 54, 55, 56,... These work out to 5, 25, 125, 625, 3125, 15625,... Now, giving one line to each, write out, first, all the hours on a 5-hour clock-face; then all the hours on a 25-hour clock-face; then all the hours on a 125-hour clock-face; and so on. Like this: 0, 1, 2, 3, 4 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,..., ..., 20, 21, 22, 23, 24 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,..., ..., 121, 122, 123, 124 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,..., ..., 621, 622, 623, 624 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11,..., ..., 3121, 3122, 3123, 3124 ...... ...... (The list goes on for ever.) Now pick a number at random from that first row. I'll pick 3. Now pick a matching number at random from the second row. By "matching," I mean this: the number has to leave remainder 3 after division by 5. So you can pick from 3, 8, 13, 18, and 23. I'll pick 8. Now pick a matching number at random from the third row. By "matching," I mean this: the number has to leave remainder 8 after division by 25. So you can pick from 8, 33, 58, 83, and 108. I'll pick 58. Now pick a matching number at random from the fourth row. By "matching," I mean this: the number has to leave remainder 58 after division by 125. So you can pick from 58, 183, 308, 433, and 558. I'll pick 183. Now pick a matching number at random from the fifth row. By "matching," I mean this: the number has to leave remainder 183 after division by 625. So you can pick from 183, 808, 1433, 2058, and 2683. I'll pick 2683. Keep going... for ever. You now have a sequence of numbers, which I'll pack together into a neat parenthesis and call x. Here it is: x = (3, 8, 58, 183, 2683,...) That is an example of a 5-adic integer. Note "integer," not "number." I'll get to "number" in a minute. (Obviously you could do this for any prime p. I'm just using 5 as an example.) Here's another 5-adic integer: y = (0, 20, 95, 95, 1970,...) Now, you can add these numbers together: x + y = (3, 3, 28, 278, 1528,...) See how I did that? I just added according to the appropriate clock arithmetic in each position: 5-hour in the first position, 25-hour in the second, and so on. Notice that the result is another 5-adic integer, with each number in the correct relation to the number on its left. You can not only add: you can subtract and multiply, too. You can't necessarily divide, though. This sounds a bit like the regular integers, which you can likewise add, subtract, and multiply, but not always divide. (12 ÷ 7 = ?). It is what mathematicians call a ring. This is the ring of 5-adic integers. Its usual symbol is Z5. (That should be the "hollow Z" that I introduced in Chapter 11, but I don't have it in my symbol set here.) How many 5-adic integers are there? Well, in populating that first position you had a choice of five. Same for the second position, same for the third,... So the number of possibilities altogether is 5∞. What other set has this number? Well, consider all the real numbers between 0 and 1, and represent them in base 5 instead of the usual base 10. Here is the fraction 1/pi, for example, written in base 5: 0.1243432434442342413234230322004230103420024... Obviously, each position of this "decimal" (I suppose that should be something like "quinquimal") can be populated in five ways. Just like our 5-adic integers! So the number of 5-adic integers is the same as the number of real numbers between 0 and 1. This is an interesting thing. Here we have created a ring of numbers that behave like the integers; yet they are as numerous as the real numbers! (The ordinary integers are not that numerous. Google on "transfinite numbers" if you don't know about that.) And in fact, there is a way to define the "distance" between two 5-adic integers in a natural way; and it turns out that two 5-adic integers can be as close together as you please —entirely unlike the ordinary integers, two of which can never be separated by less than 1. So these 5-adic integers are kind of like the integers, and kind of like real numbers. Now, just as with the ordinary integers Z, you can go ahead and define a "fraction field" Q — the rational numbers — so with Z5 there is a way to define a fraction field Q5 in which you can not only add, subtract and multiply, but also divide. That is the field of 5-adic numbers. Q5, like Z5, has a foot in each camp. In some respects it behaves like Q, the rational numbers; in others, it behaves like R, the real numbers. Like R, for instance, but unlike Q (or Z5) it is complete (see Prime Obsession, p.173). That's about as much as I can say about p-adic numbers in a way that's elementary. I hope you can see, at least, that they are curious and interesting little creatures. Now, just as, starting from ordinary "clock" arithmetic, I went up one level of abstraction and got these peculiar p-adic numbers; so, by going up a further level of abstraction, starting from p-adic numbers, you can get adeles. I'm not going to attempt it here, but you can read up the topic in plenty of textbooks and websites. [Note that there are a number of different ways to approach p-adic numbers. If you go and read about them in a textbook or on a website, the author might take an utterly different approach to this one. Trust me, the destination is the same!] Q (page 333). What's that entire function you talk about in Chapter 21.iii? What does it look like? A. Like this: Xi(s) = (s / 2)! * pi(-s / 2) * (s - 1) * Zeta(s). Note the right-hand side has four parts. If you flip it round to show Zeta(s) in terms of Xi(s), that has four parts, too, obviously. This is why Expression 21.1 has four parts. Xi is an entire function. Its zeros are precisely the non-trivial zeros of the zeta function; that follows easily from the definition. And it can be written in terms of those zeros, as Xi(0) times all possible (1 - s / rho), for all the zeros "rho." Full details are in Chapter 1 of Harold Edwards's book on the zeta function, which I can't recommend highly enough. Q (page 344). Were the "Secondary term" figures in Table 21-1 actually calculated by summing Li(1000000^rho)? A. No. Trusting Riemann's formula (which has, after all, been proved true) I reverse-engineered them. However, I spot-checked by summing across the first 100,000 of Andrew Odlyzko's zeros (which are on his website) and eyeballing the sums to 10,000, 20,000, 30,000,..., 100,000, to satisfy myself that convergence was what I expected. The problem here, as I point out on page 342, is that convergence is "harmonic." That is, it is logarithmically slow, like the convergence of Expression 9-4. Even summing to 100,000 terms does not necessarily get you four decimal places of accuracy. I trusted Riemann and von Mangoldt (who proved Riemann's formula), and checked only to make sure I had not misunderstood them. Q (page 349). I'm not clear about the role of the Hypothesis in this chapter. Does the formula for J(x) (Expression 21-1) depend on the Hypothesis being true? A. No. To my mind, this is a serious weakness in my exposition, and I wish I had made it more clear. Riemann's formula (Expression 21-1 on page 328) does not depend on the truth of the Hypothesis. The formula is absolutely, indisputably true — Von Mangoldt proved that. There is a proof in Harold Edwards' book Riemann's Zeta Function. That formula is the main result of Riemann's 1859 paper. It was that formula he was heading for through the paper. As he said, and as I quoted (twice: on page x in the Prologue, and again on page 151 in Chapter 10), he "put aside the search for such a proof ... because it is not necessary for the immediate objective of my investigation." That "immediate objective" was my Expression 21-1. It does not need the RH. The great importance of Riemann's formula is that it uncovers the intimate link between (a) the distribution of the primes, and (b) the zeros of the zeta function. It tells us: If you want to know about the distribution of the primes, all the facts are buried away there in the zeros of the zeta function. If you then start to ask questions about the actual nature of the distribution — especially "big oh" questions about the size of the error term in the Prime Number Theorem Pi(N) ~ Li(N) — then the RH comes into play, since it lets you get the most precise possible results about the distribution of the primes. If the RH is not true, then the best results you can get will be less precise. That formula, however, is true in either case. Q (page 366, Endnote 10). Where did you get that? Can you prove it? A. I've been carrying it round in my head since college days. I suppose one of my instructors must have mentioned it. Here is a proof, which you can find in full in the London Mathematical Gazette, Vol. 74, No. 768 (June 1990), pp. 167-9, "Another surprising appearance of e," by Nick Mackinnon and 5Ma (? no idea) of Winchester College. Since I heard it in my college days (way before 1990), this can't be the first appearance of this result. There must be an earlier one. Anyway, here's a proof. To get started, ask: What's the probability that the sum of just two random numbers exceeds 1? Well, it's the probability that a point in the unit square, whose corners are at (0,0), (0,1), (1,0), and (1,1), satisfied the inequality x + y >= 1. This is of course 1/2; just draw the diagonal from (0,1) to (1,0), and it's the probability that (x,y) is in the upper right triangle. If you now go to three random numbers, the chance that their sum exceeds 1 is got by "cutting a slice" across the unit cube through the corners (1,0,0), (0,1,0), and (0,0,1). This slices off a pyramid, volume 1/6, and the probability x + y + z >= 1 is then 5/6. A little mild calculus, plus the principle of induction, easily shows that the super-pyramid sliced off an n-dimensional super-cube by a hyper-plane through its corners (1,0,0,0,...), (0,1,0,0,...), (0,0,1,0,...),... has volume 1/n! The probability that a total of 1 is exceeded after adding n random numbers is therefore 1 - 1/n! The probability that a total of 1 is exceeded after adding n random numbers but not before is therefore (1 - 1/n!) - (1 - 1/(n - 1)!), which simplifies to (n - 1) / n! The expected number of selections until a total of 1 is exceeded is therefore the sum, from n = 2 to infinity, of this: (n - 1) / n! times n That, it is easy to see, is e. You can also compute the variance, which turns out to be 3e - e2. Q (page 371, Endnote 28). "Take three positive whole numbers at random..." Huh? From what probability distribution? A. Yeah, yeah. There is no neat way to say this in the space of an endnote, for the readership I was aiming at. You can fill out the details. "Let N and K be large positive integers, and consider a uniform distribution across the range from N to N + K...." Q (page 390). What is this ExpIntegralEi function in Endnote 128? What's it got to do with the Li function?
Now, if you take the ordinary definition of Li(x) (which is: the integral from zero to x of 1 / log t) and do a couple of elementary substitutions (u = log t, then v = -u), you get Li(x) = Ei(log x).
In the complex realm, things are nastier. You need
to define both Li(z) and Ei(z) by analytic continuation (a
sort of analogy to the "domain stretching" I did in Chapter 9). It
turns out that in doing this you are presented with options as to how,
precisely, you want to define the function (analogous
— I'm sorry
In the complex realm, it is a fact about the Ei function that if you hold the real part of the argument fixed and let the imaginary part go off to infinity (in other words, if you compute Ei(a+bi) for some fixed a and ever more humongous values of b), the function value closes in on pi*i. Since log x goes off to infinity (albeit achingly slowly) when x does, it shouldn't be hard to believe that the same applies to Li(a+bi). Which is why those spirals in Chapter 21 close in on pi*i and minus pi*i. I don't have a textbook that gives a full account of the Li and Ei functions for complex arguments, but I shall try to find one and recommend it. (Or, if you know one, please email and tell me!) There is, incidentally, a very nice and fast-converging series expansion for Ei(t) when t is positive. Here it is: Ei(t) = γ + log t + t + t 2 / (2x2!) + t 3 / (3x3!) + t 4 / (4x4!) + ..., where γ is the Euler-Mascheroni constant (Endnote 19), equal to 0.577215664901532860606512... I don't know of any neat series for Li(t), though. Speaking very generally, log and the related functions are not series-friendly. Q. Numerous readers have asked me about Louis de Branges' supposed proof of the RH, which got some publicity in mid-2004. As a follow-up in the newspapers, there was some discussion about e-commerce being undermined if Louis' proof was sound. Here is my take on the whole business.
A. (1) I consider it highly unlikely that de
Branges has cracked the RH, and this opinion is widely shared. De Branges is
a bit of a joker & this looks like one of his old tricks. The only
reason mathematicians do not laugh out loud at him is that he did once,
after a similar record of crying "Wolf!" produce an actual wolf: his 1984
proof of the Bieberbach Conjecture (p. 383 of Prime Obsession).
However, Louis' paper is being scrutinized by mathematicians, and a decisive
verdict will come in in the next few months. |
|||||||||||||||||||||||||||