|
|
|||
| My National Review Online "Diary" column for June 2003 included a brainteaser about record rainfall years. Rainfall
Record Years In any given year, the
weather station in New York Citys Central Park observes a certain total
rainfall. Assume that one
years total rainfall is unrelated to any other years in
mathematical jargon, that total annual rainfall is an independent
random variable. Define a record year to be a year in which the rainfall
exceeds that of any preceding year for which measurements were kept.
Given that the Central Park measurements began in 1835, by which
date would you expect to have clocked up 20 record years?
(Clue: over the 160-year period up to 1994, there were six record
years.) Now consider the situation after two years of record-keeping. Either the second year surpassed the first, or else it didn't. Since these are random variables, the chance that it did is 50-50, i.e. one-half. Thus, after two years' record-keeping, we would have either one record year (probability 1/2), or two record years (probability 1/2). An equivalent way to say this is, that the "expected" number of record years is 1 + 1/2, or 1.5. You can also argue that second year this way: Since the chance that the second year will be best of the two is 50-50, there are just two equally probable possibilities, R-R and R-X, with "R" being a record year and "X" being a non-record year. The two equal probabilities include three record years, so the "expected" number of record years is 3/2. After three years of record-keeping, what have we got? There is one chance in three that the third year is the record for the three years. If the first two years were both records, we can think of the three configurations R-R-R, R-R-X, and R-R-X as being all equally probable. Same for the case where the second year wasn't a record: then the equally-probable configurations are R-X-R, R-X-X, and R-X-X. That's all the possibilities, six in all, with a total of eleven record years. "Expected" number of record years: 11/6, which is to say, 1 + 1/2 + 1/3. If you pursue this logic for a fourth year, you get an "expected" number of record years equal to 25/12, which is 1 + 1/2 + 1/3 + 1/4. So it goes: after N years of record-keeping, the "expected" number of record years will be 1 + 1/2 + 1/3 + 1/4 + 1/5 + ... + 1/N. Check: after 160 years of record keeping, we are told there were 6 record years. Well, 1 + 1/2 + 1/3 + 1/4 + ... + 1/160 is equal to 5.65551122493974187... A pretty good match between "expected" and "actual." Jargon. The sum 1 + 1/2 + 1/3 + 1/4 + 1/5 + ... + 1/N is important in math, important enough to have a name and a symbol of its own. The name is "the N-th harmonic number." The symbol is "HN." So now the problem resolves to this one: how big does N have to be before HN exceeds 20? Key fact. For large numbers N, an excellent approximation to HN is log N + gamma. Here "log" means the natural log, often written "ln," while "gamma" means Euler's constant, equal to 0.577215664901532860606512... So to get HN up above 20, I need log N + gamma to get above 20. In other words, I need log N to get above 19.422784335098467139393487... This happens around N = e19.422784335098467139393487..., which is 272,400,600.0594077768... In fact (I am going to cheat here, using Mathematica to get more precise results): H272400599 = 19.9999999979463783225916261697... H272400600 = 20.0000000016174421895581453330... Since record-keeping began in 1835, we can expect to have logged 20 record years by around A.D. 272,402,434. HN = log N + gamma + 1/2N - 1/12N2 + 1/120N4 - 1/252N6 + 1/240N8 -... Suppose, for example, you want to work out H10. As a matter of fact, H10 is 7381/2520, or as a decimal, 2.928968253968253... (the "968253" just keeps repeating for ever). Let's see what the formula gives us. log 10 = 2.3025850929940456840... log 10 + gamma = 2.8798007578955785446... log 10 + gamma + 1/20 = 2.9298007578955785446... log 10 + gamma + 1/20 - 1/1200 = 2.9289674245622452113... log 10 + gamma + 1/20 - 1/1200 + 1/1200000 = 2.9289682578955785446... log 10 + gamma + 1/20 - 1/1200 + 1/1200000 - 1/252000000 = 2.928968253927324576... log 10 + gamma + 1/20 - 1/1200 + 1/1200000 - 1/252000000 + 1/24000000000 = 2.9289682539689912430... You can see the convergence. For a low value like N = 10, there is no advantage to using the formula instead of just adding up fractions. For big N, however, the advantage soon becomes overwhelming. (The k-th term of the infinite series, counting 1/2N as the first, is - Bk / (kNk), where Bk is the k-th Bernoulli number. I shall explain about Bernoulli numbers some other time...) |
||||