No title

Letters to the Editor The Mathematical Intelligencer encourages comments about the material in this issue. Letters to t...

Author: M.L. Senechal | C. Davis (Editors in Chief)

111 downloads 365 Views 7MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

Letters to the Editor

The Mathematical Intelligencer encourages comments about the material in this issue. Letters to the editor should be sent to the editor-in-chief, Chandler Davis.

But, then, other properties of life are

A Computer Scientist's View of Evolution

equally strange. Computer scientists

Granville Sewell's article (vol. 22 (2000),

find it hard to believe that a moderate

no. 4, 5-7) attempts to show difficulties

number of very slow components, neu

in evolutionary theory that are missed

rons, can be combined into a comput

by biologists. I have several reactions. 1. Philosophers

and

mathemati

cians (e.g., Brouwer and the intuition ists)

have

long

discussed

the

ing device that is able to perform pat tern

recognition,

understanding

of

logical natural

reasoning, language,

gap

etc.-and moreover can tolerate dam

between finite and infinite. Now com

age to a significant fraction of its neu

puting practice and computer science

rons.

have showed that another gap is of

3. Sewell ends by arguing that the

philosophical importance, namely, the

second law of thermodynamics is vio

gap between feasible and infeasible

lated by the development of life. Surely

"The U niverse is not o n ly stranger than we i mag i n e - it is stranger than we are capable of imag i n i ng." -J.B.S. H aldane polynomial

more careful wording is needed. Strange

A biologist in former times could

will reorganize the basic particles of Na

finite integers:

between

and exponential growth.

it may be that "basic forces of Nature

consider the number of molecules and

ture into libraries full of encyclopedias."

the number of years involved in evolu

We don't know any dynamical system,

tion as almost infinite. This is now

random or deterministic, that exhibits

clearly seen as wrong. These numbers

similar behavior. But there is no mathe

are negligible compared to 2n, where n

matical theorem (or clear theory) called

is the number of bits that might be al

"the second law of thermodynamics"

tered in a substantial mutation.

that prohibits it.

2. The analogy between the genetic code and code of a program sounds

A. Shen

convincing. However, from a computer

Institute for Problems of Information

scientist's point of view, the genetic code has very strange properties. As Sewell says,

if you mix different parts

Transmission Ermolovoi 1 9 K-51 Moscow GSP-4, 1 0 1 447

of a PDE-solver code, in superficial

Russia

analogy to mixing parents' genes, you

e-mail: [email protected]

will get something non-functional. We apparently must suppose that a small

How Anti-Evolutionists

mutation sometimes changes the per

Abuse Mathematics

formance of the genetic program sig

The Reverend William A Williams was

nificantly yet in self-consistent ways. If

not one of Darwin's bigger fans. In [21]

this seems miraculous, so do other

he wrote

properties of the genetic mechanism. Sewell is at pains to show that evolu

The evolution theory, especially as

tion is unbelievable; to me, its opera

applied to man, likewise is dis

tion-and for that matter its formation

proved by mathematics. The proof

from simple chemical processes-is

is overwhelming and decisive. Thus

even more unbelievable than he says.

God makes the noble science of

© 2001 SPRINGER-VERLAG NEW YORK. VOLUME 23, NUMBER 4, 2001

3

mathematics bear testimony in favor of the true theo ries and against the false theories. Needless to say, this will come as news to most biologists. The Reverend, writing in 1925, relied heavily on the au thority of the Bible in making his arguments. That same year saw biology teacher John Scopes hauled into a Ten nessee courtroom, charged with teaching scientific theo 1 ries that were in conflict with scripture. Modem critics of Darwinism take a more subtle approach, preferring to cloak their dubious religious arguments in the raiment of science. They call themselves Intelligent-Design Theorists (IDTs), the term "creationist" being now somewhat disreputable. Granville Sewell of the University of Texas at El Paso is one representative of this movement. In [19] he opined, bas ing himself on Michael Behe [ 1], "I believe there are two central arguments against Darwinism, and both seem to be most readily appreciated by those in more mathematical sci ences." The two arguments were that natural selection is not capable of building complex organisms, and that Darwinism is in conflict with the second law of thermodynamics. In making these arguments he simply ignored the vast litera ture addressing both subjects, so as to give the impression that logical fallacies obvious to you or me have somehow eluded our benighted colleagues in the life sciences.It is an arrogance typical of the ID movement; armchair philoso phers believing they can refute in a day what thousands of scientists have built over the course of a century. ID theorists offer a wide array of arguments in defense of their position, some of them explicitly mathematical.I will consider some of these arguments here. The hemoglobin in our blood is comprised of 574 amino acids arranged in a precise sequence.Any major deviation from this sequence leads to a nonfunctional molecule. We also note that there are twenty sorts of amino acids used by living organisms. Is it plausible that a mechanism based on chance, as Dar winism plainly is, could have produced hemoglobin? Mathematician David Foster doesn't think so.In [7] he offers the following: The basic argument from improbability:

The specificity of hemoglobin is described by the im probability of the specific amino acid sequence occur ring by random chance. Such specificity is capable of exact calculation in the permutation formula:

P=

N! -----

nr!n2! ...n2o!

...In the case of hemoglobin, and substituting in the above formula the specific numerical value of the solu tion, P 10654• =

Of course, N denotes the total number of amino acids in the sequence; while ni denotes the number of occurrences of the i-th amino acid. Hemoglobin is a dummy variable in this argument, any other complex organic molecule or system would have worked just as well. The logic is always the same: the n parts of the complex system are identified as the points of a probability space.This space is then equipped with the uniform distribution. The origin of the system is modeled as the event of choosing the appropriate nt- uple out of this space. If the system in question is at all complex, the prob ability of this event will invariably prove to be too small to be worth bothering with. This argument is a mainstay of creationist literature; it has been applied to DNA, the hu man eye, and the origin of life in [7], [12], and [14], re spectively, among many others.I will refer to it as the Ba sic Argument from Improbability (BAI). David Foster [7] is confused on many points (one of them being the difference between a permutation and a combination), but the most important error is the portrayal of Darwinism as fundamentally a theory of chance. Dar winism, as described in [9], has three components: 1. Organisms produce more offspring than can possibly survive. 2. Organisms vary, and these variations are at least partly heritable by their offspring. 3. On average, offspring that vary most strongly in direc tions favored by the environment will survive and prop agate. Favorable variation will therefore accumulate in populations. Part one is a simple empirical fact.Part two is the realm of chance; the genetic variations exhibited by an organism are random with respect to the needs of that organism.But part three is the antithesis of chance. Natural selection is a lawlike process.It is this aspect of Darwinism that gets left out of the BAl. Foster's argument assumes that evolution proceeds by "single-step selection." But if the preliminary stages of a complex system are preserved by selection, then com plexity can be explained as the end result of a step-by-step 2 process. Improving the BAI: Perhaps we could develop a more sophisticated probabilistic model of evolution. For example, Darwinism can be viewed as a Markov chain.The states of the chain are the genotypes3 of the organisms that have existed throughout history; the transition probabilities . are the chances of an organism with genotype E1 leaving offspring with genotype E2. Denote by C§ the set of all genotypes. Defme a function J.t: C§ X C§ __,. [0, 1] which denotes the degree of difference between two genotypes, say t:1 and t:2.

1 The question of whether Darwinism is genuinely in conflict with the Bible was not addressed at the trial. 2Popular-level treatments of the power of cumulative selection versus single-step selection, and Darwinian explanations of complexity can be found in the books by Dawkins [4] and [5]. 3The genotype of an organism is the sum total of its genes.

4

THE MATHEMATICAL INTELLIGENCER

If /.L = 0 then t:1 = E2. If /.L = 1 then E 1 and E2 share no genes. Let the random variable �(t) represent the state of the sys tem in time t. A central tenet of Darwinism asserts that the relevant genetic variations between parent and child are small relative to the size of the genome, so Prob{W + 1)

=

E J I �(t)

=

Ek} � 0 as p,(E J, Ek) � 1.

Let us take �(0) = Eo as representing the genotype of some ancient organism, one that is simple relative to the complexity we see today. The evolutionary path followed by the descendants of this organism trace out a path through our Markov chain, �(0)

�

�(1) � �(2) �

.

.

.

.

Given our present understanding of genetics, we can say that the future states of the random variable � are inde pendent of its past states, the hallmark of a Markov process. We need one more ingredient to transform our Markov chain into a model for Darwinism.Let f: C§ � IR associate to each genotype its fitness.4 Now let E be a genotype con taining a system composed of the parts p1, P 2 , ... , Pk; we will write E = {pi, . . . , Pkl· The state E is a descendant of the state Eo, which we assume did not contain the Pi· For selection to preserve the parts of the system as they ap peared, we must have the following: f(E)

> f({pl, P2,

·

·

·

,

Pk- I}) > · · · > f({pi}) > f(Eo)

The addition of each part must increase the fitness of the genotype. Further, we can assert thatfsatisfies some sort of additivity law, since each part of the system can be viewed as increasing the fitness of the system.Say: We can say that a particular state E is accessible to a Dar winian mechanism if there is a path through our chain on whichfsatisfies the above conditions. This line of argument is pursued by David Berlinski in [3]. So far it is simply a mathematical framework within which to model Darwinian explanations of complexity. The alleged refutation of Darwinism arises from the following definition: 1. A system {p1, p2, ..., Pnl is irreducibly com hereafter denoted IC, iff(E) = 0 for aU E E C§ such that Pi E E and PJ ti E for some 1 :=::; i, j :=::; n. If E is a state containing such an irreducibly complex system, then we wiU say that E is irreducibly complex. DEF1NITION

plex,

THEOREM 1. If t: is irreducibly complex then it is not ac cessible to a Darwinian mechanism.

Do IC systems exist in nature? Well, Berlinski's definition of IC is a mathematization of a definition given by biochemist Michael Behe in [1]. Behe defined a system as IC if it involves several parts working together to perform some function,

such that the removal of any part from the system results in the nonfunctionality of the machine. Examples of such sys tems are the human blood clotting cascade5, or the flagellae used for locomotion by some bacteria Thus, by taking p1, . . . , Pn to be the various parts of, say, the blood-clotting cascade, we have our example of a system satisfying Berlinski's defmition of IC.It follows that the natural world is replete with systems inaccessible to Darwinian pathways. It's an impressive argument, but wrong for at least three reasons. In [3] Berlinski claims that his definition of IC entails Behe's, but this is not correct. A system is IC in Behe's sense if the removal of one part of the system results in the non functionality of the system. It is IC in Berlinski's sense if the organism can derive no benefit from possessing only one part of the multipart system.These are plainly not the same. There are at least two sorts of explanation for how the in dividual pieces of an IC system can benefit an organism, even without the other parts of the system in place: 1. They might perform the same function in isolation as they do in the fmished system, but not as well.This mode of explanation is used by Miller in [17], in the case of the clotting cascade, and by Dawkins in [5] in the case of the vertebrate eye. 2. They might initially have performed a different function but have been later coopted for their present purpose. In [11] paleontologists Stephen Jay Gould and Elisabeth Vrba coined the term "exaptation" to describe this phe nomenon. Two examples are the evolution of the three bones in our inner ear from homologous bones in the reptilian jaw6 as described in [9], and the origin of the Krebs cycle7 as described in [16]. In 1996 Behe [1] made the audacious claim that the tech nical literature on evolution is silent with regard to the for mation of irreducibly complex systems. This charge was shamelessly repeated by Sewell in 2000 [19]-though Ken neth Miller [17] had meanwhile cited numerous examples from the technical literature to show this to be false. The point is that Berlinski's definition of IC is far more re strictive than Behe's. Thus, systems that are IC in Behe's sense are known to exist but are not inaccessible to Dar winian mechanisms. Systems that are IC in Berlinski's sense are inaccessible to Darwinian mechanisms, but are not known to exist. This is the most serious flaw in Berlinski's model, but there are two others worth mentioning. The first is that notions of irreducible complexity treat the parts of a complex system as if they are discrete entities that either exist in their complete, perfected glory, or do not exist at all. This is not realistic.The parts of a complex system become gradually differentiated

4The fitness of a genotype depends partly on the environment in which that genotype finds itself, but that is ignored for the moment. 5The details of the clotting cascade and a detailed discussion of its evolution can be found in [17]. This fine book contains a chapter refuting Behe's arguments. 6There is an extraordinary series of fossils documenting this change. 7This refers to the series of chemical reactions that releases energy from food.

VOLUME 23, NUMBER 4, 2001

5

over the course of many generations. Therefore, asking what happens to a system when one of its parts is summarily re moved is a question of little evolutionary importance. Finally, Berlinski's argument given here is one of a class of arguments based on the proposition that "genotype space" is too vast to be searched effectively by natural se lection acting on chance variations. Complex organisms represent islands of functionality in a sea of nonfunctional genotypes, you see. This brings us to the second difficulty with Berlinski's framework. His insistence that the fitness functionjbe properly increasing on any sequence of adja cent states in a Darwinian pathway ignores the possibility that mutations can be neutral. In other words, we might havej(Ej) j(EJ+t) for somej. The overwhelming majority of mutations are neutral in this sense. This vastly increases the number of genotypes that are accessible to Darwinian pathways. Two examples of the importance of neutral mu tations in molecular evolution are given by (6] and [15].8 =

Sewell also argued that Darwinism runs afoul of the laws of thermodynamics. Evolution requires a decrease in entropy over time, whereas a cherished principle of physics says that is impossible. Since Sewell recognizes that the second law applies only to closed systems (which the Earth is not), it is difficult to understand the difficulty. His claim that "natural forces do not cause extremely improbable things to happen" is pure gibberish. Does Sewell invoke supernatural forces to explain the winning numbers in last night's lottery? The fact is that natural forces routinely lead to local de creases in entropy. Water freezes into ice and fertilized eggs turn into babies. Plants use sunlight to convert carbon diox ide and water into sugar and oxygen, but Sewell does not invoke divine intervention to explain the process. Certainly the question of how the input of energy into the environ ment of the early Earth led to the creation of all that we see around us is a fascinating and important one. That ex plains the large number of scholarly articles published on the subject every year. But thermodynamics offers nothing to dampen our confidence in Darwinism. Thermodynamics:

introduction to population genetics: The ability of natural selection to craft complex adaptations out of chance variations is contingent upon two assumptions:

An

1. Beneficial mutations occur with sufficient frequency.

2.

A beneficial mutation, once it occurs in an individual, will spread through the population.

Biologists have developed mathematical models to aid in ad dressing these points. The subdiscipline of biology devoted to analyzing such models is called population genetics. I begin with a very simple model. Our genes are found in long strings, called chromosomes, in the nuclei of our cells. Typically we imagine a chromosome divided into individual

regions called loci. The bit of DNA found at a particular lo cus is referred to as an allele. Let us consider a single locus which, in each individual in the population, contains one of two alleles. Denote these alleles by A1 and A2•9 Assume that the species in question reproduces sexu ally and that the offspring inherit two copies of each gene, one from each parent. Then members of the population will either possess two copies of the A 1 allele, two copies of the A2 allele, or one copy of each. I will refer to these three cases as genotypes A1A�o A�2, and A1A2, respectively. Let us further assume that the A1 allele appears with frequency p in the population, and A2 appears with frequency q = 1 p. We can think of p and q as representing the probability that a randomly chosen allele is A1 or A2, respectively. 2. (Hardy-Weinberg) Let A1, A 2, p, and q be as above, and assume that the population mates randomly with respect to this allele. Then in the next generation the genotypes A1A1o A1A 2 , and A�z will appear with fre quencies p2, 2pq, and q2, respectively. THEOREM

Of course, this theorem is elementary. Given the sim plicity of the model, it is surprising that the Hardy-Wein berg law has proven invaluable in explaining observed data in wild populations. Next we try to quantify the effect of selection on the fre quencies of the alleles A1 and A 2. Imagine that the three possible genotypes initially appear with the frequencies de termined by the Hardy-Weinberg law. Then the extent to which a particular allele is represented in the next genera tion is proportional to its representation in the current gen eration and the probability that an individual possessing that allele survives long enough to reproduce. Let us de note the constant of proportionality by w. This constant is often referred to as the mean fitness of the population. Denote by Wij, with i, j E (1, 2}, the probability that an individual of genotype AiAJ survives to reproduce. If we now let f(A0J ) denote the frequency of genotype AiAJ in the next generation, we find

f(A 1A 1) =

p2wu {J)

'

f(A 1A2) =

2pqw 1 2 :¥>2) ' f(A -.1 {J)

=

q2W22 {J)

0

Since the sum of the three frequencies should be 1, set

w

=

p2wu + 2pqw 1 2

+

q2ltJ2z.

Let us denote by p' the frequency of the new generation. Then we can say

A1

allele in the

(Note that each A1A2 individual posseses only one copy of the A1 allele). So what can we say about the change in frequency of the A1 allele as time passes? One further calculation yields

8Berlinski presses his argument further by introducing ideas from the theories of finite-state automata and linguistics, but these arguments are no better than the ones considered here. "The following mathematical arguments are drawn from the excellent text by Gillespie [8]

6


!:lp = p'- p P2 wu + p qw 1 2 - p w w

4.Let ProbfixCP) denote the probability that the allele Ah appearing with an initial frequency of p, be comes fixed in a population of size N. Then THEOREM

=

- pq [p(wl l - WJz) + q(w 1 22 - W22)] p2 w1 1 + Zpqw12 + q W22

ProbfixCP)

_

The quantity pq is referred to as the genetic variation of the population. It is maximized when p Suppose now that the

A1

=

q = t.

allele confers a selective ad

vantage on the individuals that possess it. Specifically, as

wu > w12 > W2z. In that case we see that !:lp > 0, A1 will tend to increase in succeeding generations. By contrast, if A1 is at a selective disadvantage, so that w 1 1 < w1 2 < W2z , then we have !:lp < 0 and the frequency of A2 will tend to decrease. This ob sume that

indicating that the frequency of

servation can be expressed more succinctly in the equation !:l

P

=

pq 2w

( dw)' dp

and in words in the following theorem: THEOREM 3. (Fundamental Theorem of Natural Selection) Natural selection always increases the mean fitness of the population, and does so at a rate proportional to the ge netic variation.

A victory for evolution, right? Beneficial mutations will

=

- e-2Nsp , 1 - e-2Ns

1

where s denotes the selective advantage conferred by the allele A1. If we assume that A1 initially appears in a single individ �. Since s is assumed to be small, ual, then we will have p we can say e-s 1- s. If we then assume that N 0, we conclude that is large enough so that e-2Ns s. So most beneficial mutations are lost without Probfix(p) =

=

=

=

ever having a chance to become fixed in the population. Hoyle concludes from this that it is effectively impossible to string together a large number of beneficial mutations. Fred Hoyle is no kind of creationist.He doubts neither the truth of evolution nor the existence of a fully naturalistic ex planation for it. Indeed, he offers a rather imaginative alter native to Neo-Darwinism based on the premise that the Earth is periodically bombarded with storms of genetic material from outer space. For a brief discussion of why biologists are generally skeptical of this possibility, see

[18]

and [20].

Hoyle's argument is wrong for many reasons, the most fun damental being the absurdity of extrapolating to geologic time a mathematical model that is reliable only for short-term data. The dynamics of gene frequencies in wild populations

tend to become fixed in the population, and over long pe

are governed by so many variables that a mathematical model

riods will accumulate to produce complex adaptations.

for describing them in the long term is impossible. For ex

Not so fast. Randomness also has a role to play in the

ample, the selective value of a particular allele changes with

change of gene frequencies over time. For example, sup

the environment.The population size, and therefore the fre

pose a single individual in a population has a beneficial mu

quency of a particular allele within it, changes as subpopu

tation. The probability is only one-half that any particular

lations of animals migrate away from the ancestral stock An

child born to that individual will inherit the mutation.So it

imals interact with other animals, which are themselves

is entirely possible that the mutation will be flushed out of

evolving. Consider also that we have been focusing on one

the population before it has a chance to spread.

locus, when in reality the selective value of the allele at that

This is one example of a more general phenomenon

locus is certainly affected by the alleles at other loci.

called genetic drift. Selection is tending to cause beneficial

There are other problems. Early in his book Hoyle states,

mutations to spread through a population, while drift is

"...a considerable fraction of individuals born in every gen

tending to remove them. Perhaps a more sophisticated

eration exhibit some new mutation, the great majority being

model of population dynamics would have shown that drift

harmful in some degree." This premise is entirely false. As in

is powerful enough to overcome selection, thus effectively

dicated earlier, most mutations are neutral.And what of the

falsifying the Darwinian premise of complexity arising from

small probability that a beneficial mutation will become fixed

the gradual accretion of small, chance variations.

in a population? That only applies to very large populations.

This line of argument is pursued by physicist Fred Hoyle

Most evolutionists believe that periods of speciation, during

[13].

His starting point is the assumption that mutations

which directional evolutionary change accumulates very

are far more likely to be harmful than beneficial. How do

quickly, occur when small "founder" populations become

the handful of beneficial mutations avoid being swamped

geographically isolated from the ancestral stock

in

by the more numerous harmful ones? The answer, known

This leads us to the most insidious aspect of Hoyle's

for decades by population geneticists but presented as rev

work His book offers no index, no bibliography, and only

elation by Hoyle, is that the mechanics of sexual repro

the briefest mention of any other work in population ge

duction allow beneficial mutations to become "decoupled"

netics. Most of his book is spent rederiving old results, with

drift, which tends to deplete variation.

A lay reader will inevitably get the impression that the for

from the harmful ones. 10 But sexual reproduction leads to Hoyle then points to results like the following:

out giving any indication that they are not original to him. midable mathematical machinery employed by Hoyle, cou-

10A mathematical derivat1on of this fact can be found in any text on population genetics, [8] being a particularly good one.


7

pled with his dismissals of work that came before him, constitutes a devas tating attack on Neo-Darwinism. It doesn't.

[1 1 ] Gould, S.J. , and Vrba, E . , "Exaptation: A

on this topic. I consider that the main point in my article was the second one. Paleobiology 8 (1 982), 4-1 5. Mathematicians are trained to value [1 2] Hanegraaf, Hank, The F.A. C.E. that simplicity. When we have a simple, Demonstrates the F.A.R.C.E. of Evolution, clear proof of a theorem, and a long, Word Publishing, 1 998. Pseudomathematics: As an academic complicated counter-argument, full of dispute, all this is minor. But it plays in [13] Hoyle, Fred, Mathematics of Evolution, hotly debated and unverifiable points, Acorn Enterprises LLC, 1 999. public. ID theorists, much like the we accept the simple proof, even be 4] Huse. Scott M . , The Collapse of Evolution, [1 creationists before them, know they fore we fmd the errors in the compli 3rd ed., Baker Books, 1 997. will not convince scientifically cated argument. That is why I prefer [1 5] Huynen, Martijn A., "Exploring Phenotype knowledgeable people. Instead, they not to extend here the long-standing Space Through Neutral Evolution," Jour market their ideas to a public untrained debate over the first point, but to dwell nal of Molecular Evolution 43 (1 996) in both the methods and findings of further on the much simpler and 165-1 69. science. And all too often theirs is the clearer second point of my article, [1 6] Melendez Hevia, Waddell, Cascante, "The only viewpoint that is readily available. which is that the increase in order ob Puzzle of the Krebs Citric Acid Cycle: As When scientists are presented with served on Earth (and here alone, as far sembling the Pieces of Chemically Feasi subjects that invoke the terminology of as we know) violates the laws of prob ble Reactions, and Opportunism in the science to defend nonsense, like as ability and the second law of thermo Design of Metabolic Pathways During Evo trology or creationism, they use the dynamics in a spectacular fashion. lution, " Journal of Molecular Evolution 43 term pseudoscience. I suggest we need Evolutionists have always dis (1 996), 293-303. a similar term, pseudomathematics missed this argument by saying that the perhaps, to describe mathematical for [1 7] Miller, Kenneth R. , Finding Darwin's God, second law of thermodynamics only H arper Collins, 1999. malism used to promote bad argu dictates that order cannot increase in ments. As professional mathemati [ 1 8] Pigliucci, Massimo, "Impossible Evolution? an isolated (closed) system, and the Another Physicist Challenges Darwin , " cians, we all have an interest in Earth is not a closed system-in par Skeptic 8(4)(2001 ), 54-57. protecting the integrity of our subject. ticular, it receives energy from the Sun. We have an obligation to be aware of [1 9] Sewell, Granville, "A Mathematician's View The second law allows order to in of Evolution," The Mathematical lntelli how mathematics is being used in the crease locally, provided the local in gencer 22 (2000), 5-7. public square. When we see pseudo crease is offset by an equal or greater mathematics, we should not be afraid [20] Walsh, J. Bruce, "No Light from the Black decrease in the rest of the universe. Cloud," Evolution 54 (2000), 1 461 -1 4 62. to identify it. This always seems to be the end of the [21] Williams, William A, The Evolution of Man argument: order can increase (entropy Scientifically Disproved, in Fifty Argu REFERENCES can decrease) in an open system, there ments, Privately published, 1 925. [1 ] Behe, Michael, Darwin's Black Box, The fore, anything can happen in an open Free Press, 1 996. system, even the rearrangement of Jason Rosenhouse [2] Berlinski, David, "The Deniable Darwin," atoms into computers, without violat Department of Mathematics Commentary, June 1 996. ing the second law. Kansas State University [3] Berlinski, David, Gode/'s Question, in Mere It requires only a modicum of com Manhattan, KS 66506-2602, USA Creation: Science, Faith, and Intelligent mon sense to see that it is extremely Design, Wm. Dembski ed. , Inter Varsity [email protected] improbable that atoms should re Press, 1998. arrange themselves into mammalian [4] Dawkins, Richard, The Blind Watchmaker brains, computers, cars, and airplanes, 2nd ed. , Norton, 1 996. even if the Earth does receive energy [5] Dawkins, Richard, Climbing Mount Im Can Anything Happen in an from the Sun. We will see that the idea probable, Norton, 1 996. Open System? that anything can happen in an open [6] Dean, A.M . , "The Molecular Anatomy of an Critics of my Opinion piece "A Mathe system is based on a misunderstanding Ancient Adaptive Event," American Scien matician's View of Evolution" [1] have of the second law; that order can in tist 86, Jan-Feb 1998. focused primarily on my first point, crease in an open system, not because [7] Foster, David, "Proving God Exists, " The which deals with whether or not major the laws of probability are suspended Saturday Evening Post, December 1 999. evolutionary improvements can be when the door is open, but simply be [8] Gillespie, John H . , Population Genetics: A built up through many minor improve cause order may walk in through the Concise Guide, The Johns Hopkins Univ. ments. It is clear to me that they can door. Let us look first at a form of "or Press, 1 998. not, but this question is the traditional der" that is easy to measure. [9] Gould, Stephen Jay, An Earful of Jaw in front on which most battles over Dar Consider heat conduction in a solid, Eight Little Piggies, Norton 1 993. winism have been fought since 1859, R. If R is a closed system (no heat [1 0] Gould, Stephen Jay, Ever Since Darwin, and I did not imagine that my argu crosses the boundary), we can define Norton, 1977. ments would constitute the last word a "thermal entropy" in the usual way,

8


Missing Term in the Language of Form, "

to measure randomness in the heat dis

temperatures but carbon concentra

dias,

tribution, and show, using the second

tions identical to that in the rod, the

computers connected to laser printers,

science texts,

and

novels, or

law of thermodynamics, that the total

rod may import "thermal order" (ex

CRTs, and keyboards? If we take a

R can never decrease, and

port thermal entropy), but the "carbon

book of random letters and blow vow els into the front of the book (pretend

entropy in

will in fact increase until the tempera

order" will be unaffected. In the scien

ture distribution is uniform throughout

tific literature, thermal entropy is usu

letters can diffuse!) and suck them out

R. If R is open, the thermal entropy in R can decrease, but it is easy to show

ally referred to simply as "entropy," but

the back, we can import order into the

in fact there are many entropies (de

book, if randomness of the vowel dis

(see Appendix) that the decrease can

pending on what we choose to measure:

tribution is used to measure order.

not be greater than the entropy ex

see [2], p.xiii) and many kinds of order:

Vowels are essential for words, just as

R. Be

any macroscopic feature or property

solar energy is essential for life, but

cause a decrease in thermal entropy is

that is improbable from the microscopic

this process is not going to produce a

associated with an increase in "thermal

point of view can be considered order.

great novel: that is a different

order," this can be stated in another

For example, of all the possible config

order.

ported through the boundary of

kind of

way: in an open system, the increase in

urations that atoms could take, very few

If we found evidence that DNA,

order cannot be more than the order

would allow the transmission of pic

auto parts, computer chips, and books

imported through the boundary.

tures or air transportation of packages

entered through the Earth's atmos

According to the second law, then,

over long distances, so television sets

phere at some time in the past, then

the order in the universe is continually

and airplanes can be considered to be

perhaps the appearance of humans,

decreasing, but what is left of it at any

improbable, and to represent order.

cars, computers, and encyclopedias

time can be transported from one open

The second law predicts that-in a uni

on a previously barren planet could be

system to another. For example, if a

verse in which only natural processes

explained without postulating a viola

rod of uniform, moderate temperature

are at work-every type of order is un

tion of the second law here (it would

is used to connect a hot and a cold

stable and must decrease, as every

have been violated somewhere else!).

reservoir, the entropy of the rod will

thing

But if all we see entering is radiation

tends

toward

more

probable

decrease, as one end becomes hotter

(more random) states.But just because

and

and the other becomes colder. The

two things are both improbable does

clear that what is entering through the

meteorite

fragments,

it

seems

uni

not necessarily mean that the importa

boundary cannot explain the increase

formly distributed in the rod-some

tion of one (say, TV sets) into an open

in order observed here. Many scien

temperature

will

become

less

thing that would be extremely unlikely

system can explain the appearance

tists seem to have the idea that "en

to happen without help from outside.

there of the other (say, airplanes).

tropy" is a single number that mea

The rod is simply importing order from

Rather,

sures order of all types, so if entropy decreases locally when computers ap

the outside world, where order is now decreasing as the temperatures of the

If an increase in order is extremely

pear-no problem, entropy is increas

two reservoirs approach each other.

improbable

ing all over the rest of the universe, so

when

a

system

is

If we look at the diffusion of, say,

closed, it is still extremely improb

the total entropy is surely increasing,

carbon, in a solid instead of the con

able when the system is open, un

and the second law is satisfied.For ex

duction of heat, and take

U(x, y, z, t)

now to be the carbon concentration in

less something is entering which makes it

not extremely improbable.

"carbon entropy"

(Q

is just

U

now),

L. Hepler [3]

ment of civilization may appear con

stead of the temperature, we can re peat the analysis in the Appendix for

ample, S. Angrist and

write, "In a certain sense the develop

Although it is not as easy to quan

Lradictory to the second law....Even

tify the order associated with airplanes

though society can effect local reduc

showing again that in a closed system

and computers as the order associated

tions in entropy, the general and uni

(no carbon crosses the border) this en

with a carbon or temperature distribu

v:�rsal trend of entropy increase easily

tropy cannot decrease, while in an

tion, it is clear that life and human cre

swamps the anomalous but important

open system, the decrease in entropy

ativity are responsible for some very

efforts of civilized man."

cannot be greater than the entropy ex

large increases in order here.Contrary

What is the conclusion then-that

ported through the boundary. But it is

to common belief, however, the "ther

the explosion of new order on Earth has violated the laws of physics in a su

important to notice that now "entropy"

mal order" imported from the Sun does

measures the randomness of the dis

not help explain the formation of hu

pernatural way? Not necessarily: since

tribution of carbon, not heat, so the

mans, jet airplanes, TVs, and comput

the advent of quantum mechanics, the

amount of thermal entropy exported is

ers. If we add sunlight to the computer

laws of physics carmot be used to pre

not relevant to the change in carbon

model hypothesized in [1], would we

dict the future with certainty, and they

entropy in the solid. For example, if a

expect that the simulation would

now

do not really say that anything is ab

steel rod of uniform temperature and

predict that the basic forces of Nature

solutely impossible, they only provide

uniform carbon concentration is placed

would rearrange the basic particles of

us the probabilities. Thus one could ar

between two steel blocks of unequal

Nature into libraries full of encyclope-

gue that the origin and development of


9

Q is the heat energy density and

life may not have violated any of the

where

laws of physics-only the laws of prob

J is

ability. The conclusion is only this: con

law requires that the flux be in a di

the heat flux vector. The second

trary to what Charles Darwin believed,

rection in which the temperature is de

and contrary to the majority opinion in

creasing, i.e.,

science today, the development of in

J·VU�O

telligent life is not the inevitable or rea

(2)

sonably probable result of the right

(In fact, in an isotropic solid, J is in the

conditions, it is extremely improbable

direction of greatest decrease of tem

under any circumstances.

perature, that is,

J = -KVU.)

Note

that (2) simply says that heat flows from hot to cold regions-because the

REFERENCES

1 . G. Sewell, "A Mathematician's View of Evo lution, " The Mathematical lntelligencer 22 no. 4 (2000), 5-7. 2. R. Carnap, Two Essays on Entropy, Univer sity of California Press, 1 977.

laws of probability favor a more uni form distribution of heat energy. Now the rate of change of "thermal entropy," nition as

S, is given by the usual defi

(2), we see that the volume integral is nonnegative, and so

St 2::

-II J · n/U aR

From (4) it follows that 81

2:: 0 in an iso

lated, closed, system, where there is no heat flux through the boundary can never decrease. However, equation (4) still holds in an open system; in fact, the boundary integral in (4) represents the rate that entropy is exported across the bound ary (notice that the integrand is the outward heat flux divided by tempera ture). Thus in an open system, (4) be more than the entropy exported

Basic Books, 1 967.

through the boundary. Appendix. Consider heat conduction

Using

(3) and the first law (1), we get:

in a solid R, with (absolute) tempera

U(x, y, z, t).

R

law of thermodynamics (conservation

-V·J

where (1)

Granville Sewell Mathematics Department

The first

of energy) requires that

Qt =

aR

University of Texas El Paso El Paso, TX 79968

n is the outward unit normal on

the boundary aR. From the second law

USA e-mail: [email protected]

Beware Biomathematics I approached Feynman after one of his Cornell lectures in 1964 for advice about

how best

to

move into mathematical biophysics from engineering physics, as I had planned when choosing Cor nell. He cautioned against any such move, on grounds that biology is too much a matter of tricks and accidents of evolution, and too complex for useful mathematical representations. I believe that is correct, on average, but the rich diversity of living nature provides many niches for peculiar ques tions and aptitudes. Arthur T. Winfree The

Geometry of Biological Time

2nd edition (Springer, 2001), p. 660

10

(J·n =

0). Hence, in a closed system, entropy

means the decrease in entropy cannot

3. S. Angrist and L. Hepler, Order and Chaos,

ture distribution

(4)


c.m ott.1 ,;

If M athematicians Do Not Do It, Who Willt Daniel J. Goldstein

The Op inion column offers mathematicians the opportunity to write about any issue of interest to the international mathematical community. D isagreement and controversy are welcome. The views and opinions expressed here, however, are exclusively those of the author, and neither the publisher nor the editor-in-chief endorses or accepts responsibility for them. An Opinion should be submitted to the editor-in chief, Chandler Davis.

D

oes mathematics have any func tion in understanding biological phenomena? Of course it does, in that mathematics is the language of physics. Some mathematics is used to describe physical phenomena that were studied in classical physiology and genetics e.g., membrane biology, cardiovascular function, classical genetics-at a very macroscopic level. Classical macro scopic physiology and genetics (CMP/G) was the realm of biologists who were relatively insensitive or indifferent to molecular structure (which in any case, at the time could not have been tackled experimentally) and paid only lip service to biochemistry-they knew that some enzymology had to be thrown in to pacify the beasts. CMP/G generated black boxes and neat, elegant diagrams that convey (even today) a sense (false) of rationality and simplicity. These reduced models have tradi tionally tempted mathematicians inter ested in finding biological "laws." The problem is that the biological world is rather different from the neat rational izations and simplifications of CMP/G, and as soon as biologists went beyond the macroscopic depiction of physical phenomena, the effectiveness of math ematics collapsed. Genetics and molec ular chemistry-which together try to explain biological complexity in terms of interacting molecules-have regu larly shown the absurdity of "models" and "laws" deduced from concepts de rived from partial, mainly irrelevant, and biased information. These at tempts at mathematization were reac tionary on two grounds: first, they were based on the assumption that the bio logical world can be understood with out knowing its molecular structure and function; second, they implicitly accepted as truths the biggest biologi cal sins of CMP/G: teleonomy, the con cept of design, and the interpretation of evolution as the exclusive conse quence of adaptive selection. CMP/G, and the mathematical mod els derived therein, operate as if bio logical systems were the result of a "ra tional design" intended to maximize efficiency. This, of course, is utterly

false, and if mathematics has to do with a reality out there, the mathematical approximation to biological problems should start by recognizing that bio logical systems and objects are the re sult of accident and a curious mixture of adaptive and non-adaptive selection. To be sure, mathematics also helped in opening the biological black boxes, because the only tools available so far to determine molecular structure are two physical technologies, X-ray dif fraction and nuclear magnetic reso nance. Once modern genetics and mo lecular chemistry opened the black boxes of CMP/G, the already bewilder ing variety of the biological zoo in creased by the addition of ever stranger creatures. Biologists had to come to grips with this new expanded reality, and appreciate with awe the momen tous complexity hidden in tissues, cells, and extracellular structures; the interplay among thousands of intracel lular and extracellular macromole cules; and the astonishing heterogene ity of chemical signals that regulate the ensemble. The interactions among these gigantic collections are extraor dinarily difficult to describe, and it is utterly impossible to imagine a single "law" that can sensibly explain their collective, integrated behavior. Because the difficulties inherent in attempting an integrated approach seemed insurmountable, molecular bi ologists fell into the trap of trivial re ductionism and studied (as best they could) one molecule at a time. This re search strategy is good for description and for survival-there can be special ists in single proteins, and there are a lot of proteins out there. But there are deeper problems. For example, how can we know the total number of roles that a single protein plays in a cell or in a organism? Proteins are objects with multiple functions, and not all of the potential functions of a single pro tein species are exerted at the same time. Protein functions are context-de pendent, and biologists must approach the problem as art historians do when studying art objects, which also have multiple functionalities--aesthetic, sym-

© 2001 SPRINGER-VERLAG NEW YORK, VOLUME 23, NUMBER 4, 2001

11

bolic, or political-depending on time,

ematization, such as using topology to

ately needed, because the huge size and

place, and context. The chemical con

describe DNA knots, an application that

variegated nature of the information

text in which a protein exists (which

may or may not be useful in the future

delivered

conditions its functionality) changes in

for explaining physiological and bio

has changed radically the way we do bi

by

the

genome

projects

real time, yet our knowledge of these

chemical

and

ology, and the old reductionist tricks

fluctuating boundary conditions is piti

deriving predictive models of chromo

need to be complemented with inno vative approaches coming from mathe

phenomena

fully poor. Furthermore, protein struc

some structure, behavior, and regula

ture is not fixed: proteins undergo

tion. The (few) examples of this type

matics. Yet for this creative interaction

post-translational modifications, suffer

suggest a certain kind of laziness in the

to occur, mathematicians have to learn

limited proteolysis, associate with like

way in which mathematicians approach

enough molecular biology to be able to

molecules or with different protein

biology. They seem to decide which

grasp

species, and dissociate and even refold

"themes" are "mathematically viable" by

Mathematicians must be familiar with

in radically different ways as a function

their superficial resemblance to mathe

the kinds of objects that the biologists

of the chemical context.

matical

objects

and

the

real

biological

problems.

situations with

work with; must share the same genetic

All this is crucial for understanding

which they are familiar. Symmetries,

and molecular language; and must un

how biological systems function, be

packings, knots, sequences, and pat

derstand that biological objects are not

cause the genotype (the sum of genetic

terns occur in biology aplenty, and be

the result of design, that efficiency is a

information, whatever this may mean)

ing easily translated into mathematical

human value judgment and not une don

does not determine the phenotype (the

notation, they are defined as the areas

nee de

observable traits of an organism). The

of interface between mathematics and

the messy result of accidents and adap

la

nature,

and that evolution is

genotype encodes a collection of pro

biology. But, is there something in this

tive and nonadaptive selection. If math

teins, and the interaction of the encoded

beyond translation? Did these analogies

ematicians learn this language and un

proteins-with all the possible caveats

produce predictive models?

derstand the evolutionary process, they is

will be able to find many real biological

whether there are any mathematical

problems amenable to mathematical ex

physical and chemical reactivity)-is

objects that really behave in the same

ploration, and biology will reach another

what determines the phenotype.

way biological macromolecules

intellectual dimension. I think that the

(modification, fragmentation, associa tion, with the concomitant changes in

In

my

opinion,

the

question

do,

Occasionally, the genetic and molec

and that could help produce predictive

"Wigner-Gelfand principle," which as

ular dissection of an experimental sys

models of biological phenomena. In

serts the unreasonable ineffectiveness

tem allows the formulation of models of

this sense, the recent discovery of

of mathematics in the biological sci

universal explanatory and

predictive

Professor M. Livsic about the possibil

ences, should be reformulated. So far,

value. So far, in the fifty years of molec

ity of depicting DNA structure and

mathematics has been ineffective in the

ular biology, only three such models

replication

biological sciences because mathemati

have emerged: the Watson-Crick model

open systems may or may not open a

cians looked in the wrong places and

of DNA structure, the Jacob-Monod

new

with the wrong attitude. Mathematics

model of genetic regulation, and the

mathematics and biology.2

space

in terms

of space-time

of interaction

between

will be reasonably effective in the bio

logical sciences when mathematicians

Jacob-Monod-Wyman-Changeux-Perutz

Of course, the old problem still

allosteric model of enzyme regulation.

looms: Are these equivalences "real" or

become

Mathematics (aside from geometry) had

merely reflections of the fact that our

out there,

aware of the biological reality perceive the nature of the

no role in these momentous achieve

brains, whatever the way we see/ex

open mathematical problems hidden in

ments. These discoveries were the result

press the world, can produce only a lim

biological systems, and try to solve them

of solid thinking and strong invention in

ited number of metaphors? Yet the

(inventing/discovering some new math

structural chemistry, and the (then new)

power of metaphors is huge and should

ematics, of course).

not be dismissed with a shrug, as the ex

bacterial and phage genetics. that

traordinary interplay between mathe

Daniel J. Goldstein

"Mathematics is unquestionably effec

matics and physics so eloquently shows.

Departamento de Ciencias Biol6gicas

Some

commentators

think

tive in biology, for rationalizing obser

I am convinced that a creative inter

vations."1 This may be true, but it is not

action between mathematicians and bi

Universidad de Buenos Aires, Argentina

obvious. There are possibilities of math-

ologists is not only possible but desper-

[email protected]

' Arthur M. Lesk. Compared to What? The Mathematica/ lntelligencer 23(1 ):4 (200 1 ) . 2"Systems and genetics," in Proceedings of the Workshop Dedicated to Advances and Applications. Birkhauser, i n press.

12

THE MATHEMATICAL INTELUGENCER

the 60th

Birthday of Harry Dym (ed.

Facultad de Ciencias Exactas y Naturales

D.

Alpay, I . Gohlberg, and Y. Vinnikov) Operator Theory:

M at h e m a tic a l l y Bent

The proof is in the pudding.

Opening a copy of The Mathematical Intelligencer you may ask yourself

uneasily, "What is this anyway-a mathematical journal, or what?" Or you may ask, "Where am I?" Or even "Who am !?" This sense of disorienta tion is at its most acute when you open to Colin Adams's column. Relax. Breathe regularly. It's mathematical, it's a humor column, and it may even be harmless.

C o l i n Ad a m s , Editor

A Deprogrammer's Tale

but I understand why; I know the se ductive power of a beautiful proof, the appeal of a well-turned lemma. Larry had fallen prey in the usual

manner. Mter hearing the derivative explained in a lecture hall with 300 other students, he went to see the pro fessor during office hours. That's when they know they have you. You're one of the susceptible ones, looking for some meaning beyond the plug and

Colin Adams

chug problems. hey hooked him in calculus class.

A little chitchat, maybe notational,

Started slow. Didn't want to be too

a bit of history, Newton versus Leib

obvious. Gave him a little trig review,

nitz, that sort of thing, all seemingly in

T

some functional notation, and then in

nocuous. And then, when he least ex

troduced limits. Gave him lots of prob

pected it, the epsilon delta definition of

lems to work. Kept him busy to get his

a continuous function. Poor guy was

guard down. Then pow, hit him with

putty in the professor's hands. Before

the concept of the derivative. The raw

he could get his head back on straight,

power and simplicity of the idea, it was

the professor invited him to a depart-

1 200 peo p l e a year g et Ph . D.s i n math

in the U n ited States alone. overwhelming. How could he resist?

mental colloquium, followed by tea.

Who can? I know. I've been through it

Larry dutifully went, and although he

myself. Yes, that's right. I was one of

was blown out of the water by the ma

them once.

terial, he saw the others there, at rapt

I was a slave to mathemat

attention, and he felt he was among

ics. But unlike most, I escaped. And now my life is dedicated to helping others who were not as fortunate as

At tea, the department members ig nored Larry, feigning indifference to

I.

In this particular case, I was hired

Column editor's address: Colin Adams,

friends.

the freshman who was interested in

by the parents of one Lawrence De

math,

senex. One minute, Larry was pre-med,

wrapped up in their own research to

pretending

they

were

too

heading for a lucrative plastic surgery

care. But oh, if he only knew. They

practice in Cherry Hill, and the next

were watching his every move, as they

minute he was talking about earning a

scribbled on the blackboard and talked

Ph.D. in mathematics. All thought of fi

about this theorem or that with their

nancial gain went out the window. His

colleagues. He was a marked man, and

parents were horrified. Dreams of my

Larry didn't even know it.

son-the-doctor turned into nightmares

In cases like these, there is a small

of my-son-the-itinerant-mathematician.

window of opportunity, a short period

But me, I wasn't surprised when I heard

when a student can yet be saved. But

the tale. I'd heard it a hundred times be

you must act fast. Once students take

Department of Mathematics, Williams

fore. Believe it or not, 1200 people a

Real Analysis and Abstract Algebra,

College, Williamstown, MA 01 267 USA

year get Ph.D.s in math in the United

their fate is sealed. The window has


States alone. That sounds incredible,

been slammed shut and shuttered.

© 2001 SPRINGER-VERLAG NEW YORK. VOLUME 23, NUMBER 4. 2001

13

But Larry's parents had called me in time. He was taking Linear Algebra, the applied version. There was hope yet. I found him in the cafeteria with an untouched plate of tuna casserole and a copy of The Man Who Loved Only Numbers open in front of him. I gave him my winningest smile. "Erdos, huh? Mind if I join you?" He was clearly impressed and mo tioned to the seat across the table. "Like math, do you?", I asked. "Oh, yes," he said enthusiastically. "It's so beautiful." "Yes, it does have an appeal." "Have you ever seen the argument for the uncountability of the reals?", he asked. "That's really cool." The bubbly excitement, the glassy bright eyes. Oh, he was in deep. We talked math for a while. I played along. Euclid this, Euler that. Then I laid the trap. "Hey, my roommate and I are hav ing a birthday celebration for Karl Friedrich Gauss on Wednesday at my apartment. You're invited." Of course, he was thrilled. Suscep tible and trusting are two descriptions of the same attribute. He showed up right on time. It hadn't taken him long to pick up that c haracteristic of mathematicians. I let him in and locked the door behind him. Then everyone popped out, his par ents, his grandparents, a cousin, an aunt, his best friend from high school. "What's going on here?" he said, clearly at a loss. "This isn't a birthday party for Gauss." "No, it's not," I said. "Gauss was born in April. This is an intervention, Larry. These are the people who love you and they're here to help." He backed away. "Open the door. Let me go," he cried desperately. I blocked him. "Not until you hear what we have to say." He looked like Galois after the duel. The blood drained from his face. Must

14


have been wondering where his muse was now. His mother spoke first. "Bunchkins, bunchkins, have you thought about us? We love you, Pinchy, but good gra cious, what would the neighbors say? Mrs. Krawlick would revel in the news. Our son, a mathema, a mathema . . . , I can't say the word." She began to bawl uncontrollably. Larry's father held her. "Look at your mother. Look at what you are doing to her. She can't even say the word." "Poor, poor Erma," said his aunt, patting Larry's mother on the sleeve. "Larry, I can't believe you would do this. You seemed like you were a good kid. You used to watch television. You had a lemonade stand. What happened to you? My kids would never do this. Evan here, now, he is a dentist, aren't you Evan?" The cousin nodded yes. "And Cybil works in marketing for an ad agency. And I am proud of them both!" "What about Karen?" asked Larry. The aunt turned bright red. "How dare you mention her name in my pres ence." Evan laughed. "Karen has a masters degree in accounting." Not my area, but I sympathized. Larry's best friend spoke up. "Lis ten, Larry. The problem is, it's not cool to do math. Business degrees, they're cool. You know, Internet start-ups and all. Theater degrees, that's cool. You wear black clothes and talk about Pin ter. But math? It's not cool. Nothing is cool until everyone is doing it." Larry wrung his hands. "You don't understand. I don't have a choice. I am not choosing to do math ematics. Math has chosen me. When I saw that epsilon delta definition of con tinuity, it was like I had known it all my life. Here is what the professor was really talking about when he drew all these pictures. This is a rigorous defi-

nition. It felt so good. It's not up to me anymore." "Look, Larry," I said. "Do you want this to be you?" I showed him the pic tures of mathematicians, the addicts with their white pallor from sitting un der fluorescent lights for years at a time. Some were barely able to lift their eyes from the books in front of them as the camera clicked away. Their clothes, stained with coffee, made it clear they were unaware that fashion was an evolving concept. But he was unmoved. "That's ex actly what I want to be," he said. I sighed. "Okay, Larry, I have no choice." I strapped him into the Bar colounger and turned on the TV. I kept him there for two weeks; mostly re runs of the "Brady Bunch" and "Wel come Back Kotter." By the time we were done, spittle dripped from the side of his mouth. His brain had been washed clean. Unfortunately, it had been washed so clean that medical school was no longer an option. Larry did go on to a successful career with Seven Eleven, primarily mopping up the slushy spills at the Cherry Hill store. And I know that he's happier for it. But Larry's story is just one among many. These dangers are real. Do you know where your children are? Are you sure they are watching TV, and not sit ting in on a seminar, or leafing through a math text? If we are vigilant, we can prevent mathematics from spreading any fur ther. But we will need to fight the min ions of mathematics at every turn. We will need the entertainment industry to continue to hype style over intellectual curiosity. We will need to inundate children with the belief that being good at math is something to be ashamed of. We will need to convince everyone that there is nothing wrong with mathe matical illiteracy. So far, so good.

I\[email protected]§11£hlfiifj.lj,j11ii!,iihfj

Social Influences on Quantum Mechanicst - 1 Jane Cronin

This column is a forum for discussion of mathematical communities throughout the world, and through all time. Our ddinition of "mathematical community" is the broadest. We include "schools" of mathematics, circles of correspondence, mathematical societies, student organizations, and informal communities of cardinality greater than one. What we say about the communities is just as unrestricted. We welcome contributions from mathematicians of all kinds and in all places, and also from scientists, historians, anthropologists, and others.

Please send all submissions to the M athematical Communities Editor.

Marjorie Senechal, Department of Mathematics, Smith College, Northampton, MA 01 063, USA;


Marj o r i e Senechal , E d it o r

I

n an interesting article about the question of whether social and cul tural factors have affected the devel opment of quantum mechanics, M.B. Ruskai (Mathematical Intelligencer 23, no. 1, 23-29) concludes that quan tum mechanics "transcends social and cultural forces." There are, however, several such forces that have signifi cantly derailed or sidetracked the de velopment of quantum mechanics. The purpose here is to describe briefly some of these. The introduction of the Copenhagen interpretation by Niels Bohr and Werner Heisenberg gave rise to much controversy among very accomplished physicists. Erwin Schrodinger devised his cat-in-the-box thought-experiment to demonstrate his view that the Co penhagen interpretation was ridicu-

I

books. (See Jammer [9, pp. 247-248] and Mermin [11, p. 803].) When we consider the question of why the Copenhagen interpretation was thus accepted, the answer is sur prisingly unclear. First, of course, it should be pointed out that quantum mechanics was widely accepted in short order by physicists who applied it successfully to practical problems or extended the theory and who had little interest in the foundations of the sub ject. It was natural for them to stick with the first complete interpretation. But this does not answer the question of why the Copenhagen interpretation was accepted despite the serious ques tions raised by a number of physicists. The most important part of the answer seems to be Bohr's energetic support. His stature as a physicist, his agreeable

The introduction of the Copenh agen in terpretation by N iels Boh r and Werner Heisenberg gave rise to much controversy lous, and his biography suggests that he never changed his mind. Concerns about the nature of the observer and the collapse of the wave function led Eugene Wigner and others to quite dif ferent interpretations. (See, e.g., Rae [13, Chap. 1 1].) But the most important objection to the Copenhagen interpretation was the work of Albert Einstein and his col leagues [5], hereafter to be referred to as EPR. Einstein's concerns about the role of probability in quantum theory are far more profound than his oft quoted remark implies, and they can not be dismissed with a quip. Einstein and Bohr carried on a long, friendly dis cussion of their differences (cf. Jammer [9, Chapters 5,6]), and Bohr won out in the sense that the Copenhagen interpretation became ac cepted to the point that it entered text-

personality, and his persistence and de termination all combined to win the day for the Copenhagen interpretation. According to Murray Gell-Mann, "Bohr brain-washed a whole generation of physicists into believing that the prob lem had been solved." (See [7, p. 152] .) The Copenhagen interpretation was also supported indirectly by work of John von Neumann. As soon as the probability properties of the solutions of the Schrodinger equation (the wave functions) were introduced, it was nat ural to think of the possibility of hid den variables, i.e., variables describing the deeper structure of a given physi cal system. (For example, if a gas is de scribed in terms of temperature, pres sure, and volume, then the velocities of the individual atoms in the gas would be hidden variables.) In his book on quantum mechanics, von Neumann


15

[ 15] claimed to prove that there are no hidden variables in quantum mechan ics. (This result supported the Copen hagen interpretation, because the ab sence of hidden variables suggests that the wave function contains all possible information about the system it de scribes.) As described in Jammer [9. p. 265ff. ] , there was considerable discus sion of von Neumann's result, and in 1935, Grete Hermann [8] pointed out a deficiency in von Neumann's proof. However, Hermann seems to have been disregarded, and it was not until 1966 that John Bell [2] showed that von Neumann's proof was based on an as sumption that has been described by some writers as "silly." (See Mermin, [ 1 1, p.806.].) The direction of study of the foun dations of quantum mechanics from 1930 to the 1950s thus seems to have been strongly influenced by two non scientific or social forces: the prestige and persistence of Bohr and the pres tige of von Neumann. (Von Neumann was indeed a towering figure in twen tieth-century mathematics, but it does not follow that he was incapable of er ror.) From the point of view of gender issues, one might also ask if Grete Hermann's observation in 1935 would have been more seriously regarded if she had been named Georg Hermann. More nonscientific forces came into play in the reception of the work of David Bohm in 1952. In the accompa nying essay, Miriam Lipschiitz-Yevick describes these. In 1957, Bohm and Aharanov [4] de scribed an example of the EPR prob lem, which greatly clarified the prob lem and led the way to important further work The example is a thought experiment that reveals a puzzling point in quantum mechanics. For a careful description, see Rae [ 13, p.229]. Briefly and loosely put, the experiment involves two spin half particles. The to tal spin of the system is zero, but no in formation about the spins of the parti cles is given. The particles move apart until widely separated, after which the spin of one particle is measured. Measurement of the spin (actually a specific component of the spin) of the first particle causes the wave function

16


to "collapse" into an eigenfunction of the spin operator, and it follows that the same component of the spin of the second particle then is determined and is equal to the negative of the spin of the first particle. Thus, even though the two particles may be light-years apart, the measurement of the spin of the first particle has an immediate influence on the second particle: That is, the mea surement of the spin of the first parti cle causes the measurement of the spin of the second particle. This is an ex ample of what is called nonlocality (what Einstein called "spooky action at a distance"). It was anathema not only to Einstein but to most physicists edu cated in the twentieth century. (See Ballantine [ 1 , p.585] and Bell [2, p.20, footnote 2].) This example is so important to later work that its origins should be ex amined with some care. First, EPR, in which the example originated, raises serious questions about quantum me chanics, questions whose formulation required deep and penetrating analysis. The example of David Bohm and Yakir Aharanov clarified the original EPR thought-experiment to the point where the ideas became accessible and use ful to others, as we shall see. Actually, the example was introduced and de scribed in detail in Bohm's book on quantum mechanics [3], indeed in more detail than in [4]. In [3] , Bohm de scribed his example as a modification of the EPR experiment, which has "conceptually equivalent form" to that experiment. Bohm should receive credit not only for devising the exper iment but for doing the work at a time when it was widely thought that Einstein's questions about quantum mechanics had been laid to rest and the Copenhagen interpretation reigned supreme. (The reader who is curious about looking up references [3] and [4] needs a word of warning here. In [3], Bohm was still a supporter of the Copenhagen interpretation, but by the time [4] was written, his views had changed significantly. Indeed, his views had changed between the publi cation of [3] and the publication of his papers on quantum mechanics in 1952. The description of the example is given

in more detail in [3], but the signifi cance of the example is better sug gested in [4] .) The momentous next step was taken by John Bell. (For a detailed, up to-date account of Bell's work and later results based on his work, see Ballantine [ 1 ] . Here we give only a short, rough description.) Starting with the model of Bohm, Bell devised a thought-experiment from which can be derived (using no quantum mechanics) a testable conclusion called "Bell's in equality." He showed also that this in equality contradicts the predictions of quantum mechanics. (This result is called "Bell's theorem.") Since then, ac tual experiments modeled on Bell's thought-experiment have been carried out, and the experimental results agree with the predictions of quantum me chanics, and thus contradict Bell's inequality. It follows that there is dis agreement between quantum mechan ics and the hypotheses used to derive Bell's inequality. The only significant hypothesis used to derive the inequal ity seems to be locality (i.e., no nonlo cality). (See Ballantine [ 1 , p. 607ff.] for a careful discussion of this point.) The implication is therefore that quantum mechanics is nonlocal. Since the requirement of locality is motivated by special relativity, this suggests a possible incompatibility be tween quantum mechanics and special relativity. At present, these are deep and unresolved questions. It is worth remarking that Bell's work received the attention it de served only slowly. One reason for this may have been that his earliest work was concerned primarily with hidden variables, a subject that was, for various reasons, of little interest to physicists. Another reason for the de lay in acknowledging the importance of Bell's results may have been the fact that even in his earliest papers, he emphasized the importance of nonlo cality; and, as noted before, nonlocal ity was unacceptable to most physi cists. (The Bohm theory described in 1952 is nonlocal, but that fact was held against the Bohm theory when it was introduced.) There seems to be no doubt that

these results are very important to the foundations of quantum mechanics, and yet at each stage of their develop ment, there was resistance by the physics establishment. EPR was dis missed, Bohm was disregarded, and even Bell's work was acknowledged slowly. It would probably be impossible to make a numerical estimate of the time delay in the development of quan tum mechanics caused by this resis tance, but it seems unquestionable that such a delay occurred. Part of the reason for the resistance has already been mentioned: this work is concerned with the foundations of quantum mechanics. To physicists, bustling in their laboratories and con fident of the applications of quantum mechanics, the foundations are simply not interesting. It is therefore ironical that all this theoretical nattering has given rise to significant work in quan tum cryptography. Although still in the laboratory stages, this work shows considerable practical promise. (See [6] , [ 10) [ 12), [ 14].) Pairs of particles satisfying the conditions in the Bohm example (the particles are said to be entangled) are used to create unbreak able codes that can be used for the secure transmission of confidential material.

[3] Bohm, David, Quantum Theory, Prentice

A U T H O R

Hall, Inc. New York, 1 951 .

[4] Bohm, D. and Aharanov, Y. Discussion of experimental proof for the paradox of Einstein, Rosen, and Podolsky, Physical

Review ( 1 08) 1 070-1076, 1 957. [5] Einstein, A., Podolsky, B., Rosen, N., Can quantum mechanical description of phys ical

reality be considered

complete?

Physical Review (47) 777-780, 1 935. [6] Ekert, Artur K., Quantum cryptography based on Bell's Theorem, Physical Review JANE CRONIN

Letters (67) 661 -663, 1 991 .

Department of Mathematics

[7] Geii-Mann, Murray, The Nature of the Physical Universe, John Wiley & Sons,

Rutgers

Un i versity New Brunswick

Piscataway,

Inc . , New York, 1 976. Grundlagen der Quantenmechanik, Ab

NJ 08854-801 9 USA

[8] Hermann, G., Die naturphilosophischen


handlungen der Fries ' chen Schute (6) 75Jane Cron in (Scanlon) got her doc

1 52, 1 935. [9] Jammer, Max, The Philosophy of Quan

torate at the University of Michigan.

tum Mechanics, John Wiley & Sons, New

She has been at Rutgers

York, 1 974.

becom ing Emerita in 1 991 Her pri .

[1 0] Jennewein, Thomas; Simon, Christoph; Weibs,

Gregor;

since 1 965,

Weinfurter,

H arald;

mary field of research has been and remains singular perturbation theory

Zeilinger, Anton. Quantum cryptography

applied to models of neural activity.

with entangled photons, Physical Review

Readers

Letters (84) 4729-4732, 2000. [1 1 ] Mermin, N. David, Hidden variables and two theorems of John Bell, Reviews of

may

recall

her

book

Mathematical Aspects of Hodgkin Huxley Neural Theory, and her article in The lntelligencer 1 2 (1 990), no. 4 .

Modern Physics (55) 803-8 1 5, 1 993. [1 2] Naik, D. S., Peterson, C. G., White, A. G., Berglund, A. J . , Kwiat, P. G., Entangled state quantum cryptography: eavesdrop

REFERENCES

[1 ] Ballantine, Leslie C . , Quantum Mechanics, A Modern Development, World Scientific, Singapore, 1 998.

[2] Bell, J . S., Speakable and Unspeakable in Quantum Mechanics, Cambridge Univer sity Press, Cambridge, 1 987.

ping on the Ekert protocol, Physical

gled photons in energy-time Bell states.

Review Letters (84) 4733-4736, 2000.

Physical Review Letters (84) 4737-4740,

[1 3] Rae, Alastair I. M . , Quantum Mechanics, third edition, Institute of Physics Publish ing, Bristol, England, 1 992.

[1 4] Tittel, W., Brendel, J., Zbinden, H., Gisin, N . , Quantum cryptography using entan-

2000. [1 5] Von Neumann, J . , Mathematische Grund fagen der Quantenmechanik, Springer, Berlin, 1 932. (English translation, Prince ton University Press, 1 955).


17

Social [Bohm'sj themy, due to one of the greatest physicists of ou1· Why time, is practically ?.tnivm·sally ignm·ed, is an enigma which histmians of science offuture centuries wiU have to resolve. Influences on -Jean Bricmont, in "Cont1·e la philosophie de mecanique quantique, " Quantum M echanicst- 1 1 T Retrospect this

la

EDITOR's

OTE: And here we are in the next century, trying to resolve it.

his note is intended as a response

interpretation"), he rejected this inter

to Mary Beth Ruskai's comments

pretation in favor of a radically oppos

on David Bohm's quantum mechanics

Miriam Lipschutz-Yevick

1 995

(Mathmnatical InteUigencer,

vol.

23

ing one.2 This interpretation was to suggest3 that the EPR correlations

(2001), no. 1, 23-29, especially Appen

were to be ascribed to fluctuations in

dix B). In particular, I want to speak of

the

the social factors that inhibited the free

which his new theory had postulated;

discussion of his challenge to ortho

later he concluded rather that the cor

doxy.

relations were entirely the product of the quantum potential of his theory.4

Jane Cronin, in the note that pre

sub-quantum-mechanical

level

cedes this one, emphasizes the impor

Bohm's "hidden variables" theory

tance and relevance to later work of

was, in fact, an independent rediscov

Bohm's

EPR

ery and elaboration of the "pilot wave"

myself with his development, subse

Broglie, which he had presented at the

reformulation

of

the

Gedanken experiment. 1 I will concern quent to finishing his

ory,

Quantum The

of a "hidden variables" theory

theory of the French physicist Louis de Solvay Conference in

1927 to explain

the wave-particle duality. De Broglie

which, after a delay of several decades,

expressed

also gave impetus to a renewed inter

Schrodinger's "particle corresponds to

est in EPR and its potential applica

a wave packet" and Born's "psi func

Quantum Theory Bohm stood

tion yields probabilities only," leading

tions. In

his

disagreement

with

squarely on the side of Niels Bohr's cri

to renunciation of determinism for in

tique of EPR, and he used his modifi

dividual particles. De Broglie proposed

cation of this experiment to solidify the

instead that if we know the particle's

argument

initial position the psi function pre

against

Albert

Einstein's

conviction of the incompleteness of

cisely determines its trajectory. On the

quantum mechanics.

other hand, given an ensemble of non

Yet shortly after completing his text,

interacting identical particles with dif

after further discussions with Einstein

ferent initial positions, the psi function

and continuing the profound thought

determines the probability that an in

the

dividual particle will be in a volume of

Copenhagen philosophy and to under

he

had

devoted

to

clarifying

space at a given instant. The psi wave

standing the EPR paradox (Einstein

thus appeared simultaneously as a pi

had complimented him with "Yours is

lot wave

the best exposition of the Copenhagen

and a probability wave. "It does not

(Fiihrungsfeld

of Max Born)

1D. Bohm, Quantum Theory, Prentice-Hall, New York, 1 95 1 ; see p. 614. 2D. Bohm, A suggested interpretation o f quantum theory in terms o f hidden variables, I a nd II, Physical Re

view 85 (1 952), 1 65 and 1 80. 3D. Bohm and Y. Aharanov. Discussion of experimental proof for the paradox of Einstein, Rosen, and Podol sky, Physical Review 1 08 (1 957), 1 072.

4D. Bohm and J . B. Hiley, The Undivided Universe, Routledge, London, 1 993; p. 1 49.

18

THE MATHEMATICAL INTELUGENCEA © 2001 SPRINGER-VERLAG NEW YORK

seem to us that there is a need to re

diffraction phenomena. The Heisen

anywhere in the universe a single

nounce our belief in the determinism

berg uncertainty principle introduces

system which did not combine the

of individual physical phenomena (that

an additional indeterminacy in experi

three

is to say, the individual motion of par

ments intended to observe the actual

probability, and the wave-particle duality, then this system could be

elements

of

indivisibility,

ticles), and it is thus that our concepts,

position or momentum of a particle, re

elsewhere very similar to those of M.

sulting from an indivisible quantum be

used to make measurements on

Born, nevertheless appear to differ per

ing transferred in such

other systems which were more pre

an

observation

ceptibly."5 De Broglie abandoned this

from the observing apparatus to the

cise than the limits of precision set

interpretation and became a convinced

particle, thus changing its momentum

by the uncertainty principle, and as

proponent of the Copenhagen inter

and limiting the accuracy of the mea

a result, one of the most fundamen

pretation

surement. The precise limits on accu

tal predictions of quantum theory

racy are set by the fluctuations in the

could be contradicted. 7

as

a

result

of Wolfgang

Pauli's criticism at this Conference. The Copenhagen interpretation has

SchrOdinger field. The indeterminacy

been questioned not as "perverse" but

no longer has as a consequence the

as mysterious. J. S. Bell6 wrote,

non-existence

When I was a student I had much

of individual particle

Bohm's (and de Broglie's) interpre tation leads to precisely the same re

trajectories with well-defined positions

sults for all physical processes as does

and momenta. The abolition of the

the usual interpretation, as long as the mathematical theory retains its form. It

difficulty with quantum mechanics.

wave-particle duality returns us to the

It was comforting to find that even

classical status of probability as re

does, however, offer a broader con

Einstein had such difficulties for a

flecting imperfect knowledge of initial

ceptual framework which allows more

long time. . . . But in

1952 I saw the

conditions due to instability and com

general

impossible done. It was in papers by

plexity

Bohm's careful analysis in his text of

David Bohm. Bohm showed explic

causes imbedded in the context of the

the assumptions needed for the uncer

itly how parameters could indeed be

event under consideration. Probability

tainty principle to follow from the

introduced,

into

of

numerous

independent

mathematical

formulations.

non-relativistic

is no longer "intrinsic, " and a deeper

Fourier transform principle8 between

quantum mechanics, with the help

understanding of what goes on at the

.lx and Ap had been summarized as fol

of which the indeterministic de

sub-quantum-mechanical

scription could be transformed into

some day allow us to predict and per

sequence of the relation

a deterministic one. More impor

haps control the action of some of

tween the width of a wave packet, .l.x,

level

may

tantly, in my opinion, the subjectiv

these hidden vmiables, and reduce the

ity of the orthodox version, the nec

sway of probability at the quantum-me

essary reference to the "observer"

chanical level. Bohm,

could be eliminated. Bohm showed that the objections

lows: The uncertainty principle is a con

.lx Ak

�

1 be

and the range of wave numbers Ak of the

waves making up the packet, when we take into account the following quan

in presenting the Copen

tum-mechanical

principles:

(1)

The

hagen interpretation in his text, re

de Broglie relation between wave num

peatedly stressed that the uncertainty

ber

against de Broglie's earlier interpreta

principle is anchored in three

ele

and momentum: p = hlx = hk. (2) Whenever the position or momentum

tion could be overcome. The electron

ments:

(1) the wave property of mat

of a particle is measured, the result is a

is a particle pursuing a defmite trajec

ter;

the indivisibility of the energy

definite number.

tory subject to fluctuations caused by

and momentum transfers, and the re

"hidden variables" whose oscillations

lated particle properties of matter;

originate at a sub-quantum-mechanical

the lack of complete determinism.

level.

These

complex

and

(2)

(3)

unpre

dictable fluctuations are responsible

These three elements work together

for our need to resort to probability in

to form a unit that would fall apart

(3)

The wave function

!/!(x) determines only the probability P(x)

of a given position and the transformed function

(k)

determines only the pro�

ability P(k) of a given momentum.

A more general theory not consistent

with the usual interpretation is obtained

predicting the motion of electrons. The

if any one of them would be re

ensuing probability distribution devel

moved from any object in the uni

tent assumptions is abandoned:

ops and is derived from the wave func

verse. Thus all parts of quantum the

psi "field" satisfies the Schrodinger

tion

ory

equation.

of

the

Schrodinger

equation,

interlock in

such

a unified

if any of the following mutually consis (1) the

(2)

If we write

1/J = R

exp

which represents the action of a field

structure that it is very difficult to

(islh), then the particle is restricted to

guiding the particle's trajectory in such

conceive of our giving up any one

p

a way that the probability distribution

element,

semble of particle positions with a

will display typical interference and

whole quantum theory. If there were

unless we give up the

=

\l s(x);

(3) we have a statistical en

probability density P =

l !/JCx)J2.9

5L. de Broglie, Nouvelle dynamique des quanta, Comptes Rendus du Congres Solvay, 1 927, see pp. 1 1 4-1 1 6. 6J. S. Bell,

Speakable and Unspeakable in Quantum Mechanics, Cambridge University Press, 1 987, p. 1 60.

7Quantum Theory, p. 1 1 4

BThere is a tendency in many texts to label the Fourier transform property as "the Heisenberg Uncertainty Principle" without mentioning the restrictions from quantum theory for this label to apply.

90. Bohm, lac. cit. fn. 2, p. 374.


19

break down and where our sug

Bohm during the gestation and discov

(1), (2), (3) his

gested interpretation can lead to

ery period of his alternative interpre

hidden variables theory cannot be dis

completely different kinds of pre

tation and thereafter. I can testify that

tinguished experimentally from stan

diction . 1 1

Ruskai rejects Bohm's later claim that under assumptions

such pressures were there, on a dam aging scale once he departed from the

dard quantum mechanics because the Schrodinger equation holds. She refers

According t o Bohm, such kinds of pre

to the impossibility of deriving Heisen

diction might include the divisibility of

berg's hypothesis about transition prob

the quantum and hence the bypassing

Ruskai says that the publication of

abilities from the Schrodinger equa

tion. 10 This, however, does not settle

of the uncertainty principle; or, say, the

Bohm's controversial articles in the

study of the fluctuations at a sub-quan

Physical Review is evidence of the ob

the matter: Either the imputed non

tum-mechanical level responsible for

jectivity of the establishment toward

equivalence between the Heisenberg

the chaotic motion perceived as prob

one whom Einstein had labeled "the

and Schrodinger formalism has as a

abilistic behavior at the quantum-me

most promising young physicist." Yet

consequence that the wave equation is

chanical level.

his articles were received with a con

insufficient to ground all of quantum

Perhaps

spiracy of silence1 4 or summarily dis

if Bohm's ideas had not

orthodox Copenhagen view which he

had so strongly advocated in his text. 13

mechanics and thereby to validate the

been shunted aside, but accorded a

Copenhagen interpretation; or, in the

broad open forum of interest and dis

Bohm's article appeared during the

contrary case, any more general formu

cussion to sharpen and defend his

heyday of the House Committee on Un American Activities, and many mem

missed.15

lation in which the Schrodinger equa

views when they were still fresh, new

tion holds will equally well ground

experiments

ensued.

bers of the academic establishment

quantum mechanics. Is Ruskai asking of

Meanwhile, these theories offer an al

were reluctant to associate with vic

would

have

the Schrodinger equation that it allow

ternative to the mysteries associated

tims of this persecution. Quite a few

one to derive the transition probabilities

with the Copenhagen interpretation.

fmgered others to safeguard their own

Thus rather than being required to

positions. (In the same way, Bohm was

in order to play its ftmdamental role? Quite contrary to Ruskai's assertion

speak of superpositions in infmite-di

refused

that the proponents of Bohm's hidden

mensional Hilbert spaces leading to ex

Alamos because he had been named by

variables theory assert the impossibil

perimental results, one can speak of

one who later encouraged physicists to

ity of experimental verification as a

ensembles of trajectories leading to the

ignore his papers. 16) I recall at lunch a

clearance

to

work

in

Los

virtue rather than seeking new phe

same results.

J. S. Bell titled one of his

then young, upcoming member of the

nomena to explain or test this theory,

notes "Quantum field theory without

Institute saying that David Bohm had

we have this conclusion of Bohm's

observers, or observables, or measure

moved to the "lunatic fringe."

seminal paper:

ments, or systems, or apparatus, or

I remember the excitement and joy

wavefunction collapse, or anything like

David Bohm expressed to me-"I can't

that. "12

believe that I was the one to see

An experimental choice between

these two interpretations cannot be

The social context in which Bohm's

made in a domain in which the pres

theory was advanced was rife with

"think different" about quantum me

ent mathematical formulation of the

"hidden assumptions" in Ruskai's lan

chanics. He hungered for detailed re

quantum theory is a good approxi

guage. She dismisses the role of social

actions to his theory, for arguments

mation; but such a choice is con

pressures in guiding research and ad

and discussions with colleagues, dur

ceivable in domains such as those

herence to particular views, such as

ing his four years of exile in Brazil. It

associated with dimensions of order

the Copenhagen approach to founda

can hardly be said that societal pres

this! "-upon realizing that one could

of w - 13 em, where the extrapola

tional questions. I was one of those

sures guiding research directions were

tion of the present theory seems to

who were closely in touch with David

in no way a factor.

1 0See J ammer, The Philosophy of Quantum Mechanics, Wiley, 1 974, p. 289. 1 1 0 . Bohm, foe. cit. fn. 2, p. 391 . 1 2J. S. Bell, Phys. Reports 1 37 (1 986), 49-54. 1 3There, Bohm strongly disputed the possibility of hidden variables in many sections, Only at the very end does he accord them a very doubtful credence. See the dis cussion below. 1 4David Peat, Infinite Potential, Addison-Wesley, 1 997, the chapter "Brazil and Exile"; personal communication from Bohm and others at the time. 1 5Rosenfeld (quoted in Max Jammer, The Philosophy of Quantum Mechanics, Wiley, 1 974, pp. 279, 294) called Bohm's theories "empty talk" and "a short lived de

cay product of the mechanistic philosophy of the 19th century." Pauli said, "Old stuff dealt with long ago." A particularly scathing attack is in Heisenberg's essay in

Niels Bohr and the Development of Physics, McGraw-Hill, New York, 1 955, p. 1 8: "This objective 'description' reveals itself as a kind of 'ideological superstructure' which has little to do with immediate physical reality; for the 'hidden parameters' of Bohm's interpretation are of such a kind that they can never occur in the descrip tion of real processes if the quantum theory remains unchanged. In order to escape this difficulty, Bohm does in fact express the hope that in future experiments (e.g . , i n the range beyond 1 0 - 1 3) the hidden parameters may yet play a physical part, and that the quantum theory may thus b e false. Bohr, however, is wont t o say, when such hopes are expressed, that they are similar in structure to the sentence: 'We may hope, that it will later turn out that sometimes 2 great advantage to our finances. 1 6See fn. 1 4 .

20


+

2

=

5, for this would be of

Here is how Bell felt about it:

quantum mechanics is another such example. Bohm was a Marxist at the time he The essential idea was one that had wrote his text Quantum Theory, as been advanced already by de well as at the later time when he ad Broglie in 1927 in his "pilot wave" vanced his new interpretation. He was picture. But why then had Born not drawn to the Copenhagen interpreta told me of this "pilot wave"? If only tion by what seemed to him its dialec to point out what was wrong with tical nature: the complementarity of it? Why did von Neumann not con two potentialities-wave and parti sider it? More extraordinarily, why cle-each to be realized at the expense did people go on producing impos of its opposite. Ideology did not lead sibility proofs after 1952, and as re him to seek a deterministic synthesis; cently as 1978? When even Pauli, in his text he let no occasion pass to Rosenfeld and Heisenberg could deny the notion of hidden variables. produce no more devastating criti There are just two places in his text cism of Bohm's version than to where he leaves open the possibility of brand it as "metaphysical" and "ide determinism. In Section 6. 1 1 , he pro ological"? Why is the pilot wave pic poses a possible test involving a pro ture ignored in textbooks? Should it ton lens by which the uncertainty prin not be taught, not as the only way, ciple might be contradicted; and in his but as an antidote to the prevailing discussion of the WKB approximation, complacency? To show that vague he emphasizes that this procedure im ness, subjectivity, and indetermin plies definite trajectories and veloci ism are not forced on us by experi ties for individual particles. mental facts, but by deliberate It was this insight into the WKB theoretical choice? 17 method that crystallized his thoughts and led subsequently to a drastically Bohm's humanistic and philosophi different view of the Schrodinger equa cal convictions left a stamp on his tion. His new interpretation left an work Ruskai (following Heisenberg) 18 opening for possible future modifica objects to the fact that Bohm's theory tions of the theory at the sub-quantum destroys the symmetry between the po level-definitely affecting the "hard" sition and momentum representations. parts of the theory. Only at this time Bohm objected to the purely formal did David Bohm perceive that such a approach in terms of abstract repre new interpretation was indeed much sentations being taken as a sufficient more compatible with a materialist phi reflection of physical reality. He pre losophy, and come to regard the ferred to think problems through in a Copenhagen interpretation as mired in "physical" way, dealing with objective positivism. (His book Causality and material reality, and let the mathemat Chance in Modern Physics, published ics emerge from that. It is particularly several years later, clearly develops the inappropriate to claim that his ideas materialist underpinnings of the new were "outside the realm of physics." 19 interpretation.) Reverting to Loren Graham's exam The most conspicuous social force ple (Intelligencer, vol. 22 (2000), no. 3, on him in the 1950s was political per 31-36) intended to show social forces secution; as I recall it, that actually had affecting the "hard" as well as the "soft" a liberating effect. At the party cele parts of physical theories, let me ten brating the publication of Quantum tatively explore whether the genesis of Theory in the winter of 1951, he re David Bohm's hidden variables in marked to me with bittersweet irony

A U T H O R

MIRIAM LIPSCHUTZ-YEVICK 22 Pelham Street Princeton,

NJ 08540


Miriam Lipschutz-Yevick was born in Scheveningen (the name is so un pronounceable by foreigners that it was used as a password by the Dutch underground). She arrived in the USA in 1 940 as a refugee from the Nazis and has lived there since. Her doctorate is from MIT, 1 947; she was on the faculty of University Col· lege, Rutgers from 1 964 until her retirement.

She

probability and

has on

published her

in

invention,

"holographic logic." One of her dear est

nonscientific

grandchildren,

concerns

Aaron,

is

Ariela,

her and

Hannah, who appear with her in the accompanying photograph.

that perhaps he should be grateful to President Dodds of Princeton Univer sity and to the House Un-American Ac tivities Committee; for without the year's paid leave from the University while he was under indictment, he might never have come to "think dif ferent." Addendum 1. Ruskai writes, "With out the assumption of an external re ality it makes no sense even to discuss the concepts of science." Yes, but we must not confuse external reality, its

1 7J . Bell,

lac. cit., p. 1 60. 1 BHeisenberg, lac. cit. fn. 1 5, p. 1 9. 1 9"The posthumously published bock by Bohm and Hiley, cited in fn. 4, testifies that Bohm was fully informed on the latest experimental results relating to hidden vari· abies theories.

VOLUME 23. NUMBER 4, 2001

21

mathematical representation in theo

in the classical limit. This is how the

experiment. This could be due to

ries, and the interpretation of the lat

imaginary came into the wave equa

our ignorance or perhaps because

ter as explanations of phenomena.

tion. Schrodinger did not just intro

"God plays dice with the universe."

20

duce it, he needed it.

What sets de Broglie's pilot wave or Bohm's hidden variables against the

Addendum 2. It may be that there

orthodox view is not the abstract rep

is disagreement about the concept of

Probability rather comes about be

resentations

probability. Ruskai rests some of her

cause of objective contingencies and

that

are

mathematical

constructs, but the interpretation of

discussion on the notion presented by

not because of the subjective "we are

how they pertain to objective reality,

Faris in the Appendix of Wick's book.

unsure." Faris's attempt to have prob

21

"what the world is like." Remember

Faris writes at the beginning of this Ap

ability theory elucidate the "mysteries"

that both the de Broglie relation and

pendix,

may be merely transplanting them into

the Schrodinger equation were guided

(mimicking)

principle,

Probability comes about if we are

which implies agreement with reality

unsure of what will happen in an

by

the

correspondence

an

inconsistent

proba

bilistic formalism that cannot corre

spond to objective reality. 22

20See in this connection the return to the attitude "quantum mechanics works" in the article by Christopher A. Fuchs and Asher Peres, "Quantum mechanics needs no interpretation," Physics Today, March 2000. "What it does," these authors write, "is provide an algorithm for computing the probabilities for the macroscopic events that are the consequences of our experimental interventions." Bohm, like many of the early generation of discoverers of quantum mechanics, was searching for an "un· derstanding" beyond merely correct predictions of experiments based on algorithms. 21 0 . Wick, The Infamous Boundary, Birkhauser, 1 995. 22See, for instance, the chapter on Chance in Poincare, Science and Method, Dover, N.Y. 1 952; Miriam Upschutz-Yevick, "Probability and determinism," American

Journal of Physics (1 957), p. 570.

Evariste Galois

(1 81 1 -1 832)

Herbert E . Salzer

Evoked in every treatise on equations. Victim of violence, honored evermore As algebraist, and in human relations Rebel in spirit, radical to the core. Ingeniously you tamed the surds so cryptic, Symmetries, substitutions at one swoop. Thoughts you expressed in language too elliptic Epitomized the essential use of group. Groups and their subgroups grasped, new avenues Abound, new applications in the sequel. Lover of truth, firm against life's abuse. Original mind with courage hard to equal, Immortal creator of a new, productive Synthesis of inductive and deductive. 941 Washington

Brooklyn, USA

22


Avenue, Apt. 28

NY 1 1 225-2454

ll�fflJh§rr6hf¥1MQ.'i.i,ii!,ilh£j

Confusion About Bohm Mary Beth Ruskai

I

Marjorie Sen echal , E d it o r

I

n the preceding notes, Jane Cronin

It was never my intention to present

and Miriam Lipschiitz-Yevick raise a

anything approaching a complete ac

number of interesting issues in the his

count of the historical development of

torical development of quantum me

quantum theory, much less an evalua

chanics, particularly in regard to the

tion of social influences on the accep

work of David Bohm. However, they

tance of competing theories. Rather I

seem to have read my article outside

chose to illustrate specific issues with

of the context in which it was written,

examples from quantum theory. The

namely, as part of a set of articles in

rapid acceptance by physicists of a the

which it was agreed at the outset that,

ory which was paradoxical and "far

as Loren Graham wrote [ 10], "everyone

from every physicist's personal experi

agrees . . . that social, political, reli

ence" illustrates the extent to which

gious and philosophical ideas can of

convincing experimental evidence can

course affect what topics get studied

overcome social and cultural biases.

and what theories get conceived. . . . "

In quoting only a few words from my

Marjorie Senechal [ 18] reinforced this

concluding

theme in her introduction to the re

ously distorts its meaning to imply that

paragraph,

Cronin

seri

development

sponses by Michael Harris and me

I asserted that the

when she wrote, "We can agree at the

quantum theory was immune to social

outset

that

in

different times and

places scientific research has been

of

forces. Therefore, I repeat my con cluding sentences:

(and continues to be) directed by so ciety's carrots and sticks. . . . " Thus,

Few jigsaw puzzles fit together so

rather than "dismiss[ing] the role of so

neatly. We are forced to overcome

cial pressures" (in Lipschiitz-Yevick's

the biases arising from our experi

[23] words) I do not even consider

ence with the familiar macroscopic

them, for the simple reason that they

world of classical mechanics de

were not germane to the question I was

spite the challenge of resolving all

asked to comment on by the editors.

questions about the foundations of

That question was more complex

quantum theory. In the end, quan

and concerned the effect of social con lation of a physical theory and the

tum theory remains a human con struct subject, in principle, to so cial forces. But it is a theory so

process of "justification." I agree with

remarkable, so different from ordi

Graham that "different people with dif ferent views may formulate a theory in

nary experience, that it transcends social and cultural forces. (empha

quite different mathematical terms"

sis added)

text on both the mathematical formu

and that "social, political, religious, and philosophical ideas SOMETIMES af

That it is quantum theory itself, and not

fect scientists' evaluation of the evi

its development that I regard as "tran

dence for and against particular theo

scend[ing] social and cultural forces"

ries" (Alan Sokal, quoted by Graham

was further reinforced in the second

[10]; emphasis in original). However, I

paragraph of my Appendix A on gen

also argued that the need for "consis

der issues.

tency" between experiments, as well as

It is also important to clarify that I

the more commonly cited need for "re

use the term "Bohmian mechanics," as

producibility"

of

individual

experi

is now commonly done, to refer to the

ments, minimizes social influences in

theory developed by Diirr, Goldstein,

the fmal outcome.

al., in the past

et

15-20 years as reflected


23

by my reference to [2,5] rather than to

and Schrodinger formalism has as a

Bohm's original papers. This theory is

consequence that the wave equation

tem, and quantum mechanics has been

based on Bohm's work and can, I feel,

is insufficient to ground all of quan

shown

be regarded as part of his scientific

tum mechanics and thereby to vali

giant multi-particle systems as neutron

ally no such thing as an isolated sys

[14]

to accurately describe such

legacy; but it is not identical to his orig

date the Copenhagen interpretation;

stars. Ultimately, there is no reason not

inal formulation in every detail. Thus,

or . . . any more general formulation

to regard the entire universe as one gi

1

nothing I said was a "comment on

in which the Schrodinger equation

gantic

David

mechanics"

holds will equally well ground quan

Schrodinger equation. But rather than

tum mechanics. Is Ruskai asking of

resolving the matter, this merely re

Bohm was a brilliant and complex

the Schrodinger equation that it al

places one conundrum with another,

Bohm's

quantum

per se.

molecule

governed

by

the

person who made many important con

low one to derive the transition prob

about which there is an extensive lit

tributions to physics and to quantum

abilities in order to play its funda

erature.

theory. But it does not serve his mem

mental role?

Despite the failure to resolve fully the paradoxes associated with the pe

ory to insist that his theories were without flaws. Niels Bohr and Louis de

This question merits an answer and

culiar role of measurements and ob

Broglie had important and profound

commentary. It is

the critical issue. Schrodinger equation i-h -fJt i/J =

servables, the von Neumann formula

impacts on the development of quan

The

tion of quantum theory has withstood

HljJ describes the time development of an isolated system described by the

ertheless, the dilemma has given rise to

Hamiltonian H. In the usual Dirac/von Neumann formalism, 2 an observable is

which now include Griffith's "consis

Bohmian mechanics is ever verified or

represented by a self-adjoint operator

tent histories," and "spontaneous lo

falsified, Bohm unquestionably made

A, and additional axioms are needed to

calization," as well as Bohmian me

profound contributions. His reformula

describe the so-called measurement

chanics. An overview of the relation

tion of the EPR experiment (which

process. It is asserted that the only

between some of these theories and

tum mechanics by putting forth theo ries (e.g., the Bohr model of the atom) which proved, in the end, seriously flawed. Whether or not some variant of

the test of time and experiment. Nev a

number

of

alternative proposals

Cronin mentions and I will comment

possible result of measuring the ob

on in a subsequent article) and the so

servable associated with

called

suffice to give him an important place

A (where I have made the sim plifying assumption that A has only

in the history of quantum theory.

discrete spectrum). Moreover, when

Schrodinger equation is

the system is in the state

the proba

the conventional approach one needs

ak is

additional axioms about the measure

Aharanov-Bohm

effect

alone

A is an eigen

value of

ljf,

the measurement paradox was given recently in

[8,

9].

What is important in response to Lipschiitz-Yevick's question is that the

not enough. In

Is the Schrodinger

bility of obtaining the eigenvalue

Equation Enough?

1(1/J, cf>k/12

the correspond

ment process. In Bohmian mechanics,

Lipschiitz-Yevick points out that I re

ing eigenvector. It is here, in the so

ject the claim that Bohmian mechanics

called measurement process and not in

a single linear equation is replaced by a pair of non-linear equations. 3

where

cPk is

the Schrodinger equation, that the cannot be distinguished experimen

con

tentious probabilities arise.

However, in fairness, I must also admit that my previous article greatly

tally from standard quantum me

Now, at this point the reader may

oversimplified the situation when I

chanics because the Schrodinger

well be perplexed. Doesn't the system

said that the claim that Bohmian me

equation holds. She refers to the im

also interact with the measuring appa

chanics "can not

possibility of deriving Heisenberg's

ratus? Why not consider the original

from standard quantum theory . . . re lies on the fact that . . . the Schrodinger

[be distinguished]

hypothesis about transition proba

system and measuring device as a larger

bilities from the SchrOdinger equa

system subject to the Schrodinger equa

tion. This, however, does not settle

tion? It is

the matter: Either the imputed non

goes to the heart of the matter, and

need to derive that Dirac/von Neumann

equivalence between the Heisenberg

there is no simple answer. There is re-

measurement formalism from the non-

this

question which really

equation holds." Bohm, Diirr, Gold stein,

et al. were not only aware of the

1 1t is curious that Lipschutz-Yevick. who objects to my comments, makes no mention of Harris's remark that "Durr, Goldstein, et a/. may have constructed a consis tent deterministic account of quantum mechanics." By citing only works of Durr, Goldstein et a/., without even mentioning Bohm, Harris seems to leave the reader with the impression that they developed a new theory rather than building upon Bohm's seminal work. 2The widely used term "Copenhagen interpretation" is rather vague, and it is sometimes said that there is more than one variant of the Copenhagen interpretation. However, this interpretation is closely associated with a mathematical formulation put forth nonrigorously by Paul Dirac and in precise terms by John von Neumann. It is carefully and elegantly presented in his influential book Mathematical Foundations of Quantum Mechanics [20]. At least one author [21] makes a distinction between the "Copenhagen interpretation" and von Neumann's formulation, referring to it as the "Princeton interpretation." It is this latter mathematical theory that gained wide acceptance, and is often incorrectly referred to as the "Copenhagen interpretation." 3Lipschutz-Yevick also expresses some concern about the possibility of confusion between interpretations and mathematical formulations. Now, interpretations are quite subjective and almost surely subject to cultural forces. The most we can hope for is to insist on interpretations that are consistent with mathematical formulations and/or experiments. Therefore, I confine my discussion entirely to the mathematical theory associated with this pair of non-linear equations.

24


linear equations of Bohmian mechan ics; they presented cogent arguments for doing so. The Diirr-Goldstein ver sion, based on the concept of "quantum equilibrium," is sketched in [8] and de scribed in more detail in [5]. In the words of Sheldon Goldstein [7], Bohmian mechanics is "richer." However, attention has focused almost entirely on demonstrating that Bohmian mechanics yields the Schrodinger equation and a satisfactory explana tion of experiment consistent with the conventional theory. What has not been adequately explored are the ad ditional consequences of the non linearity. What About Experiments?

Before exploring the possibility of an experimental test of the mathematical reformulation of quantum theory now known as Bohmian mechanics, I want to clear up another point. Lipschiitz Yevick states that It is particularly inappropriate to claim that his [Bohm's] ideas were "outside the realm of physics." But what I actually said was quite different: It is curious that its proponents as sert the impossibility of experi mental verification as a virtue rather than seeking new phenomena to explain or test their theory. Whether or not the Bohmian view is useful, this seems to place it outside the realm of physics. (emphasis added) The word "this" has a clear antecedent which is not David Bohm, but the al leged "impossibility of experimental verification." This is hardly a novel point of view. Similar criticisms have been made about string theory (even by David Bohm [ 16]), because of the difficulty of experimental verifica tion. If one regards physics as an experimental science, then it is al most a tautology to assert that what is not experimentally verifiable is not physics. Of course, the list of ideas and topics, such as social implications,

which are relevant to physics is much test Bohmian mechanics. Quantum broader. theory has observable consequences at Lipschiitz-Yevick correctly points macroscopic scales. The non-locality out that Bohm's original paper con of Bohmian mechanics may have im cludes with the possibility of an even plications for quantum communica tual direct experimental test at do tion. Two of these are the possibility of mains less than 10- 13 em. However, his super-luminal communication and the biographer F. David Peat [ 16] also re security of quantum key distribution. ports (p. 269) that by the 1970s he "dis These issues will be discussed in a sub couraged such speculation, stressing sequent article. that his theory reproduced exactly all the predictions of conventional quan Reactions to Bohm's Theory tum theory." With few exceptions, his There is no doubt that, as Cronin and followers seem to have taken that ad Lipschiitz-Yevick point out, Bohm suf vice. fered for his political beliefs. At a min With recent advances in atomic and imum his exile in Brazil precluded him optical physics, it may now be possible from promoting his theory in person to test Bohm's theories directly. In via seminars and conferences. How deed, the recent work of Scully et al., ever, neither anti-communist paranoia summarized in [ 19] and verified inde in the United States in the 1950s nor pendently in a different experiment by the influence of the leaders of the Bohm's former collaborator Y. Ahara Copenhangen school provide a fully nov [ 1 ] , raises some serious questions. satisfactory explanation for the luke At a minimum [ 19], "A supporter of warm reception Bohm's theory re Bohmian mechanics would insist that ceived. The scientific objections, even the atom went along its Bohm trajec if not fatal, were also not frivolous. tory through one of the detectors, but Consider the reactions of Albert Ein left is mark in the other one," or [2], stein and Erwin Schrodinger, both of "there are 'measurements' of the posi whom were active and vocal oppo tion operator that are not measure nents of the Copenhangen interpreta ments of the actual position." tion. Lipschiitz-Yevick uses the term Einstein was neither vulnerable to, "sub-quantum-mechanical level" re nor a supporter of, McCarthyism. peatedly. Unlike terms such as "sub Moreover, he had supported and as atomic" which are well-defined, Lip sisted Bohm [ 16] in many ways. How schiitz-Yevick's term contains the ever, in a letter to Bohm's former stu hidden assumption that there is a lower dent, David Lipkin, Einstein wrote limit to the validity of quantum theory. ([ 16], p. 132), Although this may be the case, no such I do also not believe that the de lower bound has yet been observed ex perimentally. On the contrary, experi Broglie-Bohm's approach is very ments have now been performed down hopeful. If leads, f.i., to the conse quence that a particle belonging to to the level of 10- 10 atomic radii. (One a standing wave has no speed. This must be careful about the sense in which such distances are defined. This is contrary to the well-founded con viction that a nearly free particle figure is taken from [6] and actually should approximately behave ac refers to the wavelength of light asso cording to classical mechanics. ciated with accelerator experiments at the highest energy currently obtain able.) It seems that the domain of va Schrodinger, then director of the In lidity of quantum mechanics is now es stitute for Advanced Study in Dublin, tablished well below that which Bohm an ocean away from McCarthy and the envisioned at the time of his original House Committee on Un-American Ac tivities, was even less likely to be influ papers. However, I would like to emphasize enced by the political situation in the that microscopic observations are not United States. His reaction is described necessarily the only possible way to ([ 16], p. 132) in Bohm's own words in


25

an undated letter to Lipschiitz-Yevick. Schrodinger-4

tracted by a 1/r-potential for which the

objectivity of the establishment to

only stable trajectories are circular or

wards [Bohm]

bits. However, for charged particles, did not deign to write me himself,

Maxwell's theory of electromagnetism

but he deigned to let his secretary

implies that the acceleration in the tan

tell me that His Eminence feels that

gential direction would lead to radia

it is irrelevant that mechanical mod

tion of energy, resulting in the electron

els can be found for the quantum

spiraling into the nucleus. In quantum

theory, since these models cannot include the transformation theory,

theory, this is countered by the Heisen berg uncertainty principle. 5 To explain

which everyone knows is the real

the stability of a hydrogen atom in

heart of quantum theory. Of course,

Bohm's theory, one must assume that

His Eminence did not find it neces

the highly non-local quantum potential

sary to read my papers, where it is

conspires to permit exactly the deli

explicitly pointed out that my model

cate balance needed for a pair of op

not only explains the results of this

positely charged particles to remain in

transformation

equilibrium.

theory,

but

also

points out the limitations of this the

Finally, it is worth noting that even

ory to the special case where the

Goldstein, perhaps the strongest cur

equations are linear. . . . In Por

rent advocate of Bohmian mechanics,

tuguese, I would call Schrodinger

wrote [9], Unfortunately, Bohm's formulation involved

unnecessary

complica

ments indicate a considerable agree

tions and could not deal efficiently

ment between the two men. The formu

with spin. In particular, Bohm's in

lation of Schrodinger and von Neumann was based on

linear

transformations,

and Bohm's theory was decidedly non

vocation of the "quantum potential" made his theory seem artificial and obscured its essential structure.

imental evidence, it is hardly surprising

The issues surrounding the lack of ac

that each should prefer the theory he

ceptance of Bohm's theory seem com

had developed. Is this the picture of

un

studying the foundations of quan tum mechanics has long been far from the mainstream, it has never been suppressed. Bohm, Bell,

et al.

The papers of

were published in

reputable journals, . . . Reasonable people may disagree on the significance of a particular theory or in dividual's contribution. It is here, rather than in the physics per se, that questions of social influence are likely to arise. I have commented elsewhere, e.g., [17], on the role that gender sometimes plays. In a subsequent article, I will also dis

of the social and political climate on the development of the careers of individu als and the development of physics.

The articles by Cronin and Lipschtitz Yevick have stimulated me to think anew about a number of issues related to Bohmian mechanics, for which a full discussion requires clarification of some

linear. In the absence of decisive exper

burro or duas mulas?

It should be noted that even though

cuss the distinction between the effect

un burro. . . . Underneath the sarcasm, these com

is not supported by my statement

plex and provide fertile ground for his torians of science. Neither scientific

A subsequent objection, similar to

flaws nor social pressures alone seem

Einstein's, arose from the realization

to give fully satisfactory explanations.

that Bohm's theory implies that in the

technical issues regarding the EPR ex periment and non-locality. These will be discussed in a forthcoming article. Acknowledgments: It is a pleasure to

thank Edvamia Bahia for assistance with

Portuguese.

The

author

was

supported in part by National Science

ground state of the hydrogen atom, the

Conclusion

velocity of the electron is zero [22]. It

It is important to distinguish between

Foundation Grant DMS-0074566.

simply sits there, albeit at a random po

physics, which is an experimental sci

sition. To understand why some find

ence, and

this hard to swallow, it is worth re

The latter are most certainly

ob

1 54 i n Bohmian Mechanics and Quantum

calling that explaining the stability of a

jective. Thus, Lipschtitz-Yevick's asser

Theory: An Appraisal (ed. J. Cushing et a!.),

hydrogen atom is often regarded as one

tion that

physicists,

who are people.

not

REFERENCES

[1 ] Y. Aharanov, and L. Vaidman, pp. 1 4 1 -

Kluwer Academic, 1 996.

of the great successes of quantum the

[2] K. Berndl, M. Daurner, D. Durr, S. Gold

ory. The model of a hydrogen atom is

Ruskai says that the publication of

that of two oppositely charged parti

Bohm's controversial articles in the

mechanics," II Nuovo Cimento 1 1 08,

cles, one positive and one negative, at-

Physical Review is evidence

737-750 (1 995).

of the

stein, and N. Zangh, "A survey of Bohrnian

4The only discussion I could find of Bohm's work in Moore's biography [1 5] of Schrbdinger is a brief mention on p . 31 1 after a discussion of Schrbdinger's reaction to the EPR paradox in 1 935 that Bohm, among others, had eventually produced hidden-variable theories. However, the description on pp. 451-452 of events in the fall of 1 952, after Bohm's papers appeared, raises several interesting questions. In September, Schrbdinger wrote enthusiastically about a meeting planned for December to discuss the interpretations of quantum mechanics. However, in early October, he became seriously ill and was unable to participate in person . If his letter to Bohm was written during the months of Schrbdinger's illness and recovery, it would explain communicating via his secretary, which so offended Bohm. On the other hand, if Bohm was not invited at least to submit a paper to be read at the conference (if he were unable to travel), that was a serious oversight. 5The standard argument is that as the electron spirals into the nucleus, its position, and hence the uncertainty in its position, will become small; this then implies large momentum and large kinetic energy. In fact, this argument is flawed. However, an alternative argument following the same physical intuition can be formulated using Sobolev inequalities. See Lieb [ 1 3] for details.

26


[3] J. Cronin, "Social influences on quantum mechanics, 1 , " Mathematical lntelligencer

23, no. 4.

1 5- 1 7

[4] PAM. Dirac, The Principles of Quantum

The Flight from Science and Reason (New York Academy of Sciences, 1 996).

Mechanics (Oxford, 1 930). [5] D. Durr, S. Goldstein, and N. Zangh,

[1 2] M. Harris, "Contexts of justification," Math

"Quantum equilibrium and the origin of ab

ematical lntelligencer 23, no. 1 , 1 8-22

solute uncertainty," J. Stat. Phys. 67,

(2001).

[6] C. A. Fuchs and A. Peres. "Quantum the ory needs no 'interpretation,' " Phys. To

Mod. Phys. 48, 553-569 (1 976). [1 4] E. Lieb, "The stability of matter: From atoms to stars," Bull. AMS 22, 1 -49

day 53(3), 70-71 (March. 2000). [7] S. Goldstein, "Quantum philosophy: The flight from reason in science:" pp. 1 1 9-

M.

1 , 1 6-1 7 (2001 ).

0. Scully, "Do Bohm trajectories always

provide a trustworthy physical picture of particle motion," Physica Scrip ta T76,

4 1 -46 (1 998). [20] J. von Neumann, Mathematical Founda lation, Princeton University Press, 1 955).

[2 1 ] A. Whitaker, Einstein, Bohr and the Quan tum Dilemma (Cambridge University Press, 1 996). [22] D.

(1 990). [1 5] W. Moore, Schr6dinger: Life and Thought (Cambridge University Press. 1 989).

1 2 5 in [1 1 ] .

[ 1 9]

tion of Quantum Mechanics (English trans

[1 3] E. Lieb, "The stability o f matter," Rev.

843-907 (1 992).

no.

lntelligencer 22, no. 3, 3 1 -36 (2000). [1 1 ] P. R. Gross, N. Levitt, and M. W. Lewis,

(2001).

tification," Mathematical lntelligencer 23,

display social attributes?". Mathematical

[8] S . Goldstein, "Quantum theory without ob

[1 6] F. D. Peat, Infinite Potential: The Life and

servers- Part One," Phys. Today 51 (3),

Times of David Bohm (Addison-Wesley,

Wick,

The

Infamous

Boundary

(Birkhauser, 1 995).

[23] M. LipschUtz-Yevick, "Social influences on quantum mechanics, I I , " Mathematical ln

telligencer 23, no. 4, 1 8-22 (2001 ).

1 997).

42-46 (March, 1 998). [9] S. Goldstein, "Quantum theory without

[1 7] M. B. Ruskai, "Are 'feminist perspectives'

observers- Part Two," Phys. Today 51 (4),

in mathematics feminist?" pp. 437-441 in

38-42 (April, 1 998).

[1 1 ] .

[1 0] L. Graham, "Do mathematical equations

Department of Mathematics University of Massachusetts Lowell Lowell, MA 01 854 USA

[1 8] M. Senechal, "Between discovery and jus-


NEXUS NETWORK ]OURNAL ---- · ----

Architecture and Mathematics

1

The Nexus Network journal makes a significant contribution to scholar hip about the relationships between architecture and mathematics throu g h regular pub lication (quarterly on the I n ternet and ann ually in print) of research papers, book reviews, conference reports, reports on student projects and bibliographies related to architecture and mathematics. It also erves as a means of communication for the exchange of ideas and information between the biennial Nexus conferences on architecrure and mathematics.

uus NETwoRK jouRNAL

Volume 1 ( 1 999) I 200 pp., oftcover I € 20.00 ISS 1 590-5896 ISB 887923-22 1 -5

90 illus.

Volume 2 (2000) ISS 1 590-5 896 I S B N 887923-250-9 Successive volumes due each April.

Subscribe today! Thl' Nexus Network ]oumal [http://www. ncx us j o u rrul . co m ] . For morl' inf(mn.Itlon .1 hour thl' Nexw Network fournal .md the Nexus conferences, e-nuil Kim \\(!ill i.uns, l· ditor in Chief [ k. wi lli.!ms�1'lcont:t. it]


27

G. G. LORENTZ

Who D i scovered Ana yti c Sets? n answer to this question, which I will call Question 1 , requires the study of afascinating segment of the history ofmathematics, connected with the names of P. S. Aleksandrov (1896-1 982), F. Hausdorff (1868-1942), N. N. Luzin (1883-1 950), and M. Ya. Suslin (1894-1919). Analytic sets are also called A-sets or Suslin sets. I have chosen the term "analytic sets" because of its neutral character. In 1915-16, Luzin was a young professor at Moscow Uni versity. Aleksandrov and Suslin were his students. Luzin was an excellent mathematician. Even more important was the inspiration that he conveyed to his students, starting this way the astonishing ascent of Moscow mathematics. In what follows, I shall use the original papers [Aleksan drov, 1916; Hausdorff, 1916; and Suslin, 1917]. In 1915, Aleksandrov, in Moscow, and Hausdorff, in Bonn, were separated by the front line of World War I. In dependently, they proved the continuum hypothesis for Borel sets B in �n , which asserts that each B either is count able or has the power of the continuum. Both men used a representation, by means of closed or open sets, of all Borel sets B of a transfinite class VA,, g < fl. Each developed his representation incompletely, only as far as it was useful for the proof. Their formulas or methods, which depended on g, went only in one direction, from B to closed (or open) sets An, n = 1, 2, . . . . And they could not be inverted; that is, they were not defined for arbitrary An. A new student of Luzin, Suslin joined the investigation in 1916. As Luzin described it (see [4]), a natural question for himself and his two students was to describe Aleksan drov's representation formally. Of course, it would have

28

THE MATHEMATICAL INTELLIGENCER © 2001 SPRINGER-VERLAG NEW YORK

been desirable to find a solution that would produce all Borel sets and nothing else, and therefore would be inde pendent of transfinite numbers; I will call this Question 2. Partial answers were given by Suslin [15]. He proposed to relabel a simple set sequence {An} in a "crazy way" as a "Suslin tree":

This is possible because the set of all natural numbers n = 1, 2, . . . and the set of all finite sequences v0 = (n1, . . . , nk) of natural numbers are each countable. We write v = (nb . . . , nk, . . . ) for infinite sequences, and v0 < v if v0 is a beginning of v. Suslin defmed the set operation (2)

B= =

Y {An 1 n An1,n2 n

· · · n An� > . . . , nk n · · · }

U n Av0, v v0< v

calling it the A-operation. The union is extended over all (uncountably many) sequences v. For closed A v0 , the op eration (2) generates all Borel sets, but also many non-Borel sets. This was a partial answer to Question 2.

Sets produced by (2), the analytic sets, created a sen sation in set theory. Even formula (2) was unusual, con taining an uncountable union. Up to then we shunned unions of this type, for they could easily lead to undesir able non-measurable sets. Hausdorff called (2) an so oper ation. For him, u, o stood for countable unions and inter sections, respectively; s, d stood for uncountable unions and intersections. Thus, (2) cannot be written as (3) for this is a uo, not an so operation. The important development of 1927 was the second edi tion of Hausdorffs Mengenlehre [6] with a masterful pre sentation of the theory of Borel and analytic sets in metric spaces. He called these sets "Suslin sets." In the following period, general set theory became fashionable. In the West, books by Luzin, H. Hahn, K. Menger, and K. Kuratowski joined Hausdorff in his assignment of priorities. This fash ion was also featured in a few Soviet publications. The au thors of the historically important long memoir about set operations, Kantorovich and Livenson [8) could not be called unfriendly to Aleksandrov. But they claimed that "the first known (not elementary) analytic [set] operation is the A-operation of Suslin. With it he introduced a new and wider class of sets, viz., the A-sets. " Aleksandrov's friend Andrei Kolmogorov gave a very balanced and fair testi mony. In his review of set theory in the book, Mathematics in the USSR for 1 5 years [ 12] we read: "Suslin applied procedures of Aleksandrov's 1916 paper to discover a new class of sets of fundamental importance-the A-sets" (p. 38), and "the theory of A-sets has been fast developed by Suslin's methods" (p. 45). As we shall see later, Kolmogorov's for mulation is a good description of the Suslin-Aleksandrov controversy, except that it disregards Hausdorffs contri bution. I am not a stranger to analytic sets. In the 1930s I en joyed the geometric exposition of the theory by Luzin [ 1 1 ) , preferring it to the dry formulas of Hausdorffs book But K. Zeller and I had to use the Hausdorff version when we wanted to apply it to summability. The Riemann convergence set R(.'!l) : = {s} of a series .'!1: I'l an with real terms consists of all sums s = Lk= l ank of convergent rearrangements of .'!1. The familiar Riemann's theorem describes all possible R(.'!l): this set can be empty (for instance if an ...f+ 0); it can be any one-point set (if I I an i < oo) ; and it can be the whole real line. Now let C be a series summability method defined by a matrix. Replacing convergence of Iank by its C-summabil ity in the above definition, we get the Riemann C-set R(C, .'!1) of C and .'!1. To find the sets R(C, .'!1) for a given C is ex tremely difficult. But we proved (Lorentz and Zeller, [ 10]) that the set of the R(C, .'!1) for all C and .'!1 coincides with that of all analytic sets of the line. This was probably the first time analytic sets were used to resolve a concrete problem of analysis.

From the early 1920s, Aleksandrov occasionally claimed the A-operation as his. We now have new sources of in formation about the priority questions; they are pointing in opposite directions. Aleksandrov's reminiscences [2] were published in Uspekhi Mat. Nauk, a journal that he edited until his death. A second source is the book [4) which con tains complete stenographic reports of Luzin's 1936 trial at the Soviet Academy of Sciences. Believed lost or destroyed by the participants, a copy was found in the Academy's archives in 1993. The published volume contains enlight ening commentaries by eminent Russian historians of mathematics, S. S. Demidov and others. Here I shall ex amine only a small, but central and illustrative sector of the trial, the Luzin-Aleksandrov controversy about analytic sets. Luzin suffered political persecutions at two critical pe riods of his life. In 1930, after returning from a long and fruitful sojourn in Paris, he was attacked by E. Kol'man, a leading member of Moscow's Party Council and a profes sor at the Communist Academy (see Shields [ 14)). With horror, Luzin saw his older friend Egorov disappear into prison and die shortly afterwards. Kol'man denounced the activity of Egorov and his friends Luzin and P. A Florin skil as "fascist-tainted reactionary science inherited from the old Moscow mathematical school." To him Luzin's mathematics were idealistic, that is, opposing Marxism's materialistic philosophy. Luzin's posi tion at the university became precar ious when he refused to join the sign ers of a propaganda letter directed against the "enemies of the people." Luzin fled the university, finding a niche at the Academy of Sciences. In addition to real functions and set theory, he turned to applied mathematics, with only moderate success. Luzin's trial in June 1936 was an integral part of Stalin's Great Terror of 1936--37. Directed against all independent thinkers-in the Party, in the intelligentsia, and in the pop ulation in general-it took a staggering number of victims. Davis [3, p. 1325] estimates that one million persons were sent to concentration camps or executed during its worst year. In most cases, the victims did not even understand the reason for their arrest. To initiate the campaign against Luzin in 1936, his ene mies laid a cunning trap, prompting him to praise mathe matical work at one of the less-than-average high schools in Moscow-praise that was then used against him. Vilifi cation at universities throughout the nation and in news papers followed, with eight full-sized articles in the leading daily, Pravda, with titles like "About the so-called Acade mician Luzin" or "Enemy in Soviet Mask" Then followed the trial at the Academy, conducted in secret. Luzin had ample reasons to believe that he was fighting for his life. Indeed, the KGB had prepared compromising materials about him. His friend Florinskii, mathematician, engineer, and orthodox priest, arrested in February 1933 together with a friend, was broken by the KGB. They con-

Analytic sets are

n ot always Borel .

VOLUM E 23, NUMBER 4, 2001

29

fessed to belonging to the KGB-invented "Party for the Re birth of Russia," with a future "government" including Luzin as foreign minister and another mathematician, the acade mician Chaplygin, as prime minister (V. Shentalinsky, [ 13], pp. 1 1 1-115). This material, with potentially deadly conse quences for Luzin, was never used. Famous mathematicians formed the interrogating com mission at the Academy's trial. Of these, Lyusternik, Shnirelman, and Gel'fond already belonged to the "initi ating group" responsible for Egorov's downfall. They were joined by Sobol'ev. Luzin's former students were repre sented by Aleksandrov, Kolmogorov, and Khinchin. This revealed a split among Luzin's students: Lavrentiev and P. S. Novikov were present, but did not say a word against Luzin, a sign of civil courage, while Menshov and Nina Bari (one of the best Soviet female mathematicians) were missing altogether. Actually, Kolmogorov said very little. Among the full members of the Academy one saw the "red professor" 0. Yu. Schmidt, later famous for his Arctic expeditions, the completely mute I. M. Vinogradov, and S. N. Bernstein, the only faithful and persistent Luzin defender. Aleksandrov, who re placed Egorov as the presi dent of the Moscow Mathe matical Society, a post he was to hold for 32 years, was the natural leader of the anti Luzin group and the most ag gressive and sarcastic interrogator. Present at most sessions of the trial, Luzin had no legal counsel. Luzin had a complex, sensitive, and highly excitable na ture. His lectures were excellent, full of ideas, hypotheses, suggestions for investigation. He charmed people at the first meeting. Inspiring adoration by many of his students, he reserved his own for his French teachers Borel and Lebesgue. Sometimes he would attribute to them his own discoveries. Aleksandrov was quite different. Having enjoyed a rich cultural upbringing, he was at home with literature, espe cially German, and theater. As rumor will have it, after his disappointment in Moscow in 1917-18, he seriously con sidered a theatrical career in the Western provinces, and he gave up the idea only because of the possibility of po litical problems under the Bolsheviks. Extremely ambi tious, he befriended two of the best Soviet mathematicians, Uryson (who died prematurely in 1924) and Kolmogorov. With Uryson, he published joint papers and founded the Moscow topological school. He was a good lecturer, a witty raconteur, but his stature as mathematician was definitely below Luzin's. A strange antipathy, even hate, separated him from his teacher. At the trial, Luzin stood accused of having plagiarized from his students, in particular, of having "borrowed" from Suslin the notion of analytic sets. Aleksandrov was deeply involved. Forty years later he declared: "For me the ques-

tion of priority in this case [of the A-operation and ana lytic sets) was never indifferent, concerning my first and (probably therefore) my dearest result" (Aleksandrov, [2], p. 235). Terminology rarely plays an essential role in priority dis cussions. This case was an exception. As described in his autobiography, Aleksandrov [2) visited Hausdorff in Bonn in 1924. In his description we read: "To Hausdorffs ques tion on how the new sets should be called, I firmly replied, Suslin sets, because he was the first mathematician prov ing that they are really new [and not just Borel) sets." By not suggesting that the defining operation is also Suslin's, Aleksandrov indirectly reserved for himself the credit for the discovery of the A-operation. In his book Mengenlehre (6), Hausdorff followed this advice only partly, calling both the sets and the operation (2) Suslin's. At the time of Luzin's trial in 1936, Aleksandrov, translating Hausdorffs book, completely changed Hausdorffs Suslin-terminology to A terminology. This led to heated controversy between Alek sandrov and Luzin at the trial. Even more interesting than the terminology are Alek sandrov's following statements. In his reminiscences [2, p. 235], he said . categorically that "Suslin suggested the name 'operation A' for the new set operation I had con structed, and the name 'A sets' for the sets which result from its application to closed sets. He stressed that he was suggesting this ter minology in my honor." We compare this with Aleksan drov's words spoken at the 1936 trial ([4], p. 90): "He [Suslin) never told me that he called them A-sets in my honor. It was Luzin who formulated the term while lectur ing at Moscow University. Incautiously, he underlined this." As a faculty member at Leningrad University in the 1930s, I heard two versions of what motivated Suslin to call his sets A-sets: (1) to honor Aleksandrov and (2) to parallel the common use of B-set for Borel set. The strongest example Luzin's accusers could cite for his alleged plagiarism, a charge that eventually could not stand up at the trial, was the following. Suslin's expres sions of deep gratitude to his teacher Luzin in the intro duction of his paper [ 15) were interpreted as signs of Luzin's plagiarism, implying that they must have been written under pressure by him. Vehemently denying this, Luzin insisted that Suslin wrote the introduction alone. To this and other similar arguments that could be neither proven nor disproven, Aleksandrov offered Luzin mock ing advice: "As a sign of our past friendship, allow me, your former student who will be grateful to you all his life, to give you in this difficult moment a really sincere [piece of] advice. You would do much better to give up hotly de fending your rightfulness in cases when [defense) is im possible and to find the necessary courage and humility to accept the accusations against you."

Term i nology rarely plays an essential role i n p ri ority

d i scussions. Th is case was an exceptio n .

30


It is very fortunate for our inquiry that cooperation be tween Luzin and Aleksandrov during 1915-16 was also discussed at the trial. According to the record ([4], p. 89, p. 159), Aleksandrov expressed profound thanks to his teacher for the proposed subject for investigation, but minimized his contribution. Luzin was bound by the un written rule that demanded from a doctoral supervisor (which he in essence was) that the teacher never divulge his part in the joint work At the trial Luzin implied that he had never done this before and was doing it only un der the pressure of accusations. We can believe his tes timony because he would have been foolish to insult Aleksandrov and many of his assembled students by mis representation. This is what Luzin ([4], pp. 160-161) said to Aleksandrov in my free translation: During 1915 you always came to my dacha with pages of incorrect attempts which I revised. In spite of my con cerns, by means of tables of sets, a proof emerged for the Borel class � 4. I asked you to do this for the gen eral case. After joint work, a transfmite proof appeared. The reduction to one table [of sets] was entirely mine. (This probably meant the second table of Aleksandrov [ 1].) Mterwards, do you know what problem arose? How can the representation table of a Borel set be recon structed? This was completely my problem [Luzin's problem was one of the formulations of our Question 2]. We both worked on it. But then you asked to be excused because of the difficulty of the problem. I still possess a postcard where you wrote this. Exactly at this point, at this second table, the work of us three [Aleksandrov, Suslin, Luzin] intermingled. This allowed me to say in my lectures that it remained for you to make a small step, and the discovery [of the operation A] would be yours. But neither you nor I made this step. "I do not deny this," replied Aleksandrov. Aleksandrov's admission proves that he was not the dis coverer of operation A Suslin was, and he gave a partial answer to Question 2: Applied to trees (1) of closed sets, this operation produces all Borel sets, but also non-Borel sets. Seven years later, in 1923, Luzin and Sierpiriski gave a complete answer to Question 2. Operation A produces all Borel sets and these only if it is restricted to trees for which all terms in the union (2) are disjoint. How did the cooperation of Luzin and Suslin develop af terwards? Luzin did not say. We can assume that he sug gested his student answer Question 2. The title of Suslin's paper (which many find inappropriate), "On a defmition of B-measurable sets without transfinite numbers," clearly in dicates such a suggestion. But it is useless to guess about the extent of their cooperation. When and why did this deep animosity between Luzin and Aleksandrov develop? Aleksandrov indicated that it be gan in 1923, when Luzin, chief editor of the Mat. Sbornik, invited contributions by his friend Uryson to the journal,

but not by Aleksandrov. (At that time Luzin was more pow erful than Aleksandrov; in 1936 the relation was reversed.) More likely, the aversion started as early as 1916, when Luzin accepted Aleksandrov's resignation from the tri umvirate too easily, and helped Suslin to prepare his pa per. Working alone on the general continuum hypothesis, Aleksandrov suffered a failure, and left for the Ukraine, re turning to Moscow and mathematics a full two years later. Another variant of the history of the Aleksandrov-Luzin relationship is even grimmer. In Leningrad many mathe maticians believed that Aleksandrov was homosexual, a criminal offense in tsarist Russia, as well as in Soviet Rus sia, although rarely prosecuted. Perhaps Luzin had of fended his sensibilities in this connection. A note accusing Luzin appeared in a public statement by Kolmogorov at Moscow University in 1936. He reminded the audience of Luzin's great service to mathematics "before his moral and political disintegration." This was echoed by Aleksandrov [2], when the author told that he "found his teacher in the highest sphere of human values, a sphere that he later aban doned." Aleksandrov quoted Goethe that "each guilt finds its revenge in life." But I must discuss also the fourth participant on this scene. Hausdorffs role in the discovery of analytic sets was never properly described in the Soviet literature. The main difference between the two 1916 proofs was between the transparent Boolean set operations of Hausdorff and the "tables of sets" of Aleksandrov, inherited from a 1905 pa per by Lebesgue. Furthermore, Hausdorff started with open sets in his construction, while Aleksandrov employed their complements-closed sets. This difference is not that im portant so far as Borel sets � are concerned. For analytic sets 91, the matter is different. It is known that the com plement C(A) of A E 91 is also analytic only if A is a Borel set; in other words, that 91 n C(91) = 'lf3. There is no real symmetry between the classes 91 and C(91), however. An alytic sets coincide with the continuous images of Borel sets (Luzin); on IR\ they coincide with the Riemann sum mability sets (see above). Therefore we compute the dual of (2), obtained by tak ing complements. For the complement of A we get

where the Bv0 = : C(Bv0Av0) form a Suslin tree. Formula (4) yields, with open Bv0 , all sets B that are complements to analytic sets, and only these. Hausdorff [5] does not have (4), but Suslin trees are there, as are unions like Uv0 Bv0 ([5], p. 436), absent from Aleksandrov's paper. It is easier to guess (4) from Hausdorffs paper than to guess (2) from Aleksandrov's. However, to get analytic sets, a complement must be taken. Russian literature after 1990 about Suslin includes a


31

A U T H O R

GEORGE LORENTZ

2750 Sierra Sunrise Terrace

404

Chico, CA 95928

us a remarkable, impartial, and just exposition of the new theory. The results of the Academy-based trial deserve a sepa rate analysis in the English-language literature. It ended mildly for Luzin. Why was his life spared, why was he not expelled from the Academy? According to the editors of the Delo [4] he was saved by the highest Party echelons, perhaps even by Stalin himself. They insisted that accusa tions against Luzin should be formulated in academic rather than political terms. Accordingly, Aleksandrov stated a cou ple of times that Luzin's behavior displayed no anti-Soviet attitudes. The outcome of the trial suggested that mathe matics was a cherished science of the Party. The Golden Years of Soviet mathematics, particularly in Moscow, had begun.


BIBLIOGRAPHY

[ 1 ] P. S. Aleksandrov, Sur Ia puissance des ensembles mesurables George G. Lorentz was born in St. Petersburg in 1 91 0 and

B, Comptes Rendus Acad. Sci. Paris

162

(1 9 1 6), 323-325.

pursued a mathematical career in the Soviet Union, moving

[2] P. S. Aleksandrov, Matematicheskaya zhizn v SSSR, stranitsy au

later to Germany, then Canada, then the United States. He

tobiografii [Mathematical life in the USSR, pages of an autobiog

built and led an illustrious team in approximation theory at the

raphy), Uspekhi Mat. Nauk, Part 1 , 34, no. 6 (1 979), 2 1 9-249.

University of Texas in Austin, from which he retired in 1 980.

[3) N. Davis, Europe, New York, Harper Perennial, 1 998.

His research has spanned several fields of mathematical analy

(4) Delo akademika Nikolaya Nikolaevicha Luzina [Case of Academi

sis, including approximation and interpolation, divergent se

cian N. N. Luzin], S. S. Demidov, B. V. Levshin, eds., St. Peters

ries, orthogonal series, and number theory; he has also writ ten on history of mathematics. Two volumes of his selected works have been published by Birkhauser in Basel.

burg, RKhG I , 1 999. [5] F. Hausdorff, Die Machtigkeit der Borelschen Mengen, Math. Ann. 77

( 1 9 1 6), 430-437.

[6) F. Hausdorff, Mengenlehre, Berlin, G6schens Lehrbucherei, 1 927.

[7] V. I. lgoshin, M. Ya. Sus/in,

1894- 19 19, Moscow, Nauka-Fizmatlit. ,

1 996. [8] L. Kantorovich and E. Livenson, Memoir on Analytic Operations

good biography (Igoshin, [7]) and an article (Tikhomirov, [16]), "The discovery of A-sets." The conclusions of both authors, reached without benefit of the extensive new source [4], resemble those of Kolmogorov [12]. The proofs sketched in Tikhomirov's article are based on three essen tially different definitions of analytic sets, and on the exis tence of universal analytic sets. In the collection Kol mogorov in Perspective ([9], p.4) A. N. Shiryaev refers to the new sets simply as "A-sets (analytic sets, introduced by Aleksandrov). " W e see that Aleksandrov came very close t o what he had accused Luzin of, that is, to borrow from Suslin the definition of operation A. Suslin found it with some en couragement from Luzin. Hausdorffs attitude was com mendable. Devoted to the readership of his books and ig noring petty concerns, in his Mengenlehre [1927] he gave

32


and Projective Sets (1), Fund. Math. 18 (1932), 2 1 4-279. (9] Kolmogorov in Perspective, Editorial Board, Am. Math. Society and London Math. Society, 2000, History of Mathematics, vol. 20. [1 0] G. G. Lorentz and K. Zeller, Series rearrangements and analytic sets, Acta Math. 1 00 (1 958), 1 49-1 69. [1 1 ] N. N. Luzin, Ler;;ons sur les ensembles analytiques, Paris, 1 930, Gauthier-Villars. [1 2] Mathematics in USSR for 15 Years, Moscow GTI, 1 932. [Russian] [1 3] V. Shentalinsky, The KGB's Literary Archive, The Harville Press, London, 1 995. [1 4] A Shields, Luzin and Egorov, Mathematical lntelligencer 9 (1 987), no. 4, 24-27. Egorov and Luzin: Part 2,

ibid. 1 1 (1 989), no. 2, 5-7.

[1 5] M. Suslin, Sur une definition des ensembles mesurables B sans nom bras transfinis, Comptes Rendus Acad. Sci. Paris 164 (1 91 7), 88-90. [1 6] V. M. Tikhomirov, Otkrytie A-mnozhestv [Discovery of A-sets], Is

tor. Mat. lssled. , fasc. 43 (1 993), 1 29-1 39.

BURKARD POLSTER, AN DREAS E. SCHROTH AN D HENDRIK VAN MALDEG HEM

Genera ized F at an d With illustrations by the author A Hexagon* Based on the gospel of

GENERALITY

as proclaimed by the

POLYGONS

ost of my readers will be familiar with the sad story of my grandfather, an honourable square and eminent mathematician of FLATLAND who was condemned to lifelong imprisonment for claiming to have been abducted to SPACELAND, a world somewhere "out there" that extends our two-dimensional FLATLAND by a third di mension. Of course nobody, not even I, his grandson (a hexa gon), believed in his story until, on the eve of the new mil lennium, I myself was abducted to GENERALIZED FLATLAND. I discovered that this world extends our flat world and the worlds of graphs and projective planes in a completely nat ural manner. As our world is populated by polygons such as triangles, quadrangles/squares, pentagons, etc., this exten sion of our world contains generalized polygons, both us sim ple ones and much more complicated ones of breathtaking abstract beauty. I also found that GENERALIZED FLATLAND co incides with the land of mathematical buildings of rank 2 as conceived by one of our foremost mathematicians J. Tits. This means that all non-trivial mathematical buildings are made up of natives of this mysterious land. Preface

I will tell you my story and, as evidence of my claims, show you drawings of my abductors, the four smallest natives of

proper GENERALIZED FLATLAND. These drawings are exten sions of beautiful renderings of closely related highly ho mogeneous graphs such as the complete graph on four ver tices, the Petersen graph, and the Coxeter graph (Fig. 1). In fact, closer inspection discloses that my abductors share many of the remarkable properties of these graphs and are even more symmetric than the graphs they extend. I hope that the overwhelming evidence I have compiled will con vince even the most sceptical among you that there is re ally life "out there" beyond FLATLAND, and that we are able, and have an obligation, to claim our rightful place in full GENERALITY.

A Painting in the Sand

It was the last day of our 2000th year. I spent this all-im portant day at the site of some recently discovered ruins in the desert of OZ. After unearthing some mysterious mathematical writings and drawings in the ruins they were excavating at the time, the archaeologists in charge had in-

"Dedicated to my dear grandfather Edwin E. Abbot (1 838-1 926), the author of the infamous Flatland-a Romance in Many Dimensions [1].

© 2001 SPRINGER-VERLAG NEW YORK, VOLUME 23, NUMBER 4 , 2001

33

Figure 1. The complete graph on 4 vertices, the Petersen graph, and the Coxeter graph.

vited me to join their expedition as mathematical adviser. I had gladly accepted their offer and on that very day started deciphering the mathematical inscriptions that covered all the walls and floors. It soon became clear to me that what had been discovered here were some of the writings of the famous mathematical prophet J. Tits, in which he claims that there is a world he refers to as GENERALIZED FLATLAND that extends our world. Of course every child knows that these writings had been condemned as heresy and de stroyed a long time ago. I was afraid to reveal my discov ery to my colleagues in fear that they might destroy what turned out to be of true mathematical beauty, even though not referring to some real world as claimed by the prophet. My colleagues had already retired to their tents while I was still trying to unravel the mysteries of a pentagonal paint ing (Fig. 2) that occupied the interior of one of the rooms. After several hours of work, I summarized in mathemati cal language what I had learned so far from the inscriptions about GENERALIZED FLATLAND and its natives. GENERALIZED FLATLAND. Remember that a (point-line) geometry consists of a nonempty set of points

The geometry Of

and a nonempty set of subsets of the point set called lines, such that every point is contained in at least two lines and every line contains at least two points. Two geometries are isomorphic if and only if there is a bijection between the point sets of the two geometries that extends to a bijection between their line sets. Every graph can be interpreted as a geometry. Here the vertices of the graph are the points, and associated with every edge is a line consisting of the two vertices contained in this edge. In particular, an ordinary n-gon is a geom etry that is isomorphic to the geometry of vertices and edges of a regular n-gon in the plane, that is, one of the na tives of FLATLAND. Just as a graph can have multiple edges, that is, two or more edges that connect the same two vertices, a geom etry can have multiple lines that cannot be distinguished by just looking at the points contained in them. Let C§ be a geometry with point set P and line set L. A geometry C§' with point set P' and line set L' is contained in C§, if the following three conditions are satisfied: (1) P' � P; (2) every line in L' is contained in a line in L; and (3) no two lines of L' are contained in one line of L.

Axioms for Generalized n-Gons of order (s, f)

(Q l ) In a generalized n-gon C§ of order (s, t) every line contains points and every point is contained in t + 1 lines. (Q2) C§ does not contain any ordinary k-gons for 2

,


+ 1

:S k < n.

(Q3) Given two points, two lines, or a point and a lin ther least one ordinary n-gon in gon �·gon). To 11oid

a crowded appeatance of the IIElAGOII, the 1�t Slbsets of the 7-gon are represe��ted by lsmal) sclid bM poilts. The ines in the otJAnAA�W

correspond to partitions of the �on into 1· and 2·element subsets; see

FJ911'e 10. fer the ines of the� in terms of the labels ol its points see figure 9. Highlghted in the diagrams are the points of geometric hyper·

plalles -flll')!le jnwGLE and llU.IIIIWfQij and green lllf.l'ltGO)O . Alta remo�ng

these geometric hyperplanes from these geometries, we are left with models

of scme of the most homogeneous graplls -the complete graph on fOil vertices

in the case of the llVANGif, the Petersen graph il the case of the 0\J.UIUNGLE, and the

disjoint union olthe Coxeta graph jbl� points and blue and green Jiles) and the

Heawood graph btllow poilts and li�) il the case of the HEXAGON. Note that every

poilt of the OIGOH fonns a geometric �lane.

A· i

• d•

1:

.

.. . .:

:.'J t

7 poi nts

2 1 flags

? l ines

28 anti-flags

© � @l@ � ��g� © � ©@ @ aeoo o---o 0

Figure 8. Labels for the points of the

0

o

o--o

o

o

0

0

o

HEXAGON.

PGCISj 2, 2). [8] for more details about this representation 2, 2).

manner. Note that the new labels correspond in a natural

over the field with two elements, for short

way to all 1-,

See [ 10) and

ing of the

2-,

and 3-element subsets of the set consist

7 vertices of the underlying 7-gon. In terms of the 9 essentially different kinds of lines of the HEXAGON; see Figure 9. "It is clear that every symmetry and duality of the TRI new labels there are

ANGLE

induces a symmetry of the

HEXAGoN.

Encoded in the

labels is an order 7 symmetry of the TRIANGLE and an order 2 symmetry that corresponds to a duality of the TRIANGLE. Using the labels, it is easy to reconstruct my shadow; see Figure

of

PGCISl

-

-

"As you have already observed, the rule that assigns a

new label to one of the original labels can also be stated in terms of the operation EB. Here of the underlying

S consists

of the vertices

7-gon, and if a label consists of two Fano

triangles A and B (sets of three vertices), then the new la bel is A EB B. "With the above remarks it should be clear to you that my H-points coincide with the points of the 5-dimensional

7."

1: "I understand all this. Except for the step where you re

projective space PG(5,

place the original labels by new labels. It seems that the

1: "I think I know what you are getting at. Your lines are

2). Furthermore, . . . "

new label associated with a label containing two Fano tri

also . . . wait, let me double-check this . . . Yes, any two H

angles is either the symmetric difference of the two trian

points on any of your H-lines E9-add up to the third H-point

gles or the complement of this difference."

on this H-line. " HEXAGON:

HEXAGoN:

"Exactly! This means that

I am a subgeometry

right at the center of this projective space, which is an im

Strength in projective spaces

"Ah, yes that is correct. In fact, the main source

portant source of power for me."

of our power can be explained using the mathematical op

1: "So there really are beings that live in spaces of a di

eration that corresponds to this 'step.' Let S be a set with

mension greater than two, just as my grandfather claimed

lsi > 1 of elements, and let Sv2 be the set of all nonempty subsets of S with fewer than ISI/2 elements. an odd number

If A, B E

S112, A

(although this dimension is quite different from the 'tangi ble' dimensions he had in mind!)."

=I= B, let D be the symmetric difference of

A and B and define A E9 B to be D if D E

S112 or S \ D oth

Hyperplanes, H eawo od graph, and Coxeter graph

erwise. We define a geometry 'fi(S) whose point set is S and

1: "How miraculously all this fits together! But I am sure

whose lines are the sets {A, B, A E9 B} where A and B are

that there is much more beauty hiding in your shadow. For

distinct elements of S. Every line in this geometry contains 3 points. Furthermore, given two points the third point on the line is always

P and Q

P E9 Q.

on a line,

This implies

example, I just noticed that every one of the H-line labels in Figure

9 contains exactly one isosceles triangle. This

seems to suggest that the H-points that correspond to these

that any two points in the geometry are contained in ex

labels form a very special set of points."

actly one line. Closer inspection reveals that the geometry

HEXAGON:

is isomorphic to the projective space of dimension

ing you to be our messenger! Your remark reminds me of

2 1 poi nt/l ine/flags

Figure 9. Labels for the lines of the

42


HEXAGON.

lSI

-

2

"We have indeed made the right choice in select

48 flag/anti-flag/anti-flags

something else we should talk about. By now you will prob ably have guessed that the kind of conversation we are hav ing is extremely dangerous. It is only possible during the first hours of a new millennium, because at this time the BUILDINGS we are part of are too busy celebrating to broad cast every word that is said to the rest of (thick) GENERAL IZED FLATLAND. To be able to communicate with us even af ter your return to FLATLAND, you have to know a little about the flat subgeometries that my different kinds of H-points and H-lines correspond to. "A geometric hyperplane H of a geometry is a set of points such that every line either contains exactly one point of H or is completely contained in H. The set of all flag H points (isosceles triangles) is a special geometric hyper plane that intersects every H-line in exactly one point (every one of the labels in Figure 9 contains exactly one such triangle). Imagine that we remove the points of this hyperplane from me and my H-lines. Then we are left with two famous graphs: the Coxeter graph, and the double of the TRIANGLE, which in FLATLAND is also known as the Rea wood graph. "The vertices of the Heawood graph are the H-points that correspond to points and lines of the TRIANGLE. The edges of this graph are induced by the H-lines of the point/line/flag type. The picture of the Heawood graph right in the mid dle of my shadow in Figure 7 corresponds to Figure 6. "The vertices of the Coxeter graph are the H-points cor responding to the anti-flags of the TRIANGLE. The edges of this graph are induced by the H-lines of the flag/anti flag/anti-flag type. This corresponds to a well-known rep resentation of the Coxeter graph; see [6]. Also, the picture of this graph in the middle of Figure 7 corresponds, via some obvious rearrangements, to the most famous repre sentation of this graph depicted in Figure 1 (three 7-gons joined together via 7 extra points). "By the way, the presence of a special hyperplane as above distinguishes me from my dual. Also, after you are back in FLATLAND I will keep these two graphs immersed in FLATLAND so that you can communicate with me via either one of them." Misfortune Strikes

At this moment the BUILDING we were hiding in started shak ing violently. "We are discovered! Dear friend, always remem ber what we have told you today, and no matter what hap pens now you should be able to find me and my brothers again and finish what we have begun. Beware of the OCTA GON in the PENTAGON, because . . . " " THUNDERING VOICE: HEXAGON, you and your brothers have committed the heinous crime of communicating with the thin ones. For this you will suffer the terrible fate of doubling. " At this moment the ceiling slammed down on my new friend and me, and we were both squashed back into FLAT LAND. When I regained consciousness it was morning, and HEXAGON:

I found myself in the very room where all this had started. I automatically assumed that the night's adventure had been a dream induced by what I had read on the walls. But then I discovered that all the writings had vanished and that none of my companions was anywhere to be seen. I also found, to my utter amazement, that my gonality had been raised to 12-1 had been doubled. Although still somewhat shaken, I immediately started looking for the doubles of the POLYGONS-to no avail. I realized that, using my doubled IQ and the unprocessed notes in my notebook, I first had to deduce as much as possible about the POLYGONS and their doubles; then, to convince you my fellow flatlanders of their existence, locate their whereabouts in FLATLAND, and with their help claim our rightful place in full GENERALITY. The QUADRANGLE and the DIGON

It was a long journey back home. I spent most of the time organizing my notes and developing a mathematical theory of GENERALIZED FLATLAND. Following the procedures the HEXAGON had introduced me to, it was easy to show that the QUADRANGLE has 15 points and 15 lines, that its diameter is 4, that D5 = 1, Di = 3, Ifj = 6, Dti = 12, D4 = 8 for all vertices of the QUADRANGLE, and that these numbers suffice to recognize the QUADRANGLE among geometries. I also found a geometric construction of the QUADRANGLE as a derived geometry at a point of the HEXAGON; see [3]. However, this construction is rather com plicated, and executing it within the shadow of the HEXA GON yields a model of the QUADRANGLE with only very few symmetries. After two sleepless days and nights, I finally succeeded in reconstructing the shadow that I first saw in the ruins. The Shadow of the QUADRANGLE revisited Let S be the set of vertices of a regular pentagon. The points of the shadow are all elements of 8112 , that is, all 1- and 2-element subsets of S. The lines are the partitions of S into two 2-element subsets and one 1-element subset of S. Then there are essentially 3 different kinds of points and 3 different kinds of lines, as illustrated by the labels in Figure 10. Of course this representation parallels the representation of the HEXAGON as a subgeometry of the projective space PG(5, 2) and identifies the QUADRANGLE as a subgeometry right in the middle PG(3, 2). Using the labels, it is possible to reconstruct the shadow of the QUADRANGLE as in Figure 7.

Just like the the QUADRANGLE also contains geometric hyper planes that intersect every line in exactly one point. One is visible right in the centre of its shadow. It consists of the five 1-point subsets of S. If we remove the points of this hyperplane from the QUADRANGLE and its lines, we are left with the famous Petersen graph. Also, the picture of this graph in the diagram of the QUADRANGLE in Figure 7 corresponds to the most famous representation of this graph depicted in Figure 1 (two 5-gons joined together). I Geo metric hyperplane and Petersen graph

HEXAGON,

VOLUME 23, NUMBER 4 , 2001

43

� �

Figure 10. The points and lines of the

QUADRANGLE.

assume that the QUADRANGLE planned to stay in touch with us in this form. For completeness' sake I remark that the lines of the TRI ANGLE are geometric hyperplanes. After deleting one of these hyperplanes from the TRIANGLE, we are left with the complete graph on four vertices. As you are probably aware, this graph, the Petersen graph, and the Coxeter graph are almost as homogeneous as the POLYGONS they are contained in; see [2]. The derived geometry and from DIGON to QUADRANGLE You are asking me where in all this the DIGON fits in? Although I never had the honor of meeting the DIGON, I found it very easy to reconstruct its shadow (see Fig. 7). Note that it contains 3 points and 3 lines, and that every line contains all the points. Your first reaction may be similar to mine when the QUADRANGLE first introduced me to generalized digons: "What's the big deal?" Well, it turns out that there is a labelling of the QUADRANGLE in terms of the DIGON that is the direct equivalent of the labelling of the HEXAGON in terms of the TRIANGLE: The points of the QUADRANGLE are the points, lines, and flags of the DIGON. There are two kinds of lines. The lines of the first kind are of the form (p, L, (p, L} }, where {p, Lj is a flag of the DIGON. The lines of the second kind are of the form { {p, Lj, {q, Ml, {r, N}} such that {p, q, rj and {L, M, Nj are the point and line sets of the DIGON.

The Doubles of the POLYGONS

It seems obvious to me that the POLYGONS intended to be present in FLATLAND in the form of some special graphs. Ac cording to their original plan they would be surveying proper GENERALIZED FLATLAND by using only the points of one of their special geometric hyperplanes, with the rest of their bodies immersed in FLATLAND (in this form they are almost invisible). If this is what they are doing, then to get in touch with them we have to locate the graphs in Figure 1 and Fig ure 6. Of course it is also possible that even surveying just using a geometric hyperplane is too risky at the moment and they are existing only as their doubles and are fully im mersed in FLATLAND . My investigations had confirmed my belief that the POLY GONS had revealed their most symmetric shadows and sub-

Figure 1 1 . A special path in the

44


QUADRANGLE.

geometries to me. I therefore proceeded to reconstruct the most symmetric representations of their doubles. I had already encountered an attractive picture of the double of the TRIANGLE in Figure 6. Also, it turned out that the double of the DIGON is the complete bipartite graph on 6 vertices in Figure 3. Of course this meant that, without my realising it at the time, the DIGON had been present in this form throughout my conversations with his brothers right next to their shadows. To construct the best picture of the double of the QUAD RANGLE, I considered the path in this geometry depicted in Figure 1 1 . Since this is a path, two of its adjacent vertices correspond to a flag in the QUADRANGLE. Furthermore, this path contains the different kinds of points and lines in Fig ure 10 exactly once, except for its beginning and its end, which are two points of the same kind. If we fit together the 5 images of this path under rotations of the 5-gon un derlying the labels, we arrive at a path that contains every point and line of the QUADRANGLE exactly once and is in variant under the rotations. This enables us to draw a pic ture of the double such that the vertices of the graph are the vertices of a 30-gon, two adjacent vertices of the 30-gon are connected by an edge, and rotations through 360/5 de grees around the center of the 30-gon leave the double in variant. Figure 12 is a picture of the double that has been constructed in this way. This also shows that the QUAD RANGLE contains 15-gons like the one I saw in the ruins and that it is self-dual. Note that the reflection through the ver tical symmetry axis of the diagram corresponds to a dual ity of the QUADRANGLE. Figure 13 shows a similar path in the HEXAGON which can be used to model the double of this geometry on a regular 126-gon such that two adjacent vertices of this polygon are connected by an edge, and rotations through 360/7 degrees around the center of the polygon leave the double invari ant. See [9, Section 13.5] for a picture of the double that has been constructed in this way. Where to From Here?

When I finally arrived back in my hometown, I discovered that in my absence I had been accused of high treason and the police were looking for me everywhere. All this re-

Figure 1 2. The double of the

QUADRANGLE,

a generalized octagon.

minded me so much of what had happened to my grandfa ther. Of course I was only a boy when he first told me about his abduction, and at that time his story sounded like the ramblings of a madman to me. But now that I had been ab ducted myself and reconsidered what he had told me with my doubled intellect, it all made perfect mathematical sense. So, why had he been locked away for something that

Figure 13. A special path in the

our incredibly intelligent multigonal rulers should have rec ognized as the truth? And why were the authorities after me all of a sudden? I needed time to think. Since the po lice were looking for a hexagon I did not have to fear too much, of course. But the HEXAGON had warned me to beware of the "ocTA " GON in the PENTAGON. What had he meant by this? A (gen-

HEXAGON.


45

eralized) octagon in a (generalized) pentagon? There must be infinitely many such combinations! On the other hand, the way he had pronounced PENTAGON and OCTAGON was very similar to the way he pronounced the names of his brothers. Did this suggest that I had to look for the smallest thick gen eralized pentagons and octagons and that these were per haps somehow related to the POLYGONS? I returned to my studies, and after a couple of weeks of hard work I uncov ered some more fundamental properties of generalized poly gons that suggested an answer to my problem. All generalized n-gons we have to worry about are fi nite, that is, both their point and line sets are finite sets. Remember that by Axiom Q1 a generalized n-gon C§ is of order (s, t), s, t ;::;: 1, if every line contains s + 1 points and every point is contained in t + 1 lines. If s = t, we also say that C§ is of order s. This means that the POLYGONS are the generalized polygons of order 2. Also, we ordinary n-gons are, up to isomorphism, the unique generalized n-gons of order 1. A generalized polygon is slim if either it or its dual is of order (2, m) for some m > 2. If C§ is not an ordinary n-gon, then, by a celebrated result of Feit and Higman [7] (contemporaries of the prophet J. Tits), n = 3, 4, 6, 8, or 12, and, if n = 12, then C§ is slim. The smallest slim generalized n-gons can be shown to be unique up to isomorphisms and duality. These geome tries are the generalized 2-, 4-, 6-, 8-, and 12-gons of order (1, 2) and their duals. The first (trivial) geometry is the graph consisting of 2 vertices that are connected by 3 edges (this is the DIGON minus one of its points, that is, minus one of its geometric hyperplanes). The remaining four geome tries are the doubles of the POLYGONS. This means that all smallest non-trivial generalized polygons are related to the

mathematicians are writing and mathematicians are exactly the audience able to appreciate this report for what it is, I am submitting this account to a popular international math ematical journal, the perfect forum for subversive mathe matical writings. For a more detailed exposition of the mathematical the ory of generalized polygons and the all-encompassing the ory of mathematical buildings, see the recently discovered manuscripts [4] , [ 12], [ 14], [16], and [ 17]. See [5], [9], [10], [11], and [ 13] for further information about the POLYGONS. Enough said, my dear fellow flatlanders. Go forth and seek out the POLYGONS and then onwards to full GENERALITY! REFERENCES

[1 ] Abbot E.A. Flatland-A Romance in Many Dimensions, with illus trations by the author A Square, 2nd Edition originally published in 1 884 is available for free download from many literature archives and private websites on the internet. [2] Biggs, N. Three Remarkable Graphs, Can. J. Math. 25 (1 973), 391 -41 1 . [3] Bloemen, I. and Van Maldeghem, H. Generalized hexagons as amalgamations of generalized quadrangles. Eur. J. Combin. 1 4 (1 993), 593-604. [4] Brown, K.S. Buildings. Springer-Verlag, New York-Berlin, 1 989. [5] Cohen, A.M. and Tits, J. On generalized hexagons and a near oc tagon whose lines have three points. Eur. J. Combin. 6 (1 985), 1 3-27. [6] Coxeter, H . S . M . My Graph, Proc. London Math. Soc. 46 (1 983), 1 1 7-1 36 . [7] Feit, W. and Higman, G. The nonexistence o f certain generalized polygons. J. Algebra 1 (1 964), 1 1 4- 1 3 1 . [8] Pickert, G. Von der Desargues-Konfiguration zum 5-dirnensionalen

POLYGONS.

projektiven Raum mit 63 Punkten. Math. Semesterber. 29 (1 982),

So, obviously, there are no non-ordinary generalized pen tagons. Hence the PENTAGON must refer to something em bedded in FLATLAND. Of course the shape of most of our build ings here in FLATLAND is that of a pentagon and the building that houses the best-kept secrets of our government is THE PENTAGON. Could that be it? Was the HEXAGON trying to warn me of my own government? All of a sudden everything seemed to make sense. Clearly, the OCTAGON was a thick gen eralized octagon that had immersed one of its multigons into FLATLAND and under the pretence of being a circle was rul ing our land. Further study revealed that this ocTAGON is most probably a generalized octagon of order (2, 4) having 1755 points and 2925 lines. So far I have been able to show the existence of only one such octagon. As I suspected it is a distant relative of the POLYGONS: Its derived geometry is the unique generalized quadrangle of order (2, 4) which in tum contains the QUADRANGLE. I believe that this generalized oc tagon is unique but have not yet been able to prove it. Following this discovery I joined the mathematical un derground. Since governments are not interested in what

5 1 -67.

46


[9] Polster, B. A Geometrical Picture Book, Universitext Series, Springer-Verlag, N .Y. , 1 998. [1 0] Polster, B. Centering small generalized polygons- projective pot tery at work, submitted. [1 1 ] Polster, B. and Van Maldeghem, H. Some Constructions of small generalized polygons, to appear in J. Combin. Theor. Ser. A. [1 2] Ronan, M . Lectures on Buildings. Perspectives in Mathematics, 7. Academic Press, Boston, 1 989. [1 3] Schroth, A. E. How to Draw a Hexagon, Discrete Math. 1 99 (1 999), 6 1 -7 1 . [1 4] Thas, J.A. Generalized polygons. in: Handbook of Incidence

Geometry, pp. 383-431 , North-Holland, Amsterdam, 1 995. [1 5] Tits, J. Sur Ia tialite et certains groupes qui s'en deduisent, lnst. Hautes Etudes Sci. Pub/. Math. 2 (1 959), 1 3-60. [1 6] Tits, J. Buildings of Spherical Type and Finite BN-Pairs. Lecture Notes in Mathematics 386. Springer-Verlag, Berlin-New York, 1 974. [1 7] Van Maldeghem, H . Generalized Polygons. Birkhii.user, Basel, 1 998.

I

A U T H O R S

BURKARD POLSTER

ANDREAS E. SCHROTH

HENDRIK VAN MALDEGHEM

Department of Mathematics and Statistics

lnstitut fUr Analysis

Department of Mathematics

P.O. Box 28M

TU Brau nschweig

University of Gent

D-381 06 Brau nschweig

9000 Gent

Monash

University, Victoria 3800 Australia

Germany

Belgium

e-mail: Burkard. [email protected]

e-mail: a.sch roth@tu -bs. de


http://www.maths.monash.edu.aul-bpolster

http://fb 1 .math. nat . tu - bs .de/-top/aschroth

http://cage.rug.ac.be/-hvm

Burkard Polster joined the mathematical

Andreas E. Schroth was forced into the

Hendrik

underground while studying arcane territo

mathematical underground because his

to life i n the mathematical underground by

van Maldeghem was condemned

ries of finite and topological geometry. He

work on the connection between circle

his addiction to numbers. Some of the

has been on the run ever since, hastily

planes

numbers by which he lives:

completing his doctorate and working at

verged dangerously close to circle-squar

6: his favorite number. His work on gen

eight universities on three continents over

ing. Cycling across the Indian subconti

eralized hexagons earned him the 1 9g9

and

generalized

quadrangles

the last sixteen years. To maintain razor

nent, he developed a persistent attach

Hall Medal of the Institute of Combinatorics

sharpness for this hectic existence, he

ment to vegetarian Indian food (which he

and Applications.

practices daily: juggling, sculpting soap

both cooks and eats) and to bollywood

bubbles,

and creating ambigrams. Some

of these ambigrams have graced The Mathematical lntelligencer.

movies (which he only watches).

40000: the number of kilometers he ran before the age of 38-and the approximate circumference of the earth. 4/4:

the usual meter of the folk-rock

band Lezzamie, in which he plays an elec tronic drum.


47

li,i$?.ff'l . i§,fih£ili.II!QBM

The Magic Square on Sagrada Fam il ia Pieter Maritz

D i rk H uylebro u c k , E d itor

B

arcelona is probably best known for its architecture. There are many fascinating structures reflecting the art nouveau movement, known in Catalonia as Modernisme, with the city's most famous architect, Antoni Gaudf, represented by some ten differ ent works. Antoni Placid Guillem Gaudf i Cor net was born June 25, 1852, in the province of Tarragona [1]. At age eleven he entered the Col.legi de les Es coles Pies in Reus, located in the an cient convent of Sant Francese. In 1868 Gaudf moved to Barcelona to study ar chitecture. He fulfilled his military ser vice requirement during the years 1874-1877. His first large project was workers' housing in a factory, the Co-

I

most famous work, the finest example of his visionary genius, and a world wide symbol of Barcelona and of Cat alonia. This neo-Gothic project was ini tially managed, in 1882, by Francese de Paula del Villar i Lozano, Gaudf's former professor, who volunteered to carry out the ideas of Josep Maria Bocabella, chair of the Associaci6 Espiritual de Devots de Sant Josep. Martorell was part of the Temple Council. He dis agreed with del Villar about the mate rials that should be used to make the pillars, and, when they couldn't reach agreement, del Villar stepped down. Bocabella offered the position to Mar torell, who, because of the situation, did not accept but proposed his young assistant, Gaudf, who immediately ac-

A worldwide sym bol of Barcelona and of Catalo n ia .

D oes your hometown have any mathematical tourist attractions such as statues, plaques, graves, the caje where the famous conjecture was made, the desk where the famous initials are scratched, birthplaces, houses, or memorials? Have you encountered a mathematical sight on your travels? .(f so, we invite you to submit to this column a picture, a description of its mathematical significance, and either a map or directions so that others may follow in your tracks.

operativa Mataronense (Matar6 Coop erative). The project was intended to improve the workers' quality of life, but Gaudf's project was ahead of its time, and only one section of the factory and a kiosk were built. Gaudf was disap pointed, but the presentation of his project at the Paris World Fair in 1878 marked the beginning of his fame. There he also presented a showcase for pret-a-porter gloves from the shop of Esteban Comella, thanks to whom he met the man who would become one of his best friends and patrons, Eusebi Giiell. After the Paris World Fair, Gaudf decorated the Gibert pharmacy in Barcelona and collaborated with the architect Martorell on various jobs. Sagrada Familia

Please send all submissions to

Gaudf's relationship with Martorell al lowed him to take over management of

Mathematical Tourist Editor,

El Temple Expiatori de la Sagrada Familia ("Expiatory Temple of the

8400 Oostende, Belgium

Holy Family") near Avinguda Diagonal in Barcelona. This became Gaudf's

Dirk Huylebrouck, Aartshertogstraat 42, e-mail: [email protected]

.

.

cepted. In 1883, Gaudf officially took control of the project. Gaudf wanted to create a "20th cen tury cathedral," a synthesis of all his ar chitectural knowledge with a complex system of symbolism and a visual ex plication of the mysteries of faith [2]. There would be three fa A C B.

which is the same,

A+ Ac

ticle, "Sur l'axiomatique des espaces de Hausdorff. " In his article "Les ensembles fermes

Frechet had raised the problem of

et les fondements de la topologie"

characterising a Hausdorff space using

[ 1 94la],

Monteiro

showed

that the

collaborators at the Centro de Estudos

the derivation operation as a primitive

most general spaces of type ( v) whose

Matematicos de Lisboa) started by

notion. Monteiro solved this problem

topology is uniquely determined by

characterising topological spaces by

in his work [ 1940c] "Caracterisation

knowledge of the family of closed sets

means of primitive notions such as

des espaces de Hausdorff au moyen de

are precisely those where the closure

frontier, closure, and derivation opera

!'operation de derivation." Frechet had

operator of each proper subset

tions. In these first research papers, the

said that it would be of use to have two

isfies the condition

names

of

Birkhoff,

Frechet,

A

=

A.

A,

sat-

Kura

definitions for normal and completely

towski, Hausdorff, and Sierpinski were

normal spaces, one based directly on

Antonio Almeida Costa and

often cited.

the choice of neighbourhoods and the

Abstract Algebra

other on the operation of derivation,

Antonio Almeida Costa was born in

The article, "Sur l'axiomatique des

(v)"

[ 1940a] contains Mon

and in his paper Monteiro carried this

humble surroundings as the illegiti

teiro's first results on the study of the

programme through. H. Ribeiro then

mate son of an unmarried dressmaker,

foundations of abstract topology. This

went on to use Monteiro's idea to char

Maria de Jesus Costa, in Santa Maria

was written in collaboration with Hugo

acterise, by means of the derivation op

da Vila, a municipality in Celorico da

Ribeiro and contains some results on

eration,

Beira, on 25 May 1903, and died in Lis

neighbourhood spaces (spaces of type

pletely nonnal

e

bon on 24 August 1978. At the age of

(v)), a notion presented by M. Frechet

Silva also contributed to the axiomati

nine, he went to the grammar school in

in his article "Sur Ia notion de voisinage

sation of Hausdorff spaces, presenting

Guarda and there completed his sec

dans les ensembles abstraits," in 1917).

a characterisation of Hausdorff spaces

ondary-school studies in July 1919.

espaces

A space X is a space of type (v) if the derived set

A'

of a set

A

is the set of

the points x in X such that every neigh

regular,

normal, spaces.

and

com

Sebastiao

by means of the primitive notions of frontier, edge, and boundary in his

ar-

In 1920 he completed, with distinc tion, some studies in chemistry and

bourhood of x has at least one element belong to the set A

- x.

It seems to be Monteiro and Ribeiro who for the first time characterised the spaces (v) using primitive notions dif ferent from the usual notions of de rivation or neighbourhood. Partial re sults

in

this

direction

had

been

obtained by Kuratowski (in his work on the closure notion, 1933) and by Zarycki (using the concepts of frontier, interior, exterior and edge, 1927), but they only considered particular neigh bourhood spaces, the so-called acces sible spaces of Frechet. (At the time they began their research, Monteiro and Ribeiro did not know about the work of Zarycki.) Hugo Ribeiro had al ready presented an axiomatisation of Frechet's topological spaces using the primitive notions of closure, interior, edge, frontier, border, and exterior, in his first research paper, "Sur l'axioma tique des espaces topologiques de M. Frechet." Monteiro and Ribeiro proved that it was sufficient to add a third ax iom to the axiom system for Frechet's topological spaces to obtain an axiom system for spaces of type (v). So, for example, to characterise a space of type (v) using the closure operation

Figure 6. Ant6nio Almeida Costa.

VOLUME 23. NUMBER 4. 2001

61

mathematics at the University of Lis bon and then went on to the degree in Mathematical Sciences, in the Faculty of Sciences at the University of Oporto, in October 1924. He distinguished him self there by winning the Gomes Teix eira and Gomes Ribeiro prizes for math ematics, and graduated as the highest ranked student in the Faculty of Sci ences. In due course he became an aux iliary professor of applied mathematics at his alma mater. In November 1933, he was made the first Official of the Ad ministrative Services of the Santa Casa da Misericordia, the charitable institu tion where he had once worked as an office boy and which had provided fi nancial support for his studies. As a re sult, he was invited to become a mem ber of the Municipal Council of Oporto. There he remained, directing the Edu cation Section, from May 1936 to Sep tember 1937, when he left to take part in a foreign exchange to Berlin. He had applied to the Junta de Ed uca 0; } ,

'

. . . ' q

And this representation of s(n) gives rise naturally to the generalized hypergeometric function (the authors of "A = B" more usually call this a hypergeometric sum) with p numerator parameters and q denominator parameters:

Hypergeometric series with p = 2, q 1 (though they were not called that) were studied by Wallace, Newton ( - 1664) and Stirling (1 730) in connection with the rectifica tion of certain algebraic curves. Euler in 1778 discussed the general series of this type, though again without using the term hypergeometric. Gauss thoroughly investigated this se ries in a large number of published and unpublished works, beginning in 1805. Today the corresponding function is called Gauss's hypergeometric function, or simply the hypergeo metric function, though it was in only relatively recent times that Kununer (1836) applied the term hypergeometric to Gauss's series. Pochhammer (1890) and Barnes (1907) de veloped the notation for the general function. The InteUi gencer [vol. 7, no. 2, 1985] contains a nice survey article by W. K. BUhler on the history of hypergeometric functions. There are four cases: =

(i) If one of the a1 is a negative integer or zero, the series always makes sense, for it terminates. The result is a polynomial in z. Failing (i), then (ii) If q 2: p, the series converges uniformly on compact subsets to an entire function of z (of finite order q + 1 - p); (iii) If p = q + 1, the series converges and thus defines an analytic function of z on compact subsets of lzl < 1. The function may be analytically continued into the complex plane cut along [ 1 , oo]. (iv) If p > q + 1, the series diverges for all z =!= 0. However, the series may still be computationally very useful, for instance, as a member of an umbral calculus of formal power series about 0. As

an example, the series of the squares of the binomial z1��k : coefficients is such a sum, since � = ( - 1) k c

()

()

� n2

k�O

k

=

lFl

( -n -n ) ,

1

'

; 1 .

Similarly, sums of all integer powers of the binomial coef ficients are hypergeometric. Another apparently frivolous example is the situation where a drunkard is climbing around in 3-space, occupying unit lattice points. (Maybe the bar is adjacent to a playground that has an enormous jungle gym installed on it.) The drunk ard starts out at the origin (the bar) and makes consecutively any one of the six movements (:± 1, 0, 0) (0, ± 1, 0), (0, 0, :± 1) with equal probability. If an denotes the number of ways of going from the origin back to the origin in 2n steps, then

an

4n (112)n

=

"' :J-L. 2 -

n.'

( -n, -, n, 1 1

)

112 . 4 , .

X

I

n �o

6�':,

=

1.51638 . . . .

Thus u = .34053 . . . . It is known that if the drunkard moves in 1-space or 2-space, the probability of return to origin is 1. I sometimes get snickers when I mention in a talk that the difference of the 3-space behavior from the lower-di mensional behavior is the basis for all life on earth, but it's true!3 The constant .34053 . . . is called P6lya's constant. P6lya investigated the situation in 1921. Another example comes from numerical analysis and ap proximation theory, and the arcane topic of Pade approx imants. The Legendre polynomials Pn (x) satisfy the recurrence formula

(n + Po

=

1)Pn + 1 1,

PI

=

=

(2n + 2x

-

1)(2x - 1)Pn - npn - 1 , n = 1, 2, 3, . . . , 1.

Obviously, Pn(X) is a polynomial in the variable x o f exact degree n. It can be shown that the Legendre polynomials are an orthogonal set on the interval [0, 1 ] , i.e., that

f

Pn(X)Pm(X)dx

=

m,n = 0, 1,2,

hn =F 0,

8m,nhn,

. . . .

This orthogonality formula has many consequences, one of which I will explore. We defme a set of functions, p�(x), by the formula

p�(x)

=

1

1 Pn(X) - Pn(t)

0

X-

t

dt, n

=

0, 1, 2, . . .

=

p�(x)

Pn(X)

=

=

(Note the above hypergeometric series terminates.) The probability of return to the origin is u = (m - 1)/m, where

m=

If we write the numerator out as a series of differences of powers xi - ti, j = 0, 1, 2, . . . , n, then the denominator divides evenly into each term, and we see that p�(x) is a polynomial in x of exact degree n - 1 . It is called the as sociated Legendre polynomial. These polynomials, like the Legendre polynomials, have been studied by dozens of mathematicians. It is easy to verify that p�(x) satisfies the same recurrence as Pn(x), except we start with the initial values Po = 0, Pi 2. Now divide each term in the above equation by Pn (x) and do a little rewriting to get

En (X)

=

1 X-

1o

1 --

t

dt + En(X),

- In (1 - 1/x)

+ En(X),

1 Pn(X)

t

--=-!.___

1 Pn(t)

0

X-

dt.

The rational approximation p�(x)/pn(x) when developed in powers of 1/x agrees with the series for -ln(1 - 1/x) up to powers of order 2n + 1, and it can be shown that the ap proximants p�(x)lpn(X) converge to the function -ln(l 1/x) as n � oo uniformly on compact subsets of C - [0, 1 ] . Simple formulas for the Legendre polynomials are known; e.g., Pn(X)

=

( - l)n zF1

( -n n + ) '1

1

;X ·

It would be nice to have a simple formula for the asso ciated polynomials, p�(x). At the time, none was known. The story of p�(x) will resume later. The book "A = B" discusses five computer algorithms for analyzing hypergeometric sums. All of these algorithms are downloadable from the web page and require variously Mathematica or Maple. The book is very didactically ori ented, and contains a plethora of beautiful and challenging exercises. I had to tear myself away from working them to write this article. I will describe three of the algorithms:

Sister Celine's general algorithm: Suppose we have the sum

f(n)

=

I F(n, k) k

where F(n, k) is doubly hypergeometric. Sister Celine's al gorithm4 provides a method for finding a recurrence for the sum f(n). It does this by first finding a double recur rence in n and k for F and then summing this recurrence over k. Herbert Wilf and Doron Zeilberger proved in 1992

31n living organisms, the underlying explanation for many vital biochemical reactions is that the path of an enzyme molecule may be modeled by a drunkard's walk. The walk suddenly becomes constrained, i.e., drops from three to two dimensions, when the enzyme targets a cell surface. The turnover numbers of membrane-bound enzyme systems are enhanced if their substrates undergo two-dimensional diffusion along membrane surfaces. The consequent activation of the enzyme is what allows enzyme systems in living bodies to work; hence life. 4The authors provide a nice biography of Sister Celine (Fasenmeyer), who was born in Crown, Pennsylvania, October 4, 1 906. She received her Ph.D. under the di rection of Earl Rainville at the University of Michigan in 1 946, and discussed a specialized version of the algorithm in her thesis.


75

that the method will always work, and they give the proof here. The algorithm given by Wilf and Zeilberger is a very deep generalization of the one discussed by Sister Celine herself, and an inspiration to subsequent algorithmic iden tity verification. The Maple algorithm for the method is contained in the package EKHAD.5 For instance, when the method is applied to the sums of squares of the binomial coefficients,

RESULT

II: Let Sn be the sum of the first k

+

1 factorials:

Then Sn is not a hypergeometric term (modulo a constant.) RESULT

III:

Let

the recurrence delivered is

f(n)

=

-

2(2n n

1)

f(n -

1).

Iterating this recurrence gives the explicit formula men tioned previously, 2nln .

)

(

Gasper's algorithm: Let tk be a hypergeometric term. Gosper's algorithm an swers the question: can the indefinite summation of tk be expressed as a hypergeometric term plus a constant? In other words, does there exist a hypergeometric term Zn such that n

Zn

=I

k�O

1)! - 1 .

2 How like the problems o f integrating e->'2 and xex the last two examples are! The software implementing Gosper's algorithm is called GOSPSUM. It requires Mathematica.

Zeilberger's algorithm: This algorithm, sometimes called "the method of creative telescoping, " was proposed by Zeilberger in 1990, 1991; it accomplishes for definite sums what Gosper's algorithm did for indefinite sums. Let

f(n)

n

=

k�

Sn

=

(4k +

2-

1)

k! (Zk

+ 1)! "

n! (2n + 1)!

J(n)

k

= I n O:o;k=s

- (

n_ n - k zk _ n k 2k /3

)

.

The problem came from the American Mathematical Where it came from before that is probably un knowable. Zeilberger's algorithm shows thatJt:n) satisfies

Monthly.

+

1)(N -

2)j(n)

=

0.

(I follow the authors in writing N for the shift operator; f(n + 1). Old-timers called it E.) But this is a re currence with constant coefficients, and thus can be solved by exponentials, analogously to the differential equation with constant coefficients.

Nf(n)

5The temerity of this creature Ekhad! Naming a computer algorithm after oneself-well,


= I F(n, k)

where F is doubly hypergeometric. The problem, as in Sis ter Celine's algorithm, is to determine the recurrence sat isfied by f(n). However, the Zeilberger algorithm is much, much faster. I won't even attempt to convey the radically different and often startling reasoning that underlies the al gorithm; the reader will have to go to the book for that. I want to mention, though, one dramatic application out of many. The problem is to evaluate the sum

(N2

Sn

76

(n +

I:

Let

Then

=

tk + c?

The reader may observe that this is the fmite difference analog of the problem of integration in closed form. Bill Gosper published his method in 1975. Gosper, whom many believe to possess one of the most original and cre ative intelligences operating at the interface of mathemat ics and computer science, has never sought traditional venues for the dissemination of his results. Many of them exist only as conference proceedings and reports. Gosper's method involves an inhumanly clever ploy, and I won't reveal it here. Its appearance in the text is signaled by the indigitation, "And now a miracle happens." Now, I distrust miracles; if good miracles can happen, so can bad miracles� If I can win the Pennsylvania lottery, I can equally easily be struck down by some tropical disease shared by only five other people. Nevertheless, I want to list some re sults of Gosper's algorithm. RESULT

Then Sn

=

I wouldn't do it.

Determining the constants by using the values off(O), J( 1), f(2) is straightforward, and we find

f(n)

= 2n - J + cos-. 2

nw

The algorithm is contained in the package EKHAD. These descriptions furnish only an intriguing glance into a book which by now has become justly famous, and oc cupies a position at the nexus of computer science, algo rithmic theory, special functions, combinatorics. The algo rithms described in the book gained for Zeilberger and Wilf the prestigious 1998 Leroy P. Steele Award. Determined to probe the strength of the algorithms therein, I sent Herb Wilf an e-mail message describing the unsatisfactory state of affairs of the associated Legendre polynomials (*). I got back a TeX document containing the formula

P�+ ! (x) = where the

( - 1)n (n +

1)

{

k

I =

0

+ 2)k (Pn - Pk- l)x (k!)2 (n + 1 k)

(-n)k(n

k

-

Pn are the harmonic numbers Pn _

n

I

1

r=o r+ l

0,

, r 2= 0,

--

r

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

No title

Recommend Documents