The Mathematical Intelligencer encourages comments about the material in this issue. Letters to the editor should be sent to the editor-in-chief, Chandler Davis.
H o w to Write about Teichmiiller The Spring 1995 issue of The Mathematical Intelligencer contains two very different contributions, by M. R. Chowdhury (pp. 12-14) and B. Booss-Bavnbek (pp. 1520), which pick up our article on the life and work of Oswald Teichmfiller in the Jahresbericht der Deutschen Mathematiker-Vereinigung 94 (1992), 1-39. Chowdhury's text, with which we have only minute differences, is written as a service to the reader of the Mathematical Intelligencer, providing a partial translation of and commentary on Teichmiiller's infamous letter to Edmund Landau of November 1933, which had been first published in the appendix to our article on Teichmfiller. In contrast to this, Booss-Bavnbek's article appears in the "Opinion" column of The Mathematical Intelligencer and aims at fundamental criticism. This criticism is directed partly against the general editorial policy of the journal Jahresbericht D M V in matters of obituaries of German mathematicians. We also feel quite uneasy about some of the obituaries that have appeared in the Jahresbericht DMV. For instance, we consider Leichtweil~'s biographical remarks on Karl Strubecker, quoted by Booss-Bavnbek, generally superficial and occasionally offensive. One of our motivations for publishing the Teichmfiller article in the Jahresbericht D M V was in fact to help improve the quality of these obituaries. Bernhelm Booss-Bavnbek expresses an emphatic normative position on how to deal with the difficult and painful history of the Nazi period. We do not want to pass over our basic differences with him in silence. Booss-Bavnbek seems to believe that the historian ought
to be some sort of moral teacher; the more rigidly the teacher declares that certain protagonists of his story were bad guys, the more effectively he educates the readers for a better future. We hold a more sceptical, but certainly not cynical, view. Moral condemnation seems to us to be but a frail protection against a hideous past. We believe instead in close and detailed confrontation with the historical material. Of course we do place ourselves in the universe of values when we approach and present a historical subject. But we try to do so in a tempered, controllable, and criticizable manner. This was openly stated in the introduction to our article on Teichmfiller. Our restriction to documented evidence 1 seemed to 1For instance, it is virtually certain that it was Ernst Witt, not Oswald Teichmiiller, who participated (wearing SA uniform) in Emmy Noether's seminar on Hasse's notes on Class Field Theory, which she held privately at her home in the summer of 1933 because she had already been put on leave by the ministry. Thus footnote 3 in BoossBavnbek's article almost certainly relates a flawed piece of oral history. See Clark Kimberling, "Emmy Noether and her Influence," in: Emmy Noether, A Tribute to her Life and Work, James W. Brewer & Martha K. Smith (ed.), New York, Basel (M. Dekker), 1981, pp. 3-61; here: p. 12, and footnote 13, p. 47. In an early first draft of N. Schappacher, "Das Mathematische Institut der Universitat G6ttingen 1929-1950" in: Becker, Dahms, Wegeler (Hrsg.), Die Universitf~'t G6ttingen unter dem Nationalsozialismus, Mfinchen (K.G. Saur), 1987, 345-373, Schappacher also conjectured that Teichmfiller was the one. This was corrected in a letter by Fenchel; but the mistake has unfortunately survived in: C. Tollmien, "'Sind wir doch der Meinung, dal~ ein weiblicher Kopf nur ganz ausnahmsweise in der Mathematik sch6pferisch t~tig sein k a n n . . . ' - - E m m y Noether 1882-1935. Zugleich ein Beitrag zur Geschichte der Habilitation von Frauen an der Universitat G6ttingen." G6ttinger Jahrbuch 38 (1990), 153-219; here footnote 188.
THE MATHEMATICALINTELLIGENCERVOL.18, NO. 1 9 1996Springer-VerlagNew York 5
us natural when writing about a man whose contradictory personality and whose despicable political ideas and actions have given rise to many anecdotes whose accuracy is impossible to ascertain. However, since we dwelt quite a bit in the article on the previously unp u b l i s h e d - l e a d i n g r61e of Teichm/iller in the Landau boycott, it is incomprehensible to us how any careful reader could conclude that we were attempting to rehabilitate Teichmiiller's politics. Our refusal to make Teichmfiller into a "negative hero"--which seems to have caused the most serious misunderstandings of our article--was simply motivated by a desire to show that Teichmiiller was not a lonely "hero" who single-handedly forced Landau's resignation by organizing the student boycott. In fact, for the boycott to have been successful, the new ministerial policy (in the fall of 1933) was needed, which made it possible to get rid of Jewish professors disregarding the exceptions stipulated in the racist clauses of the law of April 7, 1933. Only a gross misreading of the text could make it possible to interpret our formulation as saying that we wanted to excuse Teichmiiller for his action, and put all the blame on the state administration, as Booss-Bavnbek suggests (pp. 16-17). Had such a reading occurred to us, even in our worst nightmares, our sentence would have probably come out different, and better! Clearly, the biographical section 1. of the article on Teichmiiller, written by Scholz, is not a full-fledged biography which lives up to the principles laid out nicely in the quotation from S6derqvist towards the end of Booss's article. To go further in this direction would have called for a closer investigation of the genesis of mentalities in Weimar Germany (and earlier) which prepared large parts of the German population to support or actively participate in Nazi politics, whereas others kept a sceptical distance, practised disobedience, or, in a few cases, took up active resistance. Such a major work could not be our goal in that article. We are also perfectly aware that sections 2--6 of the article on Teichm/iller, surveying most of Teichmiiller's papers, do not represent a satisfactory history of the mathematical questions that Teichm/,iller addressed. To give more would have required an effort, and an expertise, which seemed beyond our capabilities. In this sense Booss-Bavnbek's remarks (p. 16) concerning the contributions of Gr6tzsch, Lavrent'ev, and Schiffer to the early development of "Teichmfller theory" are welcome. But to insinuate that our failure to mention these mathematicians might have something to do with their being (quoting Booss-Bavnbek) an "anti-Nazi," a "Soviet mathematician," and a "Jew forced to emigrate," verges on slander. We are acutely aware of the limitations of our publication. At the same time, we see no merit in the wholesale condemnation of our partial achievements on the basis of normative ideals. Historical understanding is 6 THEMATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
reached through discourse and controversies--preferably, controversies more to the point than this one. Norbert Schappacher Mathdmatique et Informatique Universitd Louis Pasteur 7, rue Rend Descartes 67084 Strasbourg CEDEX France Erhard Scholz FB7: Mathematik GHS Wuppertal Gaussstrasse 20 5600 Wuppertal 1 Germany
Bernhelm Booss-Bavnbek Replies This exchange has been helpful and enlightening. It is fortunate it could be conducted in The Mathematical Intelligencer; I would have preferred that also the pages of the Jahresbericht would be open to such dialogue. Did N. Schappacher and E. Scholz express themselves in favor of this? Institut for matematik og fysik RUC/IMFUFA Postboks 260 DK-4000 Roskilde, Denmark
I1 Circolo Matematico di Palermo In The Mathematical Intelligencer, vol. 17, no. 2, 1995, in the Letters to the Editor there is a list of the oldest mathematical societies of the world, but it fails to include one of the most famous: II Circolo Matematico di Palermo, founded in Palermo on March 2th, 1884 by the Italian mathematician G.B. Guccia (1855-1914). II Circolo Matematico di Palermo was internationally known, and among its fellows were all the greatest mathematicians of the world.
References Brigaglia, A. & Masotto, G. 1982, II Circolo Matematico di Palermo, Dedalo, Bari. Cardamone, L. 1962, Le scuole matematiche in Sicilia dopo l'Unit~, published in La Sicilia e l'Unit?~ d'Italia, Feltrinelli, Milano, pp. 271-300. Lombardo Radice, L. & Bartolozzi, F. 1977, Matematici siciliani dell'ultimo secolo, published in La presenza della Sicilia nella cultura degli ultimi cento anni, Palumbo, Palermo, pp. 1107-1120. Nastasi, P. 1990, Matematica e matematici in Sicilia, published in Scienza in Sicilia, a special number of "Cronache Parlamentari Siciliane', N.S., a. VII, supplem, al n. VI, pp. 59-79. Atdo Scimone GRIM (Research Group for Mathematical Teaching) Department of Mathematics via Archirafi 34 90123 Palermo, Italy.
More on M a g n u s I much enjoyed Abe Shenitzer's "In Memory of My Friend Wilhelm Magnus" in the Spring 1995 issue of The Mathematical Intelligencer. I, too, have experienced Magnus as one of the most considerate and polite persons I ever met. I got to know Magnus in my first semester at G6ttingen in 1947 when he taught Differential- und Integralrechnung I to beginning students. At one point during a lengthy proof he had used the same letter t twice for two different variables. When I called this out to him, he thanked me profusely, bowing several times in my general direction (he couldn't identify the source---there were some 200 students in the audience). Then Magnus started to crank down the blackboard, marking all earlier t's with primes. For some high t's that he couldn't reach, he jumped up in the air to apply the proper primes--on the fly, so to speak. To catch his breath, he interrupted the priming once or twice, turning around, bowing, and thanking his unseen benefactor again. After another 10 minutes into the proof the two t's, the primed and the unprimed, occurred in the same equation, whereupon Magnus stopped and turned to the audience one more time, exclaiming, "Es ware eine Katastrophe geworden. Nochmals vielen Dank, vielen Dank!" At the end of the semester, beginning students had to undergo a Fortsetzungsexamen, which determined the right to continue their studies. Magnus's colleague, Arnold Schmidt (known as "Hilbert's last assistant"), who taught Analytische Geometrie und lineare Algebra, announced the results with the words, "Das Ergebnis des Fortsetzungsexamens war niederschmetternd"--The result of the examination was crushing (whereupon the lectern on which he was leaning actually collapsed). But in spite of this ominous announcement, most students actually received passing grades. How did Magnus handle this onerous task for his course? He read about 200 individual names, one after the other, always adding, "Sie haben bestanden"--you passed. Then, at the end of this long recitation, he invited the (few) students whose names he had not called out to come to his office, where he broke the sad news in complete privacy. When I visited Magnus at New York University in 1955, now myself an immigrant to the U.S., the first thing he did was to apologize--apologize for the modest Hundehfitte (kennel) in which he had to receive me (modest compared to his lavish director's office at the Mathematisches Institut in G6ttingen). Magnus loved America but he was also a keen observer of its idiosyncrasies. When he learned that (at age 28) I was still a bachelor, he ventured the prediction that I would either marry within a year or return to E u r o p e "This is not a country for bachelors." Magnus was right--I married, in February 1956, a Bulgarian who
worked for Radio Free Europe and whom I had met three days after m y arrival in America. Some of our joint hikes in the "mountains" north of New York City were along routes Magnus had recommended to us.
Manfred Schroeder Drittes Physikalisches Institut Universitil"t Go'ttingen D-37073 Go'ttingen, Germany Visit to Hua Loo-Keng Thanks to Shuzhong Zhang and Caspar Schwiegman for their excellent article on Hua Loo-Keng in The Mathematical Intelligencer, vol. 16, no. 3. He was a friend of mine, and I published several of his books. I first met him in June 1974, during the "Cultural Revolution." When I asked to see the great mathematician, my host in Beijing said he was "out of town," and only after strong insistence by me did the-y bring Hua Loo-Keng to my hotel. Later, after receiving an honorary degree at Nancy, Hua Loo-Keng visited me in Heidelberg, and'told me of his life while banished to the countryside. He joked that as a mathematician he could at least do some work in the latrine, writing on toilet paper. The main feature of his character was deep love for his country.
Heinz GStze Springer-Verlag Tiergartenstrasse 17 D-69121 Heidelberg Germany
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996 7
The Opinion column offers mathematicians the opportunity to write about any issue of interest to the international mathematical community. Disagreement and controversy are welcome. The views and opinions expressed here, however, are exclusively those of the author,
and neither the publisher nor the editor-in-chief endorses or accepts responsibility for them. An Opinion should be submitted to the editor-in-chief, Chandler Davis.
The Case Against Computers in K-13 Math Education (Kindergarten through Calculus) Neal Koblitz
In Peru, as in many Third World countries, the system of public education is in crisis. Teachers' pay--traditionally low--is falling rapidly because of inflation. The schools are dilapidated, and there is no money for basic supplies. The government has not been responsive to the teachers' protests. Under pressure from the International Monetary Fund, 1 it insists that the state sector--includes public schools--must be cut back. Yet President Fujimori has said that he wants to get computers into the schools as soon as possible. The government's priority is to "modernize" the economy and the educational system, and computerized learning is supposedly one way to do this. For teachers who are trying to cope with financial hardship and abysmal working conditions, what could be more demoralizing than the message that machines come before people? To the Peruvian teachers, Fujimori's advocacy of computers adds insult to injury. On the other hand, not everyone loses. The U.S. computer industry has an interest in creating new markets in the Third World. Thanks to such strategies as interconnected products and planned obsolescence, greater and greater payments will flow to the north as countries like Peru become dependent on U.S. technology in more areas of national life. Throughout the Third World, for about a decade pressure has been mounting to import computer learning from the wealthy countries. In February 1986, a major conference called "Informatics and the Teaching of Mathematics in Developing Countries" was held in
Tunisia. Participants were predominantly from Northern Africa, but many came from the universities and educational establishments of other regions of the Third World. All of the mathematicians and math educators sang the praises of computers and pled for the rapid introduction of computers into their school systems. Not
1The I n t e r n a t i o n a l M i s e r y F u n d , a s E g y p t i a n P r e s i d e n t M u b a r a k h a s said. THE MATHEMATICAL INTELLIGENCERVOL 18, NO. 1 9 1996 Springer-Verlag New York
9
a single note of skepticism was raised; not a single question was explored in depth. One could have asked, for example: Are computers truly what schools in Africa need? In practice, would computers be introduced on a general level, or only in a few of the elite schools for the wealthy? Could resources be better spent in other ways--to raise teachers' salaries, purchase classroom supplies, expand libraries? It seems bizarre that the Tunisia conference adopted as an axiom the notion that the introduction of computers should be a priority for elementary education in Africa. It should be noted that much of the support for this conference came from the large French computer companies. Thus, even though the meeting was of little value for people seriously interested in educational issues, the funding companies must have viewed it as a great success, holding open the possibility of lucrative new markets, particularly in the former French colonies of Northern Africa.
Some Caveats Despite my skepticism about computers in education, I do not take an extreme position--I do not want to throw out the baby with the bathwater. There are some appropriate uses for computers in math education. Thus, I will not argue 1. that it is unwise to use calculators in class. (I have m y calculus students use them.) 2. that computer-based math courses are always a bad idea. (My university has an optional computer lab course, taught in conjunction with third-quarter calculus, that seems to be functioning well.) 3. that other technology in the classroom should be avoided. (Nice films on area, volume, and so on have been around for decades.) 4. that special learning programs based on calculators or computers are never successful. (Almost any teaching method can work well under suitable conditions and with a dedicated and enthusiastic teacher.) I will argue, however, that there has been too much hype about technology in math education, and it is time to consider the downside. In my opinion, computers should not be a major component in math education reform. The downside can be divided into several broad areas: 1. 2. 3. 4.
drain on resources (money, time, energy) bad pedagogy anti-intellectual appeal corruption of educators
I shall discuss each of these objections in turn, after a few preliminary remarks. 10
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
Popular Culture Any discussion of math education reform should take into account the cultural environment in which we live. American popular culture, which has come to dominate the entertainment industry and the mass media in many parts of the world, can easily cause distortions of the movement for reform. Youngsters who are immersed in this popular culture are accustomed to large doses of passive, visual entertainment. They tend to develop a short attention span and expect immediate gratification. They are usually ill equipped to study mathematics, because they lack patience, self-discipline, the ability to concentrate for long periods, and reading comprehension and writing skills. In the United States, attempts to popularize mathematics tend to be influenced by this ambience. The public television program "Square One" is an example of a well-intentioned effort that went astray because of too much gimmickry and too little attention to content. The politics of funding adds to the pressure on reformers to water down the content. Funding agencies are impressed if grantees can demonstrate short-term success. This may explain why one often comes across pilot projects in American schools where the math content is not appropriate for the target age group--it is too trivial. Of course, the students find it easy, and the project organizers can report great success, thereby satisfying the funding agencies. Such an approach to curricular development contributes to the "dumbing down" of the curriculum. The most important example of gimmickry in math education reform is computermania.
A Drain on Resources We already saw an extreme example of misallocated resources in Peru. But even in a "wealthy" country one has to be concerned about this problem. Many educators in North America share the feeling of betrayal of the teacher who said, They can give us the axe, but they can spend thousands on computers. We have to fire our music coordinator, we have to fire our music teachers, we have shitty libraries. --Lynn, a Canadian schoolteacher, quoted in [14], p. 41 At the University of Washington, we also have resource limitations. After considering various ways to reform the calculus course, we selected a low-tech approach using some applications-oriented lecture notes that I had written. Our calculus reform was implemented relatively quickly, painlessly, and inexpensively, largely because it was not based on computers or graphics calculators. Cost is an issue not only for schools but also for individual students. Ironically, it is sometimes the colleges
Ann Hibner Koblitz discusses a graph theory example with seventh grade girls in Harare, Zimbabwe.
with the highest proportion of working-class students that become most enamored of expensive new gadgetry for teaching mathematics. At a reception for students planning to transfer to m y university from community colleges, the students I talked with were complaining about having to spend more than $80 for a graphics calculator, in addition to $60 for a textbook. (By contrast, the required material for m y 20-week calculus sequence costs a total of about $14.) Another resource issue is that techniques that work in an experimental program with an enthusiastic instructor will not necessarily work under less ideal conditions. One of the lessons of the 1960s "new math" fiasco is that we must look beyond the pilot programs and demonstration classes and selected testimonials, and ask ourselves what is going to happen in a typical classroom with a typical teacher. Every week, in conjunction with a course I teach for math education majors, I spend a morning visiting just such a school. My students and I work with regular sixth-grade classes at Washington Middle School (WMS), an inner-city school in Seattle. The teacher has 5 math classes every day, with a total of over 100 students, many of whom have .severe personal as well as learning problems. When I started visiting her classes in 1992, she had several computers in the back of the room. But they were just gathering dust, and so have been removed. What, if not computers, does an overworked teacher like her need in order to be more effective? She herself told me that she very much regrets not having any time in the
day to talk as colleagues with the other teachers--about pupils they have in common, and about teaching methods and materials. A fundamental problem in education is the failure to treat teachers like professionals. Teachers at schools like WMS need opportunities to upgrade their qualifications, to learn about different teaching materials, and to interact with other teachers as colleagues. This requires release time, light teaching years, or Sabbaticals. If money used to buy computers and software were instead devoted to improving conditions for teachers, it would be money well spent. If pay and working conditions improved for teachers, then maybe more of our best students would go into teaching. All of the fuss about computers serves to divert attention away from the central human needs of the school system--better conditions for teachers and better teacher training.
Bad P e d a g o g y
In response to the grandiose claims of such computers-in-the-schools gurus as Seymour Papert, many experts in child development have pointed out that those claims rest on a flimsy pedagogical foundation, especially where young children (K-5) are concerned. The point is that children benefit most from material that stimulates them to exercise their imagination. For example, simple, unstructured play material like clay, sand, blocks, rag dolls, and finger-painting sets are more THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
11
w h o l e s o m e entertainment than TV (even "educational" TV) and Power Ranger toys. Harriet K. Cuffaro of the Bank Street College of Education comments on computer painting versus ordinary painting: 9 in "painting" via the computer, the experience is reduced and limited by eliminating the fluid, liquid nature of paint9 In this painting there are n o . . . opportunities to become involved in the process of learning how to create shades of colors; gauging the amount of paint to be mixed; experimenting with and discovering the effects of overlaying colors; understanding the relationship of brush, paint, and paper, the effects achieved by rotating the brush and varying pressure, or how one gains control of or incorporates those unexpected, unintended drips . . . . There is an absence of texture, of smell, a lessening of qualitative associations with the experience of painting ... computer graphics have a "stamped-out," standardized, "painting-by-numbers" quality. Though the design or arrangement of colors, lines, and forms will vary with each child, there is a quality of sameness in appearance ... individuality is flattened by the parameters of the program9 [15, p. 25] M o r e generally, according Columbia Teachers College:
to
Douglas
Sloan
of
For the healthy development of growing children especially, the importance of an environment rich in sensory experience color, sound, smell, movement, texture, a direct acquaintance with nature, and so forth--cannot be too strongly emphasized . . . . At what points and in what ways will the computer in education only further impoverish and stunt the sensory experience so necessary to the health and full rationality of the human individual and society?.. 9 What is the effect of the flat, two-dimensional, visual, and externally supplied image, and of the lifeless though florid colors of the viewing screen, on the development of the young child's own inner capacity to bring to birth living, mobile, creative images of his[/her] own? [15, pp. 5 and 8] Some have also questioned the effect of c o m p u t e r s on the teacher-student interaction 9 Larry Cuban, w h o has m a d e a detailed s t u d y of the history of attempts since 1920 to introduce technology into American schools, writes: In a culture in love with swift change and big profit margins, yet reluctant to contain powerful social mechanisms that strongly influence children (e.g., television), no other public institution [besides schools[ offers these basic but taken-for-granted occasions for continuous, measured intellectual and emotional growth of children . . . . The complex relationships between teachers and students become uncertain in the face of microcomputers.., introducing to each classroom enough computers to tutor and drill children can dry up that emotional life, resulting in withered and uncertain relationships9 [2, pp. 89 and 91] Several educators have criticized the public's unquestioning acceptance of so-called " c o m p u t e r literacy" as a goal for education 9 C o m p u t e r scientist Joseph W e i z e n b a u m has said, "There is, as far as I know, no m o r e evidence [that] p r o g r a m m i n g is good for the mind 12
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
The author checks the solution of two children in Cuzco, Peru.
than [that] Latin is, as is sometimes claimed" (quoted in [2, p. 94]). According to John M. Broughton of Columbia Teachers College, Inherent in that term ['computer literacy'] is the promise of generalizability comparable with [that] of reading and writing. However, there is no evidence that programming skills transfer to other areas of psychological development, even cognitive ones. In fact, a recent comprehensive review of the literature by Pea and Kurland suggests that virtually all the claims made about the beneficial educational effects of learning to program are not only inflated, but probably incorrect. Moreover, Pea and Kurland reveal that there is not even support for the . 9 notion that learning to program aids children's mathematical thinking9 Their own research study on transfer revealed that Logo programming experiences had no effect on the planning skills that are deemed central to problem-solving skills9 The tradition of grossly inflated claims identified in the artificial intelligence literature ... appears to have carried over into the ... area of electronic learning. [15, p. 109] The inability to develop good translation software has been one of the most embarrassing failures of Artificial Intelligence 9 If the best computers in the world are unable to translate from French into English, then they certainly cannot help m y calculus students do what is the main point of the course: translating w o r d problems into mathematics. C o m p u t e r s in this course w o u l d only be a distraction and a waste of time and resources. If the focus in beginning calculus is p u t w h e r e I believe it bel o n g s - - o n analyzing real-world problems and choosing the appropriate mathematical t e c h n i q u e s - - t h e n the course cannot be centered a r o u n d computers or graphics calculators. At m y university we have an optional one-credit computer lab for students in their third quarter of calculus 9 At this point it makes sense to offer the technology, because (1) the emphasis shifts from w o r d - p r o b l e m applications to geometric applications of calculus in three
The author presents math enrichment topics in an after-school program in a poor barrio in San Juan, Puerto Rico.
dimensions, where computer graphics serve a useful purpose and (2) students at that level are sophisticated enough to benefit from computers. At all levels of schooling we have to ask: Do the students learn to punch buttons, or do they learn mathematics? One day at Washington Middle School, we had the sixth graders play a math game that involves dividing by 7 and rounding off to the nearest integer. When they had to find 60 - 7, they punched it correctly into their calculators, which displayed 8.5714 . . . . But most of them could not read or interpret the answer: they did not understand the significance of the decimal point. Similar dangers exist among older students. For this reason, in the calculus final exams at my university we usually ask for exact (not decimal) answers. For example, sin(60 ~ = 1//2 V 3 , not 0.866; the circumference of a circle is 2~-r, not 6.283r. Anti-Intellectual Appeal
Computers reinforce the fascination with gadgetry, as opposed to intellect, that is endemic in American popular culture. As pointed out by Brian Simpson, former education advisor to IBM in the United Kingdom, Technological solutions to educational problems often have a seductive appeal, promising to make education easier and more enjoyable than ever before. In the past, extravagant claims have been made for teaching machines, educational television, language laboratr)ries, and even such improbable, esoteric phenomena as sleep learning and learning under music-induced hypnosis. [15, p. 84] There is a long history of people wanting and expecting technology to transform education. It has been over 70 years since the following prediction was made by a famous American:
I believe that the motion picture is destined to revolutionize our educational system and that in a few years it will supplant largely, if not entirely, the use of fextbooks. --Thomas Edison, 1922 (quoted in Ref. 2, p. 9) Perhaps the most frequent argument for computers in the schools--and also the most illogical--is the inevitability argument: "Calculators/computers are going to be everywhere, so we might as well incorporate them into math class. One can't fight against the tide." Using the same reasoning, one could say that, since automobiles are everywhere in our society and play a crucial role in all of our lives, therefore cars should be used as much as possible in education, and driver education should be regarded as a centrally important subject in school, much more so than such relatively useless subjects as music and art. The inevitability argument for computers in the schools is exactly the same sort of antiintellectual non sequitur. Computers are usually used in the classroom in a way that fosters a Golly-Gee-Whiz attitude that sees science as a magical black box, rather than as an area of critical thinking. Much computer use is "teaching by demo" with the student as spectator. There is then little difference between so-called electronic learning and simply watching television. Most software is based on immediate gratification and does not encourage sustained mental effort. While physically playing an active role, the pupil is intellectually passive, and has little opportunity to be creative; that is, the pupil is programmed to follow a path already laid out in detail by others. Even when software designers try to get the children into a creative mode, in many cases the same could be better accomplished without the technology. Educators tend to put the cart before the horse: instead of asking whether or not technology can support the curriculum, THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
13
they try to find ways to squeeze the curriculum into a mold so that computers and calculators can be used. Like a quack cure in medicine, perhaps the most harmful effect of the computer craze is that it diverts people from other, more solidly grounded approaches to treating the ailments of mathematics education. Might not the Golly-Gee-Whiz-Look-What-ComputersCan-Do school of mathematical pedagogy eventually come to be regarded as a disaster of the same magnitude as the "new math" rage of the 1960s?
Computer Science Is Not the Same As Computers One can strongly advocate increased teaching of computer science (and related areas, such as discrete math) while opposing the use of computers. Computer scientist Michael Fellows, who is an active campaigner for computer science in grade school (see [ 3]), has said: "Most schools would probably be better off if they threw their computers into the dumpster." Fellows uses the term "Cargo Cult" to refer to the fetishization of computers by the media and educational establishment.
Definition of Classical Cargo Cult: An isolated civilization comes into initial contact with European technology. Ignorant of modern science, they interpret the benefits of technology in terms of their familiar world and their familiar mode of operation. They pray and perform sacrifices, or do whatever they think might be necessary to induce the deities to bring them the Cargo. Modem Cargo Cult- In the United States and many other countries, most of the general public is prescientific, in the sense of having no rational understanding of the intellectual processes that go into scientific advances or their application to the real world. On the other hand, like the classical Cargo Cultists, they realize that technology is associated with economic well-being, and that something must be done so that youngsters will later be able to reap the benefits of the "computer age." The natural response, then, is to fetishize computers and fit them into the familiar world of traditional mindless school math. The public needs to understand that math and computer science are not about computers, in much the same w a y that cooking is not about stoves, and chemistry is not about glassware; that is, COMPUTER SCIENCE q~ COMPUTERS The meaning of this inequality is: What children need in order to become mathematically literate citizens in the computer age is not early exposure to manipulating a keyboard, but rather wide-ranging experience working in a creative and exciting way with algorithms, problem-solving techniques and logical modes of thought. 14
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
Money Corrupts Much of the effort to introduce technology in the classroom is profit-driven. As Douglas Sloan says, It does not take a flaming Bolshevik, nor even a benighted neo-Luddite, to wonder whether all those computer companies, and their related textbook publishers, that are mounting media campaigns for computer literacy and supplying hundreds of thousands of computers to schools and colleges really have the interests of children and young people as their primary concern. [15, p. 3] In the words of Joseph Menosky, an American writer and former science editor at National Public Radio, Certainly those who have a great deal to gain from a universal acceptance of computer literacy--microcomputer firms selling hardware, textbook companies selling educational software, organizations selling worker and teacher retraining courses, and writers and publishers selling books and instructional guides--have done a brilliant, if morally indefensible, job of commercial promotion. [15, p. 77] Corporate domination of math education reform has corrosive effects. The profit motive has an excessive influence; the intrinsic value of a pedagogical idea is not considered to be as important as its saleability. Educational ideas that are not based on expensive gadgetry or new textbooks are not likely to be supported strongly. There is an excess of faddism and hype. It is regrettable that computers have been so aggressively marketed to teachers and school systems. In speaking to parents, teachers, and school boards, many company sales representatives take the hard-sell approach: "If you don't buy our latest products, you will be neglecting to prepare your children for the 21st century." Foundations and government agencies, such as the National Science Foundation (NSF) in the United States, compound the problem, because they share the biases of the commercial interests. Money from corporate and foundation sources seduces educators, whose objectivity and independent judgment become compromised. Grants can corrupt. The technology-in-education movement has some of the characteristics of a religious evangelical campaign, fueled by corporate and foundation money. One sees the same reliance on testimonials. A technology enthusiast might proclaim, "How graphics calculators have changed my life!"--just as a born-again Christian talks about rediscovering Jesus. Like their religious counterparts, technology boosters tend to adopt a Manichaean view, dividing educators into two camps: those who have seen the light, and the fractious infidels. I recently had a personal encounter with this closedmindedness. About 2 years ago, the NSF asked me to help evaluate calculus reform proposals. But when they learned that I am skeptical about computers and graphics calculators in calculus, they changed their minds and
Asian students using abacuses. (Photo used by permission of Superstock.)
decided not to send me any proposals to review. They did not want any input from a nonbeliever.
Low-Tech Is Better To end on a positive note, the good news is that the vast majority of enrichment topics do not require high technology, but only pencil and paper and inexpensive manipulatives, such as geoboards and abacuses? All material that my math education students and I present to the Washington Middle School youngsters is low-tech. Of the four examples given below, the first two have worked well with the WMS sixth graders. The cryptography exercise is easy. The answer is "iHasta la vista!"-a phrase that non-Spanish-speaking kids usually know from the movies. The second exercise is more challenging, since we ask the children to solve it by first constructing a mathematical model: a graph in which two vertices are joined by an edge whenever the corresponding girls can room together. The third example is accessible to slightly older children. The fourth problem is taken from a calculus final exam at the University of Washington. A calculator might be useful (but is not necessary) for the third problem, and should certainly be-used in part (c) of the fourth problem. 2Accordingto the Wall StreetJournal (November22, 1994),the abacus is making a comeback in Asian schools, replacing calculators. The sound and tactile sense give children a feel for the algorithmic dynamics-kids say things like "to subtract 7 you add 3 and then subtract 10"--whereasthe use of calculatorsmade the childrenfeelalienated from the arithmetic.
Examples Example 1: Cryptography. You intercept the coded message EXPQXIXSFPQX, which you know was formed by a shift in the 26-letter alphabet (Caesar encryption). The message is in Spanish. Using the fact that "a" is the most frequently occurring letter in Spanish, break the code and read the message. Example 2: Mathematical Modeling. Ten girls from different parts of the world are coming to spend 4 weeks at International Summer Camp. The girls will live two to a room. As Head Counselor, you have the job of selecting roommates. Each girl must room with a girl with whom she has a common language. The list of campers, together with their countries of origin and the languages they speak, is given below. Try to find a perfect matching. Conchita (Guatemala)--Spanish Eliane (Brazil)--Portuguese, Spanish Farkhonda (Egypt)--Arabic, English Hilweh (Jordan)--Arabic, French Keiko (Japan)--Japanese, French Mei Li (Macao)--Cantonese, Portuguese Pilar (Spain)--Spanish, French Selani (Zimbabwe)--Shona, Ndebele, English Sisai (Botswana)--Shona, English Zeineb (Tunisia)--Arabic, French
Example 3: Statistics. In a certain Third World country, in 1985 the annual salaries of THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
15
7,000,000 people were approximately $800 2,000,000 people were approximately $1200 900,000 people were approximately $2000 100,000 people were approximately $20000 After several years of a new economic policy called "Structural Adjustment Program" (SAP), a statistical study in 1995 found that 10,000,000 people earn approximately $600 per year 1,800,000 people earn approximately $1000 per year 550,000 people earn approximately $6000 per year 150,000 people earn approximately $40000 per year
time was heavily French-influenced, imported the new pedagogy despite the strenuous opposition of the leading Peruvian mathematicians (particularly Professors C. Carranza and M. Helfgott). Of course, the introduction of Bourbaki-style mathematics in the schools was a disaster in Peru, as elsewhere. The Peruvian mathematicians had been right, and the French had been wrong. Will history repeat itself? Will countries around the world again import a poorly thought out and unworkable pedagogy from the United States and France? This time the cost will be higher. The "new math," for all its foolishness, at least was relatively inexpensive.
References Find the mean, median, and mode of the annual incomes in 1985 and in 1995. (a) Which average would be used by someone who wanted to argue that the SAP had been a success? (b) Which average would be used by someone who wanted to argue that the SAP had been a failure?
Example 4: Calculus. At time t = 0 you borrow $5000 to buy a car. You pay back the loan with monthly payments of $50. The annual interest rate is 8% comp o u nd ed continuously. Make the simplifying assumption that the payment of $50/month is also made continuously, rather than in lump sums once a month. (a) Write a differential equation for the amount y(t) you still owe after t years. (b) Find a formula for y(t) by solving this differential equation. Please show all steps clearly. (c) After how many years will you have paid off the debt? Give a decimal answer. (d) Make up a problem about a unicorn farm* which leads to the identical mathematics as the above problem about loans (i.e., the same differential equation and the same initial value of the unknow n function). Your problem should involve bi r t h / deat h rates and something about how many unicorns are being sold as pets. Use complete English sentences. What question about the unicorn farm is analogous to part (c) above? *A unicorn is a mythical horselike animal which does not exist outside of ancient legends and modem calculus exams.
Epilogue: First Time it's Tragedy, Second Time it's Farce At the beginning of the article I mentioned that President Fujimori of Peru has become a fervent advocate of computers in the schools. It is interesting to note that in the late 1960s, Peruvian education officials came under influences that were in some ways a foretaste of the computer rage. The "new math" was then the reigning paradigm in math education in the wealthy countries. The Peruvian education ministry, which at that 16
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
1. Consortium for Mathematics and Its Applications, For All
Practical Purposes: Introduction to Contemporary Mathematics, New York: W. H. Freeman (1988). 2. L. Cuban, Teachers and Machines: The Classroom Use of Technology Since 1920, New York: Teachers College Press (1986). 3. M. R. Fellows, Computer science and mathematics in the elementary schools, in N. D. Fisher, H. B. Keynes, and P. D. Wagreich, editors, Mathematicians and Education Reform 1990-1991, Providence, RI: American Mathematical Society (1993), pp. 143-163. 4. M. R. Fellows, A. H. Koblitz, and N. Koblitz, Cultural aspects of mathematics education reform, Notices of the American Mathematical Society, 41 (1994), 5-9. 5. M. R. Fellows and N. Koblitz, Math Enrichment Topics for Middle School Teachers, Seattle, WA: Math Liberation Front (1994). 6. N. Koblitz, Math 124/125 (Calculus Lecture Notes), University of Washington Mathematics Department (1995). 7. N. Koblitz, The profit motive: the bane of mathematics education, Humanistic Mathematics Network Journal, No. 7 (1992), 89-92. 8. Mathematical Sciences Education Board and National Research Council, Measuring Up: Prototypes for Mathematics Assessment, Washington, DC: National Academy Press (1993). 9. National Council of Teachers of Mathematics, Curriculum and Evaluation Standards for School Mathematics, Reston, VA: NCTM (1989). 10. National Council of Teachers of Mathematics, Professional Standards for Teaching Mathematics, Reston, VA: NCTM (1991). 11. S. Papert, Mindstorms: Children, Computers, and Powerful Ideas, New York: Basic Books (1980). 12. R. D. Pea and D. M. Kurland, On the cognitive effects of learning computer programming, New Ideas in Psychology 2 (1984), 137-168. 13. R. D. Pea and D. M. Kurland, Logo Programming and the Development of Planning Skills, Technical Report No. 16, New York: Bank Street College of Education (1983). 14. K. Riel, Factors That Influence Teachers" Use and Non-Use of Computers, M.A. in Education thesis, University of Victoria (1994). 15. D. Sloan (ed.), The Computer in Education: A Critical Perspective, New York: Teachers College Press (1985). 16. C. Stoll, Silicon Snake Oil: Second Thoughts on the Information Highway, New York: Doubleday (1995). Department of Mathematics University of Washington Seattle, WA 98195, USA
Some Kinds of Computers for Some Kinds of Learning: A Reply to Koblitz Ed Dubinsky and Richard Noss
Introduction It is surely time to examine the computer's role in mathematical teaching and learning. The initial flush of enthusiasm over "new technologies" is beginning to pass, and we should now begin to think carefully about the ways in which computers may help students. A dispassionate, well-informed examination of costs, benefits, and difficulties is needed. Unfortunately, Neal Koblitz has failed to make such a start. Moreover, by confounding different aspects, he has ironically ceded decision power to the computer and software giants which he and we--deplore. Koblitz begins by casting doubt on the need of the third world for computers in schools. The question is up to them to decide for themselves, but he is free to advise them. We read on, awaiting his analysis. The view of computers in education which he offers is, alas, so narrow as to border on distortion. Following Koblitz's organization, we will begin with the economics of educational commodities, which come from the industrialized countries and are sold wherever the market can be found--in the first, second, or nth world. Then we will respond to his discussion of the use of computers in education (in any ith world), and argue for a broader view, encompassing phenomena of which Koblitz seems unaware and leading perhaps to very different conclusions. We believe a computer environment to help students learn mathematics can be designed in ways very different from those which Koblitz (quite properly, for the most part) rejects; we believe these methods can be extremely effective.
medicine. The pharmaceutical giants certainly" put profits ahead of third-world welfare: artificial milk products are killing babies throughout the "undeveloped" world. Do we conclude that the introduction o,f new medicines should be stopped? Hardly. Rather we distinguish between therapeutic alternatives, evaluate them on the basis of research, and try to see how cures and prevention of illness can be integrated into people's lifestyle. Just so for computers. They are used for both missile guidance and microsurgery. It is the culture that determines how the technology enters, not the technology itself. As one of us has written elsewhere, "Pieces of knowledge are appropriated (or not) depending upon pupils' own agendas, how they feel abouf their partici-
Use of Computers in Third-World Education Koblitz suggests that computers add "insult to injury" in the third world, and their spread is aimed primarily at expanding the market for the first world manufacturing industry. True enough. True for the first world as well: for example, the intl:oduction of computers into UK schools in the 1980s was handled by a government department whose brief was to expand the UK's computer industry, not to help education. True for other industries, too: everything Koblitz says about technology holds for the billion-dollar textbook publishing industry (including sales of left-over stock to the third world). The conclusions are not so clear. Take the example of THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1 9 1996 Springer-Verlag New York
17
pation, teacher intervention, and above all, the setting in which the [computer] activities are undertaken." (Hoyles and Noss, 1992).
for the potential of an environment that includes all of these and also appropriately used computers.
The Charge of Anti-Intellectualism Resources and Cultures Koblitz attacks computers as the epitome of "popular culture," which is based on passive "entertainment" and which, in the educational domain, entails a "dumbing down" of content. Of course. Modern economic and social life is based on the commodification of essential physical, emotional, and intellectual life: on casting people as consumers rather than as whole human beings (see Noss, 1994). This is the central role assigned in modern capitalist economies to all technology. But here again Koblitz has a simplistic view. Here again it is not just the technology, but how it is used. Koblitz is right to observe that technology can be used to lull a child into a passive inability to "concentrate for long periods," but the same technology can hold a child engaged for hours; the machines that offer immediate gratification for minimal reading and writing skills can also afford young children ways to craft complex and meaningful pieces of writing, and introduce them to the skills of editing and redrafting; the same computer that runs software in which mathematics problems are solved simply by pushing the right buttons can also support systems that students use to construct mathematical tools on the computer and simultaneously construct mathematical concepts in their minds. We can no more blame "computers" for a passive popular culture than we can blame "medicine" for encouraging Indian mothers to stop breast-feeding. Our big disagreement with Koblitz is that he bases his argument on one way of using computers--admittedly a widespread w a y - - a n d ignores other uses to which his complaints do not apply. We are also not much enamored of Koblitz's alternative, which is, apparently, that a mathematician gets up a set of notes for a course and if the notes are good enough, calculus will be fixed. Where have we heard that before? He would have us do things in the same old way, but better. There are already many talented and dedicated practitioners of traditional pedagogies, and still, even in the industrialized countries, children are neither learning nor liking mathematics. Koblitz asserts that computers are a "drain on resources." We are certainly opposed to any attempt (and such attempts are underway) to replace teachers by computers. Computers alone cannot make good teachers, and we are not encouraged by attempts to get computers to do the same things that teachers do (see Noss, 1995, for evidence of this assertion in undergraduate mathematics teaching). We agree that computers are a drain on resources if used as an excuse to fire teachers. Teachers are a higher priority, as are books, desks, roofs over classrooms, warmth, and running water. We argue 18
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
Koblitz argues that computers foster anti-intellectualism. This overlaps his earlier argument concerning passivity. "Most software is based on immediate gratification, and does not encourage sustained mental effort," he says. Software of that sort we reject also. But a small (yet substantial) fraction does not supplant mental activity; indeed it does not do very much at all. The wordprocessors with which we are writing, or the high-level programming languages in which we express mathematical ideas do not "do" or "encourage" or "gratify" in any way. Each is a medium of expression, like a piano or a pen. Whether creativity occurs depends on social responses of people doing things with the cultural tools they have at their disposal. If a particular musical composition is boring, don't blame the violin. In the case of learning mathematics, we could not agree more with Koblitz that "teaching by demo" is bad pedagogy. But we cannot avoid the observation that amongst the worst examples of this pedagogical strategy are having students listen to lectures, follow worked-out problems in a text, or read notes written by a mathematician who is not paying attention to how learning actually occurs.
Computers and Pedagogy Now we come to the meat of Koblitz's paper. He claims that computers generate "bad pedagogy." He attacks the "grandiose claims" of, among others, Seymour Papert, drawing evidence from several references. Some of these (e.g., Cuffaro, Sloan, Cuban) are ten-year-old expressions of opinion, from an interesting but bygone era when it was fashionable to discuss whether computers were as good as paintbrushes for artistic expression, or whether computer sounds were as pleasant as those produced by a Stradivarius. Such discussions had appeal to some, but "evidence" they are not, and it is somewhat disingenuous of Koblitz to cite them as such. We have two points regarding computers and pedagogy. One is that, as we have already indicated, there is a multitude of ways in which computers might be used in education. These include "microworlds" where mathematical phenomena are an integral part of the learner's environment; computer algebra systems that students use to explore ideas and solve sophisticated problems; mathematical programming languages used to construct mathematical concepts on the computer; graphing facilities that allow the learner to see complex phenomena; multiple-representation software with which a student can see the effect that changing a feature of one representation has on the others; playful and experimental "games" in which learners try to accom-
plish entertaining tasks; and spreadsheets in which the structure of the software provides a powerful w a y to model some mathematical situations. We do not endorse everything done with all of these pedagogical tools; we are not agnostic as to their relative merits (nor do we suggest they are well-ordered). We do claim that there is a lot more to this multitude of software than appears in Koblitz's simplistic report-and much of it can be used to make a significant difference in how much is learned by how many people. The second point we wish to make is this. If Koblitz rejects "grandiose"--unjustified--claims, well, of course. But there is a growing literature describing effective strategies and providing evidence of their effectiveness. All we can do here is mention some examples. For effects of having students write programs to learn mathematical induction, see Dubinsky, 1989; to learn predicate calculus, see Dubinsky, in press; to learn functions, see Breidenbach, et al, 1992. The contributions in Hoyles and Noss (1992) testify to the breadth of the encounter of computer programming in Logo with mathematical topics including three-dimensional geometry, group theory, and dynamical systems. For an overview of experiments showing both success and failure of ways of using calculators to learn various mathematical concepts, see Dunham, 1993; for spreadsheets, see Smith, 1992. The diverse contributions in diSessa, Hoyles, and Noss (1995) amply illustrate the wide range of mathematical ideas which the computer (e.g., programming languages such as Logo and Boxer, dynamic geometry tools such as Cabri Geometry, and the trusty spreadsheet) can open to learners and teachers alike. We feel that this literature is quite different from Koblitz's choice of the Holy Grail of research, the paper which has become ubiquitous in arguments that computers have no (or detrimental) effect on mathematical learning. That paper is by Roy Pea and Midian Kurland, two researchers from Bank Street College who undertook a series of studies of the computer programming language Logo and its effect on learning mathematics. We are sorry to see Koblitz join the ranks of writers who turn to Pea and Kurland as evidence without comment. For the benefit of readers unfamiliar with Pea and Kurland's research, we outline here their "findings." Pea and Kurland mounted a series of studies in the early 1980s (a typical and oft-quoted example is Pea and Kurland, 1984). Their aim was to explore the "effects" of young children's learning to program a computer in the programming language Logo. As psychologists, not mathematicians, they were interested in "transfer" of "human cognition." In a typical experiment, they took (not randomly) a sample of 24 children from a private school, half of whom learned Logo, and half of whom did not (we are not told what these children did instead). At the end of a four-month period, the children were asked to devise a plan to carry out six classroom chores on a transparent plexiglass map of a fictitious classroom.
Success on this task (as measured by a fairly arcane set of criteria) was found to be not significantly different between the Logo and "control" groups. N o w many charges might be laid against this experiment, concerning misplaced objectives, faulty statistics, and inadequate methodology (see Hoyles and Noss, in press). We will restrict ourselves here to the point most relevant for Intelligencer readers: the "finding" tells us nothing about the learning of mathematics. It does not show that "all the claims made about the beneficial educational effects of learning to program are not only inflated, but probably incorrect." Neither does it show that "there is not even support for t h e . . , notion that learning to program aids mathematical thanking." It is simply an irrelevance. Worse, an irrelevance based on a finding of "no significant difference." If it is supposed to be a counterexample, what is it a counterexample to? Most telling is the failure of Pea and Kurland to document what the children did with Logo. Were they given specific activities? If so, what kind? Did the activities relate to the idea of planning which would subsequently be tested? We simply do not know. Pea and Kurland showed that young children who had been given a copy of Logo were no better at solving a cute puzzle involving pushing in chairs and cleaning tables than those who had not. So what? The contrast between Pea and Kurland's actual research and the way it is invoked by Koblitz points to the bottom line on the literature in this field. There are many reports, some of which may provide evidence of something; but if the subject we are addressing is how to make the learning of mathematics more effective and enjoyable, we must weigh the studies which bear on the subject, and leave aside the studies which do not.
What Are We Trying to Achieve? We find this curious sentence in Koblitz's article: "If the best computers in the world are unable to translate from French into English, then they certainly cannot help my calculus students do what is the main point of the course: translating word problems into mathematics." We share Koblitz's goal that students should learn to translate word problems, although we would rather think of it as using mathematical structures to model situations described in word problems. His argument here seems misdirected. It is superficial to insist that the computer be capable of doing what we wish the student ultimately to accomplish. The computer may play a role different from the human's, as a piano may accompany an oratorio. But there is a deeper issue. Does Koblitz really believe that the main point of learning calculus is translation of word problems in the w a y that he could imagine a computer doing? To us, the point is to help students learn to respond to a situation by mathematizing it, by constructing functions and relationships which can make THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
19
sense of it, and by using tools such as calculus to answer questions about it. This is a very different matter from mindless translation of the sort that computers might (but, in general, cannot) do. It may be that Koblitz is in agreement with us on this substantive point, but he blames the computer for a failure to think through an appropriate pedagogy. Let's agree that translation (in the sense of modeling) is important: why can't we use appropriate computational tools to help us, and reject those that are inappropriate? Certainly the work we have referred to suggests that appropriate uses do exist. Turning to computers themselves, Koblitz offers the following banal observation: "What children need in order to become mathematically literate citizens in the computer age is not early exposure to manipulating a keyboard, but rather wide-ranging experience working in a creative and exciting way with algorithms, problem-solving techniques, and logical modes of thought." If anybody thinks otherwise, it must be an advertising agent of a computer company, whom we join Koblitz in despising. But why is there that word "rather"? Is exciting work for children possible in some technologies but not in others? Are pens and paper acceptable? Presumably. Light pens and word processors, presumably not. Dynamic geometry tools and programming languages, definitely not. We ask Koblitz this question: Where does he draw the divide? Does the inclusion of a chip rule out the tool? And if so, why should the tools used by the most creative professionals (including creators and users of mathematics) be closed to children?
Cash, Computers, and Choices Koblitz says that "money corrupts." He reiterates the negative effect of corporate profit on education. We concur. He ridicules "believers" who say, "Graphics calculators have changed m y life." We take his point. Our (horror: electronic) thesaurus says, "Belief: creed, conviction, faith, tenet, doctrine." Koblitz is objecting to those who make high-tech a religious prejudice. It is a shame that he reduces his objection to the same level. If Koblitz is really concerned about the corruptive effect of cash, then he should go after the economic system in which the only way that educational materials can be produced and widely disseminated is through someone's making a profit. In such a system, it is natural that profit-makers want more profit, and there arises a conflict between what makes sense educationally and what will sell. Many calculus reformers in the US are just now discovering how difficult this makes it to introduce innovations into their materials. This is a very big problem, but Koblitz is wrong in suggesting that it has to do with computers. It goes much deeper. In his understandable effort to resolve the dilemma, Koblitz concludes that "low tech is better." He gives four examples, one of which is a "real world" calculus prob20 THEMATHEMATICALINTELLIGENCERVOL.18,NO.1,1996
lem based on continuously compounded interest payable on a car loan. Apparently the technology of the internal combustion engine, the business practice of banks, and the role of the car in consumerism are on the right side of Koblitz's technological divide. Whose "real world" does Koblitz have in mind? And can he cite a single study that shows that using his kind of example has any positive effect on student learning? We wouldn't even dare to ask for such information about the effectiveness of the "low-tech approach using some applications-oriented lecture notes that [Koblitz] had written." Koblitz must know that such an approach has been tried by a multitude of mathematics teachers. Some, after looking at the results, have concluded that much more serious thinking about learning is needed. Some of us, mathematicians and educators, feel that we are not in a position to be complacent, and that we cannot afford to reject out of hand new technologies that might--just might--help us in our task.
References Breidenbach, D., Dubinsky, E., Hawks, J., & Nichols, D. (1992), Development of the process conception of function, Educational Studies in Mathematics 23, 247-285. diSessa, A., Hoyles, C., & Noss, R., eds. (1995), Computers and Exploratory Learning. Berlin: Springer-Verlag. Dubinsky, E. (1989), Teaching mathematical induction II, The Journal of Mathematical Behavior 8, 285-304. Dubinsky, E. (in press), On learning quantification, Journal for Computers in Mathematics and Science Education. Dunham, P. (1993), Does using calculators work? The jury is almost in, UME Trends 5.2, 8-9. Hoyles, C. & Noss, R., eds. (1992), Learning Mathematics and Logo. Cambridge MA: MIT Press. Hoyles, C. & Noss, R. (1992), A Pedagogy for mathematical microworlds, Educational Studies in Mathematics 23, 31-57. Hoyles, C. & Noss, R. (in press), Windows on Mathematical Meanings. Dordrecht: Kluwer. Noss, R. (1994), Structure and ideology in the mathematics curriculum, For the Learning of Mathematics 14, 1. Noss, R. (1995), Reading the Sines. Unpublished report of the evaluation of the Teaching and Learning with Technology Project #15, Institute of Education, London. Pea, R. & Kurland, M. (1984), On the cognitive effects of learning computer programming, New Ideas in Psychology 2, 137-168. Smith, R. (1992), Spreadsheets at Joint Meetings in Baltimore, UME Trends 4.2, 1-3. Ed Dubinsky Department of Mathematics Purdue University West Lafayette, IN 47907-1968 Richard Noss Department of Mathematical Sciences University of London, Institute of Education 20 Bedford Way London WCIH OAL England
Hilbert's Problems and Their Sequels Jean-Michel Kantor II n'y a pas de probl~mes rdsolus, il n'y a que des probl~mes plus ou moins rdsolus. 1 Henri Poincar6
"To speak of the future of mathematics is an exercise in fantasy which can not be discouraged too strongly; it is really a p u r e absurdity . . . . [The state of mathematics in the year 2000] will be just fine if the atomic physicists or some peace conferences don't s u d d e n l y interrupt the course of progress" (R. Godement, 1948, Gen. Ref. [LeL]2). T o d a y the historical d e v e l o p m e n t of the subject can no longer be treated more than partially, and even then it takes a team such as was assembled at the conference in the United States in 1974 on the problems David Hilbert had posed in 1900. Poincar6 and Hilbert were no d o u b t the last mathematicians to have an overall conception of their science; it is no longer within the scope of one mathematician to handle the mathematical legacy of Hilbert. This m a y serve to excuse in part such omissions and errors as m a y be found in the following, which are m y responsibility alone, despite the kind help of V.I. Arnold, M. Berger, M. Berry, H. Br6zis, P. Cartier, J.-L. Colliot-Th616ne, J. Dixmier, M. H i n d r y , J.-P. Kahane, V. Kharlamov, B. Malgrange, Y. Matijasevich, B. Mazur, J. Milnor, C. Sabbagh, J.-P. Serre, G. Tenenbaum, and M. Waldschmidt. M y w a r m thanks to them, and to the staff of the interuniversity science library of Jussieu, and to Alberto Arabia.
Who would not lift the veil that hides the future from us, to have a look at the progress of our Science and the secrets of its further development in future centuries? [H, p. 58] H e lectured on 10 of the 23 problems later published in the Proceedings of the Congress: On foundations--P.r.obtems 1, 2, and 6. Four problems on arithmetic and a l g e b r a - - P r o b l e m 7, Irrationality and transcendence; P r o b l e m 8, The Riemann hypothesis; P r o b l e m 13, Superposition of two functions; and P r o b l e m 16, Ovals and limit cycles. Three m o r e on theory of f u n c t i o n s - - P r o b l e m 19, Calculus of variations; P r o b l e m 21, Fuchsian differential equations and m o n o d r o m y ; a n d Problem 22, Uniformization. To Hilbert, the choice of problems was supposed to show the d e e p unity of mathematics and support his declaration of faith in a glorious future (in mathematics there is no " i g n o r a b i m u s ' - - D u b o i s - R e y m o n d ) . The
The Context 3
Poincar6 m a r k e d the first International Congress of Mathematicians in 1897 b y a lecture on the relations between mathematics and physics. When Hilbert was invited to speak to the next International Congress (Paris, 6-10 August 1900), he hesitated between an answer to Poincar6 and a list of problems to stimulate the research of the new century. He asked Minkowski, "Might I point to probable directions of the mathematics of the new century . . . a look to the future?" Minkowski was enthusiastic. At Paris, Hilbert had a rapt audience:
Jean-Michel Kantor
Jean-Michel Kantor of the Universit~ de Paris VII has been
"From Plato to Queneau" on mathematics for students in
1,,There are no solved problems, there are only more-or-less solved problems." 2Referencesare indicated by authors' initials. Some are general (Gen. Ref.) and some pertain to particular problems. 3For the entire section, see Gen. Refs. [HI and [MI.
the French adaptation of the beautiful book Rasskazy o matematikakh i fizikakh by Simon G i n ~ , under the title
Pendules, horloges et mdcanique cdleste (Diderot Editeurs, Paris, I995). We await future installments.
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1 9 1996 Springer-Verlag New York
21
Hilbert problems went on to have the success we all know; their history is rich in instructive anecdotes, but also in blind alleys. There are also some surprises waiting in their present status. Let me try to bring the problems up to date; I emphasize the progress made between 1975 and 1992, because a serious work was produced by the American Mathematical Society seminar in 1974. For an introduction to the situation in number theory (before the "Wiles bomb"), see [Ma].
T h e Present Status Problems 1 and 2: The continuum hypothesis; consistency of arithmetic. Associated with the first two problems are the names of Kurt G6del and Paul Cohen. The continuum hypothesis, the subject of Problem 1, may be stated as Every uncountable subset of R has the same cardinality as R,
of exhaustion, whereasfor plane figures the calculation of the area of any polygon is reducible by decomposition ("scissors geometry") to that of the square. To show that such a reduction is impossible for volumes. Actually the problem was already solved by Bricard (1896) and Hilbert's student Dehn (1900), based on the notion of invariant; two solids which can be cut up into congruent pieces necessarily have the same Dehn invariant. Call "scissors congruence invariant" an element D(P) (of a group) associated to every polytope P in such a way that D(P N P') + D(P 0 P') = D(P) + D(P') if P, P', and P U P' are polytopes. Further, D(P) = 0 if the polytope is planar; and for g a motion of space, D(g(P)) = D(P). The Dehn invariant is defined as follows. The group of angles between lines in the plane is identified with R/2"rrZ, and one sets D(P) = Z
ILil Q
8i, D(P) E R C)z(R/2rtZ),
i or
2~~ = R1. The consistency of arithmetic was treated by G6del, who showed (1931) the incompleteness and the undecidability of arithmetic. G6del showed in 1938 that if set theory (according to the Zermelo-Frankel axioms) is consistent, then it remains consistent if augmented by adjoining the axiom of choice a n d / o r the continuum hypothesis. Finally, in 1963 Paul Cohen showed that if the Zermelo-Frankel axioms are consistent, then the negation of the axiom of choice or even the negation of the continuum hypothesis4 can be adjoined and the theory will remain consistent! G6del's impact was felt especially outside of mathematical circles: it did not change the way of working of mathematicians--with the obvious exception of logic i a n s - a n d it gave rise to a huge trove of citations eligible for the next edition of the Dictionary of Stupidity. 5 J.Y. Girard sums up G6del's discovery by this image: There are things in mathematical thought which are not part of the mechanism. [G] Problem 3: Congruence of polyhedra. "On the equality of the volumes of two tetrahedra with equal base and altitude." Two pyramids with triangular base and the same altitude are in the same ratio as their bases. The proof (Euclid) uses the method
where Li is an edge and 3/the dihedral angle at Li. Then Dehn shows that the cube and the regular tetrahedron of the same volume do not have the same Dehn invariant. The problem was considered as solved. However, Sydler in 1965 showed that two polytopes are equivalent if and only if they have the same volume and the same Dehn invariant (see [B], [C], and [S]). On the other hand, no analogous results are known in higher dimensions, or in non-euclidean geometry in three dimensions or more. More recently, other invariants, the Hadwiger invariants, were introduced in arbitrary dimensions, with respect to the group of translations, and it was shown that for two polytopes to be equivalent when only translations are allowed, it is necessary and sufficient that they have the same Hadwiger invariants. We refer to [C] for the connection to the calculation of Eilenberg-Mac Lane homology groups and for current developments. Problem 5: Topological groups and Lie groups. Is a locally euclidean group a Lie group? This problem was in fashion in the 1950s. It was solved in 1953 by Gleason and by Montgomery and Zippin. The following question still remains open: Given a locally compact topological group acting faithfully on a topological manifold, is it a Lie group? For example (actually, equivalently), can the p-adic integers act faithfully on a compact topological manifold? We observe that the Montgomery-Zippin theorem plays a crucial role in a recent important work of M. Gromov [G].
4The generalized continuum hypothesis implies the axiom of choice. 5Example: R. Debray (Le Scribe, 1980): "From the day that G6del s h o w e d that there is no consistency of Peano arithmetic formalizable within that theory (1931), political scientists were in a position to understand w h y Lenin had to be m u m m i f i e d and d i s p l a y e d . . , in a mausoleum." 2 2 THE MATHEMATICALINTELLIGENCERVOL.18, NO. 1, 1996
Problem 6" Axiomatization of physics. There are two main reasons w h y all relation has disappeared between this problem as Hilbert stated it and as it would be stated today:
1. The revolution in modern physics: relativity, general relativity, quantum physics. 2. The introduction of new mathematical tools, first of all Hilbert space. Hilbert was starting work on it in winter 1900, in the following spirit: It appears to me of outstanding interest to undertake an investigation of the convergence conditions which serve for the erection of a given analytic discipline so that we can set up a system of the simplest fundamental facts which require for their proofs a specific convergence condition. Then by the use of that one convergence condition alone--without the addition of any other convergence condition whatsoever--the totality of the theorems of the particular discipline can be established. (Gen. Ref. [R, p. 85])
Stimulated by his reading of Fredholm's work, Hilbert introduced the space which bears his name. One notes with astonishment that he makes no mention of this research in the problems stated at the Paris Congress. The notion of Hilbert space completely transformed the mathematical formalism in use in theoretical physics. Skipping forward a century, if we try to indicate the most active directions of the present, these seem to stand out: General relativity and global differential geometry. Construction of manifolds which could be effective cosmological models (S. Hawking, R. Penrose). Mathematics of quantum field theory; gauge theory; introduction of quantum groups in the formalism of quantum physics. Problem 7: Irrationality and transcendence of od3 for c~ algebraic and j8 algebraic irrational (for example, 2x/2). The problem was solved by Gel'fond (1935) and Th. Schneider. Euler's constant remains as mysterious as ever. Let us note some recent progress in this area: A. Baker (1966, Fields Medal in 1970): If the ai are nonzero algebraic numbers whose logarithms are linearly independent over Q, then
1, log al . . . . . log an are linearly independent over the field of algebraic numbers. In the last 20 years tools have been brought into play from various mathematical domains. The theory of functions of several complex variables enabled Enrico Bombieri (Fields Medal 1974) to resolve a conjecture of Nagata and enabled W.D. Brownawell in 1985 to give the first effective version of the Hilbert Nullstellensatz. The latter theorem is useful in problems of algebraic independence; the method initiated by A.O. Gel'fond in 1949 and developed by G.V. Chudnovsky in the 1970s enabled him to prove the transcendence of numbers like F(1/4) or F(1/3) (where F is the Euler gamma function). Commutative algebra, with Chow forms, was introduced in this context with great success by Yu.V.
Nesterenko. Results on transcendence involving the usual exponential function or elliptic functions (Th. Schneider) were extended to (commutative) algebraic groups by S. Lang in the 1960s; this theme was abundantly developed especially by D.W. Masser, first alone and then in collaboration with G. W/istholz, in connection with the work of Faltings on the Mordell conjecture. This result of G. Faltings implies that, for a fixed p -> 3, the equation xP + yP = zP
has a finite number of solutions without a common factor. What remained of the Fermat problem was to show that this number is zero. K. Ribet showed that the Fermat conjecture is implied by a deep conjecture (Taniyama-Weil) related to number theory and also to algebraic geometry and group theory. This got the Fermat problem out of its isolation.6 Problem 8: The Riemann hypothesis. Surely the most celebrated problem in the history of mathematics; according to Hilbert, the most important problem of mathematics, and he even Went so far as to say in conversation that it was the most important problem for humanity! (loc. cit.) It is now known that more than 40% of the zeros are on the fateful line (Gen. Ref. [C] 1989), and the fluctuations among the zeros have been studied. It seems, as was suggested very early, that the distribution of zeros resembles that of the eigenvalues of a self-adjoint operator (Hilbert and P61ya, about 1915). About 1973 Dyson suggested that the operator could be a random hermitian operator. For several decades, progress on the hypothesis has been essentially technical, and the most powerful computers have been tried on it. Some of the achievements have been real tours de force ([B-I], cf. [G-K]). On the other hand, there have been attempts, following a suggestion by Hilbert, to generalize the Riemann hypothesis by replacing the field of rationals by a field of functions on a projective curve defined over Fq. This was a motivation of the founders of modern algebraic
6Having already cited Gen. Ref. [Ma] for number theory through the 1980s, let me draw on the introduction to the 1993 English edition for the recent developments: " . . . the famous problem 'Fermat's Last Theorem,' together with the Taniyama-Weil conjecture for semi-stable elliptic curves, seem to have been completely proved by A. Wiles. Wiles used various sophisticated techniques and ideas due to himself and a number of other mathematicians (K. Ribet, G. Frey, Y. Hellegouarch, J.-M. Fontaine, H. Hida, J.-P. Serre, J. Tunnel . . . . ). This genuinely historic event concludes a whole epoch in number theory, and opens at the same time a new period which could be closely involved with implementing the general Langlands program . . . . One of the characteristic features of the new methods and ideas is the intensive use of p-adic L-functions and Galois representations. Another striking example of this feature is K. Rubin's construction of rational points on elliptic curves using special values of p-adic L-functions and their derivatives." THE MATHEMATICALINTELLIGENCERVOL.18, NO. 1, 1996 23
geometry, A. Weil and O. Zariski. The "Weil conjectures," stated in 1949, which extended the earlier conjectures to higher-dimensional varieties, were proved in 1973 by P. Deligne (Fields Medal 1978) (Gen. Ref. [K]). Problem 10:
"On the possibility of solving a diophantine
equation." The problem is to give an algorithm which tests the solubility of diophantine equations (polynomial equations with integer coefficients whose integer solutions are sought). As the statement indicates, the problem is at the border between logic--recursion theory--and number theory. It was settled negatively by Yu. Matijasevich in 1970. This negative result was obtained by deepening our knowledge of recursive sets (recursively enumerable or "listable') and diophantine sets; that is, sets of integer parameters ai for which
P(al, a2 . . . . .
an,
Zl, Z2.....
Z m) = 0
has solutions (z) in integers. Matijasevich's theorem asserts that these two families of sets of integers are the same. Astonishing application: there exists an effectively computable polynomial (in 10 integer variables) with integer coefficients whose positive values are precisely the set of prime numbers! Other problems of the same sort remain unsolved; for example, does there exist a system fiX1, X 2. . . . .
Xn,
t) -- 0
numerous mathematicians around the world are laboring (whereas Langlands himself has turned to percolation!). See Gen. Ref. [Ma]. In the abelian case, one would like to describe explicitly the abelian extensions of a number field K which is a finite extension of Q, in terms of the values of certain special functions (exponential, elliptic) and the action of the Galois group. For Q, the Kronecker-Weber theorem gives an explicit description of the abelian extensions by means of the action of the Galois group on the roots of unity. The work of Shimura and Taniyama [S-T] concerning abelian varieties with complex multiplication provides an almost complete answer for "CM fields" (totally imaginary quadratic extensions of a totally real field).
"Impossibilityof solving the general equation of degree 7 in terms solely of functions of two arguments": Problem 13:
superposition of functions. The equation of third degree can be reduced by translation to
X3+pX+q=O, which has the solution (Scipione del Ferro, 16th century)
X= ( - q
+
(1)
of equations with integer coefficients which has rational solutions (x~. . . . . Xn) if and only if the parameter t is an integer? Otherwise stated, does there exist a morphism of varieties over Q V --> Affine line such that the fiber at a is nonempty if and only if a is an integer? Problem 11: Classification of quadratic forms with coefficients in rings of algebraic integers.
( _ q _ ~ A p 3 + a Y q 2 ) 1/3 2
4(27)
to
P(X) = 0
Problem 12: This concerns generalizationsof Gauss's quadratic reciprocitylaw, the law from which has flowed the modern algebraic theory of numbers. Consider the equation (in integers) x2 + 1 ~ 0 (mod p) This equation has solutions if and only if the prime p is congruent to 1 modulo 4. This rule is at the origin both of group theory and of modern number theory. To sum up the ramifications of this problem, one would cite class field theory (Hilbert, and around 1930 Furtwangler, Takagi, E. Artin); the introduction of methods of cohomology of groups; the study of L-series; and the vast "Langlands program" aiming to extend the quadratic reciprocity law to the non-abelian case, on which THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
"
The equation of fourth degree can be solved by superposition (composition) of addition, multiplication, square roots, cube roots, and fourth roots. To try to solve algebraic equations of higher degree (a vain hope according to N. Abel and E. Galois), the idea of Tschirnhaus (1683) was to adjoin a new equation:
See Gen. Ref. [K].
24
~/4Pg+a7q2) 1/34(27)
one adjoins Y = Q(X), where Q is a polynomial of degree strictly less than that of P, chosen expediently. In this w a y one can show that the roots of an equation of degree 5 can be expressed via the usual arithmetic operations in terms of radicals and of the solution ~b(x) of the quintic equation
X5+xX+ I
=0,
depending on the parameter x. Similarly for the equation of degree 6, the roots are expressible in the same way if we include also a function O(x,y), a solution of a 6th-degree equation depending on two parameters x and y.
For degree 7 we would have to include also a function or(x, y, z), solution of the equation X 7 + xX 3 + yX 2 + z X + 1 = O. Hence the natural question: can or(x, y, z) be expressed by superposition of algebraic functions of two variables? V.I. Arnold asked whether cr can be represented using as "irreducible branches" of the superposition mappings topologically equivalent to algebraic functions of two complex variables. Despite results of V. Lin, this question, along with the original question of D. Hilbert, remains unanswered. Let us note one more strange result in this circle of ideas: The solution of X5 + xX2 + y X + I = O cannot be expressed using algebraic functions of one variable, addition and multiplication (division is excluded!). (A. Khovanskii, see [R]; historical articles UPK] and [D].) A.N. Kolmogorov's important notions of complexity (created around 1955) were the subject of interesting comments by Yuri I. Manin at the International Congress at Kyoto (Gen. Ref. [C]). Hilbert had had another motivation for this problem: nomography, the method of solving equations by drawing a one-parameter family of curves. This problem, arising in the methods of computation of Hilbert's time, inspired the development of Kolmogorov's notion of e-entropy; among its applications is its crucial role in theories of approximation now used in computer science. Let us briefly recall the history. Contrary to the expectations of Hilbert and of contemporary mathematicians, in 1957 V.I. Arnold (then a student of Kolmogorov) showed that any continuous function f of three variables can be written
fix, y, z) = Z fi(4i(x, y), z)
i
with functions fi and ~bi continuous. A few weeks later Kolmogorov showed that any continuous function f of n variables can be written in the form 2n+1
P r o b l e m 15: "Rigorous foundation of the enumerative geometry of Schubert." Schubert calculus. Schubert's study of the number of points of intersection of varieties--for example, of the number of lines meeting four given lines in 3-space was taken up by the Italian school (Severi). The nonrigorous arguments--'principle of conservation of numbers," that is, heuristic arguments of continuity--grew into the modern theory of multiplicities of Pierre Samuel and Alexander Grothendieck (Fields Medal 1966). A topological point of view was taken by Ren6 Thorn (Fields Medal 1958): Thom-Bordman polynomials, enumerative theory of singularities. Problem 15 can not yet be regarded as solved, despite some recent progress (Demazure, Fulton, Kleiman, R. MacPherson). P r o b l e m 16: "Problems of the topology of curves and of algebraic surfaces." This is the only problem directly about topology. It falls into two distinctparts. A: Consider a nonsingular curve of degree m in p2(R). From a topological point of view such a curve is composed of ovals of which there are at most l(m - 1) (m 2) + 1, and this bound is the best possible. Curves attaining the maximum are called M-curves. The problem is to study what configurations of the ovals are possible. The problem is nontrivial beginning with m = 6, the case raised by Hilbert; this was solved by Gudkov in the 1970s. (See Fig. 1.) In 1933 Petrovskii proved inequalities concerning the topological invariants of
B = {xlf(x) >- 0}, where f determines a curve of even degree; general inequalities were proved by Petrovskii and Oleinik (1949-1951), mainly concerning surfaces in R 3, and similar results were obtained by Thorn and Milnor (1964), see [R]. In 1971 V. Arnold proved congruence relations
f
f(Xl, X2..... Xn) = i~=1fi~~j 49j(Xj)), where the 4~j are continuous, monotone, and independent of f. In the opposite direction are the results of Vitushkin (1954). When we deal with superpositions of formal series, or of analytic functions, or even of infinitely differentiable functions, we can show by an elementary enumeration technique f(~llowed by a Baire argument that, for example, almost every entire function has at an arbitrary point of C 3 a germ which is not expressible by superposition of series in two variables. So there are many more entire functions of three variables than of two. The result of Vitushkin is that for functions of smoothness c~ (with ~ larger than 1, not necessarily an integer), representability depends on n/oe. THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
25
ooo
ooo 000 !Hamack (1878)
OO
0 iHilbert (1894)
Gudkov (1973)
Figure 1. Possible arrangements for m = 6.
connecting the number P of even ovals (contained in an even number of ovals) and the number I of odd ovals. For a curve of even degree m, P - I is congruent to (m/2) 2 modulo 8. For degree 8 the problem is still not solved. An account of the work, from O. Oleinik and I. Petrovskii (1949) to O. Viro (1979), is in [R]. The Leningrad school (Kharlamov, Gudkov, Rokhlin, Viro) turned up subtle relations between a nonsingular complex algebraic variety and its real part [V]. The current work is very technical. The results have applications in analysis (partial differential equations, completely integrable systems). B: Problem of finiteness of limit cycles. Given a vector vield V in the plane, a cycle is a periodic trajectory, and a limit cycle is a cycle on which nonperiodic trajectories accumulate. When V is analytic, a limit cycle is isolated in the set of cycles. But it is possible for cycles to accumulate at a point or more generally on a "polycycle" (concept due to H. Poincar6) formed of pieces of integral curves ending at singular points of V. Hence the natural question posed by Hilbert: Given a vector field V = (X, Y), X and Y being polynomials of degree < n, find the maximum possible number of limit cycles. To begin with, one would want to try to prove Conjecture: A polynomial vector field in the plane has only finitely many limit cycles or polycycles.
In 1923, Dulac proved this conjecture, but his proof was insufficient, as was recognized in the 1970s by Yuli II'yashenko. However, some of Dulac's ideas were revived by II'yashenko and Ecalle, in different directions. II'yashenko (1984) proved that the finiteness conjecture holds on the complement of a proper algebraic subset of the vector space of vector polynomials of degree n. Actually, the proper setting is that of singular foliations F (defined locally by analytic vector fields with isolated singular points). The return-map is defined in the neighborhood of a polycycle like C (Fig. 2). There exists an analytic curve T:[0, 1]--~M with T(0) E C such that T is transversal to F and the leaf through T(t)--for t sufficiently small--cuts T again for the first time at T(f(t)). The mapping f leaves 0 fixed, preserves orientation, and is analytic on a punctured neighborhood of 0, but need not be analytic at 0. Thus it may have a sequence of isolated fixed points (tn) which 26 THEMATHEMATICALINTELLIGENCERVOL.18,NO.1,1996
converge to 0. C is then the limit of the limit cycles through the T(tn). It is to be shown that a polycycle cannot be the limit of limit cycles, which is to say that if f is not the identity, then 0 is an isolated fixed point. This would imply the finiteness by the Poincar6-Bendixson theorem. II'yashenko's proof of the finiteness conjecture and Ecalle's are given in [I] and [E]; Ecalle pursues his investigation, which uses his theory of generalized asymptotic developments (and resurgence) to study the return-map f. Let me mention finally that there is a "linearized" version of Problem B [A-I]. Problem 17: Sums of squares:
"Representation of definite
forms by sums of squares." Here "definite" means that the forms (homogeneous polynomials) in n variables with real coefficients have non-negative values for all real values of the variables.
Can they always be represented as quotients of sums of squares of forms? This has been a motivating problem in real algebraic geometry. T h e o r e m of Emil Artin (1927): Let X be an irreducible
algebraic variety over R or Q, let f be a rational function on X, and assume f is positive at the real points of X where it is defined. Then f is a sum of squares in the function field of X (functions with coefficients in Q, if X is defined over Q). T h e o r e m of Pfister (circa 1970): Let X be an irre-
ducible algebraic variety over R of dimension d, and assume f in the function field of X is positive (in the sense above). Then f can be written as the sum of at most 2a squares. Hilbert had already studied the conditions under which a homogeneous polynomial of degree m in n variables can be written as a sum of squares of homogeneous polynomials. Recently, this has been studied for analytic functions, for differentiable functions, and for Nash functions; see [B]. Real algebraic geometry is attracting renewed interest, by virtue of its relations with
Figure 2.
Polycycle (of a vector field) and return-map
logic (elimination principle of Tarski and Seidenberg, model theory) and with industrial applications (robotics), but it is intrinsically more complicated than complex analytic geometry. Problem 18: "To rebuild space with congruent polyhedra." Hilbert's problem is divided into three distinct parts. A: "In Euclidean space of n dimensions, show that there
are only finitely many different kinds of groups of displacements with a (compact) fundamental domain." In other words, one looks for discrete subgroups with compact quotient of the group E(n) of isometries of R n. The result was proved by Bieberbach in 1910. The classification of these groups is important in crystallography and it generalizes to questions about lattices in Lie groups. B: "Tiling of space by a single polyhedron which is not a fundamental domain as in A." J. Milnor's remarks in Gen. Ref. [M] still apply: This is a very lively topic today [G-S1]. Important developments have been the theory of Penrose tilings [P] and the closely related physical problem of anomalous crystal structure (see, for example, [J]), as well as Thurston's theory of self-similar fractal tilings [T]. C: "Packing of spheres. How should spheres of the same radius be arranged in space (of any dimensionality) so as to achieve the greatest density of packing?" Hilbert's text gives the impression that he did not anticipate the success and the developments this problem would have. The hexagonal packing in the plane is the densest (proof by Thue in 1882, completed by Fejes in 1940). In space, the problem is still not solved. There is very recent progress by Hales. For spheres whose centers lie on a lattice, the problem is solved in up to eight dimensions. The subject has various ramifications: application to the geometry of numbers, deep relations between coding theory and sphere-packing theory, the very rich geometry of the densest known lattices. Before entering on analysis, Hilbert poses, in general, the question of choosing the class of functions with which to work: differentiable, analytic... ? There are other, intermediate classes which in my view have not been looked at sufficiently--for example, classes of functions defined by inequalities on their derivatives (Gevrey classes) or functions which are solutions of certain types of differential equations (differentiably algebraic functions). Problems 19, 20, and 23: Partial differential equations: calculus of variations and the Dirichlet problem. The developments in calculus of variations are so diverse (finite-element method, control theory) that it is impossible to survey them here. They play a central role in the study of nonlinear phenomena (Plateau problem, equations of minimal surfaces) and, in the form of optimal control, in industrial applications of mathematics. Taking for granted the exposition by James Serrin (Gen.
Ref. [M]), I may indicate for Problem 20 (solvability of boundary-value problems) developments involving elliptic systems, both linear with nonregular coefficients and nonlinear: study of harmonic mappings; questions of regularity motivated by global geometry, as in the work of Schoen and Uhlenbeck (cf. Gen. Ref. [C], Karen Uhlenbeck's article); mechanics (elasticity), or more recently the study of liquid crystals (Gen. Ref. [I], 1988 and 1990). The study of extensions of the calculus of variations (Problem 23) has had a very wide development in recent years. Take for example the work of Bahri on the existence of periodic solutions of the three-body problem. Problem 21: Monodromy of Fuchsian equations. "Does
there always exist a linear differentia~ equation of the Fuchs class having given critical points and a given monodromy group?" One might try to prove the existence on the Riemann sphere of the following-objects with given singular points and given monodromy groups: A: Fuchsian equations (the Fuchs class): a linear first-order differential equation whose monodromy matrix has only simple poles at the given singular points; B: one requires all the solutions to have regular singular points, that is, with moderate growth in angular sectors (this is the problem treated by Deligne [D]); C" Fuchsian systems. "Fuchsian" implies singular regular but not conversely. Plemelj (1930) claimed a solution to A. However, there was an error in his proof (see Gen. Ref. [E], vol. 1), and counterexamples have been found (see exposition in [B]). Problem 22: Uniformization of analytic curves. This is a fine example of a crossroads problem: topology, complex function theory, group theory, partial differential equations . . . . . Recent developments concern the extension to higher dimensions (Griffiths). Some Reflections
One must not overlook little problems although they seem like dead ends without much future: there are many such in Hilbert's list, and they gave rise to important developments (particularly in geometry). The phenomenon keeps occurring since then: the Henon attractor; non-periodic tilings of Penrose, used and studied by physicists, by Connes ...
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
27
If we look at the whole list of Hilbert problems and the works that have grown out of them, it is clear that they connect to an important part of this century's mathematics. They stimulated major efforts of numerous mathematicians: who in this century has not dreamed of solving a question posed by Hilbert? However, let us look at the topics locally, undistracted by the impressive figure of their author. Even aside from the absence of certain important questions (functional analysis, measure theory) and the minor part given to others (topolo g y - n o doubt Poincar6 has something to do with this), some mathematicians find the interest excited by Hilbert for certain problems to have been disproportionate. J. Dieudonn6 [LeL] judges a good portion of these problems "isolated," but this reproach can be invalidated, as we see, for example, from the recent work that "uncloisters" Fermat's conjecture. Some problems not presented at Paris (Problems 3, 5, and 15) and their sequels also took on luster from the great mathematician.
Problems, Program? As his correspondence with Minkowski shows, Hilbert must have harbored the dream of writing the program of action for the mathematicians of the 20th century--at least the first stages. This was in line with his structured, centralized vision of mathematics: G6ttingen at the center, number theory at the heart of mathematical science . . . . Didn't Ostrowski even find a curious resemblance to another great advocate of programs of action, V.I. Lenin? Rather than a program, which would inevitably be lacking in means of implementation, it was more a matter of a project, a projection toward the future, bringing out ideas dear to Hilbert: mathematicians' power to resolve crises (there are no problems without solution--contrast the dialectical point of view of Poincar6); the deep unity of mathematics, around number theory; mathematics as independent of the physical sciences. The other subject that Hilbert had thought of treating in his Paris lecture, the relationship between pure mathematics and physics, he would take up in 1930, after his retirement, when he was made honorary citizen of the great city of Koenigsberg-Kaliningrad, the city of Kant and Jacobi. In his talk, which he summarized in an interview on the local radio, the defense of pure mathematics is linked, by w a y of a reading of Kant, to his profound conviction --which he asserted as if to prove of the omnipotence of mathematics:
We must know. We will know. And on these words one hears a great burst of laughter from Hilbert. What could it mean? An intuition of what was happening? Two months later the Monatshefte fflr Mathematik received an article from the 25-year-old mathematician Kurt G 6 d e l . . . 28
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
Yesterday, Today, Tomorrow With the year 2000 coming up, it is tempting--but dangerous--to imagine an analogous effort. Indeed, other mathematicians since Hilbert have dared to envisage partial programs (A. Weil for one [LeL], or the team assembled by F. Browder [M], or finally the "sketch of a program" of A. Grothendieck [G]). Lucky chance, or sign of the end of an era? Klein's Encyclopedia of Mathematical Sciences marked the end of the century; Hilbert's lecture marked the start of a formidable development (say 1900-1970). The appearance today of the Encyclopaedia of Mathematical Sciences [E] could fill a role analogous to the former. But how could we imagine it followed by a "new Hilbert program"? In the last 20 years the progress of mathematics has been considerably transformed. Whereas mathematics in the first part of the century, up to the sixties, was motivated essentially by an internal logic, and thus came under the sponsorship of Hilbert (and his disciple Bourbaki), today "external needs"--in a global sense--are the principal motor driving mathematics as a science. Poincar6's revenge. The elaboration of electronic processing of information allows a "positive feedback" between theory and applications. This is indeed what the editors of the Encyclopaedia of Mathematical Sciences say (Vol. 1), speaking in their introduction of "the industrialization of Mathematics." Let us take a few examples: the motivating role of computer science in research in logic [I]; problems of coding, cryptography, and signal theory in number theory; "experimental" research in this area using the most powerful computers; rapprochement of theoretical physics and mathematics (Fields medals of 1990); topology and formal calculus (F. Sergeraert [I, 1990]); nonlinear systems (chaos, Korteweg-de Vries equation . . . . ); minimal surfaces of constant mean curvature (applications to interfaces of polymers). This new nature of the evolution of mathematics (which merits a deeper study comparing this era to others) is summed up by Jacques-Louis Lions in reference to optimal control theory ILl: "Herm6s, Hilbert program (Problem 23) in practice" (the Kalman filter already has an important role in controlling the Airbus). This transformation naturally carries with it sociological consequences, bringing the ways of the mathematical tribe closer to those of other subcultures: the competition for contracts, the growing weight of military contracts together with a romantic tendency in the practice of research (one dreams, "but perhaps one ought not to dream too much" [F]). In another direction, the relation between philosophy and mathematics, I would refer to [K2] but especially to
[c'].
ILl
J.-L. Lions, L'Ordinateur, nouveau D6dale, Daedalon Gold Medal Lecture, 1991. MathematicalDevelopments Arising from Hilbert Problems (F. Browder, ed.), Providence, RI: American Mathematical Society (1976). Yu. I. Manin and A. A. Panchishkin, Number Theory I: Introduction to Number Theory, in Encyclopaedia of Mathematical Sciences, Heidelberg: Springer-Verlag (1993). H. PoincarG L'avenir des math6matiques, International Congress of Mathematicians, Rome, 1908. C. Reid, Hilbert, New York: Springer-Verlag (1970).
The "we must k n o w " of Hilbert should t o d a y be acc o m p a n i e d by a "we w a n t to act." One could imagine beginning the " p r o g r a m of twenty-first-century mathematics" with a study, d o n e jointly by mathematicians and other scientists, which w o u l d list the i m p o r t a n t scientific problems deserving of mathematicians' efforts-n e w or r e n e w e d - - ( F e y n m a n integrals, turbulence, complex systems not yet studied from this point of view, theoretical biology, cognitive sciences . . . ) and w o u l d extract choices corresponding to a real social d e m a n d . Might a p r o g r a m of this sort, putting the p o w e r of mathematics to work, m a k e David Hilbert l a u g h - - b u t with delight?
References Specific to the Problems
General References
[B]
[C] [C'] [D]
[E] [F] [G] [H]
[I]
International Congress of Mathematicians, Kyoto, 1990. Proceedings, Mathematical Society of Japan/SpringerVerlag, Tokyo, 1992. P. Cartier, La pratique et les pratiques--des math6matiques, in Encyclopddie philosophique universelle, vol. 1, 1991. Ein Jahrhundert Mathematik 1890-1990, Vieweg & S., 1990. Review in Mathematical Intelligencer 13(1991), no. 4, 70-74. Encyclopaedia of Mathematical Sciences, Heidelberg: Springer-Verlag (1990) (Russian original Moscow, 1985.) J. M. Fontaine, Valeurs sp6ciales des fonctions L des motifs, Sdmin. Bourbaki (1992), Expos6 751. Astdrisque 206, (1992). A. Grothendieck, Esquisse d'un programme, Dossier de candidature au C.N.R.S., 1985. D. Hilbert, Sur les problbmes futurs des math6matiques, in Proceedings of the Second International Congress of Mathematicians, Paris: Gauthier-Villars (1902), pp. 58-114. Images des math6matiques, Annual Supplement to Courrier du CNRS, CNRS, Paris. 1985 J. F. Boutot and L. Moret-Bailly, Equations diophantiennes: la conjecture de Mordell. Images des nombres transcendants, d'apr6s P. Philippon. 1988 M. Parigot, Preuves et programmes; les math6matiques comme langage de programmation. J. M. Ghidaglia and J. C. Saut, Equations de Navier-Stokes, turbulence et dimension des attracteurs. 1990 F. Sergeraert, Infini et effectivit6: le point de vue fonctionnel. F. Bethuel, H. Brezis, J. M. Coron, and F. Helein, Probl6mes math6matiques des cristaux liquides.
J.-M. Kantor, Hilbert (probl6mes de), in Encyclopaedia Universalis (1989), vol. 11. [K2] J.-M. Kantor, L'intuition en 6quations, Report on the Kyoto Congress, Le Monde (August 1990). [LeL] A. Blanchard, Les grands courants de la pensde mathdmatique (F. LeLionnais, ed.), (1948), new edition 1962, Paris; in particular the articles by J. Dieudonn6, R. Godement, A. Weil.
[M] [Ma]
[P] JR]
Problems 1 and 2
[G]
J. Y. Bouleau and A. Louveau Girard, Cinq confdrences sur l'inddcidabilitd, Paris: Presses de l'Ecole nationale des Ponts et Chauss6es (1982). E. Nagel, J. Newman, K. G6del, and J.Y. Girard, Le thdor~me de G6del, Paris: Seuil (1989). Problem 3
[A] [B] [C] [S]
Agr6gation de math6matiques, probl6mes de math6matiques g6n6rales, Revue de mathdmatiques spdciales, 1985-1986, Paris, pp. 139-150. Vuib~rt. V. Boltianski, Hilbert's Third Problem, New York: Wiley (1978). P. Cartier, D6composition des poly6dres: le point sur le troisi6me probl~me de Hilbert, Sdmin. Bourbaki (1984-1985), No. 646. G. Sah, Hilbert's Third Problem: Scissors Congruence, London: Pitman (1979). Problem 5
[A] [G] [S]
J. Acz61, The state of the second part of Hilbert's fifth problem, Bull. Am. Math. Soc. 20 (1982). M. Gromov, Groups with polynomial growth and expanding maps, Publ I.H.E.S., no. 53. I. Sillmann, Every proper smooth action of a Lie group is equivalent to a real analytic action: a contribution to Hilbert's fifth problem, M.P.I. Mathematik, Bonn, 1993. Problem 6
[B]
P. Bohl's Fourth Thesis and Hilbert's Sixth Problem, Moscow: Nauka (1986) [in Russian].
[B-M]
A. Baker and D. W. Masser (eds.), Transcendence Theory, Advances and Applications, New York: Academic Press (1977). D. Bertrand and M. Waldschmidt (eds.), Approximations diophantiennes et nombres transcendants, Basel: Birkh~iuser (1983). A. Baker (ed.), New Advances in Transcendence Theory, Cambridge: Cambridge University Press (1988). P. Philippen (ed.), Approximations diophantiennes et nombres transcendants, de Gruyter (1992).
Problem 7
[B-W] [B] [p]
[KI
Problems 8 and 9
[B] [B-I] [G-K]
M.V. Berry, Semiclassical formula for the number variance of the Riemann zeroes, Nonlinearity 1 (1988), 399-407. E. Bombieri and H. Iwaniec, On the order of ~(12+ it), Ann. Scuola Norm. Sup. Pisa 13 (1986), 449-472. Graham and Kolesnik, Van der Corput's Method of THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
29
Exponential Sums, Heidelberg: Springer-Verlag (1991). A. Ivic, The Riemann Zeta Function, New York: Wiley (1985). E.C. Titchmarsh, The Riemann Zeta Function, Oxford: Oxford University Press (1986).
[II [T]
Problem 10
[M] [M1]
Yu. Matiyasevich, The Tenth Problem of Hilbert, Moscow: Nauka (1993) [in Russian]. Yu. Matijasevich, My collaboration with Julia Robinson, Mathematical Intelligencer 14 (1992), no. 4, 38-45.
[R] [V] [Y]
Problem 17
[BS]
Problem 12
[K-S] K. Kato and S. Saito, Global class field theory of arithmetical schemes, Contemp. Math. 55(Part I) (1986), 5-331. IS-T] G. Shimura and Y. Taniyama, Complex multiplication of algebraic varieties and its application to number theory, Publ. Math. Soc. Japan 6 (1961). Problem 13
[D]
J. Dixmier, Histoire du treizi6me probl6me de Hilbert, Sdminaire d'histoire des mathdmatiques de l'Institut Henri Poincard, 1991-1992. [JPK] J.-P. Kahane, Le treizi6me probl6me de Hilbert: un carrefour de l'analyse, de l'alg6bre et de la g6om6trie, Cahiers Sdmin. Hist. Math S&. 1, 3 (1982). [R] J.J. Risler, Complexit6 et g6om6trie r6elle, d'apr6s A. Khovanskii, S6min. Bourbaki (1984-1985), No. 637. V.Y. Lin, Superposition of algebraic functions, Funct. [L] Anal. ego Prim. 10 (1976), 32-38. Problem 15
A. Lascoux, Anneaux de Grothendieck de la vari6t6 des drapeaux, in The Grothendieck Festschrifl, Vol. III, Basel: Birkh/iuser (1993), pp. 1-34. P. Samuel, Sur l'histoire du quinzi6me probl6me de Hilbert, Gazette des Mathdmaticiens (1975), (Oct. 1974), 22-32
[L]
IS]
[K]
Problem 18, Parts A and B
Problem 18, Part C
[C-S]
[O]
[B] [E]
[I] [K] [M] 30
V. Arnold and Yu. II'yashenko, Ordinary differential equations, in Dynamical Systems--l, Encyclopaedia of Mathematical Sciences, Heidelberg: Springer-Verlag (1988). J. Bochnak, M. Coste, and M.F. Roy, G6om6trie alg6brique r6elle, Heidelberg: Springer-Verlag (1986). J. Ecalle, Introduction aux fonctions analysables et preuve constructive de la conjecture de Dulac, Paris: Hermann (1992); and Six lectures on trousseries, analysable functions, and the constructive proof of Dulac's conjecture, in Bifurcations and Periodic Orbits of Vector Fields (D. Schlominck, ed.), Kluwer Acad. Publ. (1993), pp. 75-184. Yu. II'yashenko, Finiteness Theorems for Limit Cycles, American Mathematical Society Providence, Rh (1991). V. Kharlamov and I. Itenberg, Towards the maximal number of components of a non-singular surface of degree 5 in RP3, preprint (1993). R. Moussu, Le probl6me de la finitude du nombre de cycles, d'apr6s R. Bamon et Y. S. Ii'yashenko, Sdmin.
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
H. Benis-Sinaceur, De D. Hilbert a E. Artin: Les diff6rents aspects du dix-septiOme probl6me de Hilbert et les filiations conceptuelles de la th6orie des corps r0els clos, Arch. Hist. Exact Sci. 29(3) (1984), 267-286. A. Khovanskii, Fewnomials, Providence, RI: American Mathematical Society (1991).
[D-G] L. Danzer, B. Griinbaum, and G. C. Shephard, Does every type of polyhedron tile three-space? Topol Struct. 8 (1983). [D-K] M. Duneau and A. Katz, Cristaux apdriodiques et groupe de l'icosa~dre, Palaiseau, France: Publications du Centre de Physique th6orique, Ecole polytechnique, [G-S] B. Griinbaum and G. C. Shephard, Tiling with congruent tiles, Bull. Am. Math. Soc. 3 (1980), 951-973. [G-S1] B. Griinbaum and G. C. Shephard, Tilings and Patterns, New York: W.H. Freeman (1987). M. Jaric, Introduction to the Mathematics of Quasicrystals, [J] New York: Academic Press (1989). R. Penrose, Pentaplexity, Eureka 39 (1978), 16-22; [P] Pentaplexity: a class of non-periodic tilings of the plane, Mathematical Intelligencer 2 (1979), no. 1, 32-37. [T] W. Thurston, Groups, Tilings, and Finite State Automata, Providence, RI: American Mathematical Society (1989); Research Report GCG-1, Geometry Supercomputer Project, University of Minnesota, Minneapolis (1989).
Problem 16
[A-I]
Bourbaki (1985-1986), No. 655. Astdrisque, 145-146 (1987). J.J. Risler, Les nombres de Betti des ensembles alg6briques rods, une mise au point, Gazette des Mat hdmaticiens (1992), O. Viro, Progress in the topology of real algebraic manifolds, in Proceedings of the International Congress of Mathematicians, Warsaw, 1983, pp. 595-611. J.-C. Yoccoz, Non-accumulation des cycles limites, Sdmin. Bourbaki (1987-1988), No. 690.
[S]
J.H. Conway and N. J. Sloane, Sphere Packings, Lattices and Groups, New York: Springer-Verlag (1988). J. Oesterl6, Empilement de sph6res, Sdmin. Bourbaki (1989-1990), No. 727; "Les sph6res de Kepler," video (series "Mosa'ique math6matique"), Paris: Productions "Les films d'ici." F. Sigrist, Sphere packing, Mathematical Intelligencer 5 (1983), 34-38. Problems 19, 20, and 23
[V]
C. Viterbo, Orbites p6riodiques dans le probl6me des trois corps, Sdmin. Bourbaki (1992-1993), No. 774.
[B]
A. Beauville, Equations diff6rentielles a points singuliers r6guliers d'apr6s Bolybrukh, Sdmin. Bourbaki (1992-1993), no. 765. Astdrisque, 216 (1993). P. Deligne, Equations diffdrentielles ?l points singuliers rdguliers, Heidelberg: Springer-Verlag (1970), p. 136.
Problem 21
[D]
29, rue de Lacdp~de 75005 Paris, France
David Gale* This column is interested in publishing mathematical material which satisfies the following criteria, among others: 1. It should not require technical expertise in any specialized area of mathematics. 2. The topics treated should when possible be comprehen-
sible not only to professional mathematicians but also to reasonably knowledgeable and interested nonmathematicians. We welcome, encourage and frequently publish contributions from readers. Contributors who wish an acknowledgement of submission should enclose a self-addressed postcard.
Triangles and Proofs For the third time in as many years this column will be concerned with the subject of triangles, but with a difference. The earlier columns were concerned with the impact of computers on triangle geometry, whereas here we will go back to basics, taking a new look at some very classical topics. "The point of a generalization is not just to include more cases, but also to get rid of unnecessary hypotheses, and sometimes this can lead to simpler proofs." I recall being told something like this as a graduate student by one of my first mentors, Norman Steenrod. In fact there are two kinds of generalizations. One can either weaken the hypothesis or strengthen the conclusion. The examples to be treated below illustrate both of these approaches; in both cases, finding the right generalization reduces what appears to be a complicated problem to essentially a mechanical verification, a matter of just "turning the crank." For our first example we are grateful once again to Donald Newman.
points. These form the small triangle inside the starting triangle. This small interior triangle is far'from being arbitrary, however, Morley's great discovery (1899) being that it is always equilateral! When I read, or rather tried to read, Morley's proof of this startling theorem, I found it absolutely impenetrable. I told myself that maybe in future years I would return and then understand it. I never succeeded in that, and even when I read the much simpler proof based on trigonometry, or the fairly simple geometric proof due to Navansiengar, there was still too much complexity and lack of motivation. (A series of lucky breaks!) Were we to give up, forever, understanding the Morley Miracle? Or are we failing because we are asking too little? After all, Morley's theorem states that in Figure 1, the inner triangle always will be equilateral. The reason that all the proofs seem to be so difficult and unmotivated is probably because Morley's theorem is really only half the story. The full picture is in Figure 1 and this tells the whole story and indeed proves itself! (This
The Morley Miracle D. J. Newman One of the sad things about the current philosophy of mathematical education is the avoidance of plane geometry. Today's generation, and perhaps their parents as well have not heard of marvels like the 9-point circle, Descartes's theorem, Ceva'-s Theorem, or the marvel of marvels, the Morley triangle. As shown in Figure 1, one takes an arbitrary triangle and trisects its angles, obtaining three intersection
*Column editor's address: Department of Mathematics, University of California, Berkeley, CA 94720 USA.
Figure 1
THE MATHEMATICALINTELLIGENCERVOL.18, NO. 1 9 1996Springer-VerlagNew York 31
References 1. Frank Morley, Extensions of Clifford's theorem, Amer. J. Math. 51 (1929), 465-472. 2. J.M. Child, A Proof of Morley's Theorem, Math. Gaz. (1922), 171. 3. M.T. Navansiengar, Educ. Times, New Series 15 (1909), 47. 4. H.S.M. Coxeter, Introduction to Geometry, Toronto: John Wiley and Sons (1969), pp. 23-25.
A n a t o m y and Evolution of a T h e o r e m on Triangles (by the C o l u m n Editor)
Figure 2
Figure 3 happens often in induction proofs: The fuller statement is easier to prove than the restricted one.) So we turn to the "cheating" strategy as used in [4], namely, we start with the equilateral triangle and build out. The result is Figure 2. Here we have normalized matters by choosing the equilateral triangle to have side 1. Note that from symmetry it will be sufficient to prove that the indicated angle is A/3. At this point we could turn the proof over to a high school trigonometry student who has learned to "solve triangles." Observe that all the side lengths in Figure 2 are determined by angle-side-angle (ASA) of the three constructed triangles. Thus, the law of sines applied to triangle AB'C' gives AC' sin(C + ~r)/3
--
1 sin(A/3)
As every school boy used to know, the medians of a triangle meet in a point (the centroid), as do the altitudes (the orthocenter). (These days you'd be lucky to find a school boy who even knows what a median is.) A less familiar example of concurrence is the Fermat point. This is the point (of an acute triangle) which minimizes the sum of the distances from the three vertices. To find it, one constructs three equilateral triangles having as bases the three sides of the given triangle. Now connect the far vertex of each of these triangles to the opposite vertex of the given triangle. The intersection of these three lines is the desired point. Much less well known is the following theorem or pair of theorems related to a construction sometimes attributed to Napoleon. In the figure below, choose A', B', and C' to be the centers rather than the far vertices of the three triangles. Once again the lines AA', BB', and CC' turn out to be concurrent. (The so-called Napoleon's Theorem says that in this case the points A', B', and C' are themselves vertices of an equilateral triangle!) Further, the three equilateral triangles can be taken either outside or inside of the original triangle.
l
SO
AC'=
sin(C + ~r)/3 sin(A/3)
BC'=
sin(C + ~-)/3 sin(B/3)
Similarly,
Also /_AC'B = 2~r
A+~3
B+vr 3
~r 3
C+27r 3 '
so for triangle AC'B we have Figure 3. For this triangle we know two sides and the included angle, (SAS), so the remaining angles are determined, and they must be A / 3 and B/3, which one verifies by once again using the law of sines. QED 32
THE MATHEMATICAL
INTELLIGENCER
V O L . 18, N O . 1, 1996
The Fermat Point
First G e n e r a l i z a t i o n It turns out that all of these theorems are special cases of an infinite one-parameter family of theorems, which was first discovered over 100 years ago, although apparently it keeps being rediscovered. The theorem, attributed to Ludwig Kiepert, replaces the equilateral triangles of Fermat and Napoleon by any set of three similar isoceles triangles. If the base angle of these triangles is c~, then the Fermat point corresponds to the case ~ = ~r/3, Napoleon to cr = ~-/6 (plus or minus); the centroid and orthocenter are the limiting cases ~ = 0 and o~= Ir/2, respectively (check it out). The locus of the points of concurrence as o~runs from - I r / 2 to Ir/2 is a rectangular hyperbola known as Kiepert's hyperbola. For a recent exposition, see Mathematics Magazine, June 1994, pp. 188-205. Exercise (surprising but easy): Show that the Kiepert hyperbola also passes through the three vertices A, B, and C; thus it is the unique conic passing through the five points consisting of the centroid, the orthocenter, and the three vertices. (Do it. Get in on the fun. I dare you!) Second Generalization It turns out that the one-parameter family above is a special case of a triply infinite family of theorems. Referring to Figure 1 below, we have THEOREM 1. Given a triangle ABC, and points A', B', and C', such that o~ =/_BAC' = LB'AC, ]3 =/_ABC' = /_A'BC, and 3' =/_A'CB = LACB'. Then AA', BB' and CC' are concurrent. Thus, the triangles on the sides of ABC need not be isoceles. The key condition is that the base angles of the three triangles be equal in pairs, as shown in Figure 1. Note that because o~, ]3, and ~/are arbitrary we have a three-parameter family of theorems; Kiepert's result is the special case where ~ = ]3 = 7. The result is so simple and
natural in its statement that one suspects it must have been noted long ago, but the historical trail seems to be murky. A recent reference is a "Classroom Note" by D. Kirby (Am. Math. Monthly, January 1980, pp. 45-47). Third G e n e r a l i z a t i o n The rest of the story is based on observations communicated by Clifford Gardner. First some terminology: The lines a and a' in Figure I are called isogonal lines with respect to vertex A, meaning that they are mirror images of each other with respect to the angle bisector at A. Analogously, instead of reflecting in the angle bisector, one can "reflect" in the medians of the triangle. More precisely, two lines through a vertex are called isotomic if they meet the opposite side in points equidistant from its midpoint. It turns out that the analogue of Theorem 1 also holds if the lines through the vertices are isotomic rather than isogonal. In fact the result is true for any triple of concurrent lines through the vertices A, B, and C in the following form. THEOREM 2. Let p, q, and r be concurrent lines through the vertices A, B, and C, respectively, of triangle ABC. Let PA be the pencil of lines at A and let TA be the (unique) projective mapping on PA which (1) interchanges lines AB and AC, (2) leaves p fixed. Define PB, Pc, and TB, Tc similarly. For any line a in PA let a ' = T A(a). Similarly, for b in PB and c in Pc, let b ' = TB(b), c' = Tc(c). Let C' = a' n b, B' = b' n c, and A ' = c' n a. Then AA', BB', and CC' are concurrent. Note that we now have a five-parameter infinity of theorems, for the coordinates of the point P can be chosen arbitrarily. Actually, Theorem 2 is an immediate consequence of
B:/
8
B'
A'
A a'
Figure 1
--~C,
b
Figure 2 THE MATHEMATICALINTELLIGENCERVOL. 18, NO. 1, 1996 33
Figure 3 Theorem 1, using the well-known fact that a projectivity of the plane can be defined arbitrarily on a n y four independent points. Thus, in particular, Figure 1 becomes Figure 2 by leaving vertices A, B, and C fixed and taking the in-center into any point P. The usefulness of the generalization, therefore, is not to include more cases but rather to simplify the proof. Namely, we make one more projective transformation which carries C to the origin, A to the point at infinity on the y axis, B to the point at infinity on the x axis, and P to the point (1, 1) as shown in Figure 3. Then lines p, q, and r become the lines x = 1, y = 1, and x = y, respectively, and we have C'=a'nb=(1/a,b), A'=b'nc=(1/bc, B' = c' n a = (a, a/c),
PA is the set of all vertical lines, PB is the set of all horizontal lines, Pc is the set of all lines through the origin.
thus, AA' has equation x = 1/bc, BB' has equation y = a/c, and CC' has equation y = (ab)x. Then AA' n BB' = (1/bc, a/c) which lies on the line CC'.
N o w recall that any one-dimensional projective transformation is of the form y = (ax § b)/(cx § d), so TA m a p s the lines x = a to the lines x = 1/a: this is the unique projectivity which interchanges the y axis and the line at infinity, leaving the line x = 1 fixed. We will denote the vertical line whose equation is x = a simply by a, the horizontal line y = b by b, and the line y = cx through the origin by c. Then with similar definitions of TB and Tc we have
a'=TA(a)=l, a
b' = T B ( b ) = 1 b
c'
Tc(c)
1 c
It is n o w a matter of simple high school analytic geometry to calculate the coordinates of the points A', B', and C' and the equations of the lines AA', BB', and CC' and verify that they are concurrent. Namely, 34
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
1/b),
Fourth (and Last) Generalization Observe that Theorem 2 is a theorem of projective geometry; this implies that Theorem 1 is "absolute," meaning that it holds in elliptic and hyperbolic as well as Euclidean geometry (in the hyperbolic case one makes the hypothesis that the points A', B', and C' exist). Note that this is not obvious even for the simplest special case, namely, the concurrence of the medians, which was the starting point of this exposition. The usual Euclidean proof makes strong use of the parallel postulate, as one needs the line connecting the midpoints of two sides of a triangle to be parallel to the third side. It can be shown using other methods that the median theorem also holds in the non-Euclidean cases. By contrast, theorem 2 covers all three cases directly.
Symmetries of Fractals Xiang Sheng and Michael J. Spurr
Introduction
Symmetries
The Mandelbrot set, M(2), pictured in Figure 1, is a remarkable subset of the complex plane C having an extremely complicated b o u n d a r y . This b o u n d a r y appears to enclose arbitrarily small miniature "copies" of the original set, making it v e r y jagged. This behavior is typical of a fractal, and, in fact, the H a u s d o r f f dimension of the b o u n d a r y of M(2) (which we take to be a measure of jaggedness) is 2 (see [5]). One also observes that M(2) has a reflective s y m m e try about the real axis. That is, if a complex n u m b e r c = x + iy is in M(2), then its complex conjugate conj(c) = = x - iy is also in M(2). We will be interested in finding all such symmetries (reflective or otherwise) of sets which are analogues of M(2), namely, the Generalized Mandelbrot sets M ( k ) which are pictured in Figures 1 and 2 for k = 2, 3, 4, and 5. First let us define M(2). Given c E C, we take iterations of the function Pc(z) := z 2 + c to obtain P~ = z,
Re-examine the pictures of M ( k ) in Figures I and 2. One sees several lines of apparent reflective s y m m e t r y through the center of each snowflake-like Generalized Mandelbrot set. There also appear to be'rotational symmetries: if M ( k ) is rotated through an angle of 2~r/(k - 1) it appears to exactly cover the original M ( k ) . These symm e t r y motions of the complex plane (reflections and rotations) lie in two groups of interest to us, namely, G, the g r o u p of rotations and reflections of C which preserve 0,
Pc(z)
= z 2 -ff c , Pc2(Z) = ( z 2 q- c ) 2 + c ,
PB(z)
= ( ( z 2 q- c ) 2 q- c ) 2
+ c, and so on: Pn(z) := Pc(pnc -1 (z)). The sequence {Pn(z)}n=0 is called the orbit of z (under iterations of Pc). The behavior of the orbit of 0, n
oo
{Pc(0)}n=0 = {0, c, c2 + c, (c2 + c)2 + c, ((c2 + c)2 + c)2 + c. . . . }, determines whether or not c belongs to the Mandelbrot set M(2), which is defined b y M(2) := {c E C I the orbit {pn(0)} remains bounded}. One defines the generalizations M ( k ) of the Mandelbrot set M(2) similarly. For each integer k ~ 2, let Pc,k(Z) = z k + c. The orbit of 0 u n d e r iterations of Pc,k is given by {pcn,k(0)}~=0 = {O,c,c k + C, (Ck + C)k + C. . . . }. The Generalized Mandelbrot set M ( k ) is M ( k ) := {c E C I the orbit {P~k(0)} remains bounded}. THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1 9 1996 Springer-Verlag New York 3 5
Figure 1: M(2) is the Mandelbrot set for z 2 + c. M(3) is the Mandelbrot set for z 3 + c. FL(2) is represented in white as the cardioid main body of M(2). CL(2) consists of the point (1/4, 0) at the pinch in the cardioid. FL(3) is represented in white as the main b o d y of M(3) and resembles an open "figure 8." CL(3) consists of the two points at the pinches in the "figure 8."
G := {~b I ~b = e iO or ~b = e iO. conj for some 0 E R}, and R R , the group which includes reflections (about any line in C) and rotations (about any point in C), R R := {61 ~b = th + t, where th E G and t ~ C}.
A n y motion of the plane C lies in RR. For instance, so q~ rotates z about the point 0 t h r o u g h an angle of 0 radians. Similarly, e iO. conj(z) = ei~ is a reflection about the line L = {rei0/2 I r E R}. If qb(z) = eiOz is a rotation in G with e iO ~ 1, then ~(z) = ei~ t in R R is a rotation about the point - t / ( e i~ - 1) through an angle of 0 radians; otherwise for e iO = 1, t~(z) = z + t is a translation. If th(z) = ei~ and if ~b(t) + t = 0, then t~(z) = ei~ + t is a reflection through the line { t / 2 + i A t / 2 I A E R} if t ~ 0; if t = 0, then ~b is just the reflection & If ~ z ) = z + t or if ~ z ) = ei~ + t with ~b(t) + t v~ 0, then iterations of ~btend to infinity. Finally, w e mention G is a subgroup of R R , characterized byt = 0. qb -- e i~ takes z to ei~
36
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
To discuss the s y m m e t r y motions of M(k), we define two types of symmetries of an arbitrary set S in C. We let the G-symmetries of S, Gsym(S), be those motions in G which preserve S: Gsym(S) := {~b ~ G I th(s) E S r
s ~ S}.
Gsym(S) is a subgroup of G. More generally, the set of symmetries of a set S, Sym(S), is the set of motions in R R which preserve S: Sym(S) := {~b~ R R I ~ s ) E S r
s ~ S}
and Sym(S) is a subgroup of R R . The s y m m e t r y motions of M ( k ) a p p e a r to be precisely the same as the s y m m e t r y motions of a regular polygon having k - 1 sides and the center at the origin. These are rotations about 0 t h r o u g h any multiple of 2 r c / ( k - 1), and reflections about the lines through 0 forming an angle with the positive real axis which is an
Figure 2: M(4) is the Mandelbrot set for z 4 + c. M(5) is the Mandelbrot set for z 5 + c. FL(4) is represented in white as the main body of M(4). FL(4) resembles a 3-leaf clover or a flower with 3 petals. CL(4) consists of the 3 points at the pinches. FL(5) is represented in white as the main body of M(5). FL(5) resembles a 4-leaf clover or a flower with 4 petals. CL(5) consists of the 4 points at the pinches.
integer multiple of vr/(k - 1). For instance w h e n k = 5, then k - 1 = 4, and our regular polygon is taken to be a square as pictured in Figure 3. It appears that M(5) exhibits exactly the same s y m m e t r y properties as the square. The symmetries of a regular polygon with k - 1 sides described above form a subgroup of G called the dihedral group Dk-1, defined as { ~ = e2 ~ m / ( k - 1 ) o r
q5 =
e2 ~ m / ( k
1). conj I m
E
Symmetries
Form Dihedral
Gsym(M(k)) = Dk 1. T H E O R E M 2. For k ~ 3, the group of symmetries of the Mandelbrot set M(k) is the dihedral group Dk-l: Sym(M(k)) = Dk 1.
Z}.
In the Spring 1992 Mathematical Intelligencer Alexander, Giblin, and N e w t o n [1] s h o w e d that every motion in Dk-1 is a s y m m e t r y of M(k), just as one is intuitively led to believe b y examining the pictures in Figures 1 and 2. T h e y also conjectured that e v e r y s y m m e t r y of M(k) must belong to the dihedral g r o u p Dk-1. We will n o w show w h y this is indeed true. Mandelbrot
Mandelbrot set M(k) is the dihedral group Dk-l:
Groups
The main results on symmetries of M(k) are as follows: T H E O R E M 1. For k ~- 2, the group of G-symmetries of the
To our knowledge, Theorem 1 first a p p e a r e d in [3]. We sketch the ideas behind Theorems 1 and 2, but we refer the reader to [4] for the full proof of these theorems (and others mentioned). A key idea will involve finding points on the b o u n d a r y OM(k) of M(k) which lie closest to 0. (The partial solution by Alexander, et al. relies on points furthest from 0.) The fixed points q of Pc,k(Z) = z k + c which have derivative P~,k(q) of n o r m less than 1 are called attracting fixed points of Pc,k(Z). We wish to describe all c which have attracting fixed points. Using the fixed-point condition, we see that
qk + C = q, THE MATHEMATICALINTELLIGENCERVOL. 18, NO. 1, 1996 3 7
To indicate the d e p e n d e n c e of c on r and a w e denote it b y c = Cr,,,. We then define FL(k) b y
FL(k) : = {Cr, a [ a E [0, 2 ~ k - 1)[ and r < 1}. FL(k) is r e p r e s e n t e d in Figures 1 a n d 2 as the white flower-like m a i n b o d y of M(k). As an example, FL(2) is the cardioid m a i n b o d y of the M a n d e l b r o t set M(2). For higher values of k, FL(k) begins to resemble a flower: FL(3) looks like an o p e n "figure 8," FL(4) like a 3-leaf clover, a n d FL(5) like a 4-leaf clover. There are several reasons w h y w e are interested in FL(k). First, one can s h o w FL(k) lies inside M(k) (see [4]). Second, it is an o p e n set [because the analytic m a p p i n g f(q) = q _ qk takes the disk of radius (1/k) 1/(k-l) to FL(k)]. But m o s t importantly, w e h a v e s h o w n [4] that the b o u n d a r y OFL(k) = {c1,~ [ a E [0, 2 ~ k - 1)]} of FL(k) has precisely the s a m e set of points closest to 0 as does OM(k), the b o u n d a r y of M(k)! This follows after showing Cl,0 is in OM(k) a n d observing that
[Cl,c~[2 =
( k ) 1/(k-1) ei~
(1-keia)
2
= (k)2/(k-l) I1 -- 2k COS(~ q- ~21
~ (k)a/(k-1) (1 - k)2 = [cl,ol2,
Figure 3: The square w i t h sides parallel to the axes has symmetry motions in the dihedral group D4. M(5) has these same s y m m e t r y motions.
with equality holding precisely w h e n a = 2ran for m in Z. Thus, the set of closest b o u n d a r y points CL(k) con(1- 1/k)= sists of the points (1/k) 1/(k-1) r ei2cnn/(k-1)Cl,0 for m in Z. Refer to Figures 1 and 2 to see CL(k) as the set of "pinch points" in OFL(k). N o w if ~b = eiO or 6 = ei~ conj is a G - s y m m e t r y of M(k), then q~ m u s t also be a G - s y m m e t r y of the b o u n d ary 3M(k). Because ~bpreserves distance to 0, b takes the set of closest points CL(k) to itself. This implies that 6(Cl,0) = ei~ = ei2mn/(k-1)Cl,0 . W e conclude that ~b = e i2mn/(k-1) or q~ = ei2~-m/(k 1) . conj, a n d that a n y G-symm e t r y of M(k) m u s t lie in the dihedral g r o u p Dk-1. Let 6 = ~b + t be a m o t i o n in Sym(M(k)). We will s h o w that 0 lies in Gsym(M(k)) = Dk-1 b y s h o w i n g that t = 0. If 6 = ~b + t with 6 = ei~ conj, t h e n q~(t) + t = 0,
a n d using the condition that the derivative P',k(q) h a v e n o r m less than 1, w e h a v e
U s i n g both of these equations, w e can solve for c:
for if ~b(t) + t ~ 0, then ~ ( z ) a p p r o a c h e s infinity a n d ~b cannot preserve the b o u n d e d set M(k). N o w ~bo ei2mn/(k-1) = eiOe-i2mn/(k-1) . conj q- t also belongs to Sym(M(k)). W e conclude that
c = q - qk = q(1 -- qk-1)
ei~ + t = 0
kqk-1 = rei,~ with r < 1.
(1 for a ~ [0, 2r
- 1)] a n d r < 1.
38 THEMATHEMATICALINTELLIGENCERVOL.18,NO.1, 1996
and
eiOe-i2,mn/(k 1)~-q_ t = 0.
M(g) := {c E C I for e v e r y critical point r of g,
Therefore
el~
(t)
- e -i2mn/(k-1))
the orbit {fn(r)} of r r e m a i n s bounded}.
= O.
W h e n k m 3, (t) can o n l y be satisfied w h e n t = 0 = t. So ~b = ~b + 0 belongs to Gsym(M(k)). If ~b = q~ + t with c~ = eiO, then ~bo conj = e iO. conj + t also belongs to Sym(M(k)) a n d w e get t = 0 as above. So e v e r y s y m m e t r y of M(k) is a G - s y m m e t r y of M(k) a n d so lies in Dk 1, giving Sym(M(k)) = Dk-l! N o t e that w h e n k = 2, (1 -- e -i2mn/(k-1)) = 0 in (t) and w e cannot conclude that t=0.
N o t e that w h e n g(z) = z k, then g'(z) = k z k - 1 = 0 has only the solution 0 = r as the critical point, and the above definition of M(z k) reduces to the original definition of
M(k). W h a t are the s y m m e t r i e s of M(g) for a general polynomial g? This is a m o r e difficult a n a l o g u e of the original question p o s e d in [1] asking for the s y m m e t r i e s of M(k). The a n s w e r is p e r h a p s m o r e complicated! If N
g(z) = Z
Generalizations
any "Znj
j=l J
If one studies iterations of the p o l y n o m i a l az k + c, one also obtains a generalized M a n d e l b r o t set M(az k) := {c E C I the orbit of 0 u n d e r iterations of az k + c r e m a i n s bounded}. The s y m m e t r i e s of M(az k) f o r m a "twisted" dihedral g r o u p D(k - 1, a) as described below. T H E O R E M 3. Given a = ReiL then for k ~ 2,
Gsym(M(azk)) = D(k - 1, a), where
D(k - 1,
e [2~m-2iy]/(k-1)
a)
:=
{~b = e2wim/(k-1)
is a p o l y n o m i a l w i t h any E C \ { 0 } a n d g ( 0 ) = 0, then w e have s h o w n that G s y m ( g ) = A N I D ( n j - 1, anj)= n ~/=1 Gsym(anj znj) (see [4]). There are s o m e technicalities in defining D(0, a) but for our p u r p o s e s it will suffice to take D(0, a) to be G for a real a n d positive. As an example, let g(z) = 2z 3 + 3z 2 + 2z. Then Gsym(2z 3 + 3z 2 + 2z) = D(3 - 1, 2) N D(2 - 1, 3) fir D(1 - 1, 2) = D 3 _ 1 n D 2 - 1 n G = D 1 . W e concluded that Gsym(2z 3 + 3z 2 + 2z) = {z, 7}.
or
ch =
9 c o n j I m = 0, 1, 2, . . . , k - 2}. Also for
k>_3, Sym(M(azk)) = D(k - 1, a). This result can be d e d u c e d f r o m T h e o r e m s I a n d 2 b y m a k i n g the change of variables z = (1/a)l/(k-1)u. A key algebraic p r o p e r t y , that of c o m m u t a t i v i t y of m o t i o n s with a p o l y n o m i a l , drives the inductive argum e n t s which s h o w m a n y of the g r o u p c o n t a i n m e n t s w e h a v e mentioned. The algebraic s y m m e t r i e s of a polynomial g(z) with g(0) = 0 are those motions w h i c h comm u t e with g(z):
So conjugation and the identity are the only G - s y m m e tries of 2z 3 + 3z 2 + 2z. We show, h o w e v e r , that Sym(2z 3 + 3z 2 + 2z) is strictly larger t h a n Gsym(2z 3 + 3z 2 + 2z). Certainly, Sym(2z 3 + 3z 2 + 2z) contains the identity a n d conjugation, but it also contains the m a p p i n g ~(z) = - z - 1 because g ( ~ z ) ) = ~Kg(z)): 2 ( - z - 1) 3 + 3 ( - z - 1) 2 + 2 ( - z - 1) = - ( 2 z 3 + 3z 2 + 2z) - 1. The two m o t i o n s ~ and - z Sym(2z 3 + 3z 2 + 2z). We h a v e
1 generate the g r o u p
Sym(2z 3 + 3z 2 + 2z) = {z, ~, - z - 1, - z -
1}.
Gsym(g) := {c~ ~ G I g(ch(z)) = ~b(g(z)) for all z E C}, Sym(g) := {~bE RR I g(~(z)) = ~K~(z)) for all z E C}. Both Gsym(g) and Sym(g) are groups. W e observe that every element of Dk-1 is in G s y m ( z k) because (e2"a'im/(k-1)Z)k = e2"a'irn/(k-1)Z k a n d (e 2 ~ m / ( k 1)~)k = e 2 ~ m / ( k - 1 ) Z k. It is s t r a i g h t f o r w a r d to c o m p u t e b o t h G s y m ( z k) and Sym(z k) a n 4 see that they equal Dk-1 for k m 2. Theorems 1 and 2 give that, for k m 3, G s y m ( z k) = Sym(z k) = Dk-1 = Gsym(M(k)) = Sym(M(k)). Based on this fact, one might conjecture similar behavior for m o r e general polynomials. H o w e v e r , s o m e care m u s t be taken. First, if g(z) is a n y p o l y n o m i a l with g(0) = 0, w e let fc(Z) = g(z) + c. The M a n d e l b r o t set M(g) associated to g(z) is defined as
N o w let us c o m p a r e our algebraic s y m m e t r i e s for 2z 3 + 3z 2 + 2z c o m p u t e d in the p r e c e d i n g p a r a g r a p h with the geometric s y m m e t r i e s for the M a n d e l b r o t set M(2z 3 + 3z 2 + 2z). By examining M(2z 3 + 3z 2 + 2z) in Figure 4(a), w e see that Sym(M(2z 3 + 3z 2 + 2z)) should consist of: reflection a b o u t the x-axis (~), reflection a b o u t the y-axis ( - ~ ) , rotation about 0 b y vr ( - z ) , and the identity (z). W e will s h o w that these are, in fact, elements of Sym(M(2z 3 + 3z 2 + 2z)) in T h e o r e m 4. But m a t t e r s are complicated. The geometric s y m m e tries Sym(M(2z 3 + 3z 2 + 2z)) cannot m a t c h either of our algebraic s y m m e t r y g r o u p s Gsym(2z 3 + 3z 2 + 2z) or Sym(2z 3 + 3z 2 + 2z). For instance, - z belongs to Sym(M(2z 3 + 3z 2 + 2z)) but not to Gsym(2z 3 + 3z 2 + 2z), so Gsym(2z3 + 3z 2 + 2z) is strictly contained in THE MATHEMATICALINTELLIGENCERVOL. 18, NO. 1, 1996 39
F i g u r e 4: (a) M ( 2 z 3 + 3 Z 2 + 2z), the M a n d e l b r o t s e t for g(z) = 2 z 3 + 3Z 2 + 2z. &(z) = - z - 1 is a s y m m e t r y o f g, h e n c e ~b(z) = - z is a s y m m e t r y o f M ( 2 z 3 + 3Z 2 + 2z). 0 is the c e n t e r of M ( 2 z 3 + 3 z 2 + 2z). (b) T h e f i l l e d - i n Julia set for 2Z 3 + 3Z 2 + 2z. Jo is the b o u n d a r y o f the f i l l e d - i n Julia set. Jo h a s s y m m e t r i e s : ~ z ) = - z - 1, c o n j u g a t i o n (and h e n c e r e f l e c t i o n t h r o u g h the l i n e x = - 1/2).
Sym(M(2z 3 + 3z 2 3- 2z)). But also one can s h o w that the algebraic s y m m e t r y &(z) = - z - 1 in Sym(2z 3 + 3z 2 § 2z) does not lie in the geometric s y m m e t r y g r o u p Sym(M(2z 3 + 3z 2 + 2z)). This follows because 0 belongs to M(2z 3 § 3z 2 § 2z) but ~ 0 ) = - 1 d o e s not belong to M(2z 3 + 3z 2 + 2z). So Sym(2z 3 + 3z 2 + 2z) Sym(M(2z 3 + 3z 2 3- 2z)). W e are left to h u n t for another algebraic candidate to give Sym(M(g)). We propose
S(g) := {q~ E G I 3 6 = ~b + t E Sym(g)}, w h i c h is obtained b y taking the "G c o m p o n e n t " ~b of each algebraic s y m m e t r y ~b = ~b + t. Note that for g(z) = 2z 3 3- 3z 2 3- 2z, we h a v e Sym(2z 3 + 3z 2 + 2z) = {z, E, - z - 1, - E - 1}. Thus, S(2z 3 3- 3z 2 3- 2z) = {G-component(z), G-component(E), G - c o m p o n e n t ( - z - 1), and Gc o m p o n e n t ( - E - 1 ) } . So S(2z 3 3- 3z 2 + 2z) = {z, E, - z , -E}. These are precisely the geometric s y m m e t r i e s w e s a w for Sym(M(2z 3 + 3z 2 + 2z)) in Figure 4(a). W h a t we can p r o v e is:
point of g(z) is equivalent to ~(r) being a critical point of g(z). N o w fC~(c)(~z)) = g ( ~ z ) ) + 4~(c) = tldg(z)) + 4~(c) = ch(g(z)) + t + oh(c) = ch(g(z) + c) + t = ~b(fc(z)), and b y induction f~(c)(~(z))= ~fc(z)), k Vk E N. Therefore, {fck(r)} remains b o u n d e d for all critical points r of g (qdfk(r))} remains b o u n d e d for all critical points r of g 9 there do not exist any more rational points . . . . " A s far as the configurazioni poligonali misti are concerned, he remarks that he has shown that Z / n Z cannot occur when n = 20 and that therefore n = 5(2k) cannot occur for any k - 2. He has shown that Z/2nZ does not occur for n = 7 and "it
is probable that they do not exist either for larger odd values of n." It is not difficult to see that these conjectures already imply that the above list for the groups of the type Z / n Z • Z / 2 Z should be complete: "Such configurations exist for n = 2, 4, 8, 6, but not for n = 12 and 10; one can argue that they do not exist for larger values of n." Apart from the group Z/24Z which he does not mention, this means precisely that Beppo Levi believed the list(*) to be complete. More than 40 years later T. Nagell made the same conjecture [Nagell 1952a]. s In our days the conjecture became widely known as Ogg's conjecture, after Andrew Ogg, who formulated it 60 years after Beppo Levi. The problem studied by Beppo Levi and later by Billing, Mahler, Nagell, and Ogg has been very important in the development of arithmetic algebraic geometry. In the years following 1970, rapid progress was made by invoking the arithmetic theory of modular curves. The cases n =-11, i5, and 24 had ~lready been taken care of before the fifties [Billing and Mahler]; [Nagell 1952b]. (It is touching to read Beppo Levi's review of [Billing and Mahler 1940] in Mathematical Reviews where he recognizes the equation of X1(11) that he had already published in 1908 but was unable to solve completely at the time.) Ogg shows that n = 17 does not occur, in his important paper [Ogg 1971] where the connection between this problem and the theory of modular curves is spelled out. Ligozat [1975] and Kubert [1976] take care of several small values of n. In 1973 Mazur and Tate [1973] show that there do not exist rational points of order n = 13 on elliptic curves over Q. This exceptional case is eliminated by means of a 19-descent (!), performed in the language of flat cohomology, on a curve of genus 2. Finally in 1976, Barry Mazur proves the conjecture. It is a consequence of his careful study of the modular curves X0(n), which are closely related to the curves Xl(n). His paper [Mazur 1977] is a milestone. The techniques developed therein are basic in the proof of the main conjecture in Iwasawa theory by Mazur and Wiles [1984], Ribet's reduction [Ribet 1990] reducing Fermat's Last Theorem to the Conjecture of Taniyama, Shimura, and Weil and Wiles' recent proof of this conjecture. The latest development concerning torsion on elliptic curves is Lo~c Merel's proof (Spring 1994) of the general boundedness conjecture. This is the statement that the K-rational torsion of any elliptic curve defined over a field K of degree d over Q is bounded in terms of d alone. Beppo Levi's conjecture is an explicit version of the special case d = 1 of this new theorem. Merel's proof builds upon Mazur's work and subsequent refinements by S. Kamienny, combining them with other recent results in the arithmetic of elliptic curves. SWe t h a n k P r o f e s s o r A. S c h i n z e l for b r i n g i n g t h i s p a p e r to o u r a t t e n tion. THE MATHEMATICALINTELLIGENCERVOL. 18, NO. 1, 1996
65
Figure 5. Barry Mazur. ParmamBolognamRosario Having treated the first 10 years of Beppo Levi's professional life rather extensively, we will be much shorter with the remaining 50 (!) years. This half century from 1908 to the end of the 1950s falls naturally into three periods: almost 20 years in Parma, 10 years at Bologna, and a good 20 years in Argentina. In 1909 Beppo Levi married Albina Bachi. She was from the town of Torre Pelice in Piemonte, the alpine north west of Italy, as was Beppo Levi. He had started visiting his future in-laws in 1906, the year of his nomination at Cagliari. Three children, Giulio, Laura, and Emilia, came from the marriage. At the end of 1910 the family left Cagliari: Beppo Levi was appointed at the university of Parma, on Italy's mainland. He stayed there until 1928. Among his uninterrupted production (increasingly also on questions of mathematics teaching), there is one remarkable number-theoretic contribution from this Parma period: a paper on the geometry of numbers in Rendiconti del Circolo Matematico di Palermo [Levi 1911] where he claimed to give a proof of a conjecture of Minkowski's concerning critical lattices in R n. However, [Keller 1930] mentions a letter of Beppo Levi in which he acknowledged a gap in his proof. A complete proof of this result was given only in the 1940s [Haj6s 1942]. It was in Parma that the Levis lived through World War I and the ensuing political transformation of Italy and of Europe. A reflection of these events--immediately painful for Beppo Levi through the death of two 66
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
of his younger brothers---can be found in his remarkable speech (11 January 1919) at Parma University for the opening of the academic year 1918-1919, on "Nations and Humanity" [Levi 1919]. Another nonresearch publication of his from roughly the same period is Abacus from One to Twenty [Levi 1922], a booklet conceived and illustrated by Beppo Levi himself. It is designed to introduce children to the first numbers and elementary arithmetic operations. In the explanatory notes at the end of the booklet the author sketches his general ideas about the concept of number as being the fundamental example of mathematical representation, distinct from the process of counting, as well as about teaching elementary arithmetic. In the 1920s, the first decade of Mussolini's rule, Beppo Levi was explicitly antifascist. He signed the Croce manifesto in 1925. Around that time his situation at Parma University became increasingly difficult because more and more disciplines had to be suppressed for budgetary reasons. In the end, the sciences were reduced to chemistry, and Levi was the only professor left at the mathematics department. It therefore came as a great relief when he obtained his transfer to Bologna-a town with a traditionally famous university--at the end of 1928, after all obstacles to his nomination there had finally been overcome. While in Bologna he held various posts in the Italian Mathematical Society (U.M.I.), and took care of the Bolletino dell'Unione Matematica Italiana for many years. It may have been through correspondence related to a paper submitted to this journal that Beppo Levi first entered into contact with a mathematician from Argentina. In spite of his personal opposition to fascism, Beppo Levi took the oath to fascism in 1931, like most other Italian mathematicians. This oath was generally considered a mere formality; even the church held that it was a legitimate claim by the government for obedience. The mathematician Levi-Civita added a private reservation, and the government showed that it was quite prepared to accept the substance without the form. Of roughly 1200 professors, only 11 refused to sign. The 71year-old Vito Volterra was one of them. Volterra, by the way, stayed in Italy, where he died in 1940--so the SS car that came to his house in 1943 to deport him had to leave empty . . . . It was only after the rapprochement between Hitler and Mussolini, in 1938, that Italian fascism adopted some of the racial policies of the Nazis which at that time were building toward their monstrous climax in Germany and German-controlled Europe. Thus LeviCivita, Beppo Levi, and a total of 90 Italian Jewish scholars lost their jobs in 1938, and most of them had to start looking for a country of refuge. Beppo Levi was 63 years old when he lost his professorship in Bologna. At age 64 he started as the director of the newly created mathematical institute at the Universidad del Litoral in Rosario, Argentina.
The founding of this institute at Rosario, upstream from Buenos Aires, 6 took place at a time of cultural expansion of several provincial Argentinian cities, mainly Rosario, C6rdoba, and Tucum~n. A relative prosperity helped in the development of more substantial groups of professionals, mainly lawyers, medical doctors, and engineers, who promoted local cultural activity in these cities and invited leading intellectuals and artists from Buenos Aires to lecture or visit there. These professionals were financially better off, and their clients were richer yet. Societies, orchestras, art galleries, and publishing houses began to emerge in this period in Rosario. The official opening ceremony of the mathematical institute in Rosario was held in 1940. Lectures were delivered by Cort6s PlY, Rey Pastor, and Beppo Levi. These two men had been the key to Beppo Levi's arrival in Argentina. PI~ was an engineer who taught physics and had an active interest in the history of science. PI~ was a friend and admirer of Rey Pastor, the Spanish mathematician who founded the Argentine mathematical school.
6We are grateful to Eduardo L. Ortiz, London, for the information on Argentina contained in the following paragraphs. For the opening of the Rosario institute, see Publicaciones del Instituto de Matemdticas de la Universidad Nacional del Litoral 35 (1940); cf. E. Ortiz (ed.), The works of Julio Rey Pastor, London 1988.
Figure 6. Beppo Levi: the Argentine years.
Figure 7. Beppo Levi (front row, fourth from left) at a meeting of the Argentine Mathematical Union, 1948.
THEMATHEMATICALINTELLIGENCERVOL.18,NO.1,1996 67
Beppo Levi was extremely active in Rosario, and the n e w responsibilities w h i c h he happily e m b r a c e d right f r o m the v e r y beginning of the job gave m o r e satisfaction to his life in emigration than one w o u l d expect. 7 Apart from organizing and managing the Institute (assisted by Luis A. Santal6) he f o u n d e d and edited a journal and a book series of his institute. The journal app e a r e d for the first time already in 1939, and as of 1941 was called Mathematica~ Nota~ (Boletin dej Instituto de Matematica). Roughly one-third of Beppo Levi's publications are in Spanish. These are his papers from the Argentinian period, m a n y of which a p p e a r e d in Mathematica~ N o t x . Beppo Levi continued teaching at Rosario until the age of 84. In 1956 (shortly before he turned 81) he was a w a r d e d the Italian Premio Feltrinelli. Unfortunately the official text of the prize committee [Segre 1956] shows a somew h a t uncertain appreciation of some of Beppo Levi's works. At the end of the evaluation the committee of this prize for Italian citizens congratulates itself that Beppo Levi has highly h o n o r e d the n a m e of Italy by his w o r k in A r g e n t i n a . . . Beppo Levi died on 28 August 1961, 86 years old, in Rosario, where his institute is n o w n a m e d after him. He was probably the shortest mathematician in o u r century, with the longest professional activity. Acknowledgments We w o u l d like to thank all colleagues and friends w h o gave us hints or sent us documents while w e were preparing this article. We are particularly indebted to Professor Salvatore Coen, Bologna, for freely sharing his extensive k n o w l e d g e of Beppo Levi's life and w o r k with us, and for making available some of the less accessible publications of Beppo Levi; and to Dr. Laura Levi, Buenos Aires, for a most interesting correspondence which conveyed a vivid impression of the personality of her father. Thanks to R a y m o n d Seroul, Strasbourg for drawing the figures on his computer. References Billing, G. and Mahler, K. (1940): On exceptional points on cubic curves, J. London Math. Soc. 15, 32-43. Bourbaki, N. (1974): Eldments d'histoire des mathdmatiques, Collection histoire de la pens6e, Paris: Hermann. Coen, S. (1991): Geometry and complex variables in the work of Beppo Levi; in Geometry and Complex Variables, Proc. Int. Meeting on the Occasion of the IX Centennial of the University of Bologna (S. Coen, ed.), New York: Marcel Dekker 111-139. Coen, S. (1994): Beppo Levi: la vita; in Seminari di Geometria 1991-1993, Universitt~ degli Studi di Bologna, Dipart. Math., 193-232. 7This is reflected already in some of the first letters back to relatives in Italy. The same letters also show that his wife found her life rather more difficult in Rosario. We owe this information to Dr. Laura Levi. 68
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
Deny, J. and Lions, J.L. (1953): Les espaces du type de Beppo Levi, Ann. Inst. Fourier 5, 305-370. Dieudonn6, J. (1974): Cours de gdomdtrie alge'brique, 1: Aperr historique sur le developpement de la gdomdtrie alge'brique, Paris: PUF. Dieudonn6, J. et al. (1978): Abrdgd d'histoire des mathdmatiques 1700-1900, tone II, Paris: Hermann. Fraenkel, A. and Bar-Hillel, Y. (1958): Foundations of Set Theory, Amsterdam: North-Holland. Fubini, G. (1907): Sugli integrali multipli, Rend. Acc. Lincei 16, 608-614. Gario, P. (1988): Histoire de la r6solution des singularit6s des surfaces alg6briques (une discussion entre C. Segre et P. del Pezzo), Cahiers Sdmin. Hist. Math. Paris 9, 123-137. Gario, P. (1992): Singolarita e geometria sopra una Superficie nella Corrispondenza di C. Segre a G. Castelnuovo, Arch. Hist. Ex. Sci. 43, 145-188. Goldstein, C. (1993): Preuves par descente infinie en analyse diophantienne: programmes, contextes, variations, Cahiers Sdmin. Hist. Math. Inst. Henri Poincard 2(3), 25-49. Haj6s, G. (1942): Uber einfache und mehrfache Bedeckung des n-dimensionalen Raumes mit einem Wfirfelgitter, Math. Zeitschr. 47 427-467. Hawkins, T. (1975): Lebesgue's Theory of Integration, Its Origins and Developments, 2nd ed., New York: Chelsea. Hironaka, H. (1962): On resolution of singularities (characteristic zero), in Proceedings of the International Congress of Mathematicians 1962, Djursholm, Sweden: Institut MittagLeffier, pp. 507-521. Hurwitz, A. (1917), Uber tern~ire diophantische Gleichungen dritten Grades, Vierteljahresschrifl der Naturforschenden Gesellschafi in Zfirich, 62, 207-229 (Math. Werke II, 446-468). Kazan (1906): Izvestiya fiziko-matematicheskogo obshtshestvo pri imperatorskom kazanskom universitet (Bulletin de la Socidtd physico-mathdmatique de Kasan) (2) 15, 105-118. Keller, O.-H. (1930). Uber die lfickenlose Erffillung des Raumes mit Wfirfeln. J:. Reine Angew. Math. 163, 231-248. Kiepert, L. (1888-1890): Uber die Transformation der elliptischen Funktionen bei zusammengesetztem Transformationsgrade, Math. Ann. 32 (1888), 1-135; Uber gewisse Vereinfachungen der Transformationsgleichungen in der Theorie der elliptischen Funktionen, Math. Ann. 37 (1890), 368-398. Kubert, D. (1976): Universal bounds on the torsion of elliptic curves, Proc. London Math. Soc. 33, 193-237. Lebesgue, H. (1906): Sur les fonctions d6riv6es, Rend. Acc. Lincei (5) 152, 3-8. Lebesgue, H. (1991): Correspondance avec Emile Borel, Cahiers Sdmin. Hist. Math. Paris 12. Levi, B. (1897a): Sulla riduzione dei punti singolari delle superficie algebriche dello spazio ordinario per trasformazioni quadratiche, Ann. Mat. Pura Appl. 26, 218-253. Levi, B. (1897b): Risoluzione delle singolarita puntuali delle superficie algebriche, Atti Reale Acc. Sci. Torino 33, 66-86. Levi, B. (1902): Intorno alla teoria degli aggregati, Rend. Reale Istit. Lombardo Sci. Lett. 35, 863-868. Levi, B. (1906a): Sopra l'integrazione delle serie, Rend. Reale Istit. Lombardo Sci. Lett. 39, 775-780. Levi, B. (1906b): Sul Principio di Dirichlet, Rend. Circ. Mat. Palermo 22, 293-360. Levi, B. (1906-1908): Saggio per una teoria aritmetica delle forme cubiche ternarie, Atti Reale Acc. Sci. Torino 42 (1906), 739-764, 43 (1908), 99-120, 413-434, 672-681. Levi, B. (1909): Sull'equazione indeterminata del 3~ ordine, in Atti del IV Congresso Internazionale dei matematici, Acc. dei Linc., Roma 1908 2, 175-177. Levi, B. (1911): Un teorema d el Minkowski sui sistemi di forme lineari a variabili intere, Rend. Circ. Mat. Palermo 31,318-340.
Levi, B. (1919): Nazioni e Umanit?l, Parma: E. Pelati. Levi, B. (1922): Abaco da uno a venti, Parma, B. Levi editore (60 pages, of which 44 illustrated). Levi, B. (1923): Sui procedimenti transfiniti (Auszug aus einem Briefe an Herrn Hilbert). Math. Ann. 90, 164-173. Levi, E.E. (1959-1960): Opere, 2 vol., ed. Cremonese: Unione Matematica Italiana (a cura di M. Picone). Ligozat, G. (1975): (Gourbes modulaires de genre 1. Thbse, Bull. Soc. Math. France, Mdmoire 43. Mazur, B. (1977): Modular curves and the Eisenstein ideal, Publ. Math. IHES 47, 33-186. Mazur, B. and Tate, J. (1973): Points of order 13 on elliptic curves, Invent. Math. 22, 41-49. Mazur, B. and Wiles, A. (1984): Class fields of abelian extensions of Q, Invent. Math. 76, 179-330. Moore, G. H. (1982): Zermelo's Axiom of Choice, Its Origins, Development, and Influence, Berlin: Springer-Verlag. Moss, B. (1979): Beppo Levi and the Axiom of Choice, Hist. Math. 6, 54-56. Nagell, T. (1952a): Problems in the theory of exceptional points on plane cubics of genus one; Den llte Skandinaviske Matematikerkongress, Trondheim 1949, Oslo: J. Grundt Tanums Forlag, pp. 71-76. Nagell, T. (1952b): Recherches sur l'arithm6tique des cubiques planes du premier genre dans un domaine de rationalit6 quelconque, Nova Acta Re. Soc. Sci. Upsaliensis, 15, 1-66. Neumark, M.A. (1959): Normierte Algebren, in Hochschulb/icher ffir Mathematiker 45, Berlin (VEB Verl. Wiss.). Nikodym, O. (1933): Sur une classe de fonctions consid6r6e dans l'6tude du probl6me de Dirichlet, Fund. Math. 21, 129-150. Ogg, A. (1971): Rational points of finite order on elliptic curves, Invent. Math. 22, 105-111. Parikh, C. (1991): The Unreal Life of Oscar Zariski, Boston: Academic Press.
Poincar6, H. (1901), Sur les propri6t6s arithm6tiques des courbes alg6briques, J. Math. 7,161-233 (CEuvres V, 483-550). Riesz, F. (1934): Zur Theorie des Hilbertschen Raumes, Acta Sci. Math. Szeged 7, 34-38. Ribet, K.A. (1990): On modular representations of Gal(Q/Q) arising from modular forms, Invent. Math. 100, 116-139. Schappacher, N. (1990): D6veloppement de la loi de groupe sur une cubique, in Sdminaires de Thdorie des Nombres, Paris 1988-1989, Boston: Birkh/iuser. Segre, B. (1956): Relazione per il conferimento del Premio "Antonio Feltrinelli" per la matematica per il 1956, riservato a cittadini italiani, di L. 1.500.000. Commisione: Francesco Severi (Presidente), Giu'lio Krall, Modesto Panetti, Mauro Picone, Enrico Pistolesi, Giovanni Sansone, Beniamino Segre (Relatore), Atti Acc. Lincei, Vol. 29, 663--664. Terracini, A. (1963): Commemorazione del Corrispondente Beppo Levi, Rend. Acc. Lincei 34, 590-606. Zariski, O. (1935): Algebraic Surfaces, Berlin: Springer-Verlag. Zariski, O. (1939): The reduction of the singularities of an algebraic surface, Ann. Math. 40, 639-689. Zermelo, E. (1904): Beweis, dat~ jede Menge wohlgeordnet werden kann, Math. Ann. 59, 514-516. U.F.R. de mathdmatique et informatique Universitd Louis-Pasteur 7 rue Rend Descartes F-67084 Strasbourg Cedex, France e-maih
[email protected] Dipartimento di Matematica 2 a Universitdt di Roma "Tor Vergata" 1-00133 Roma, Italy e-maih
[email protected] THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
69
Jet Wimp*
The Interpretation of Quantum Mechanics by Roland Omnes
phenomena. It was therefore assumed, by analogy, that an underlying medium, called the "ether," was necessary to support the propagation of electromagnetic Princeton NJ: Princeton University Press, 1994, 550 pp. waves. It was then possible to estimate the properties US $39.50 (softcover); US $95.00 (hardcover) of this medium. Estimates revealed very bizarre properties (more solid than steel, as unresisting as a vacuum). Reviewed by Robert Gilmore All attempts to measure its properties were trumped by The 20th century has witnessed two major revolutions mysterious effects which "exactly countered" the effect in physics that have radically changed our understand- being measured. The most famous experiment to proing of the world in which we live. These are the Theories duce a null result was the Michelson and Morley failure (1887) to measure the speed of the earth through of Relativity and of Quantum Mechanics. The Special Theory of Relativity has forced us to re- this medium. "Etherists" advanced a number of ingealize that space and time are not independent but rather nious explanations for the null result of the experiment. observer-dependent projections of a four-dimensional These ideas were all swept away by Einstein's radical continuum, space-time. This has generated a number of reformulation of the problem, in which he took as axparadoxes. These are predictions of the theory which iomatic that c (the speed of light) is the same in all inrun counter to our human intuition developed from ex- ertial reference frames. The consequences of this reforperience at speeds much less than the speed of light. We mulation were equally as bizarre as the properties of the have had to learn to live with these "paradoxes," be- "ether": "Ether" was a superfluous concept; one had to give up the idea of simultaneity; time was not absolute; cause all which have been tested have been verified. Quantum Mechanics (QM) has forced us to revise our one had to relinquish the idea of space and time as sepinterpretation of microscopic phenomena. The problem arate entities and replace them by a space-time continis that there was no consensus on the proper interpre- uum. Many consequences of the new theory were "paratation of QM when it was developed (-1925), nor is doxical" but, in the long run, insights obtained by abandoning the analogy between electromagnetic and there any consensus now. The source of the interpretation problem can be stated sound waves far outweighed the difficulties. We face the same problem with QM. Our interpretasimply. We develop theories from data and they must provide predictions consistent with experiment. tion depends on analogy. Our analogy involves shrinkHowever, we often interpret new theories by analogy ing macroscopic concepts down to the microscopic level. with older ones. Analogy can be very useful, but it can There are two principal concepts on which the analogy sometimes be a bad guide. Physicists have made bad is built: particles and fields. Particle motion, described analogies in the past; we are at present guilty of draw- by classical mechanics, is properly the quantum meing poor analogies in interpreting QM; future genera- chanical behavior of N (>>1) "particles" with m > 0. The electromagnetic field is properly the quantum metions will undoubtedly be guilty of this sin. A recent faulty analogy can serve as a cautionary tale: chanical behavior of N (>>1) quanta (photons) with the interpretation of Maxwell's equations (1864). These m = 0. We forget this--that N > > 1 in either case equations predicted that both light and radio waves when we want to interpret what a single quantum (N = were electromagnetic wave phenomena differing only 1, either m > 0 or m = 0) is and how it behaves. Is it a in frequency. Wave phenomena were already familiar "particle"? Is it a "wave"? How could it be both? Our from the propagation of longitudinal and transverse problem is that this analogy is not correct, like "ether" waves in fluids and solids. In the case of sound waves before it, and to the extent it is not correct, these quesan underlying medium was necessary to support these tions don't even make sense. Beginning shortly after QM evolved to the point at which predictions of experimental results could be *Column Editor's address: D e p a r t m e n t of M a t h e m a t i c s , D r e x e l U n i v e r s i t y , P h i l a d e l p h i a , P A 19104 USA. made and were found to be correct, the midwives of the 70 THE MATHEMATICALINTELLIGENCERVOL. 18, NO. 1 9 1996 Springer-Verlag New York
theory embarked on a discussion of how to interpret it, how to "understand" what is happening at an atomic level. This discussion is nowhere near resolution today. Interpretations of QM were proposed by (among others) Dirac, Feynman, Bohr and Heisenberg, and Einstein and Schr6dinger. In addition, there is a lunatic fringe not covered by these broad areas. Some of the fringe ideas turn out to be not so lunatic. The view of Dirac is not widely known and even less widely adopted. So far as I know, it has two adherents-Dirac and myself. Dirac points out that there is in QM a dimensionless ratio, the fine structure constant a = e2/hc = 1/137.03604. He suggests that in any final theory of matter, all dimensionless ratios must be derived quantities, not inputs. Thus, at some point in the future we must be able to derive one of the three quantities e (fundamental charge), c (speed of light), or h (Planck's con-stant/2~r). By not entirely convincing arguments, Dirac rates e and c more fundamental than h, so that h must be derivable from them. But Planck's constant is the lower bound for the uncertainty relations; Dirac concludes that it is fruitless to argue about the ~nterpretation of uncertainty relations in particular, and QM in general, until such time as we can derive th~s physical constant from a more basic theory. Feynman took a more pragmatic view. He warned young physicists that the question is a trap. The different interpretations give the same predictions and therefore cannot be distinguished by physical experiment. The vast majority of working physicists have adopted Feynman's viewpoint and Bohr's Copenhagen Interpretation (see below) as a default. Important arguments about the interpretation of QM have revolved around the view of Bohr and Heisenberg, on the one hand, and Einstein and Schr6dinger, on the other. Bohr and Heisenberg established an interpretation based on observables. This view evolved from Bohr's early attempts, and failures, to describe atoms more complicated than hydrogen in terms of planetary models (e.g., two electrons in orbit around a helium nucleus). He imposed the condition on his younger colleagues, most notably Heisenberg, that classical concepts such as position, momentum, orbit, and angular momentum, were not reliable constructs on which to build an atomic theory. A microscopic theory should be based only on observables. Heisenberg obliged, and constructed a theory in which observables were identified with matrices. Heisenberg's atomic theory was called Matrix Mechanics. Schr6dinger came to his formulation from a different route. Einstein observed (1905) that light of wavelength behaved, in some respects, like a collection of particles with momentum p = h/,~ (h = Planck's constant). A surprising time later (1923), de Broglie proposed that a particle of momentum p behaved, in some respects, like a wave of wavelength A = h/p. Schr6dinger decided that if deBroglie were to be taken seriously, then asso-
ciated with each particle there would be a wave and that wave must satisfy some equation. What equation? To guess this, he reached back to Maxwell's equations, deduced an approximate equation satisfied by each of the three components of the electric and the magnetic fields, and made identifications between 1/A and p/h in a particular representation (Hamilton-Jacobi) of classical mechanics. The result of this ad hoc process is now called the Schr6dinger equation. The approximate correctness of this equation for describing low-energy atomic phenomena, and its ease of use (compared with Heisenberg's matrices), guaranteed its central role in the modern description of matter. Schr6dinger also showed (1925) that his wave mechanics was equivalent ~to Heisenberg's matrix mechanics. Bohr and Heisenberg wished to say as little as possible about a system. They would discuss only observables. If "it" wasn't observable, "it" didn't exist. Even if "it" was observable but wasn't observed, "it" didn't exist. On the contrary, Einstein and Schr6dinger felt that particles were endowed with wavelike properties as well as particlelike properties, which only enriched the discussion of their spectrum of possible behaviors. As the discussion about the interpretation of QM continued, Bohr refined his interpretation. Whenever multiple interpretations of an experiment were possible giving different outcomes, Bohr's interpretation was always correct. It was as if he had a direct telephone connection to the atomic level. He acted as if he never lost sight of Hilbert's observation that a system governed by PDEs is not even defined until the boundary conditions are specified--and as if the observer formed part of the boundary conditions on a quantum system. Bohr's interpretation grew into what we now call the Copenhagen Interpretation. Einstein and Bohr continued their dialogue for the rest of their lives. The problem haunted Bohr, and at his death in 1962, equations were found on his blackboard which reflected his continuing obsession with it. The Copenhagen Interpretation was elegantly summarized in D. Bohm's book (1952). Every quantum system Q is described by a state vector 1@ in some Hilbert space. This state vector contains all the information that can ever be known about Q. The state vector evolves dynamically according to Schr6dinger's equation. In order to extract information about Q, an interaction must take place between Q and a measuring apparatus A. The measuring apparatus is a classical object and gives classical results. It is outside the scope of analysis. The entire system A + Q interacts to produce a measurement of Q. Replacing A by a different apparatus A' changes the entire system A + Q to a different system A' + Q, and the results of the measurement need not be similar. Only quantities which have been measured can be considered real. The only values of an observable which can be observed in a measurement are eigenvalues a, of the observable, which is a hermitian operator on the Hilbert THEMATHEMATICALINTELLIGENCERVOL.18,NO.1,1996 71
space. The probability of observing eigenvalue an when the experiment is carried out is I(~nl~)l 2, where I~n) is a unit eigenvector with eigenvalue an. After the measurement, if it gave the value an, the state vector of Q is I4~n) instead of I@. Von Neumann replaced Bohr's classical measuring apparatus by an apparatus obeying the laws of QM. This resolves some questions while raising others. For example, where is the dividing line between A and Q when both are quantum mechanical? There are three questions which should be addressed if the Copenhagen Interpretation is to be accepted as a satisfactory interpretation of QM. 1. What is the origin and meaning of the uncertainty relations, or more generally of the noncommutativity of observables? 2. Where is the boundary in A + Q? 3. How does the apparatus force the "collapse of the wave function" I@ ~ I~n~ during a measurement? Omnes's monograph is the latest in a long line of efforts to address these and other philosophical issues. It is not for the lay reader or the faint of heart. It is directed to physicists with a working knowledge of this field who would like to know how to interpret what they practice. It is also directed to philosophers who realize that profound questions posed by Greek and Renaissance philosophers may finally be answerable that answers may be provided by a deeper understanding of the world as understood by physicists. Question 1 above is Dirac's question. It is not even raised in Omnes's book. Neither is Question 3: Omnes simply states that 3 above is what happens and need not be explained by theory. In essence, this book studies only half of the interaction between A and Q. I found this to be a major disappointment. Even at this level, fundamental questions arise. One was raised by Schr6dinger. Suppose a radioactive nucleus has probability 0.5 of decaying in an hour. At the end of an hour the state of the nucleus is a linear superposition of undecayed and decayed states. One such nucleus and a cat are placed inside a box. The decay of the nucleus will initiate a series of reactions that will ultimately kill the cat. How does Copenhagen interpret the cat after one hour? Von Neumann would say that the cat is in a linear superposition of live and dead states: only our conscious observation of the cat after an hour would force its wave function to collapse to live or dead. Heisenberg would say that only when the box is open does it become true that the cat is live or dead. Bohr argued that the cat is a classical object, and as such must be live or dead at any time, and that we determine which when we open the box. All three contributed to the "Copenhagen Interpretation." To see how Omnes's approach differs from or improves on the Copenhagen Interpretation, it is first useful to state the Copenhagen Interpretation in axiomatic 72 THEMATHEMATICALINTELLIGENCERVOL.18,NO.1,1996
formulation. Postulates are presented in many of the texts available today. In general, all presentations are equivalent, but no two are identical. A useful approach is modeled on Newton's formulation of Classical Mechanics. It has three postulates dealing with kinematics, dynamics, and measurements. 1. Kinematics. The playing field on which QM takes place is a Hilbert space. A quantum system Q is represented by a (state) vector I~) in this Hilbert space, or more generally by a density operator p on this space. Observables are represented by hermitian operators acting on this space. The Hilbert space for two particles, or more generally two systems, is the tensor product of the Hilbert spaces for the individual systems. If the particles are identical bosons (fermions) the Hilbert space carries a symmetric (antisymmetric) representation of the permutation group. 2. Dynamics. The state I@ or p presenting a quantum system evolves under the Schr6dinger equation (or Pauli-Schr6dinger equation, or Dirac equation . . . . ). If two systems are noninteracting, the Hamiltonian for the combined system is the direct sum of the Hamiltonians for the two subsystems. If the particles are identical, the Hamiltonian commutes with the permutation group. 3. Measurement. The quantum system Q and the apparatus A measuring the value of an observable interact with each other during a measurement. Q acts on A to produce a unit eigenvalue an with probability Pn = I(q~nl~/)l2, where I~n) is a unit eigenvector of the hermitian operator representing the observable with eigenvalue an. A back reaction of A on Q guarantees that Q is in the eigenstate Ibn) of the hermitian operator immediately after the measurement. Omnes formulates his version of QM in five Rules, which differ somewhat from the postulates of the Copenhagen Interpretation. He accepts the Kinematic Postulate in his Rules I and 3, and the Dynamics Postulate as his Rules 2 and 3. He changes the first half of the Measurement Postulate (the effect of Q on A) in his Rule 4, but accepts the second half of the Measurement Postulate, dealing with the back reaction of A on Q and the "collapse of the wave function," as his Rule 5. Omnes's Rule 4 differs in a very substantial way from the Measurement Postulate. It is not stated at all in terms of a quantum system and a measuring apparatus. It is stated instead in terms of only the quantum system and a calculus of logics. A quantum system may obey more than one logic. For example, a system may obey logics L1 and L2. These two logics are compatible if there exists a third logic which contains both L1 and L2 as sublogics (possibly not proper sublogics). If no such logic exists, L1 and L2 are called complementary. Roughly speaking, a measuring apparatus specifies a logic. If two logics are compatible, the observables commute and their values can simultaneously be known; if comple-
mentary, the observables do not commute and their values cannot simultaneously be known. There is more to it than this. The probability that a quantum system undergoes event I is pr(1) = Tr PlpP1, where p is the system's density operator and P1 is the Hilbert space projection operator which defines the event 1. The probability that event 1 occurs at time tl and event 2 occurs at time t2 is pr(2, 1) = Tr P2PlPP1P2, where Pi is the projector associated with event i at time ti. This sequence of events is called a history. If one of these events is further resolved into two alternatives [such as the particle is in arm 1 of an interferometer at time tl (P~) or in arm 2 of the interferometer at time tl (P~')], then the projectors obey P1 = P~ + P~' and P~ Pi' = 0. The requirement pr(2, 1) = pr(2, 1') + pr(2, 1") places the following constraint on the projectors:
tem obeys the laws of classical mechanics is I - 9 where 9 - 10-Pq, and pq is a very large number. This formalism is then applied to the measurement problem. The measurement apparatus is assumed to be described by the laws of QM. Omnes assumes that there is a small number of "classical" degrees of freedom, and the remaining degrees of freedom are not accessible. The trace involved in determining the outcome or probabilities for an experiment is over these nonclassical degrees of freedom. Since they are never really seen, these degrees of freedom should be regarded as "hidden variables" in Omnes's presentation of measurement theory, although this term is usually reserved for other problems in the interpretation of QM. For example, for the Schr6dinger cat paradox, the state of the cat is described by the 2 • 2 density matrix
Tr P2P{PP~'P2 + Tr P2P~'PP{P2 = O. Logics L1 and L2 are compatible only if all such consistency conditions (for P~ and PI' obeying L1, P2 obeying L2) are satisfied. This reduces the compatibility or complementarity of two logics to a computation of consistency conditions, where the result of a consistency computation ranges from 0 (compatible) to 1. It then makes sense to consider nearly compatible logics for which the consistency conditions hold within 9 where 9 is a small number. This provides a major advance over the standard Copenhagen Interpretation, where the observables either commute or do not, and are simultaneously measurable or are not. This also provides a mechanism to answer the question: Under what conditions can we reliably talk about a quantum particle taking a trajectory from spacetime point a to point b? Bohr would say it makes no sense; Einstein would demur. Omnes responds with a nice computation for a particle traveling from the origin at time t = 0 and observed within a small sphere centered at 2d on the z axis at time 2t. He asks, "How reliable is it to speak about a trajectory from the origin to the region of observation?" This is done by computing the consistency condition for the particle within a somewhat larger sphere centered on the z axis at d at time t, subject to the boundary conditions: created at a, observed near b. This consistency condition computation is roughly equivalent to a Feynman sum over restricted paths between these regions of space and time. The result is that in many instances it makes a great deal of sense to talk about trajectories for quantum particles. Omnes asks if we can in fact regard classical mechanics as a limit of QM. His answer is in no way equivalent to the standard Copenhagen Interpretation approach to the classical limit given by the Ehrenfest equations, where Newton's equations of motion are satisfied by expectation values of quantum observables. The results of Omnes's computation are presented in a way which at first seems unsatisfactory. He argues that, for the system considered, the probability that the sys-
TrPlpPI=
[ pu Pld] PdZ
Pdd
"
Here pll is the probability that the cat is still alive at time t: Pu = 2-(t/T), T = 1 hour, Pdd is the probability that the cat is dead: Ped = 1 - PU. The off-diagonal matrix elements Pld = P~I represent quantum correlations. A density operator describing a classical result must be diagonal, with no quantum correlations. This is the nub of the cat paradox. Omnes estimates Pld ~ l O - t / D , where D is a very small number. He thus concludes that quantum correlations drop off very rapidly. His computation resolves the conflict among the views of Bohr, Heisenberg, and von Neumann within the Copenhagen Interpretation itself; once again, Bohr's intuition was on target. Regrettably, this approach still leaves unresolved the question: Given that the cat is either alive or dead, why can we only compute probabilities? Is it for the same reasons that we compute probabilities in statistical mechanics: insufficient knowledge of initial conditions? Or is there something yet deeper involved? Following the codification of the Copenhagen Interpretation, as for example in Bohm's book (1952), three major developments in the interpretation of QM have been made by Bohm, Bell, and Leggett. Bohm developed a version of the quantum theory in which it was possible to speak about the position of a particle, and the trajectory which it followed. Such a theory had previously been developed by de Broglie (1927), but was abandoned by him after criticism by the principals involved in the Copenhagen Interpretation. This development by Bohm was in remarkable contrast to a theorem by von Neumann stating that "hidden variables" could not occur in any correct version of quantum theory. Hidden variables were values assigned to dynamical variables (position, momentum, angular momentum, spin) before or between observations--things Bohr warned Heisenberg not to build a theory on; things that Einstein and Schr6dinger deeply felt should exist. Einstein felt this so deeply that he proposed an experTHE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
73
iment (the Einstein-Podolsky-Rosen experiment, 1935) to show that one could speak of such variables. The basic idea was to consider two correlated particles, to measure the position of one and the momentum of the other, then argue that this information meant that the position and momentum of both could be known. After extended discussion with Bohr, he conceded that the correctness of QM would force some kind of long-range (faster-thanlight) exchange of information between the two particles. Disappointed, he found it difficult to believe in any kind of "spooky action at a distance." Bohm's representation spurred Bell to wonder just how it was that von Neumann's theorem could be circumvented. This theorem had put a damper on searches for better interpretations for 20 years. It must be regarded as the first of the "no-go" theorems in QM. The problem with no-go theorems is that there may be somebody in the wings who is smarter than you--even if you are von N e u m a n n - - a n d finds some way to make "go" what you think cannot. Bell identified the point at which Bohm's representation of quantum theory evaded the constrictions of von Neumann's theorem. There is an assumption of locality implicit in the proof of the theorem. Bell formulated (1966) a number of inequalities that must be satisfied by any system for which locality holds. He showed that under certain conditions the results of quantum mechanical calculations could violate these inequalities. For once there was a clear-cut test to distinguish different philosophical interpretations of QM. The tests have since been done by Clauser (1972) and others. They show that Bell's inequalities are violated. They show more: the quantum mechanical predictions are correct well within experimental error. They also show that correlations among quantum systems do not obey locality. So Einstein's fear of "spooky action at a distance," influenced by his close connection with the causality and finite propagation speeds of Special Relativity, is in fact not shared by Nature. Somehow signals travel at one speed, correlations (symptoms of boundary conditions) travel at another. The third advance, proposed by Leggett, was that certain devices (SQUIDS) could be very large and still remain in a quantum state. Bohr's original dictum that a measuring device was classical because it was "large" could no longer be supported. Omnes's discussion of these points leaves something to be desired. For example, the manner in which Bohm's model evaded von Neumann's theorem is not clearly stated. He does not make clear precisely what hidden variables are: they remain mysterious throughout the discussion. His discussion of Bell's theorem leaves the impression that Bell proved hidden variables could not exist, whereas the crucial assumption is locality. Bell's theorem shows that locality implies inequalities; QM predicts these inequalities could be violated; experiment shows QM is correct. Therefore, quantum correlations do not satisfy locality. This feature is already present in 74
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
Bohm's model. So once again Bohr was correct and Einstein not, but everybody is surprised at this action at a distance, which is now a clearly recognized part of the quantum interpretation. This is not the first time that action at a distance has created a problem for interpretation. Newton's force of gravitational attraction acts at a distance. This was a great embarrassment to Newton, who was able to coexist with this force but never comfortably. It remained for Einstein to resolve Newton's dilemma more than 200 years later with his own theory of gravitation. Will we have to wait another 200 years for some future savant to resolve Einstein's dilemma? Omnes suggests that the existence of macroscopic quantum systems can aid in our study of the decoherence phenomenon. Decoherence, to which Omnes devotes an entire chapter, plays a central role in the measurement problem. It would have been nice if Omnes suggested how SQUIDS could be used to measure decoherence times. The measurement of such phenomena would add credibility to his interpretation. This book contains a number of other errors, less serious but annoying nonetheless. The first time a hamiltonian is presented, the equation is incorrect. The description of the spin-echo experiment is not correct. Expressions for the Wigner distribution are presented incorrectly more often than correctly. It has been the fashion, these days, to lay out the formulation of QM in an axiomatic format. Earlier presentations simply described the computational processes to be carried out. The push toward axiomatic formulation derives from our desire to make the assumptions of our subject precise so that, if disagreements occur, the possible culprits can more easily be identified. These axiomatic formulations of QM are modeled on earlier axiomatic formulations which underlie other branches of physics. So it would be useful, at this point, to investigate just how successful such formulations have been. The archetypical axiomatic formulation goes back to Classical Mechanics and Newton's three laws of motion: (1) a particle moves with constant velocity unless acted upon by forces; (2) F = dp/dt; (3) Fij + Fji = 0. The functions of these three laws are as follows. The first defines a coordinate system (inertial coordinate system) in which the actual laws of dynamics (F = dp/dt) are true. The third is a conservation (of momentum) law. Now it turns out that Newton's first law, as stated by Newton, isn't true: consider an observer on a merry-goround. A proper statement of this law is: In an inertial coordinate system a particle moves with constant velocity unless acted upon by forces. Only much later was it understood that it defined inertial coordinate systems. Then comes the pragmatic question: how do you know the forces on an object are zero? The standard answer "when the object is sufficiently far from other stuff" won't do because of the long-range nature of the gravitational force. Instead, it is more reasonable to
adopt Mach's principle (Boundary conditions for ODEs? Hilbert would be shocked!) to resolve the definition of inertial frames; that is, an inertial frame is defined by the distribution of matter in the universe. This not only defines an inertial frame, it does what Michelson and Morley failed to do, and what Einstein stated axiomatically cannot be done. It defines an absolute rest frame. We have recently been able to do this in practice as well as principle from our observation of the 3 ~ microwave background blackbody radiation. From the observed dipole distribution of this radiation we know that we are moving about 370 k m / s in the direction of Leo with respect to the cosmic background radiation (late 20th century ether). This radiation is statistically distributed, as would the graviton background radiation be, could we measure it. This reduces us to the situation that Newton's inertial rest frame can only be defined as a statistical average. Nor is it clear that the rest frame defined by the microwave background (m = 0, JP = 1-) and the gravitational background (m = 0, JP = 2 +) have any relation to each other (m = mass of photon or graviton, J = helicity, P = parity). The archetype for all axiomatic formulations is of course Euclid's formulation of the rules of surveying. The origins of this scientific endeavor are lost in antiquity. The honor of codification goes to Euclid, who came perhaps a millenium after the field became economically important in the postdiluvian Nile river valley. Even though Euclid's axiomatic formulation has been available for -2500 years, the consistency of this set of axioms has not always been clear. For example, it is only relatively recently that the role of the fifth postulate has (apparently) been settled. As a branch of mathematics, Euclidean geometry should not be subject to the messy kind of probability or approximation statements which the physical disciplines share. As an axiomatization of some practical endeavor, however, it is. For example, Euclid proves that the sum of the interior angles of a regular figure with four equal sides ("square") is 2r radians, where a surveyor would actually measure 2~1 + ~) radians, with E -IO-PE, pE small. This suggests that the axiomatic formulation of a field of physics as intuitive as Classical Mechanics amounts to a moving target, with perhaps no satisfactory resolution, even though it has been going on for 300 + years. In short, the axiomatization of any area of practical endeavor is filled with potholes and surprises. Approximations and probabilities apparently cannot be avoided. The statement t h a t Classical Mechanics holds with probability 1 - e, 9 = lO-Pq is far less shocking when we realize that pq > > PE. That is, Classical Mechanics is a much better approximation of QM (for a macroscopic system) than thermodynamics is of statistical mechanics, and this is a much better approximation than Euclidean geometry is of surveying. However, it is clear that the axioms provided by
Omnes are incomplete. How can they give the Classical Mechanics in the limit when they do not take into consideration either the first or third laws of Newton? How can they be complete when they ignore the collapse of the wave function? The current volume provides a useful contribution to the literature of interpreting QM. However, it will probably only convince those who need no convincing. The author presents many useful arguments, most of them correctly; but many more are not presented at all. It seems that there is yet another uncertainty relation lying just below the surface which no one has yet stated. This is a relation between the precision with which we do calculations and make predictions (Predictability) and our understanding of the underlying physics (Understanding). QM is the most precise theory ever developed; A(Predictability) is very small. As the dialogue of which this book is a part shows, &(Understanding) is very large. Perhaps A(Predictabitity) z~(Understanding) -> 1 in some suitable system of units, and this is what ultimately limits our ability to interpret QM.
Department of Physics & Atmospheric Science Drexel University Philadelphia, PA 19104 USA
Spirals: From Theodorus to Chaos by Philip J. Davis Wellesley, MA: A.K. Peters, Ltd., 1993. • + 237 p. ISBN 1-56881-010-5. US $29.95
Reviewed by Michele Emmer It would only be possible to imagine life or beauty as being "strictly mathematical" if we ourselves were such infinitely capable mathematicians as to be able to formulate their characteristics in mathematics so extremely complex that we have never yet invented them. Theodore Andrea Cook [1]
Spirals Everywhere I was particularly happy to receive Davis's book on spirals to review. Recently, I have been working with spirals and helices both for exhibitions on art and mathematics and for movies on visual mathematics. Moreover, in an introduction to a chapter on spirals of a book dedicated to Venice, I quoted a phrase ascribed to Madame de Sta61 [2]: L'esprit humain fait toujours des
progr~s mais, ce progr~s est spirale. The same phrase is quoted on the first page of Davis's book. I do not know whether he, like me, found the phrase in Theodore Cook's famous book on spirals, THEMATHEMATICALINTELLIGENCERVOL.18,NO.1,1996 75
written in 1914 [1, p. 417]. There is a more important reason for my current interest in the work of Davis. For the past 2 years I have held a position at the University Ca" Foscari (Ca' means casa, or palace) in Venice. The math department is located in a wonderful 18th-century palace called Ca' Dolfin near the second bend of the Canal Grande that divides Venice in two. The Canal Grande consists of two large bends of a spiral, one in one direction, from the railway station to the Rialto bridge, the second, from the Rialto bridge to Piazza San Marco. (See Figure 1.) So this review on spirals is written from the inside of a spiral, one of the most famous! Davis's book is divided into several parts: Part One consists of the text with amplifications of the three Hedrick Lectures delivered on the occasion of the 75th anniversary of the Mathematical Association of America (MAA); Part Two contains some technical developments by Walter Gautschi and Arieh Iserles; Part Three consists of some historical supplements. Finally, there are the notes. The organizing principle of the book? I have planned this book so as to exhibit a variety of things: some history, some philosophy, some anecdotes, a fair amount of mathematics, naturally; some things old, some things absolutely new, some things proved, and many things that invite exploration at a variety of levels of sophistication; many things that invite discussion, conjecture and proof. (p. 4) An essential part of the book is the notes, wherein the author has taken the liberty of free-associating on some of the ideas contained in the text. The notes are the more personal part of the volume; the author hopes he has been able to moderate his natural effusiveness so that it would not be possible to say of him, as it was of George Eliot's Casaubon, that "he dreams footnotes" (p. 5).
The starting point of this fascinating and amazing book is the question: "What is a Spiral?" It would seem to be a simple question, capable of a simple answer. We find examples of spirals everywhere. Davis, looking on the compact disc that contains the Mathematical Reviews of the past 50 years, turned up about 250 entries under spirals and an equal number under a variety of related designations. A few examples of what he has found: pythagorean spirals; lambda spirals; spiral galaxies; spiral waves; spirals and convexity theory; spirals and quasiconformal mappings; spirals generated by Diophantine Gauss sums; spiraMike analytic functions; spirals in stochastic processes; spirals in Hilbert space. There are the spirals of genetics, in the internal structure of a tornado, and so on. (See, for example, a recent conference and exhibition on spirals in science, art, and culture [3].) How is it possible to find one's way in this infinity of spirals? Only by starting from a clear definition! Davis begins with dictionaries; they do not help much. They give definitions for which counterexamples are easily provided. In the Encyclopaedia Britannica, 11th edition, a spiral is defined as a curve that winds around a fixed point. In the American Heritage Dictionary, it is the locus of a point moving around a fixed center at a monotonically increasing or decreasing distance. In the James and James Mathematical Dictionary there is no entry for spiral. We can add other questions to the first one: What is a discrete spiral? What is a fractal spiral? So the question is how to proceed from a common, perhaps visual, conception to a mathematical definition, or at least, to a common but perhaps unspoken set of agreements. These questions can be posed for entities other than spirals. What is a curve? What is a number? What is a set? What is probability? What is chaos?
Figure 1. Perspective map of Venice, in M. Giulio Ballino,
Disegni delle pill illustri citt~, Venice, 1569. (2 Vernou) 76
THE MATHEMATICAL INTELL1GENCER VOL. 18, NO~ 1, 1996
We could answer questions of this kind by making normative definitions, that is, definitions that preserve desirable properties. However, there must be a sense in which a definition, once promulgated, proves fruitful and stable. "This process may take centuries, millennia; and since a definition is necessarily a limitation--finitization--the process is always unsatisfactory and never ending" (p. 23). Which properties must characterize a spiral? Would you say that the equation of a spiral cannot, in rectangular coordinates, be algebraic? Would you say that the product of two spirals is a third spiral? And so on. The number of possible questions grows. There is the problem of distinguishing spirals from what some writers have called volutes, whorls, meanders, wanderings, doodles, noodles, tangles, or explosions (Fig. 2). Why all the wishywashiness about a definition? Isaac Barrow wrote (p. 25), "[mathematicians] take up for contemplation those features of which they have in their minds clear and distinct ideas; they give these appropriate, adequate and unchanging names ..."
Figure 2. Is this a spiral?
Is This a Spiral?
Why is it that, whereas, for example, the notion of a group has been quite stable for almost two centuries, mathematicians do not have clear and distinct ideas about spirals? The story is complex because, as Charles Sanders Peirce wrote (p. 26), technical words should be introduced that are "so unattractive that loose thinkers are not tempted to use them." The term spiral is an ancient one. "Everybody loves a spiral, wants to have a spiral of their own." Of course, mathematics has ways of accommodating all tastes. Mathematics slaps on its subjects such adjectives as almost, sub, semi, super, quasi, alpha-quasi, pseudo, true, faithful, fuzzy, ultra, meta, strong, weak degenerate, standard, generalized, pre-, -like, -oid, incomplete, and hundreds more. We can produce a weak, generalized, fuzzy, alpha-quasi pre-spiral, if we want. Another possibility is to look at the forms of spirals and try to understand how we should distinguish between spirals and nonspirals, perhaps performing experiments. We could use a modern approach to decide what a spiral is. Teach a computer what a spiral is. Build yourself a "spiral recognizer"; that is write a program that allows the computer to discriminate between spirals and nonspirals. If one of the major problems of the computerization of mathematics could be to get a computer to recognize automatically what is mathematics and what is not, Davis in a note (p. 211) states, "One thing is certain: mathematics is far too important a subject to be left to the mathematicians either for definition, extension or promulgation." The starting spiral of Davis's investigation is the spiral of Theodorus. Theodorus was born in Cyrene, approximate dates 465 to 399 BC. He was the teacher of Plato and Theaetetus. In Plato's dialogue Theaetetus,
Theaetetus tells Socrates, "Theodorus wag writing out for us something about roots, such as roots of 3 or 5, showing they are incommensurable by the unit: he selected other examples up to seventeen; there he stopped. N o w as there are innumerable roots, the notion occurred to us to include them all under one notion or class." A discussion of what is knowledge and what is abstraction follows, the problem of the particular and the general and the relationships between the two (see note 3, p. 194). So w h y does Theodorus stop at V ~ ? An answer was provided 70 years ago by J.H. Anderhub. He imagined that Theodorus constructed V2, V3, by a sequence of contiguous right-angled triangles. In each triangle, each outer leg is of length 1. Anderhub observed that the resulting snail-like figure is such that V ~ arises from the last triangle for which the total figure is nonself-overlapping. This spiral is, for Davis, the Ur-spiral, the granddaddy of mathematical spirals (see Fig. 3). If we now consider the spiral of Theodorus in the complex plane and define its vertices Zn in iterative fashion, we have Zn
.
Zn+l = zn + i iZnl ,
(1)
or replace it by a linear homogeneous difference equation with nonconstant coefficients, Zn+l = 1 +
Zn.
(2)
We can study on the basis of (2) what Davis calls the discrete spiral of Theodorus. H o w we can draw a curve through the sequence of points z0, zl, z2. . . . ? There are an infinity of ways to do it, for example, by connecting up the points automatically with straight-line segments. The resulting curve will then have corners. Smoothness THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
77
ematics may become impresarios of hardware and software instead of playwrights, composers or actors, with the more creative roles in the total mathematical production preempted by a small cadre of mathematical elite who work in conjunction with a large group of people skilled in visuals.
Figure 3. The Spiral of Theodorus, exhibiting the "solution" of Anderhub.
might be required; it might be required that the interpolant satisfy the difference equation (2) for noninteger values of n. The interpolant could be required to be analytic, and so on. Mathematicians should always generalize; so one writes down Zn+l = aZn + b zn
JZnl
(3)
for a and b arbitrary complex numbers. The dynamics of (3) are non-trivial and are considered in lecture 111, "Theodorus Goes Wild." I hope it is clear by now that the book of Davis is a genuine source of stimulating ideas on many topics. As the author himself writes (p. 170): "This small volume on spirals is an assemblage of theorematic material both old and new, theorematic conjecture, computer experience, computer graphics, historical monuments of mathematics that span two and one-half millennia, historical and philosophical animadversions." As I have already pointed out, it is in the notes that Davis puts together his reflections on mathematics, culture, and science. It is impossible to give an idea of the variety of stimulating observations contained in them. I suggest you read them while reading the text. Don't avoid them!
In another note (p. 225) he speaks of the possible recognition of a class of computer-generated visual theorems that might be incorporated into our mathematical experience, and poses the question whether this contradicts the idea that mathematics is a process of verbal and symbolic communication at both the input and output ends. He asks himself, "Do you think that you can see something in the figure that cannot be described in words? Can this visual theorem contribute to one's emotional life? And if visual theorems, why not auditory theorems? One might listen to a spiral, via its discrete Fourier transform" (see also Refs. 4 and 5). This book provides many questions, starting with the concrete question "What is a spiral?"--so apparently trivial. In some ways, the book bridges the gap between the two cultures, the humane and the scientific. The computer is the protagonist of this tale--a character both loved and detested, and in any case absolutely necessary to our age. The book ends on an optimistic note: "The computer and computer graphics have clearly widened old and opened up new creative sources of the mathematical spirit. A retrospective show of spirals in the year 2500 would amaze us. One hope that both the visual and the symbolic will endure in their individual aspects as well as in fruitful partnership."
References 1. Theodore Andrea Cook, The Curves of Life, London: Constable & Co. (1914); reprinted by Dover Publ. Inc., New York (1979), p. 428. 2. Michele Emmer, La Venezia perfetta, Venice: Centro Internazionale della Grafica (1993), p. 59. 3. Hans Hartmann and Hans Mislin (eds.), Die Spirale im menschlichen Leben und in der Natur: eine interdisziplin~'re Schau, Basel: Birkh/iuser (1985). 4. Michele Emmer (ed.), Visual mathematics, special issue, Leonardo 25(3/4) (1992); reviewed by Harold L. Dorwart, Mathematical Intelligencer 16(4) (1994), 70-72. 5. Michele Emmer (ed.), The Visual Mind, Cambridge, MA: The MIT Press (1993). Dipartimento di Matematica Universitdl Ca' Foscari Dorsoduro 3825/E, Ca' Dolfin 30123 Venice, Italy
Visual Mathematics?
A Photo A l b u m for Weierstrass edited by Reinhard Biilling
One of Davis's most trenchant observations concerns what could be called the imperialism of the visual.
Vieweg Verlag, Wiesbaden, 1994. ISBN 3-528-06602-4, DM 98
The tendency of our students to think only in terms of colored overlays in polywindowed displays, the decline of poetry and recitation, the condemnation of old-fashioned oratory as flatulent and suspicious, are all independent indications of a transition . . . . Our future teachers of math78
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
Reviewed by R.B. Burckel
This book is a reproduction of two photo albums presented by students, friends, and colleagues to Karl
Weierstrass on 31 October 1885, his 70th birthday. They lay slumbering in a Berlin museum since 1904, and the first one was serendipitously discovered in about 1980. The whole fascinating story of their provenance, plus many details about participants, was told to readers of this journal by Reinhard B611ing in an article entitled "A birthday present," Mathematical Intelligencer vol. 11 (1989), no. 4, 20-25. That article is the best possible review of this book, all the more since Dr. B611ing is a noted Weierstrass scholar. In that article he declared his intention to publish an edited version of the recently found albums. Now, some five years later, he has brought this to fruition with the help of Vieweg Verlag, and we are all in his debt. On the album's cover is a photo of the beautiful copper engraving that adorned the original. There is a short but intimate biography of Weierstrass, including the story of how the album came into being. These are given in both German and English. The original plan, proposed by Mittag-Leffier, was for a sculpted bust, together with a medallion if funds permitted. Solicitation was not confined to Germany, as Weierstrass's former students and admirers were well distributed throughout Europe and Russia. Somewhat more than 5000 marks were ultimately contributed (all from individua l s - n o big foundations in those days), enough money to finance the marble bust, a gold medallion, and the photo album, the idea for which came up rather late in the planning. Due to the well-known rivalries among the Berlin mathematicians and their adherents, some friction was anticipated over this project. There was
some, but it was not significant. Besides this saga, a bilingual account of the birthday celebration itself is included, as are many letters (untranslated) among t h e prominent organizers. Then the editor has undertaken the time-consuming task of ferreting out biographical data to accompany (most of) the photos. Here these are all approximately 4 x 6 cm., although the originals in the second, shorter album are considerably larger. There are altogether 334 of them. The frontispiece is a photo of Weierstrass, the latest known of him (taken sometime between 1865 and 1870), but unfortunately no photo of the marble bust is offered. However, the mathematical tourist can see the bust itself in the Weierstrass lecture hall of the Humboldt University in Berlin. In 1912 the Acta Mathematica (founded in 1882 by the great organizer and internationalist Mittag-Leffier) published a special 179-page issue which comprised the tables of contents of its first 35 volumes, plus photos and professional biographies of most of its contributing authors to that point (204.altbgether). Many of these appear in younger photographs in the Weierstrass album, and comparing photos from the two treasuries is fun. There is another excellent short biography of Weierstrass in English, which puts his mathematical accomplishments in perspective, by K.-R. Biermann; it appears on pp. 219-224, vol. XIV of the Dictionary of Scientific Biography (Charles Scribner's Sons, N e w York, 1971).
Department of Mathematics Kansas State University Manhattan, KS 66506-2602 USA
STATEMENT OF OWNERSHIP, MANAGEMENT, AND CIRCULATION (Required by 39 U.S.C. 3685). (1) Publication title: The Mathematical Intelligencer. (2) Publication No. 001-656. (3) Filing Date: 10/1/95. (4) Issue Frequency: Quarterly. (5) No. of Issues Published Annually: 4. (6) Annual Subscription Price: $39.00. (7) Complete Mailing Address of Known Office of Publication: 175 Fifth Avenue, New York, NY 10010-7858. (8) Complete Mailing Address of Headquarters or General Business Office of Publisher: 175 Fifth Avenue, New York, NY 100107858. (9) Full Names and Complete Mailing Addresses of Publisher, Editor, and Managing Editor: Publisher: Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010-7858. Editor: Chandler Davis, Department of Mathematics, University of Toronto, Toronto, M5S 1A1, Canada. Managing Editor: Springer-Verlag New York, Inc., 175 Fifth Avenue, New York, NY 10010-7858. (10) Owner: Springer-Verlag Export GmbH, Tiergartenstrasse 17, D~9121 Heidelberg, Germany, and Springer-Verlag GmbH & Co. KG, Heidelberger Platz 3, D-14197 Berlin, Germany. (11) Known Bondholders, Mortgagees, and Other Security Holders Owning or Holding 1 Percent or More of Total Amount of Bonds, Mortgages, or Other Securities: Dr. Konrad Springer, Heidelberger Platz 3, D-14197 Berlin, Germany. (12) The purpose, function, and nonprofit status of this organization and the exempt stat~s for federal income tax purposes: Has Not Changed During Preceding 12 Months. (13) Publication Name: The Mathematical Intelligencer. (14) Issue Date for Circulation Data Below: 10/94-9/95. (15) Extent and Nature of Circulation: (a.) Total No. Copies (Net Press Run): Average No. Copies Each Issue During Preceding 12 Months, 9207; Actual No. Copies of Single Issue Published Nearest to Filing Date, 8800. (b.) Paid and/or Requested Circulation: (1) Sales Through Dealers and Carriers, Street Vendors, and Counter Sales: Average No. Copies Each Issue During Preceding 12 Months, 1250; Actual No. Copies of Single Issue Published Nearest to Filing Date, 1981. (2) Paid or Requested Mail Subscriptions: Average No. Copies Each Issue During Preceding 12 Months, 4739; Actual No. Copies of Single Issue Published Nearest to Filing Date, 4755. (c.) Total Paid and/or Requested Circulation: Average No. Copies Each Issue During Preceding 12 Months, 5989; Actual No. Copies of Single Issue Published Nearest to Filing Date, 6736. (d.) Free Distribution by Mail: Average No. Copies Each Issue During Preceding 12 Months, 152; Actual No. Copies of Single Issue Published Nearest to Filing Date, 152. (e.) Free Distribution Outside the Mail: Average No. Copies Each Issue During Preceding 12 Months, 535; Actual No. Copies of Single Issue Published Nearest to Filing Date, 535. (f.) Total Free Distribution: Average No. Copies Each Issue During Preceding 12 Months, 687; Actual No. Copies of Single Issue Published Nearest to Filing Date, 687. (g.) Total Distribution: Average No. Copies Each Issue During Preceding 12 Months, 6676; Actual No. Copies of Single Issue Published Nearest to Filing Date, 7423. (h.) Copies Not Distributed: (1) Office Use, Leftovers, Spoiled: Average No. Copies Each Issue During Preceding 12 Months, 1673; Actual No. Copies of Single Issue Published Nearest to Filing Date, 1273. (2) Return from News Agents: Average No. Copies Each Issue During Preceding 12 Months, 858; Actual No. Copies of Single Issue Published Nearest to Filing Date, 104. (i.) (Sum of 15g, 15h(1), and 15h(2)): Average No. Copies Each Issue During Preceding 12 Months, 9207; Actual No. Copies of Single Issue Published Nearest to Filing Date, 8800. Percent Paid and/or Requested Circulation: Average No. Copies Each Issue During Preceding 12 Months, 89.71%; Actual No. Copies of Single Issue Published Nearest to Fliing Date, 90.74%. (16) This statement of Ownership will be printed in the Winter issue of this publication. I certify that all information furnished on this form is true and complete.
Craig Van Dyck Senior Vice President
THE MATHEMATICAL INTELLIGENCER VOL. 18, NO. 1, 1996
79
Robin Wilson* Mathematical Physics III Keith Hannabuss and Robin Wilson A quarter of a century after Planck's original quantum hypothesis, physicists still had no coherent understanding of quantum theory, but in 1925 two independent discoveries revolutionised the subject. Towards the end of that year, Erwin Schr6dinger (1887-1961) learned of de Broglie's idea that particles could also behave like waves, and found the appropriate partial differential equation to describe those waves. He showed that the energies of the light quanta emitted by a hydrogen atom were differences between eigenvalues of his partial differential operator. This resolution of the problem had, however, been preceded six months earlier by the more algebraic theory of Werner H e i s e n b e r g (1901-1976). Heisenberg realised that quantum-theoretical quantities like position, momentum, and energy could be represented by infinite arrays, which Max Born later recognised as matrices. The non-commutativity of matrix multiplication led Heisenberg to his famous
Erwin Schriidinger: wave functions for hydrogen
Uncertainty Principle. In the previous year, W o l f g a n g Pauli (1900-1958) explained the structure of the periodic table of elements by the introduction of an Exclusion Principle, which forbade the simultaneous occupation of the same state by two electrons. He showed how Heisenberg's new ideas could also explain the radiation properties of the hydrogen atom. In 1928 Paul Dirac (1902-1984) derived an equation for the electron which, unlike those of Heisenberg and Schr6dinger, was consistent with Einstein's Theory of Relativity. This equation explained electron spin, and led him to predict the existence of antiparticles, such as the positron which was detected four years later. Schr6dinger discovered that, despite the apparent differences, his theory was equivalent to that of Heisenberg. John y o n N e u m a n n (1903-1957) then showed that both theories are special cases of a more abstract approach based on the theory of linear operators in Hilbert spaces.
Werner Heisenberg: body-centered cubic lattice
Paul Dirac: cloud chamber track of a positron-electron pair
John yon Neumann
Erwin Schr6dinger * C o l u m n editor's address: Faculty of Mathematics, The O p e n University, Milton Keynes, MK7 6AA, England.
8 0 THE MATHEMATICALINTELLIGENCERVOL.18, NO. 1, 1996
Wolfgang Pauli