This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
rj , (Px
== r) . The next theorem allows us to use L(S) P in the
WI
same manner as a Skolem expansion is used in classical model theory. 5.7. r)4J(x,y) is S-equivalent to
v
V
O<m<w O r +1..m ,
or V V [ O<m<w O 0 m
The last formula belongs to Q . Now suppose 4J(X, y) € M has monotone complexity zero, that is, IJ!(X, y) is a quantifier-free formula of L(S)wp' 4J is S -equivalent to a
80
H.KEISLER
formula in disjunctive normal form,
.v /\ ((Tij(X,y) > O]Al(Uij(X,y)> 0]) 1=1 J=1 p
This in tum is
q
~
S-equivalent to
p q ~ ~ 1 V I\«T'j(X,y»O]A 1\ (Ui,(X,y) 0],,(- - U(x, y) > 0] , n
1\
O 0] n
Here
T(;;, y)
U(;;, v)
The terms Vn(X, y) monotonically decrease in value as n increases, Therefore (Px > r) ljJ (x, y) is V
S-equivalent to 1\
0< m<w 0< n<w
(Px
> r + 1.)( V m
n
We have already seen that the inside formulas are
(x, y)
> 0]
S-equivalent to formulas
81
KYPERFINITE MODEL THEORY of Q , so (Px > r)ljJ(x, y) is
S-equivalent to a
formula of Q .
Now assume the result for monotone complexity less than ~ 'P (x, y) in M have monotone complexity
01.
n monotone,
F (Py > r)
I\cP
n
Hence (Py > r) 'riCPn is
n
0',
and let
Since the conjunction is
1 ~ V 1\ (Py>r+-)cpn(x,y). O<m<wn m
(x, y)
S-equivalent to a
V'P (x) in M have monotone complexity
n n
formula in Q. 0'.
Finally, let
Then since the disjunction
is monotone,
F (Py > r) V cP (x, y) n n Thus (Py > r)
X'Pn
V (py
n
>
r) 'P (x, y)
n
is S-equivalent to a formula in Q.
This completes
our induction. The proof of the Quantifier Elimination Theorem gives an algorithm which, given a formula of L
wp
formula of L(S)
WI
p'
S-equivalent quantifier-free
The exact sense in which the algorithm is effective
will be discussed later. be quite complicated.
, produces a
The translation of a simple formula of Lwp may
For example, given the sentence (Px> r)(Py
~
of Lwp ' the algorithm produces the
s) F(x, y) > 0
S-equivalent
sentence
VI\VVI\I\VI\VV abcdefghij
1 1 1 1 - (r+-+-+-+-)] > 0 d g i a
5.8.
SL,
COROLlARY.
For all hyperfinite models
the following are equivalent.
~
and
'fI for
H.KEISLER
82 (i)
'13(S) are L(S) p-equivalent, 1. e., they satisfy the WI
(11)
same sentences of L(S)
WI
Proof:
If m,.,
P .
13 then m(S) and 13 (S) satisfy the same atomic
sentences, and hence the same quantifier-free sentences of L(S) p
WI
By the Quantifier Elimination Theorem they are L(S) P -equivalent.
WI
Thus
(i) implies (11). If (11), then for every constant term T and real r we have st(Tm) > riff iff '13(S) and hence m,., 5.9.
r] > 0 iff st(T'13) > r ,
13 .
WI
m and
13 are the reducts of ~(Sl and 13 (S) to L p' WI
eq symbol does not occur. ~.
13 -
and
Let If
tp
tp
-f
be a sentence L(S) P in which the
WI
has as-model m then it has a uniform
If ~ is two-valued we may take It to be two-valued.
Let ~ (J)
Proof: f:
13 are two- valued hyperfinite
13 , then m and 13 are L p -equivalent.
5.10. THEOREM.
S-model
If m,
COROU.ARY.
models and m,., Proof:
F [T -
m(S) l= [T - r] > 0
F
tp •
By Proposition 3.30 there is a refinement
=C m,., By 3.24, T T'13 for all constant terms T without the eq
~ and a uniform hyperfinite model It for SL such that B
13'" ~.
symbol. Therefore m(S) and '13(S) satisfy the same quantifier-free sentences of L(S) that
tp
WI
P without eq.
From the proof of Theorem 5.7 we see
is S-equivalent to a quantifier-free sentence IjJ of LcS) P
without eq , Therefore UI is a
S-model of
S-model of
WI
tp
and hence of 1jJ,
IjJ and hence of tp , and finally It is a uniform
13 is a
S-model of tp .-f
HYPERF1N1TE MODEL THEORY
83
We now consider the effect of changing the hyperreal universe '~[V(R)]
.
If the signature (I, cr) of L belongs to V(R) , then the set of
P belong to V(R). Let us write ~ E ':'[V(R)] if every reduct wI of m to a finite sublanguage of L belongs to *[ VCR)] . formulas of L(S)
s.u, Let
~ r) , (Px
~
a.
Notice that formulas in Lap contain only quantifiers
r) such that r e
n.
5. 13. ABSTRACT COMPLETENESS THEOREM. admissible set with
W
E
aC
Let
a
be an
HC , and let L have an a-finite signature.
Then the set of all valid sentences of Lap is a-recursively enumerable. Proof:
We give the proof in the case that L has a finite signature.
The a-finite case is similar.
Let
Co be the set of connectives of SL
consisting of the linear functions with rational coefficients, the rational polygonal functions in one variable, and the max and min functions. Then the set of terms of SL(C ) is a-finite. O
In order to obtain the result for arbitrary
a
with
W
E
a~
HC we
HYPERFINITE MODEL THEORY
85
need a weakening of the notion of a monotone formula.
A conjunction
/\ q, is said to be directed if for all 'PI' 'P2 "q, there exists 'P" q, such
that 1= 'P -
'PI" 'P2 • Directed disjunctions are defined dually.
formula 'P of
L(C O'
S)wrP
A
is directed if it is built up from finite quantifier-
free formulas, using only directed conjunctions and disjunctions and probability quantifiers.
Let J
a. Ja
which belong to
be the set of sentences of
a
is an a-recursive set.
L(C
o ' S)w 1p
By effectivizing the
proof of the Monotone Formula Theorem it can be shown that there is an
a- recursive function fl: Ja -
Ja such that for each sentence
fl'P is a directed sentence which is equivalent to 'P.
'P"
Ja '
The details of this
effectivization, which require some care in the inductive definition of f
are given in Keisler [21]. Let K be the set of quantifier-free a l, sentences in J a' The proof of the Quantifier Elimination Theorem gives
an a-recursive function
f 2: J
a
-
K
a
such that whenever ljJ is a
directed sentence of Ja' f 2ljJ is a sentence lent to ljJ •
e" K a which is
S-equiva-
Let f be the composition
Then f: Ja lent to 'Po
K
a is a-recursive and for each
'P"
Ja' f'P is
S-equiva-
(This is the precise sense in which the algorithm of Theorem
5. 7 is effective. ) Call a model ~ for SL hyperrational if all the values of are hyperrational numbers.
If
f1 and
Fj
~ is a finite model for SL and the values
of f1 and F. are all rational, we call J
m rational.
The rational models
are elements of HF, the set of hereditarily finite sets. For each constant term T " SL(C in the language of (HF, 8)
o) ,
such that
there is a finite formula eT(u, v)
H.KEISLER
86 if and only if
~
function T -
aT is a-recursive.
is a rational model for
(HF, e)!= if and only if ~ (S) 1= [T > 0]. K
a
SL
and
T~ ~
-k- '
and the
Moreover, V a ( ~ , m) 0< m < w T
Let g be the
a-recursive function from
into La such that g[T> 0]
V
0< m < w
aT(u, m)
Then g(l1J) has one variable u and is built from finite formulas using only finite and infinite connectives (no quantifiers).
Let
r be the set of all
L which hold in (HF. e). Since w € a, r ww is a-recursively enumerable. We claim that for each sentence tjJ EO K finite sentences y
€
a'
the following are equivalent. (1)
(gtjJ)(u) is consistent with
(Z)
tjJ has a
r.
S-model.
(1) holds if and only if (g tjJ)(u) is satisfiable in some model of
r.
Since (gtjJ) is built from finite formulas without quantifiers, (1) holds if and only if (g q),
COROLIARY.
where q is rational.
of valid sentences of Proof:
be the set of formulas of Lwp with only quantifiers
L~p
(Px 2: q)
a
Let
Suppose L has a hyperarithmetical
L~p
is a
1Ti
Then the set of Qjdel numbers
set of natural numbers.
be the smallest admissible set such that w
set X C w is hyperarithmetical iff it is a-finite, and
1Ti
€
G.
A
iff it is
a-recursively enumerable (see Barwise (2 J).
-j
We conclude with a result about the expanded Sus lin logic 1(S)\O( p .
5.15. has
0] for T(X)
E
SL- , and form L(r)
real-valued probability model
WI
~
p in the usual way.
Each
determines a two-valued probability
model~(S-) for L(S -) , and formulas of L(S -) P can be interpreted in
WI
the natural way in ~(S-). We do not have a natural expansion to a model for the full language L,(S).
92
H.KEISLER The standard part st ~ of a hyperfinite model is an especially well
behaved real-valued model, in which the full logic 6.7.
SL,
tenus T(x) of
Tst~[a]
the inductive definition of T
st~
SL.
For all
~
[a] makes sense and
= st(TU[a]) .
We argue by induction on the complexity of T(x).
Proof:
result holds for atomic tenus. U(x)
can be interpreted.
Let ~ be a hyperfinite model for
THEOREM.
~
SL
= ST(x,y)dy.
Assume the result for T(X, y).
Let a € An. fey) = T
st a
The
Consider
The function
~
~
~
[a,y] = steT [a,y])
is the standard part of an internal function and hence is measurable.
For
internal g(y) ,
J( st g (y» dOf.1(y)
st( ~b € A g(b) f.1(b» .
Therefore
U
st U
~ [a] =
f T st U[a,y]d ~
0
f.1(y) =
\' ~~ = st(Lib€AT [a,b]f.1(b» =
Now consider W(X)
= (max
v) T(
has a maximum value r , the set value st r.
x, y).
JsteT U[a,y])d f.1(y) ~
~~
st(U [a]) . Since the set {T U [a, b ] : b € A} ~
~
{st T [a, b] : b € A} has maximum
Therefore st r
~
~
st(IN [a]) .
The term (min Y)T(X, v) is similar. Assume the result for Tl(X), ... , Tm(X) and let
Then since C is continuous,
0
HYPERFINITE MODEL THEORY
C(stT
l
~
, ... , stT
~
m
93
)
-t 6.8.
COROLLARY.
Let ~ and 'j.\ be hyperfinite models for
Then ~ "" 'j.\ if and only if for every constant term T of
SL.
SL,
Two hyperfinite models for SL have the same standard part, st m " st'.jl, if they have the same universe A" B, the internal measures f.L
and
II
generate the same Loeb measures
they have the same
0f.L" 01',
constants a " b ' and the functions are infinitely close, i i
By 6. 8, if st m " st '.jl then
~
R!
'.jl.
Here is an easy characterization of models which are nearly twovalued. 6.9.
PROPOSITION.
Let m be a hyperfinite model for
following are equivalent. (i)
st ~ is a two-valued model.
(ii)
Fj(a)
(iii)
There is a two-valued hyperfinite model 'j.\ R!
(iv)
For all j
and
R!
0 or Fj(a) "" I for all j
€
J,
€
J, iii € AlT(j) • ~
•
SL.
The
H.KEISLER
94
We digress briefly to discuss the effect of removing the integral quantifier from 5L while retaining the (max xj and (min x) quanttffers. Let M be the set of all terms T(X) of 5L in which the integral quantifier 5 .. dx does not occur.
Each T(X)
€
M has a natural interpretation
T m[a] in a real-valued probability model m which is independent of the measures
jJ
measures.
•
n
We are thus led to consider real-valued models without
The model theory "for such models with the many-valued logic
M has been studied in [7] (with a compact Hausdorff space in place of the space of values [-1, 1]).
If m is a hyperfinite model and
x-saturated, the real-valued model st(m) turns out to be the sense of [7].
*[V(R)]
IC_ saturated
is in
The Isomorphism Theorem Without Integral Quantifiers
3.12', which we stated in this papeljfollows from the uniqueness theorem for saturated models in [7] . We now return to our study of the logic L(r)
WI
the proof that every probability model is L(r)
WI
p'
We work towards
p-equivalent to a hyper-
finite model. It is easy to check that each term T(x) liTII
a
€
which is the supremum of An.
Tm[a]
€
5L-
has a finite norm
~ and
for all probability models
Moreover, the Stone-Weierstrass Theorem holds as before.
The
Quantifier Elimination Theorem and Bernstein's Inequality can be proved in the following formulations using the arguments from Section 5. 6.10.
QUANTIFIER ELIMINATION THEOREM.
(H)[a, b
T']>(H)[a]
Thus when
Then
~
m]
B(H)n, T']>(H)[a] coincides with our previous definition.
We prove several facts by induction on the complexity of terms T(X) in
SL-.
In each case the steps for atomic terms and for T
=:
C(T
l,
" ., T
k)
are routine. Claim 1.
For every T(x) and G < w there exists K (T, G) < w l
such that whenever Kl(T, G)
S
H < w, a € An , and b,
C€
AW differ
on at most one place,
Assume Claim lfor U( X, y) and let T(x)
=:
SU(X, y)dy.
If b.
differ on at most one place and H == Kl(U, G) , and H == 1, then
c
HYPERFINITE MODEL THEORY
97
Therefore Claim 1 holds for T with KI(T, G) = max(K1(U, ZG), 4GIIUII) . Claim Z.
For every
rex)
and G < w there exists Kz(T, G) < w
such that whenever Kz(T, G) -s H < wand -a
€
An ,
The proof is similar to the argument for the Elementary Subsequence Theorem 4.6 and uses
6.IZ. Assume the claim for U(X, v) , let
T(X) = SU(x,y)dY and let KZ(U, G) -s H.
a
€
An, G < eo , Kl(U,G) -s H < co , and
Let Y be the random variable m
on the space (AW
,
vw ) . and let
x m (b) =
Y
m
(b) -
E(Y
)
m
Then
and the X are independent. By Claim I, for each b m conditional probability inequality
€
A we have the
H.KEISLER
98 Therefore, putting r" ZiG,
We now obtain a bound for the probability that using Corollary 6.12. G :s (H Ilog H)1/
4
Assume H is large enough so that
- Tm[a]
I ~ 3/G
3(211T 11)2 ,
. We have
P(I
:s P(
I T~(H)[a]
L:
H-luNH)[a, b - Tm[a] I that is, to find mechanical or, equivalently, formal rules: rr ~ trr' In short, the original question of justifying proofs (that is, principles of proof) is replaced by questions of handling proofs (mechanically). (a) Two examples of mechanical unwinding. They need only proof theory of first order predicate logic. They span a period of nearly 30 years. They answer genuine questions, (actually) asked by excel lent mathematicians about their own work.
(i) A theorem of Littlewood (proved in 1914, refuting a guess of Riemann): rr(x) - £i (x) changes sign infinitely often where n(x) is the number of primes < x and £i (x) = fX dx/log x . e
Of course, this means that there is some (recursive) method of determining a, in fact the least natural number n such that n In) > £i (n); since £i (n) is not l ntecr a l , we can compute n In) - £i (n) to sufficient accuracy to determine whether n(n) ;;; ~i (n), and then take the least N:rr(N) > 1i (N). Littlewood's proof consists of two
G.KREISEL
114
parts, one assuming the (still open) Riemann hypothesis, the other its negation. As Littlewood (1953), p. 113 puts it, 'it appeared later that this proof is a pure existence theorem and does not lead to any explicit numerical value [of N].' Unquestionably, the proof is indirect, since it appl ies the law of the excluded middle to an undecided proposition. Nevertheless, an essentially routine appl ication of proof theory (E-theorems or cut el imination) appl ied to Littlewood's original proof, extracts a bound for N; c f , Kreisel (195Z). Of course, this use cannot be expected to give optimal bounds for which further ideas are needed, and it doesn't: it gives a bound of about 10
10
10 34
compared to (1.65)10
1165
found in Lehman (1966). Actually, as Littlewood real ized (p , 113, 1. 13 - 14), the size of N is not the only issue here: by (iv) on p. lIS, a 'further idea' was alleged to be needed to extract a bound from Littlewood's proof, for example, by the switch from TI to W. This impression is refuted, beyond a shadow of doubt, by the proof theoretic analysis which makes a ~outine appl ication (to Littlewood's original proof for TI) of a gene~aZ (logical) method. Historical Remark. To be precise, the ideas involved in the general proof theoretic procedure were applied, not the procedure itself. For one thing, such a procedure operates on a formalized proof, and, of course, Littlewood's original proof was not. But also, at the time the relevant procedures had not been worked out for formal systems very close to the language of ordinary analysis. It turned out that what were obviously the only critical steps in Littlewood's proof could be transcribed into the formal ism of first order predicate logic, to which thencurrent proof theory appl ies; cf. Remark 5.Z p. 171 of Kreisel (1958). A significantly more systematic use of proof theory here would have to make the passage from informal proofs to their formal izations a principal object of study: this was premature before the advent of high-speed computers (at least, if the general impression is right that formalizations of such proofs as Littlewood's are too complex for humans); cf. §4(b). (i i) Unwinding a p~oof of Roth. in a conversation about Roth's theorem: \fn\fa3q (\fq;;'q )\fp(ia _ p/q[> q o
0
A. Baker brought up the following point - (Z+.!..)
n),
where a ranges over the irrational algebraic numbers (and the other variables over the natural numbers). Baker felt morally certain that the proof in Roth (1955) could be 'unwound' to yield some bound for the numbe~ of exceptionally close approximations to a, that is, the number of the set E (of rationals r): 1 n,a -(Z+-)
{r:[a- rl,,;;; q
n}
where r ~ p/q (in its lowest terms),
the bound depending on n and the height of a. But he also felt that this bound would be insufficient for the use he made in Baker (1964) of the bound by Davenport and Roth (1955), which requires a 'further idea.' As it happened the matter of unwinding had been considered in the 1iterature, in Kreisel (1970), pp. 135-136, as an application of Herbrand's analysis of logical theorems of the form 3x\fy R (x,y), R(t l, Yl) v R(t Z' yZ) v ... R(t k, Yk) , where the terms ti do not contain any variable Yj with j ;;. i. This appl ies in an obvious way to Roth's theorem for fixed n and a; with q in place of x and the pair (q,p) in place of y. Inspection then shows that k ?ields a bound on the
THEORY OF PROOFS
115
the number of En u' This supports the first part of Baker's impressions. Modulo the uncertainty about the passage from Roth's informal proof to its formal ization, discussed at the end of §la(i), I have convinced myself that the kn a suppl ied by Herbrand's analysis are indeed too large for the application in Baker (1964). Incidentally, this example provides one of the few useful appl ications of Herbrand's own formulation, specifically, in contrast to the no-counter example -- interpretation which introduces fuctionals; in the case of 3x Vy R(x,y) above, a new function symbol f is introduced (as a 'counter example' \fx-,R[x,f(x)]) and one gets Terms T containing f, such that i,
It will not have escaped the reader's notice that the interests of both Littlewood and Baker were so to speak contemplative, not activist. They had bounds -- and (as mentioned) there are better ones than those suppl ied by the original proofs considered. Contrary to activist propaganda, those interests are no less permanent than the search for better results, which, after all, will be superseded, too. (b) Hilbert's 17 th problem: a case of a knotty unwinding. Artin's solution of Hilbert's problem was stated for archimedean formally real, that is, orderable fields K in which every positive element is a sum of";;; k squares. If P is a polynomial with coefficients in K and p p
=
~(Pi/qi)2,
1";;;
~
0, then
i ,.;;; N, where Pi and qi are also polynomials with
coefficients in K -- How large is
N?
Because of the algebraic character of Artin's proof, it was clear that some such bound for N would depend only on k, the number n of variables and the degree of p; and not on, say, arithmetic properties of K (which might, however, permit 'better' bounds; cf. the end of §2). Historical Remark. In seminars Artin had raised the problem of finding bounds, ever since his original proof in the twenties. He mentioned the problem to me some 20 years ago. As it happened, around the same time, the matter was also considered by A. Robinson, but in model theoretic terms. His proof appeared in Robinson (1955), not long after the proof theoretic solution had been found. Model theory suppl ied a recursive method of determining a bound (by trial and error appl ied to derivations in predicate logic), proof theory a bound involving n-fold exponentiation. As in §l (a), or, for that matter, also in comparable uses of model theory in the fifties, Artin's proof was first transcribed into (first order) predicate logic, but with this difference: two not altogether trivial modifications had to be made, (i) Restatement of Artin's theorem. In contrast to the cases in §! (a), Ar t l n ' s formulation has to be modified since the condition that K be archimedean is not of first order. Inspection of Artin's argument showed that it was only used to ensure that p ~ 0 in all real closed extensions of K. So it could be replaced by this weaker condition. Incidentally, as observed in Henkin (1960), given this replacement a more elegant formulation is obtained for ordered K without the assumption that positive elements are sums of";;; k squares: U~der the new hypothesis, or, equivalently, if p ~ 0 in the (unique) real closure K of K, then c.sK I
and
c. > O. I
As.usual, the proof was uniform for given nand d. So there are polynomials Pi J, qi J in bo~h t~e variables and coefficients of the general polynomial p, and polynomials coJ ,c i J in the coefficients only (J ,.;;; j ,.;;; j) such that, for each j,
G.KREISEL
116 p
•
~
•
.
• 2
I(c.J/c J) (p.J/q.J) . and if P > 0 in I 0 I I J
K,
for some j, each (c.j/c j) > 0 I a (ii) Restatement of Artin's proof. Very much in contrast to §l (a), it can hardly be claimed that the passage from Artin's own proof to a proof in the language of predicate calculus is altogether obvious, even after a reformulation of his theorem is given,as in (i) above. Artin uses an infinite tower of field extensions; determining a bound on the relevant finite portion of it, is tantamount to finding N: and one needs that finite portion before one can even begin to formalize a version of his proof in predicate logic. -- Actually two quite different (first order) proofs of (i) are sketched in Kreisel (1960). One keeps quite close to Artin's proof, the other combines the algebraic identities used in his proof with the completeness of the axioms for real closed fields and the so-called uniformity theorem of logic, a corollary of Herbrand's theorem. The last five years have seen a considerable extension of proof theory; in particular, cut-el imination was extended to (impredicative) type theories in such a way that all cut-free proofs of first order theorems are themselves of first order. In other words, to an arbitrary (higher order) derivation of a first order theorem is associated a particular first order derivation: its cut-free (normal) form. Cut-elimination is then the procedure which mechanizes the mathematician's unwinding of proofs (or is, at least, a candidate for such a procedure). But the matter is not yet settled. Sure, if one insists on replacing Artin's ordinary (algebraic) language by the 'logical' language of type theory mentioned in the last paragraph, the passage from his own exposition to a formal ization of his proof in type theory is pretty unambiguous; certainly as much as for the parallel passages in §l(a). However, this insistence may involve a strategic mistake, in accordance with the ordinary mathematician's instinctive resistance to the language of type theory and to other 'foundational' languages. This issue is a principal topic of §4(b) below.
Remark. At least at present the issue is also contemplative, and not activist (in the sense of the remark at the end of §2(a) above). In particular, numerically better bounds for N itself have been obtained in Pfister (1967) independently of d when K itself is real closed. (Both the proof theoretic and model theoretic methods yield bounds simultaneously for N and the degrees of everything in sight.) More interestingly, for suitable topologies on (suitable) K, one would expect topological versions of Artin's solution, where the p.,q.,c.,c depend continuously on the coefficients of p or where c and q~ Aev~r ~anish 0 I at al I when p ~o in K. Such results are certainly not to be obtained by unwinding Artin's proof. ELABORATIONS AND DIMINISHING RETURNS 3. The main proof theoretic results on first order' predicate calculus used in §2 are. cut el imination and functional interpretations. They have been extended to stronger systemS (including infinitary ones), usually by use of some kind of transfinite iteration or hierarchy, for philosophical reasons discussed in (a) below. When the process which is iterated is 'elementary', the dOminant factor is ti) the 'length' or ordinal of the iteration; tacitly, together with a 'control' at limit ordinals by means of so-cal led fundamental sequences. When already each step of the process involves some kind of 'collection', one has (i i) functionals of higher type; tacitly, together with some defining schema for the latter. -- NB. The distinction between (L) and (t i ) is quite well illustrated by the familiar (i) constructible and (i i) natural hierarchies of sets generated by iterating the socalled predicative, resp. the full power set operation; 'quite well', not perfectly since in the case of proof theory there are additional relations. Specifically, the computability of terms (for functionals) in (ii), is proved by transfinite induction on (orderings with) the ordinals in (i); conversely, 'definitions' by transfinite recursion in (i) are, demonstrably, satisfied by suit-
THEORY OF PROOFS
117
able terms in (ii). -- NB. Many of the elaborations contain material of considerable mathematical charm, as illustrated very well by the lectures of Girard and Ershov at this conference, which state results in a form independent of the original proof theoretic aims. Make no mistake about it: though perhaps exceptionally polished, Girard's material is quite typical of the kind of ordinal structures developed in proof theory (categories of not necessarily well-founded orderings with suitable maps and functors between them), and Ershov's is typical of -- what are nowadays cal led -- cartesian 'closed categories used for functional interpretations when certain informal continuity properties are needed (made precise by Ershov's topology or by appropriate limit space structures); cf. footnote 1 of Kleene (1959) or the 'principal result' (2.4 and 5.1) of Kreisel (1959). Two obvious questions are prompted by what has just been said: (i)
Have we simply not elaborated enough? not put enough resources into the traditional aims? or,
(ii) Have the elaborations only heuristic value? by having led (some of us) to ideas which are effective only when divorced from the original, misconceived aims? Before giving -- what seems to me convincing evidence for (i i) in the case of current proof theory, readers may wish to have some
Examples concerning (i) and (ii) from successful parts of logic.
Ad (i): Mcintyre's lecture at this conference illustrates very well (my) misjudgments on the use of elaborations in model theory, specifically, in connection with Morley (1965), usually considered to be the first piece of 'hard' model theory. Some ten years ago, I had high hopes of glamorous appl ications; cf. p. 152 of the original (French) version of Kreisel and Krivine (1971). These were not fulfil led in the next five years, and so the reference to Morley (1965) was dropped from (p. 186of) the latest (German) version. Mcintyre's results more than fulfill the original hopes, when the new 'hard' model theory is combined with results for suitable mathematical structures (groups, not, e.g., rings, as Mcintyre mentioned), and emphasis is shifted away from simple-minded 'logical' cardinal ity properties to stability and the like. I -- Ad (ii): Gadel 's work on the constructible hierarchy, L, mentioned earl ier, provides an 'excellent object lesson here. L is an (elegant) extension of the ramified hierarchy introduced by Poincare and Russell for their programme of predicative foundations. But as stressed in the clearest possible terms on pp. 146-147 of Gadel (1944), his work drops a requirement which is absolutely essential to that progr9mme, since the relevant properties of the iteration process are not proved predicatively (but in axiomatic set theory itself, which is enough for Gadel's relative consistency results). A somewhat similar twist w.r.t. fundamental sequences for 'proof theoretic' ordinals was used successfully in Jensen (1972). Gadel 's somewhat nostalgic description of his twist, loco cit. as having primarily mathematical, not its original, .philosophical interest seems (to me) to overlook the principa) philosophical issue: Is the predicative programme, admittedly a philosophical affair, also of philosophical interest? And one test is to compare the results obtained by modifying it (as Gadel did) with those obtained by respecting it, as in certain autonomous transfinite progressions; cf. the survey in Feferman (1968). Of course, outside logic there are much more impressive examples of (i i). for example. from the famil iar theory of ideal fluids, The need for such combinations is also fami1 iar from applications of '50ft' model theory (p-adic fields) and of recursion theory (finitely generated groups), and of course from §2 above. -- Warning. The 'contemplative' results of §2 are to be compared to appl ications of model theory in the fifties which consisted in easy (general) proofs of easy (general) theorems, not to the more substantial applications in the sixties. In the fifties, model theorists familiar with the material of §2 found it to be as promising as then-current model theory (an impression which has not been justified so far); cf. A. Robinson's review of Kreisel (1958).
118
G.KREISEL
mentioned in the Introduction. This theory -- or, as one says, this' ideal ization' -- is simply not adequate for its original aim, for hydrodynamics. But the ideas that came from the development of the theory, especially in the two~dimen sional case, have permanent value provided they are suitably separated from the original aim (functions of a complex variable or harmonic functions, used to describe the potential and the flow of -- hydrodynamically pretty useless -- ideal fluids). After these illustrations concerning the alternative between (I) and (iii auuve we return to our principal concern, proofs. Here, as promised, is evidence for (i i), both w.r.t. (a) the particular hierarchies mentioned at the beginning of §4, and (b) traditional proof theoretic aims in general. (a) Two strategic assumptions in the construction of hierarchies. The first concerns justifications (of principles P of proof), and is restrictive: P should be justified 'from below', via a hierarchy, by some kind of reduction to more elementary principles than P. The second is permissive: with any (reduction) step, an arbitrary finite iteration is taken to be 'given' too; in short, w-iterations are not counted, nor their w-iteration, and so forth; for a fairly, but by no means absurdly broad sense of 'and so forth', cf. Girard!s lecture. -- NB. Trivially, idealizations are involved here. This Is not the issue at all. What is questionable is the impl icit assumption that they are even approximately adequate, for example, for studying rel iabil ity of proofs. In fact, there is a radical alternative: Don't we do better by reversing the strategy altogether? specifically, by not building up P from below at al I, but by reducing length, the (finite) number of iterations of anyone step. For example, suppose that, for some given P, the passage, in §2, from derivations d to explicit real izations td becomes compl icated. What use is then the possibility of such a passage? One would actually look for P+ which permit -- real istically -- simpler proofs than P, at the -- real istically negl igib!e -- cost of losing that possibi lity altogether. A less extreme example is famil iar enough from elementary arithmetic, where it is certainly futile to reduce numerical terms to numerals (0, sO, ssO, •.. ); instead one looks for new, more efficient notations; for an instructive appl ication in 'advanced' arithmetic, cf. Feferman (1971) and its review where the aspect relevant here is pointed out. -- For reference in §4: Statman (1974) reverses the -- usual -- aim of eliminating cuts in order to reduce not length, but genus. More generally, one might try to mechanize a good deal of the related, fami! iar routine of introducing suitable lemmas for 'cleaning up proofs': after al I, we learn to do this sort of thing almost automatically. Viewed in the I ight of the considerations above, the successes of §2 may well constitute a kind of I imit to useful apptications of the guiding ideas of current proof theory; a kind of optimal value for the ratio: additional information/additional effort in a traditional proof theor.etic analysis. To be a I ittle more specific, we conclude PaAt I of this article with some general ities about proof theoretic aims. (b) Mathematical reasoning and mathematical objects: a discovery. Trivially, the moment we make it our business to be self-conscious about our knowledge (of anything!), the so-called subjective elements of this knowledge become most prominent: they are thought of as particularly close to the thinking subject. In the case of mathematical knowledge, definitions and proofs -- as opposed to the objects defined or to the theorem proved -- are among those elements. As a matter of historical fact, whenever some branch of mathematics began to be analyzed, the first distinctions that came to mind concerned methods: projective and metric methods in geometry, algebraic and differential ones in analysis, and the like. It was a discovery that the particular differences mentioned were more profitably interpreted
THEORY OF PROOFS
119
by reference to the different, possibly novel notions or structures for which results proved by different methods are val ido 2 In other words, we have discovered an unexpected adequacy of 'objectivist' analysis. Historical Remark. Traditionally, and more dramatically, one speaks here of confl icting views of the nature of (mathematical) real ity, of a grand conflict between: objective and subjective. This becomes much less dramatic when specialized to famil iar examples: after all, there are projective and metric planes on the one hand, and there are projective and metric methods on the other. Far from being presented with a confl ict, with a choice between different views on what there is, we have a very close relation between methods and objects, so close that the objects concerned can be characterized in terms of the methods; cf. the distinction between those physical objects which are, and those which are not visible to the (ideal i zed ) naked eye. The distinction is objective enough (and not particularly hard to make precise). But -- on present evidence -- it is weak simply because visibi 1ity is not a significant factor in most physical phenomena at all (for which we have viable theories). In short, there is a very real issue here, but much more subtle than the hackneyed business of real ity. Do the reservations (a) and (b) above finish the subject? (of proof theory) Surely not, provided we look for phenomena of mathematical reasoning in which proofs are -- 1 ikely to be -- principal factors; in short, if proofs are to be principal objects of study; if we do not insist on standing on our heads, and think of proofs principally as a means, for example to analyze the 'meaning' of theorems. 0
PART II.
A FRESH START
INTRODUCTION To continue the view of scientific progress presented in the Introduction to P~ T, readers should recall that, at least in existing sciences, the mathematical elaboration of early conceptions and the restriction to imagined experiments soon reached a point of diminoishing returns. Generally -- this seems to be a fact of scientific 1 ife -- progress in the successful sciences really picked up only when striking laws were discovered which had been in doubt or had not even been suspected. In other words, they were found by genuine, not merely imagined experiments. For the present purpose it is not necessary to distinguish between such experiments and what are called observations: the latter concern phenomena that turn up in the course of nature, while experiments involve observations of phenomena in specifically designed situations. Of course, as in all matters of knowledge, observers are not passive: in experiments the external circumstances are manipulated (so to speak' interfered with' before the observation), in observations a selection is made among the raw data. This selection involves, in effect if not by intention, .the notions to which early speculations, discussed in the Introduction to p~ T, had drawn attention; in particular, in the branches of physics considered there, one looks for phenomena exhibiting 'perfect' shapes, 2 For readers interested in constructive mathematics, the corresponding reinterpretation concerns the particular (new) species of operations for which the theorems hold -- as opposed to the methods of proof used to establ ish that the identities hold which the operations are claimed to satisfy. -- Occasionally, there are candidates for the reverse procedure, for example, when Brouwer came up with a characterization of functions F on choice sequences f in terms of his 'fully analyzed' proofs of Vf3n R(f,n), and F satisfies the identity VfR[f,F(f)]. As at the end of (a) above, one expects only a narrow class of cases where the possibility of 'extracting' F is useful: in general, F must be so simple that it is worth writing out a definition in full.
120
G.KREISEL
resp. properties of 'ideal' fluids. Incidentally, even if this normative terminology, of perfection and ideals, is inappropriate if taken to mean what 'ought' to be the case, it occasionaly makes good sense as a hint on what ought to be studied. In short, early speculations come up with such hints.
Natural history wil I be used below for those parts of science which formulate laws close to our ordinary conception of the phenomena involved. This is somewhat broader, but more appropriate than the usual meaning of 'natural history' which excludes all but the most rudimentary mathematical developments of the (ordinary) conception.
As is well known, some early speculations introduced also extraordinary views. The best known such view goes back to the Greeks: the atomic structure of matter. (It is so far removed from our ordinary views that we do not see the world as atomic even after we have absorbed atomic theory). However -- and this is perhaps the principal point of p~ II -- the mere introduction of this extraordinary view was sterile. What was needed was massive progress in natural history,specifical ly the discovery of laws relating data belonging to sciences which are quite different for our ordinary conceptions, for example, laws in spectroscopy relating data from optics and chemistry. (Massive progress was needed, and not merely isolated relations of this kind; for example -- the substance with the composition of -- glass has long been known to be transparent). Naturally, a systematic science, covering phenomena from different branches of natural history has to use extraordinary conceptions(or combinations of ordinary conceptions, which are so compl icated that the difference in degree beCOmes enough of a difference in kind; enough to be consistent with the assumption in force, that we have to do with different branches of natural history). In general, laws relating different branches do not determine a particular extraordinary conception. But they pinpoint an area where such a conception may be tested, The simple observations above will guide the exposition and the interpretations of p~ II. PROOFS AS PRINCIPAL OBJECTS OF STUDY:
NATURAL HISTORY
4.
Explicit definitions have turned out to be fruitful here. They provide the sort of striking phenomena (in the area of mathematical proof) which are needed for natural history in the sense of the Introduction above, And, as required at the end of §3, these phenomena cannot be analyzed in 'objectivist' terms, that is, by anything 1 ike current reinterpretations of the theorems proved. -- NB. This article confines itself to val id proofs though, presumably, fallacious arguments are also important fora study of reasoning (as illusions are for perception). Incidentally, fallacies are often a~equately analyzed in objectivist terms, for example, in those paradoxes which come about when some of the axioms and inferences are val id for one notion, say, of 'set' or 'predicate', others for another notion; cf. the discovery discussed in §3(b). Uses of explicit definitions are conveniently grouped into those where (a) notions are introduced by means of such definitions, and (b) already understood notions with many famil iar properties are analyzed by being shown to be equivalent to others that are expl icitly defined (in some given language). Roughly speaking, in (a) it will often be obvious, in (b) some 'foundational' work is needed to show that the expl icit definitions can be el iminated (at all): but the quantitative differences between (a) and (b) will turn out to be quite del icate.
Examples. (a) An important strategy in applying modern abstract mathematics, say group theory to number theory, is this: (i) An operation 0 is defined number theoretically, say on lZ.xlZ. or modulo some prime. (ii) 0 is shown to satisfy the group laws. (iii) Special izing general facts about groups to 0 leads to number theoretic results. As a matter of empirical fact this strategy has often
THEORY OF PROOFS
121
been strikingly successful; probably the most spectacular use is in Weil (1967) which brought to I ight a computational oversight in Siegel (1952). At the same time, here and elsewhere in current practice, the strategy is logically tpivial, that is, el iminable in the following precise sense: Proofs of (ii) have turned out to be formal izable in elementary number theory (though, by incompleteness, this has to be checked), and the general facts used in (i ii) are naturally proved in conservative extensions of that theory (by completeness of predicate logic this is automatic if the facts involved are of first order). Thus the uses of this strategy in current practice are candidates for the~rogramme at the end of §3. -- (b) Much of traditional mathematics, for example, Eucl id's geometry was dominated by the search for correct definitions of famil iar notions (circle, tangent) in Eucl idean terms, for which fami 1iar (geometric) properties of those notions can be formally derived from Euclid's axioms. Clearly, these properties correspond to the facts about groups in (i i i) above, though -- in contrast to the case of groups -- geometric properties were used long before there was an 'official' list of axioms for those familiar notions. Bibl iographical remarks. The examples (a) and (b) above playa basic role in the particular brand of natural history (of mathematics) described, somewhat grandly, on p. 42 of Bourbaki (1948) in terms of 'intuitive resonances' (to mathematical structures). The discovery of (expl icit) definitions which are easy to grasp and to handle provides of course critical evidence for such 'resonances' (and for the significance of the notions defined): evidently, this appl ies to groups in (a) above. Presumably in order to stress the (obvious) similarities between the two kinds of uses of explicit definitions in (a) and (b), Bourbaki play down the differences. Finally, with the relentless logic of epigones, some followers want to suppress altogether the uses quoted in (b) above, for example, in the school curriculum. -- Less grand, but no less emphatic are the references to the importance of expl icit definitions in Wittgenstein (1976), for example. p. 33 or p. 111. The idea, in (b) of pecognizing the equivalence between an understood notion and an explicit definition,that is, of the correctness of the definition is as much anathema to Wittgenstein as to Bourbaki. It is perhaps worth noting that -- apart of course from the obvious literary differences -- there are several other significant similarities in the general views of Bourbaki (1948) and Wittgenstein (1976), above al I, concerning the superficial character of the properties of proofs stressed in traditional foundations. Wittgenstein illustrates his points by means of examples from the most elementary kind of numerical arithmetic, Bourbaki from 'advanced' mathematics. Wittgenstein's skepticism extends to set theoretic foundations, while Bourbaki (seem to) pay I ip service to the language of set theory on p.37. However, on p. 40, footnote (2), they stress the importance of their basic structures without eve~ mentioning the definability of those structures in set theory. These matters wi 11 be elaborated in a review of Wittgenstein (1976) for the Bull. A.M.S., including the errors of both Bourbaki and Wittgenstein due to their ignorance of logic (of the sixties). Granted then the -- obviously -- striking ro~e of expl icit definitions, one looks for specific factors involxed in the effective use of such definitions; tacitly, always together with the pertinent axioms; cf. (ii) in Example (a) above. The reader should recall here from the Introduction (to p~ II) that superficial or 'phenomenological', not 'hidden' factors are treated in (any) natural history; further, that their relevance is not expected to be establ ished purely mathematically from fami 1iar experience: the latter only helps one choose a sensible candidate for a (relevant) factor. -- NB. As in other scientific practice, the facts will be stated in terms of any notions we have learnt to use, without insisting on a premature analysis (of 'how' we know, for example, that a particular spectral I ine is blue); an extreme case is the use of 'metal I ic look' in early mineralogy. (a) Genus of proof figures and the particular use of expl icit definitions illustrated in Example (a) above. Fi rst of all, it is to be remarked that none of the famil iar measures of length is a particularly decisive factor here. The number
122
G.KREISEL
of steps (nodes) is reduced only (roughly) linearly by introducing new notions by explicit definitions. The number of symbols (in formulae) can be reduced exponentially, but striking uses of the strategy in Example (al occur also without such shortening. In any case many arguments are strikingly affected by mere rearrangement, if thereby, say intricate cross references are avoided. This observation is, qual itatively, consistent with using genus as a factor, introduced in Chapter 1 of Statman (1974) where details are to be found. (Roughly speaking, the genus of a proof figure is the least genus of any surface in which the figure can be embedded without crossing). Here are a few rather general features of genus which are not discussed in Statman (1974), but illustrate the kind of considerations needed for the natural history of proofs (and which tend to be disturbing to someone brought up on traditional mathematical logic). (i) Genus is sensitive to the style of formal ization. Specifically, all derivations in a calculus of sequents have genus 0, in contrast to natural deductions (provided the nodes are joined where an assumption is introduced and used).3 Far from being a defect, this sensitivity allows one to choose between the styles; cf. the shift to hel iocentric coordinates in astronomy at the time of Kepler (which was essential for natural history, despite all the business about the relativity of space). (ii) Genus is hard to calculate from the data: one needs a computer even for quite elementary derivations (and a good deal of effort to convert ordinary proofs into data suitable for high-speed calculation). An anecdote concerning such calculations: At Stanford a body of (natural) deductions had been stored in a computer, left over from the homework by students in a computer-assisted course on axiomatic set theory; information on the course can be found in Suppes (1975). Tarjan's algorithm was efficient enough for dividing this ready-made material lnto planar and non-planar deductions. The job would be formidable if instead written homework for an ordinary course had to be prepared for the computer. Far from being a drawback, (i i) seems (to me) promising, granted that genus is a significant factor at all. For one thing, since computers are new, only a short while ago it would have been simply premature to study our subject empirically at all. Less trivially, (ii) reduces a, if not the principal complication in studying human behaviour, whether it be psychological or physiological (which is in practice rarely an important difference). Specifically, knowledge of a theory is 1iable to influence the subject of the observation; in contrast to other sciences where of course the influence of such knowledge on the observer, not on the object of the observation, can be critical, and where one needs safeguards against conscious or unconscious faking (by the observer). If a theory involves notions 1ike genus, which are hard to calculate, there is a better chance that the subjects will not even try -- and so simply will not know the theoretical prediction (even if they know the theoretical Jaws). At the very least, the need for statistical tests is reduced in the case of such intractable theories. Finally -- and quite superficially -- the phenomena (of reasoning) considered here strike us as obscure: so if a theory uses superficial (phenomenological) notions 1ike genus such obscurity would go well with a difficulty in actually applying the theory (to the data); cf. also the last paragraph of the Introduction to Panx II. -- Incidentally, the actual computations of genus for the corpus of proofs by Stanford students mentioned earlier, were not particularly conclusive; at least partly because the whole material was just a bit too elementary to present any really striking differences to start with. What has been said so far is quite sufficient to show that there are plenty of data for a natural history of proofs; in fact, whenever we find striking phenomena 3 This distinction, in Statman (1974), pinpoints for the first time a measure for the difference between the two styles, though it had long been clear that there was some significance to that difference (as suggested in a general way by Gentzen's terminology 'natural deduction').
THEORY OF PROOFS
123
which do not invite analysis in objectivist terms (in the sense of §3). The role of explicit definitions discussed above is of course only one of many such phenomena. As observed in the last paragraph, the presentation of those phenomena for theoretical treatment, that is, the choice of data, is quite critical, especially if notions 1ike genus are involved which are so hard to compute that only automatic data-processing is real istic. 4 For a conventiona1 exposition of elementary logic, one of the best known 'reductions' is the expl icit definition of an arbitrary propositional operation (tacitly, together with proofs of its axioms) in terms of (the rules for) say ~ and v. What is lost by such reductions? Statman (1974) shows (actually for the Impl icational calculus, not for --, and v) that the genus can be increased unboundedly. In a lecture at Clermont-Ferrand (in 1975) he considered propositional quantifiers, and showed how length is increased too; those quantifiers allow expl icit definitions since \fpP is 'short for' p[p/T] " p[pll].
Example.
Correction. Though Statman (1974) refers to Kreisel (1973), tacitly, p, 267, his (successful) treatment followed totally different 1ines from what I intended at the time, that is, at the congress at Bucharest in 1971. My idea was that the choice of axioms for explicit definitions would be made with essential help of some kind of traditional philosophical analysis of the meaning of the defined notions, and the combinatorial side would look after itself as it were. Indeed, as J stressed expl ici tly on p. 258 and in the PS, I was looking for sustained philosophical analysis, in contrast to its use in current foundations where it is completely overshadowed by the mathematics (needed to develop the one-l iners expressing those analyses). Statman concentrates on combinatorial, not philosophical analysis. -- Incidentally, five years ago I did not think (consciously) of Bourbaki nor of Wittgenstein in connection with expl icit definitions, though J had read Bourbaki (1948), and I knew Wittgenstein in the forties. Actually, even today I do not remember his talking about the topic in my presence. Evidently he must have done this in view of the quotations from Wittgenstein (1976) given earl ier, and equally evidently, I was not ripe to profit from those remarks. (b) Foundational languages: artifacts. Trivially, the effectiveness of an expJ icit definition of a given notion will depend on the choice of language (to be used in the definiens). What seems to be less weI I-known is that the famil iar foundational languages tend to introduce artifacts, in particular, when notions from geometry including descriptive set theory are defined in the usual way in the language of higher order arithmetic, The examples below come from my own misjudgments. (i) There are two standard proofs of the theorem of Cantor-Bendixson; one identifies the perfect kernel of a (closed) set F (of reals) with its set of condensation points, the other with the I imit of its derived sets. If one insists on transcribing those proofs into second order arithmetic, when F is coded by its set of complementary intervals (with rational end points), then the difference between the proofs is most obviously expressed in model theoretic terms: the former uses IT 1 - comprehension (appl ied to the predicate of being a condensation 2 point), the latter uses only IT] 1 - comprehension; cf. Kreisel (1959A).5 This difference is illusory l nasrnucb as a so-to-speakmoregeometric formalization of the first proof (with additional primitives for real numbers, etc.) can be modelled
4 For example, in contrast to an exposition based on normal ization where the
data include normal ization rules, and so a 'reduction' must preserve normal ization steps, too. 5 The main result of Kreisel (1959A) does not concern the difference between the two proofs mentioned, but the relative complexity of F and of (any second order definition of) the perfect kernel of F.
124
G.KREISEL
by use of TIll - comprehension too. This was discovered by Friedman in recent developments (to third order theories) of his careful formal izations in Friedman (976). Corollary. The (striking) difference between -- careful formal izations of -- the two proofs above does not invite analysis in objectivist terms after all, and is thus a candidate for the programme at the end of §3.
Remark concerning the logician's (so far practically unsubstantiated) impression of a 'need' for higher types in ordinary mathematical practice. Of course, by a footnote 48 in Godel (1931) mentioned already in §l (a), the possibility of a
logical need exists: some arithmetic theorems of the (usual impredicative) theory of type n + 1 just aren't provable in the theory of type n , But the actual: basis for that impression may not be logical at all, but structural; cf. the structural improvements introduced by Friedman's third order language with axioms that are weaker (than those in the original second order formalization). It is certainly of interest for the natural history of proofs to present alternative hypotheses for such impressions (as was done above by distinguishing between logical and structural needs). (i i) Many theorems which are formulated in the language of first order logic (and therefore, by completeness, aan be proved by the usual rules of predicate calculus), are in fact proved by use of geometric and/or analytic methods. An obvious problem for the natural history of proofs is this: Are these mathematical methods needed to make the proofs manageable (by reduction of length, genus or whatever)? In Kreisel (1976) a candidate from the theory of real closed fields was suggested: the theorem establ ished by schut te and van derWaerden (1953) in their solution of Newton's problem of the 13 points (on the surface of the unit sphere). My specific suggestion was to formal ize their solution in usual type tyeory where, in particular, trigonometric functions are expl icitly defined. But closer inspection shows that elementary facts about geometric and topological notions (area of spherical triangles, or Euler's theorem on polyhedra) have quite unmanageable proofs in the language I had in mind. In other words, a representation in that language introduces complexities that are not at all intrinsic to the solution of Newton's problem.
Remark.
It cannot be assumed that to every solution of Newton's problem in a suitable geometric language there corresponds a particular proof in first order predicate calculus (in contrast to solutions in type theory where such a correspondence is provided by normal ization); cf. the transcription of Artin's own work on sums of squares into first order logic discussed in §2(b), as an issue where the 'instinctive resistance' of ordinary mathematicians to foundational languages is relevant. PROOFS AS PRINCIPAL OBJECTS OF STUDY:
SYSTEMATIC SCIENCE
5. To put first things first: How much natural history of the kind suggested in §4 do we need? Can we not learn from the history of successful sciences how to find a systematic scheme without wasteful detours via natural history? Historical Remarks. Much work in natural history was indeed scientifically wasteful; it was superseded by later, fundamental science without being used. One example was already menioned in (the bibl iographical remark of) §4: before X-ray crystallography, the natural history of minerals was preoccupied with shapes and colours or with the distribution of particular minerals in odd corners of the globe. The information obtained was sound enough, but simply does not lend itself to theoretical study. (The same appl ies to much of botany or zoology). A more topical example is provided by the subject of 'natural languages'. It has not inspired much confidence, perhaps because -- as somebody said -- these particular languages do not tell us much about the genuine possibilities (for human language), and even less about the actual world which is supposed to be described by language. -- Cer-
THEORY OF PROOFS
125
tainly, there have been bright ideas in the parts of natural history mentioned (perhaps as imaginative as much more successful ones); for example, d'ArcyThompson's on growth and form, or Chomsky's on transformational grammars. Not despite, but rather because of their familiarity and general ity, these ideas do not even smell 1ike germs for a systematic theory. -- As a memorable contrast: The superficially very special idea in genetics, to concentrate on bacteria and viruses, was not only successful for a systematic science of genetics, but immediately convincing: here one could observe -- in a humanly short time -- 50 many more generations that the difference in degree could be expected to produce a difference in kind (of the data involved). Despite all the excitement about the discovery of formalization, at present there do not seem to be any even mildly promising ideas for a systematic or fundamental science of proofs. Worse still, the two pillars of all current work, (a) and (b) below (which -- at the present stage -- are surely sound for the natural history of proofs), seem -- to me -- quite weak. (a) We look for conscious elements in (mathematical) reasoning, or, at most, the very mild extension to those elements which we are able to make conscious to ourselves. As usual, traditional philosophy discusses pedantic doubts (here: doubts about the reliabil ity of introspection, existence of mental objects and the 1ike), and thereby -- consciously or unconsciously! -- draws attention away from the critical issue,namely this: Do those (sub)conscious elements constitute adequate data for anything 1ike a systematic theory? cf. the situation in mineralogy before X-ray crystallography (in the Historical Remarks above), and more generally the inadequacy, for systematic physical theory, of the domain of phenomena that can be made visible. (b) Impressed by its unquestionably distinctive features, we separate mathematical reasoning from other intellectual activities; in humans, and of course in other species. Again, the issue is not the (official) alleged difficulty of making some such distinction precise, and even less the -- equally often alleged-prejudice against assuming intellectual abilities in subhuman species (after all, before very recent advances in electronic technology, we could not even begin to record the more intimate features of their behaviour). The issue is, once again, whether this particular part of intellectual activity, which as it were strikes the mind's eye particularly vividly, is right for (building a fundamental) theory. To repeat: (a) and (b) do not cast doubt on the natural history of proofs, where we make do with with a comparison between stages in the study of nature of) proofs and of matter. The comparison tion; of course, it is not suggested that proofs
poss i b iii ty of progress with the what we have. -- Let me conclude (what is usually called: the seems quite effective for orientaare material substances. 6
A parallel: What is proof? and What is matter? (i) An important step in the development of the atomic theory was the discovery of chemically pure substances, among the mostly impure substances which dominate natural history. This allowed the distinction between atoms and molecules. Bourbaki (1948) mentioned already in §4(a), sounds very much as if their structures - meres (basic structures) are to be compared to chemically pure substances, in particular, to atomic ones. But as stressed in §4(b), foundational analysis has not yet taken the discovery of those analogues to chemical purity into account: By insisting on one foundational language, such as set theory or type theory, one enforces a 1iteral, but purely
6 The comparison is also useful for expanding the discussion, at the Introduction, of the heuristic role of traditional philosophy. After idea of an atomic structure is one of the most striking contributions lative philosophy -- perhaps matched by the idea of the relativity of time, made prominent by critical philosophy.
end of the all, the of specuspace and
G.KREISEL
126
formal unity on mathematics. (On pp. 36-37 Bourbaki too begin with a fantare on the 'unity' achieved by means of their basic structures -- but eight pages later they stress that there are several such structures, and also that quite a lot of mathematics, for example, what is left out of their treatise, is not 'composed' of those structures anyway). Bourbaki, surely quite properly, do not mention in this context the hackneyed business about an objective or subjective view of mathematical real ity. After all, even granted the objectivity of those basic structures, the fact that they contribute to the 'profound intelligibil ity' of mathematics (p. 37) concerns specifically our intellectual equipment. (ii) Building on the discovery in (i), chemical atomism could be developed, and the molecular diagrammes of elementary texts in chemistry. This required only a quite crude idea of atomic structure, the valency of particular atoms; in fancy language, needed for a pun (on the representation of proofs by graphs, as in §4(a) above}, molecules were represented by a purely graph theoretic structure. This can hardly be said to tell us what matter is (possible, as it were), since it leaves open an enormous number of combinations of atoms which never turn up at all. (i ii) A really significant advance was made by relating valency to the intepnal structure of atoms, in Rutherford's theory refined by use of quantum theory. This not only made 'sense' of valency (as is well known). It did much more when Paul ing derived from it a metric structure, specifically, the lengths of and angles between chemical bonds. This additional structure cut down the number of possible combinations sufficiently to make it an effective scientific tool; probably, the most spectacular uses were made in molecular biology. -- It would be a I ittle too facile to compare the use of an internal structure of atoms to going beyond conscious elements in (a) above; or the quite essential information provided by such unfamil iar elements as radium to the need for broader experience in (b). However, it seems quite clear that in one direction the current graph theoretic representation of proofs has to be enriched (even though, presumably not by a conventional metric) since a proof is rarely convincing if the conclusion depends on something which is too 'distant' in our memory. In short, one expects that some hypothesis about memory structure is needed here (and, as mentioned already, the difficulties are much the same whether one thinks of memory in physiological or psychological terms). But also, after we have grasped a proof of a proposition, we use the proposition more easily: as somebody said, we know more when we have proved a proposition than when we merely know it is true. The functional interpretations mentioned in §§2-3 make expl icit some such additional information (which, by §2, is occasionally useful). What is missing is a convincing test for deciding whether this particular addition even approximates the factors which are actually operative. As so often, the trouble is not at all that we cannot think of any theory; if anything, we can think of too many which are roughly consistent with famil iar experience (of mathematical proofs). HEURISTIC VALUE OF TRADITIONAL AIMS:
A OISCLAIMER
6. The view of scientific progress adopted in this article and described in the introductions to p~ I and II disregards the principal specific claims of traditional philosophy -- and the conscious or unconscious hope that something like this tradition would provide a short-cut to a fundamental systematic science, without detours via natural history. In particular, the view disregards the claim that traditional analysis is needed to coprect eppors in our ordinary conceptions, and the associated hope that corrections would so-to-speak automatically lead to the extraordinary conceptions needed for a basic systematic science. In the case of proofs, our idea of a valid argument is claimed to be in need of correction (or at least of some kind of analysis). And the hope mentioned is impl icit in the socalled logical priority of validity which, being the 'essence' of proof, is assumed to be a basic element of any systematic theory of proofs. (A pun may be involved
THEORY OF PROOFS
127
here too, since unquestionably, discussions of val idity were temporally, that is. 1iterally prior to theories of proof.) Granted all this. the hope rests on the further assumption that an analysis of val idity (tacitly. in terms real istically available at the present time) would be rewarding. Of course. the parallel to validity in the case of physics is the business of real ity. which ,- in its originally intended generality -- has not been of much consequence for the progress of physics. -- The claims just mentioned are disregarded in this article, but not dismissed nor rejected. After all, we have evolved in the world in which we 1ive. So why shouldn't our built-in so-called a priori conceptions be a very good guide in science, both efficient and re l iable, and not primarily a I imitation, an obstacle between us and the Ding an sich? Certainly, a mild form of paranoia is needed to concentrate only on the limitation -- either with pride in our impotence or in horror of it. ACKNOWLEDGMENT My colleague, P. Suppes, has helped me with careful criticisms of an earl ier draft of this article. REFERENCES: Baker, A. (1964).
Acta Math. 111, 97.
Bishop, E. (196]). Foundations of constructive analysis. (McGraw-Hi 11, New York); reviewed by A. Stolzenberg (1970). Bull. Amer. Math. Soc. 76, 301. Bourbaki, N. (1948). Pp. 35-47 in: Les grands courants de la pensee mathematique ed. F. LeLionnais. (Cahiers du Sud, Paris). Davenport. H., and Roth, K. F. (1955). Mathematika 2. 160. Feferman. S. (1968). Pp. 121-135 in: Logic, Methodology and Philosophy of Science III. (North-Holland, Amsterdam). (1971). Actes Congr~s Int. Math. 1970, Vol. I. 229; rev. J. Symb. Logic 40, 625. Friedman, H. (1976). J. Symb. Logic 41, 558. Gentzen, G. (1969). The collected papers of Gerhard Gentzen. (North-Holland, Amsterdam). Godel, K. (1931). Monatshefte Math. Physik 38, 173. (1944). Pp. 123-153 in: The philosophy of Bertrand Russell (Northwestern Univ. Press, Evanston and Chicago). Henkin, L. (1960). Pp. 284-291 in: Surnrna r i es of talks presented Summer Inst. 5ymb. Logic, Cornell ·Univ. 1957. (Inst. Defense Analyses, Princeton). Ingham, A. E. (1932). The distribution of prime numbers. (Cambridge University Press, Cambridge). Jensen, R. B. (1972). Ann. Math. Logic 4,229. Kleene, S. C. (1959). Pp. 81-100 in: Constructivity in Mathematics. (North-Holland, Amsterdam) • Kreisel, G. (1952). J. Symb. Logic 17, 43. (1958). ibid. 23, 155; reviewed by A. Robinson (1966). ibid. 31, 128. (1959). Pp. 101-128 in: Constructivity in Mathematics. (North-Holland, Amsterdam) •
128
G.KREISEL (l959A) . Bu11. Acad. Sc , Po1. ?, 621. (1960). Pp. 313-320 in: Summaries of talks presented Summer Inst. Symb. Logic 1957 (Inst. Defense Analyses, Princeton). (1970). Springer Lecture Notes 125, 128. (1973). Pp. 225-277 in: Logic, Methodology and Philosophy of Science IV. (North-Holland, Amsterdam). (1976). Acta Phil. Fennica 28, 166. (in press). Proc. Fourth Scand. Logic Symp. (North-Hal land, Amsterdam).
Kreisel, G., and Krivine, J.-L. (1971). Elements of Mathematical Logic. (NorthHolland, Amsterdam). Lehman, R. S. (1966). Acta arithmetica
11,397.
Littlewood, J. E. (1953). A Mathematician's Miscellany. (Methuen, London). Morley, M. (1965). Trans. A.M.S. 114,514. Pfister, A. (1967). Inventiones Math. 4, 229. Robinson, A. (1955-6). Math. Ann. 130, 257. Siegel, C. L.
(1952). Math. Ann. 124, 17 and 364.
Statman, R. (1974). Structural complexity of proofs. (Dissertation, Stanford University). Suppes, P. (1975). pp , 173-179 in: Computers in Education, IFIP, Part 1 (NorthHolland, Amsterdam). Wei1, A. (1967). Acta Math. 113,1. Wittgenstein, L. (1976). Lectures on the Foundations of Mathematics, Cambridge, 1939 (Cornell Univ. Press, Ithaca, N.Y.).
R. Gandy, M. Hyland (Eds.l, LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
THE FOUNDATIONS OF MATHEMATICS IN POLAND AFTER WORLD WAR II by
, "
W. MAREK (WARSZAWA) ,
In the period after world war two, Foundation of Mathematics developed in Poland in three main directions:
Set Theory, Hodel Theory, and Recursion Theory.
Obviously the Foundations of Mathematics in Poland were not created in 1945; the roots of interest in these particular domains of sciences one must seek in the investigations of great mathematicians of the between the war period, 1918-1939 and in particular of Banach, Jaskowski, Kuratowski, Lesniewski, Lindenbaum, . ~ukasiewicz, Sierpinski, Tarski and Ulam.
Among those mentioned only Jaskowski,
Kuratowski and Sierpinski lived and worked in Poland after the war.
Banach,
Lindenbaum and Lesniewski vanished or died in the war, and~ukasiewicz, Tarski and Ulam carried over their activities outside of Poland.
It is worth mentioning
that Tarski has maintained contact with Polish scholars up to the present. From the historical point of view, the oldest among the fields listed above is set theory and its foundations.
Immediately after the war, important investi-
gations in this field were carried out by Sierpinski [35], who had already proved during the war the result (previously announced by Lindenbaum and Tarski) that the generalized continuum hypothesis implies the axiom of choice; he subsequently investigated equivalents of the continuum hypothesis. Immediately before the war, Mostowski discovered a general method of independence proofs in set theory with atoms (urelements).
This method, which is nowadays
called the Fraenkel-Mostowski method, is based on the following construction: Let
A
be a set of individuals (i.e. objects which are not sets).
a hierarchy of sets over these individuals as follows: A A U (R~ n P(R~»), for t; > 0 Finally set R = A ; Rt; O \!
such that increasing of the same length
~(A)
have exactly the same properties. Recently "abstract" model theory has been studied in many places.
In particular,
the study of generalized quantifiers was initiated by Mostowski [19] who considered the generalized quantifier "There exists infinitely many
x
such that ••• " •
Another important result by Mostowski which motivated abstract model theory was his paper [22], where he proves that Craig's and Beth's theorems fail in some extensions of first order logic.
The important results of Mostowski on quantifiers
(also [25]) were the inspiration for the investigations of Fuhrken and Vaught and, later, of Keisler and Krivine and McAloon. The field of Recursion Theory is represented by research on hierarchies of sets and functions, decidability and metamathematical investigations on second order arithmetic (analysis).
Of basic importance was the early work of Mostowski [14]
on the arithmetical (Kleene-Mostowski) hierarchy.
In the late forties and the
fifties Mostowski stUdied the incompleteness of Peano arithmetic.
Apart from
obtaining important extensions of GOdel results, Mostowski wrote a monograph "Sentences undecidable in formalized arithmetic" which is one of the most widely studied monographs of the subject. An important result of the forties was the proof by Szmielew [36], of the decidability of abelian groups. Later research in recursion theory was conducted by Grzegorczyk both on computable functionals and on the hierarchy of recursive functions [6], [7].
Grzegorczyk
also did some important work on decidability. Research in the metamathematics of analysis (on the borderline between recursion theory and set theory) were initiated by a paper of Grzegorczyk, Mostowski and Ryll-Nardzewski [8].
Formalized analysis or the theory dealing with properties
on integers and reals, like every first order theory, has numerous models.
Among
them one discerns models with a faithful interpretation of the natural numbers, these are the so called W-models.
The study of these models by the authors named
above, in COmbination with later deep results by Kreisel, Baiwise, Kripke, Platek, Sacks and others, led to the development of a completely new field; generalized recursion theory.
A slightly different line of research initiated by Mostowski
in [20], was investigation of so called B-models of analysis, which were intensively stUdied by a group of mathematicians around Mostowski in Warsaw [26], [1],
[25].
When we sum up the activity in the Foundations of Mathematics in Poland after W.W. II, it is quite clear that the individual prevailing over the development
W. MAREK
136
of this domain of mathematics in Poland was Andrzej Mostowski.
His efforts led
to the establishment and education of a large group of mathematicians dealing with foundations and the mathematization of the subject.
The lines of research
initiated by Mostowski were and are actively developed in foundational studies. Throughout the post-war period, foundational research was connected with the intensive investigations in computer science conducted by Ehrenfeucht and Pawlak and later by Blikle and Mazurkiewicz.
Inspirations from each side led to
interesting developments in both fields. Another important aspect of Foundations in Poland was an exchange of scientific thought with mathematicians from abroad. periods in Poland one lists:
Among logicians who spent longer
Addison, Benda, Hinman, Lopez-Escobar, Prikry,
Sayeki, Scott, Suzuki and many others. International activity of Polish logicians is also evident in the organization of international conferences and meetings.
Apart from the "Infinitistic methods"
symposium (Warsaw 1959) and an ASL meeting (Warsaw 1968) there was the Logical Semester at the Banach Center in Warsaw (Spring 1973) and more recently a series of three conferences on set theory and hierarchy theory (Sudety Mountains, 1974, 75, 76) was organized by Wroclaw group. The editorial activity of Polish authors - apart from the above mentioned monographs of Mostowski, Rasiowa and Sikorski - consisted in publishing handbooks and monographs including:
"Set Theory" by Kuratowski and Mostowski, "Mathematical
Logic" (in Polish) by Grzegorczyk. Foundational actiVity is conducted in Poland mainly in two centers:
Warsaw
and Wrocraw (although almost every University in Poland hires logicians and Logic and Foundations is a part of the courses required by teaching institutions). Recent research in Warsaw concerns chiefly algebraic logic, foundations of geometry, and foundations of set theory, whereas in
Wroc~aw
foundations of set
theory and model theory are the main subjects of interests. REFERENCES This bibliography does not exhaust important papers written in the post-war period in Poland.
It is just pertinent to the main lines of the article.
[1] Apt, K.R., Marek, W. (1974). Second order arithmetic and related topics Annals of Math. Logic ~, pp. 177-229. [2] Ehrenfeucht, A. (1961). An application of games to the completeness problem for formalized theories. Fund. Math. XLIX, pp. 129-141. [3] Ehrenfeucht, A., Mostowski, A. (1956). Models of axiomatic theories admitting automorphisms, Fund. Math. XLIII, pp. 50-68. [4]
Grzegorczyk, A. (1953).
[5] Grzegorczyk, A. (1955). 202.
Some classes of recursive functions, Diss. Math IV. Computable functionals, Fund. Math. XLII, pp. 168-
FOUNDATIONS OF MATHEMATICS IN POLAND
137
[6] Grzegorczyk, A. (1956). Some proofs of undecidability of arithmetic, Fund. Math. XLIII, pp. 166-177. [7] Grzegorczyk, A. (1974). Warszawa - Dordrecht.
Outline of Mathematical Logic, PWN-Reidel,
[8] Grzegorczyk, A., Mostowski, A., Cz. Ryll-Nardzewski (1958). The classical and w-complete arithmetic, Journal of Symb. Logic 22, pp. 188-206. [9] Kuratowski, K. (1948). Ensembles projectifs et ensembles singuliers, Fund. Math. XXXV, pp. 131-140. [10] ~s, J. (1954). On the categoricity in power of elementary deductive systems and some related problems, ColI. Math. III, pp. 58-62. [11] bas, J. (1955). Quelques remarques, theoremes et problemes sur la classes definissables d'algebres, In: Math. interpretation of formal systems, NorthHolland, Amsterdam, pp. 98-113. [12] Marek, W., 11ostowski, A. On extendability of the models of ZF set theory to the models of KM theory of classes, In: Springer Lecture Notes 499, pp. 460-522. [13] Mostowski, A. (1945). pp. 137-168.
Axiom of choice for finite sets, Fund. Math. XXXIII,
[14] Mostowski, A. (1947). XXXIV, pp. 81-112.
On definable sets of positive integers, Fund. Math.
[15] Mostowski, A. (1949). XXXVI, pp. 143-164.
An undecidable arithmetical statement, Fund. Math.
[16] Mostowski, A. (1951). Some impredicative definitions in the axiomatic set theory, Fund. Math. XXXVII, pp. 111-124. [17] Mostowski, A. (1952). Logic 17, pp. 1-31.
On direct product of theories, Journal of Symb.
[18] Mostowski, A. (1952). Sentences undecidable in formalized arithmetic, North -Holland, Amsterdam, 117 pp. [19] Mostowski, A. (1957). pp. 12-36.
On generalization of quantifiers, Fund. Math. XLIV,
[20J Mostowski, A. (1960). Formal system of analysis based on an infinitistic rule of proof, In: Infinitistic Methods, PWN - Pergamon Press, Warszawa - London. [21J Mostowski, A. (1965). Fennica12, 180 pp.
Thirty years of foundational studies, Acta Phil.
[22] Mostowski, A. (1968). Craig interpolation theorem in some extended systems of logic, In: Logic, Methodology and Phil. of Sciences III, North-Holland, Amsterdam, pp. 87-103. [23] Mostowski, A. (1967). Constructible sets with applications. North-Holland, Warszawa-Amsterdam, 269 pp. [24J Mostowski, A. pp. 220-282.
An exposition of forcing.
In:
PWN-
Springer Lecture Notes 450,
[25] Mostowski, A. (1975). Observations concerning elementary extensions of W-models I, Proceedings of MiS Symposia in Pure Mathematics XXV, pp. 349-355. [26] Mostowski, A., Suzuki, Y. (1969). Fund. Matho LXV, pp. 83-93. [27] Mycielski, J. (1964). pp. 205-224.
On w-models which are not S-models,
On the axiom of determinateness I, Fund. Math. LIII,
W. MAREK
138 [28] Mycielski, J. pp. 203-212.
(1966).
On the axiom of determinateness II, Fund. Math LIX,
[29] Mycielski, J., Cz. Ryll-Nardzewski, (1968). II, Fund. Math. LXI, pp. 271-281.
Equationally compact algebras
[30] Pacholski, L., Cz. Ryll-Nardzewski, (1970). products I, Fund. Math. LXVII, pp. 155-161.
On countably compact reduced
[31] Rasiowa, H. (1974). An algebraic approach to nonclassical logics. North-Holland, Warszawa-Ansterdam.
PWN-
[32] Rasiowa, H., Sikorski, R. (1950). A proof of the completeness theorem of Gadel, Fund. Math. XXXVII, pp. 193-200. [33] Rasiowa, H., Sikorski, R. (1963). PWN, Warszawa.
The mathematics of metamathematics,
[34] Ryll-Nardzewski, Cz. (1959). On the categoricity in power < ~' Bull. Acad. Pol. Sci. Sere Sci. Math. Astronom. Phys. VII, pp. 545 2 548. [35] Sierpinski, W. (1947). L'hypothese generalisee du continu et l'axiome du choix, Fund. Math. XXXIV, pp. 1-5. [36] Szmielew, W. (1955). XLI, pp. 203-271. [37] W)!glorz, B. (1966) • pp. 289-298. [38] WtglOrZ, B. pp. 9-93.
(1967).
Elementary properties of abelian groups, Fund. Math. Equationally compact algebras I, Fund. Matho LIX, Equationally compact algebras III, Fund. Math. LX,
POSTSCRIPT>I< Professor A. Tarski and Dr. S. Givant pointed to us certain inaccuracies and important omissions in our text. As this could lead to misunderstanding we notice most important of them: (1) Reporting the domains of foundational studies we omitted several branches of investigations such as:
al
Foundations of Geometry investigated in Warsaw and other places by Borsuk, Szmielew, Szczerba and others.
bl Universal Algebra as developed in Ryll-Nardzewski and others.
wroc~aw
by Marczewski, Mycielski,
(2) We generally omitted reference to parallel work of other investigators like in case of Ehrenfeucht games reference to parallel Fralsse work on elementary equivalence or in case of Ryll-Nardzewski result on ~o-categoricity independent work of Engeler and Svenonius. (3) We erronously stated that the Axiom of Determinataness implies the principle of dependent choices. As pointed by Professor J. Mycielski the axiom of determinateness implies only the existence of choice function for countable families of reals.
* Added on
March 20, 1977
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
A TRIBUTE TO A.MOSTOWSKI Helena Rasiowa Institute of Mathematics University o~ Warsaw Warsaw, Poland It is my privilege to speak about Andrzej Mostowski, a teacher, a colleague, and a friend of mine, and to pay homage to the work he left behind him which constitutes a vital contribution to the development of mathematical logic and the foundations of mathematics. Andrzej Mostowski was one of the greatest spirits and a cofounder of the Polish school of logic and the foundations of mathematics. He was one of those researchers inspired to undertake logical investigations in mathematics itself, yet at the same time his papers are shot through with philosophical ideas. He always maintened a lively interest in algebra and in this field also he wrote a number of books and expository articles. Professor Mostowski was the author or coauthor of 118 scientific publications, including monographs and textbooks, which form a lasting contribution to Polish and world science. The papers of Mostowski opened up new avenues of research and the results contained in them are among those most of~en cited in the literature. I will touch on only some of Mostowski s results which were the source of new lines of investigation. Mostowski's permutation models (1939) which he obtained by making precise an idea of Fraenkel have found wide application in proofs of independence results for set theory and of course in particular in the proof of independence of the axicm of choice. The method of permutation models is still important even after the discovery of forcing. The so-called Mostowski contraction lemma l1949) is one of the basic metamathematical results of set theory. The hierarchy of arithmetical notions, devised by Mostowski during the second wordl war independently of S.C.Kleene and published only in 1947, after the appearance of Kleene's paper on the same subject, is considered as one of Mostowski's greatest achievments. It became the source of many discoveries and the starting point of a very wide literature. Mostowski's paper on non-deducibility in intuitionistic functional calculus (194e), in which he proposed an algebraic interpretation of quantifiers, inspired later researc~ of other authors on an algebraic approach to various predicate calculi • The theory of indiscernible sets created jointly with A.Ebrenfeucht (1956) yielded one of the most frequently employed methods for constructing models and has found numerous applications both in the theory of models and in other parts of the foundations of mathematics. 139
140
HELENA RAS IOWA
His paper on generalized quantifiers (1957) had a strong influence on the beginnings of so-called soft model theory and also found application in research on second order arithmetic. The procedure invented by Mostowski in his paper "On direct products of theories" (1952) for reducing the truth of a sentence in a direct product to questions about the truth of related sentences in components of the product, was later extended in stages by other authors and applied many times in model theory. The paper "The classical and the ~ -complete arithmetic" (1958) written jointly by A.Mostowski, A.Grzegorczyk and C.Ryll-Nardzewski initiated the etudy of w-models of arithmetic. MostolVski's paper" Formal system of analysis based on an infinita~~ ~~~n~fo~~~~f~ri~~~~i~~fundamental for the study of ~ -models The monographs and textbooks of which Mostowski was the author or coauthor , characterized by their preciseness will always arouse the reader's interest. His first book "Mathematical logic" ,wri tten in Polish immediately after the second wordl war and publishea in the series Monografie Matematyczne in 1948, was an excellent textbook. He refused permisson for its translation and publication in other languages. In 1946 appeared "Outline of Galois theory" ,written in Polish, as an appendix to the textbook "Higher algebra" of W.Sierpinski. In 1952 appeared the monograph "The theory of sets" ,written in Polish, the joint 1V0rk of K.Kuratowski and A.Mostowski. The second edition completely revised was published in 1966 and an English translation appeared in the North-Holland series Studies in Logic and the Foundations of Mathematics in 1967. The third edition, completely revised and widely extended, will appear. Professor Mostowski finished the preparation of this edition in May 1975A Russian translation was published in Moscow in 1970. In the North-Holland series Studies in Logic and the Foundations of Mathematics also appeared the following monographs of MostolVski: 1952 "Sentences undecidable in formalized arithmetic" a very successful exposition of the work Qf G~del on the undecidability of arithmetic; 1953 "Undecidable theories" written with A.Tarski and R.M.Robinson, which presented a method for prOVing the undecidability of many mathematical theories a method used for many years afterwards; finally in 1969 appeared "Constructible sets with applications" the most complete acoount of the theory of constructible sets and of relative constructibility, inclUding independence results in set theory, which has appeared upto the present time. From amongst his more expository works one should also mention "Thirty years of Foundational Studies", Acta Philosophica Fennica (1965), "Mod~les Transitifs de la theorie des ensembles de Zermelo-Fraenkel", University of Montreal (1967), and "Models of Set Theory", C.loM.E. Lectures, Varenna (1968). With M.Stark , Mostowski wrote in Polish a three-volume textbook "Higher Algebra" (1953,1954,1954) as well as the textbooks "Linear
A TRIBUTE TO A.
~lOSTOWSKI
141
Algebra (1958) and "Introduction to Higher Algebra" (1955), of which the last named was published in English, Oxford (1963). One of the great contributions of Mostowski to Polish science was the substantial influence he exercised on the work of post-war Polish logicians the majority of whom were his students. Mostowski's influence was also felt in other countries. Young specialists from allover the word1 came to work with him. Among nonPolish logicians who spent some time in their careers with Mostowski are: J.W.Addison, M.Benda, M.Dickman, P.Hinman, M.Machover, K.Prikry, H.Sayeki, Y.Suzuki and G.Wilmers. Andrzej Mostowski was invited to lecture at many universities and gave invited addresses at many international congresses and symposia. All wished to talk with him and hear his advice. Professor Mostowski kept in touch with many of the most distinguished specialists in the wordl. He heard of the most interesting results before they were published directly from the authors. He knew all the problems in logic and the foundations which were currently the centrebf interest. He himself was an inexhaustible source of information in this field due to his exceptional memory and the ability to concentrate on problems discussed with him. Mostowski was not one of those scientists who confine themselves to their own specia1ity.Besides mathematics and philosophy he had a lively interest in physics, history, literature, painting,music and the theatre. Thomas Mann and Charles Dickens were two of his favourite authors. His favourite literary work was "The Pickwick Papers" from which he often quoted. As a teacher he was demanding and most demanding of himself. He did not rate highly his own achievments and successes. He hated any kind of ceremonial or even collegial expressions of the recognition he so richly deserved.When his colleagues at the University of Warsaw organized a reception in honour of his sixtieth birthday he did not turn up. The next day he felt very abashed and apologized to the organizers. Andrzej MostowsMi was born October lst,19l3 in Lwow. His father StanisIaw Mostowski an assistant in physiological chemistry at the University in Lwow died a year later. By the family and friends Andrzej Mostowski was called Staszek as his father. Besides the early childhood spent in Lw6w and Zakopane, a mountain resort in the Tatras,Mostowski's almost who~e life was connected with Warsaw. There he went to school from 1923 to 1931 and there studied mathematics at the University from 1931 to 1936. At that time at the University of Warsaw worked mathematicians and logicians some already with a wordl-wide reputation, some young and talented, cofounders of the Polish school of mathematics and logic: Wac1aw Sierplnski, Stefan Mazurkiewicz, Kazimierz Kuratowski, Karol Borsuk, Jan Bukasiewicz, Alfred Taraki, Adolf Lindenbaum, stanis~w Lesniewski. The scientific atmosphere of this environment and the trends of research, being focussed mainly on the theory of sets, topology~ mathematical logic and foundations of mathematics, shaped the scientific interests of Andrzej Mostowski.However, as a student he strove to obtain the widest possible mathematical education going beyond the programme of studie~
142
HELENA RASIOWA
The strongest influence on the creative work of Mostowski was exercised by his teacher Alfred Tarski. The years 1936, 1937 Mostowaki spent abroad studying mathematics at the Universi try of Vienna and at the Federal Polytechnic in Z~ich. In Vienna he was a student of Kurt G~del whom he regarded with warmth and admiration until the end of his life. In ZUrich he attended the lecture~ of Herman Weyl on ay,mmetry. Weyl'a personal charm, the beauty of his lectures and his treat work "Raum, Zeit, Materie ", from which Mostowski learned the theory of relativi try, left a permanent impression. After the return to Poland Andrzej Mostowski began a new phase of intensive research. At that time he was intrigued by problems surrounding the idea of finiteness. In 1938 at the University of Warsaw he defended his doctoral dissertation which was devoted to various forms of the definition of finiteness of a set. After his doctorate, being unable to find a position in the University because they were none vacant, he started work in the Gove~ ment Meteorological Institute whose director was the distinguished physicist Jan Blaton. During the occupation of Poland Andrzej Mostowski remained in Warsaw working as assistant to the accountant in tar-paper factory. During the years 1942-1944 he gave lectures in the underground university, the University of Warsaw having been closed by the. invader. After the Warsaw uprising all were forced to leave Warsaw whose destruction had been ordered and which already lay in ruins. Mostowski escaped from one of the transports by which the inhabitants of Warsaw were moved to Germany and located in concentration camps. Expositions of numerous results obtained by Mostowski during the war were burned when his appartment was destroyed. A large part of those results has never been reconstructed. Among them were results on decidability and on descriptive set theory parallel to those due to J.W.Addison. Immediately after the liberation of Poland Andrzej Mostowski had for several months a position at the Sl~sk Polytechnic which had temporary quarters in Krakow. He attained the rank of docent at the Jagiellonian Universi~ in 1945 on the basi3 of his habilitation thesis about the independence of the axiom of choice, which initiated research on the independence of various forms of the axiom of choice for finite sets. A year after his habilitation he returned to the University of Warsaw to become head of the section concerned with the philosophy of mathematics. At that time he was the only logician in the Department of Mathematics and Natural Sciences remaining from the strong prewar group of research workers. On him fell the responsibility for the future of the Polish school of logic and the foundations which before the war had attained", Lead.i.ng position in the wordl. Andrzej Mostowsk1, the young mathematician, eagerly threw lumaelf into teaching and research. He gave lectures and seminars in his o,vn speciality and at that tims also lectured on group theory and Galois theory which he always p'.J.rticularly Li ke d, His talent and personal charm attracted young mathematicians and students to him.
A TRIBUTE TO A. MOSlOWSKI
143
gladly discussed scientific problems with them, he always had time and patience. He suggested questions of interest, discussed them in an atmosphere of humour and good will, and kindled the enthusiasm of youth. He won the hearts and shaped the thoughts of his students. I am happy to recall that in 1946 I became his first assistant, and that in 1950 Andrzej Grzegorczyk and I graduated together as his first doctoral students. The number of his students quickly increased.In the early years were Henry Hiz, the talented logician Antoni Janiczak who died in 1953, Andrzej Ehrenfeucht,and later many others.
~e
Polish mathematics between the wars was characterized by concentration on certain chosen fields of mathematics: the theory of sets, topology, functional analysis, real analysis, mathematical logic and foundations of mathematics. In the period of reconstruction the Polish mathematical community felt an urgent need to develop every subdiscipline and in particular algebra. Understanding this need in 1953 professor Mostowski became head of the algebra section in the University of Warsaw and continued -~his function until 1969. The awakening of interest in algebra in Poland and the creation of a cadre of algebraists of international calibre was in considerable measure due to Andrzej Mostowski. The algebra textbooks which he wrote with Marceli Stark were basic material for our students for many years and are still in use together with others which appeared later. From 1969 to the end of his life Andrzej Mostowski was head of the section of the foundations of mathematics in the University of Warsaw and in his last years was also vice-director of the Institute of Mathematics. From the opening of the Government Mathematical Institute in 1949, later transformed into the Mathematical Institute of the Polish Academy of Sciences, Andrzej Mostowski was head of the section concerned with the foundations of mathematios. In 1956 Andrzej Mostowski beoame a oorresponding member of the Polish_Aoademy of Sciences, and in 1963 attained full membership, the highest scientific honour in Poland. Twice he received Government Prizes for his mathematical achievments. He received also Irzykowski prize being Polish American prize for d1stinguis~ ed Polish scientists. In 1973 he was elected to the Finnish Acadeny of Sciences an honour which he prized highly. Professor Mostowski fulfilled many administrative functions in Polish institutions and scientific societies. He also fulfilled a number of functions in the administration of the Association for Symbolic Logic. From 1964 to 1968 he was Vice-President, Division of Logic Methodology and Philosophy of Science, of the International Union of History and Philosophy of Science. From 1971 to 1975 he wa President. Andrzej Mostowski also found time to undertake many editorial duties.He was editor of the Bulletin of the Polish Academy of Sciences for mathematics, astronomy and physics. He was a member of the editorial oommittees of Fundamenta Mathematicae,Dissertationes Mathematicae, Studia Logica, the Journal of Symbolic Logic and others. From the beginning of fifties he was in a olose connection with North-Holland Publishing Company cooperating first with M.D.Frank and later with E.Fredriksson. From 1966 he was one of editors of North-Holland series Studies in Logic and the Foundations of Mathematics. He was also one of cofounders and editors of Annals of Mathematioal Logio.
144
HELENA RASIOWA
Of Mostowski's numerous visits to the scientific centres of other countries I will mention only the longest. The academic year 1948/ 1949 he spent at the Institute for Advanced Study in PrincetOn where he renewed contact with K.G~del. Ten years later he was a Visiting professor at the University of California, Berkeley,where he worked once again with A.Tarski. In the academic year 1969/1970 he was a member of All Souls College, Oxford. Andrzej Mostowski spent the summer months of 1975 at the Universi4Y of California, Berkeley. Hlis lectures awoke lively interest in his listeners as always, and he mentioned with pleasure the satisfaction they had given him. On the way from Berkeley to London,Ontarw, where he intended to participate in the V-th International Congress of Logic Methodology and Philosophy of Science, he stopped in Vancouver at the invitation of professor Alistair Lachlan of Simon Fraser University. On the 20th August at 3:30 he gave his last lecture "Set theory with classe~'. He spoke beautifully and as usual laced his presentation with flashes of humour. None of those present amongst whom were A.Lachlan, R.Harrop, S.Thomason, D.Pinc~ M.Dubiel and I, sensed that this would be his last lecture. Almost immediately after the lecture he suddenly fell ill and died two days later. At the Congress in London, Ontario, there was a special memorial session in his honour as part of the General Assembly of the Division of Logic Methodology and Philosophy of Science of IUHPS. Andrzej Mostowski was an unusually modest man of great personal charm and unfailing kindness. His modesty and simplicity made him universally liked. All who knew him personally found his death a great loss. REFERENCES Mostowski,A.(1939).Uber die Unabh~ngigkeit des Wohlordnungsatzes von Ordnun~sprinzip. Fundamenta Mathematicae XXXII,pp.20l-252. Mostowski,A.(1947).On definable sets af positive integers.Fundamenta Mathematicae XXXIV,pp.81-l12. Mostowski,A.(1948).Proofs of non-deducibility in intuitionistic functional calculus.Journal of Symbolic Logic 12,pp.193-203. Mostowski,A.(1949).An undecible arithmetical stalement.Fundamenta Mathematicae XXXVI,pp.143-164. Mostowski,A.(1952).On direct product of theories.Journal of Symbolic Logic 17,pp.1-31. Mostowski,A.·and Ehrenfeucht,A.(1956).Models of axiomatic theories admitting automorphisms.Fundamenta Mathematicae XLIII,pp.50-68. Mostowski,A.(1957).On a generalization of quantifiers.Fundamenta Mathematicae XLIV,pp.12-36. Mostowski,A.,Grzegorczyk,A.~dRyll-Nardzewski,Cz.(1958).The classical and GV-complete arithmetic. Journal of Symbolic Logic ~, pp.188-2Q6. Mostowski,A.(1960).Formal system of ~alysis based on an infinitary rule of proof.Proceedings of Warsaw Symposium on Infinitistic Methods, Warsaw-London,pp.141-166.
R. Gandy, M. Hyland (Eds.l, LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
BOLZANO'S CONTRIBUTION TO LOGIC AND PHILOSOPHY OF MATHEMATICS Jan Berg Department of Philosophy Munich Institute of Technology Munich, West Germany
The Wissenschaftslehre (1837) by Bernard Bolzano (1781-1848) is one of the masterpieces in the history of logic. In this encyclopedic work Bolzano intended to construct a new and philosophically satisfactory foundation of mathematics. The search for such a foundation brought forth valuable by-products in logical semantics and axiomatics. For example, Bolzano introduced the notion of abstract, non-linguistic proposition and described its relations to other relevant notions such as sentence, truth, existence and analyticity. Furthermore, he studied relations among propositions and defined highly interesting notions of validity, consistency, derivability and probability, based on the idea of "replacing" certain components in propositions. In set theory, he stated the equivalence of reflexivity and infiniteness of sets and considered isomorphism as a sufficient condition for the identity of powers of infinite sets. He conceived of a natural number
as
a
property characterizing sets of objects, even though he did not base his development of arithmetic on this notion, and analyzed sentences about specific numbers in a way reminiscent of Frege and Russell.
In a posthumous manuscript from the
1830's (recently pUblished) he developed a theory of real numbers, which differs from those of Dedekind, Weierstrass, Meray and Cantor. BOlzano's real numbers may be identified with certain sequences of rational numbers. Logic in BOlzano's sense is a theory of science, a kind of metatheory, the objects of which are the several sciences and their linguistic representations. This theory is set forth in Bolzano's monumental four-volume work Wissenschaftslehre (hereafter referred to as 'WL') .
JAN BERG
148
Bolzano's very broad conception of logic with its strong emphasis on methodological aspects no doubt accounts for the type of logical results which he arrived at. The details of his theory of science proper are given in the fourth volume of the WL and belong to the least interesting aspects of his logic. On the other hand, Bolzano's search for a solid foundation for his theory of science left very worthwhile by-products in logical semantics and axiomatics. His theory of propositions in the starting-point of these results. Bolzano became more and more aware of the profound distinction between the actual thoughts of human beings and their linguistic expressions on the one hand, and the abstract propositions and their components which exist independently of these thoughts and expressions on the other hand. Furthermore, he imagined a certain fixed deductive order among all true propositions. This idea was intimately associated with his vision of a realm of abstract components of propositions constituting their logically simple parts.
* For the following presentation of Bolzano's theory of propositions I have to define some terms. A concrete sentence occurrence is a sequence of particles existing in space and time, arranged according to the syntactic rules of a grammar, and contrasting with its surroundings. A simple sentence shape, on the other hand, is a class of similar concrete occurrences of simple sentences. A compound sentence shape is built up recursively from simple sentence shapes by means of syntactic operations. Not every compound sentence shape has a corresponding concrete sentence occurrence. Two compound sentence shapes may be considered identical if they are built up from identical simple sentence shapes in the same way. Two simple sentence shapes are identical if they contain the same sentence occurrences. Now consider the compound sentence containing the following concrete sentence occurrence:
'a simple sentence shape is a class of similar
sentence occurrences or it is not the case that a simple sentence shape is a
cl~ss
of similar sentence occurrences'. In another sense
one could say that this sentence shape, which is an abstract logical
149
BOlZANO'S CONTRIBUTION TO lOGIC
object outside of space and time, contains two sentence occurrences, i.e .• two abstract "occurrences" of the simple sentence shape containing the following concrete inscription:
'a simple sentence shape
is a class of similar sentence occurrences'. In the following, I will use the expression 'sentence occurrence' exclusively in the first, concrete sense.
* Bolzano's notion of abstract non-linguistic proposition (Satz an s i cn ) is a keystone in his philosophy and can be traced in his' writings back to the beginnings of the second
decade of the 19th centu-
ry. I shall try to characterize Bolzano's conception of propositions by means of certain explicit assumptions. These assumptions also give information about the relation between propositions and other logically interesting objects. In his logic Bolzano utilizes a concept which is an exact counterpart of the modern logical notion of existential quantification. Therefore, he could have stated that (1)
There exist entities, called 'propositions', which fulfill the following necessary conditions (2) through (15).
r cr ,
WL §§30
ff. ) Thus, propositions possess the kind of logical existence developed in modern quantification theory. However, (2)
A proposition does not exist concretely in space and time (WL § 19).
According to Bolzano, both linguistic and mental entities such as thoughts and jUdgments are concrete (WL
§§
34, 291). Hence, proposi'
tions could not be identified as concrete linguistic or mental occurrences. Furthermore,
(3)
Propositions exist independently of all kinds of mental entities (WL
§
19).
Therefore the identification between propositions and mental dispositions sometimes made in medieval nominalism cannot be applied to
150
JAN BERG
propositions in BOlzano's sense. A proposition in BOlzano's sense is a structure of ideas-as-such. Hence, an idea-as-such (Vorstellung an sich) is a part of a proposition which is not itself a proposition (WL
48). But to be able
§
to generate propositions we have to characterize ideas-as-such independently of propositions. This is in fact implicit in Bolzano. He worked extensively with the relation of being an object of an ideaas-such, which corresponds in modern logic to the relation of being an element of the extension of a concept.
In terms of this relation,
taken as a primitive by Balzano, certain postulates may be extracted from his writings which concern the existence and general properties of ideas-as-such. Independently of human minds and of linguistic expressions there exists a collection of absolutely simple ideas-as-such. As examples Bolzano mentions the logical constants expressed by the words 'and',
'some',
'to have',
'to be',
'ought'
(WL
§
'not',
78); but he admits
being unable to offer a more comprehensive list. He seems to mean that each complex idea A can be analyzed into a sequence ~ (A) of simple ideas which would probably include certain logical constants. I shall call this sequence .::r(A) the
'primitive form'
of A.
The man-
ner in which a complex idea is built up from simple ones may be expressed by a chain of definitions. So it appears that some complex ideas behave somewhat like the open formulas of a logical calculus. Bolzano assumes that two ideas are strictly identical if and only if they have the same primitive form (WL
§§
92, 119, 557).
We can now state two conditioqs for propositions, which are peculiar to Bolzano. The first one reads essentially as follows:
(4)
Not all propositions are simple. A compound proposition is built up recursively from simple parts by means of primitive operations (WL
558).
§
Furthermore, to each Bolzanian proposition P there corresponds uniquely a proposition i(p) containing only simple ideas. Such an f(P)
I shall call the
'primitive form' of P.
::rep)
may be found by
reducing all complex ideas of P to simple ones by means of chains of definitions. We can now express Bolzano's principle of individuation for propositions:
BOLZANO'S CONTRIBUTION TO LOGIC (5)
151
Two propositions are strictly identical if and only if they have the same primitive form (WL
§§
32, 558).
We may assume that we have a language of a more or less specified structure and that we know what a sentence shape is in this language. Bolzano apparently identified all linguistic entities with concrete occurrences (WL
§§
49, 334) and did not say anything explicit about
the abstract notion of sentence shape. It is possible, however, to reconstruct a concept closely related to our notion of linguistic shape and based entirely on concepts within Bolzano's philosophy. For if an object of the idea A is a linguistic expression in Bolzano's sense, then A may be a counterpart in Bolzano's theory of our notion of abstract linguistic shape. In particular, if the objects of A are sentence occurrences, then in some cases A may be an abstract sentence shape. Whereas a shape is usually taken as a class in modern logical syntax, Balzano's notion could be described as a corresponding intensional property. From the postulates (4) and (5) it appears that propositions in Balzano's sense behave somewhat like the closed formulas of a logical calculus. The following assumption, on the other hand, would seem to agree with Bolzano's intentions:
(6)
No linguistic entity should necessarily be a component in propositions.
Hence, it would not be possible to identify propositions as sentence shapes or as any other objects essentially involving sentence shapes or occurrences, such as, e.g., ordered pairs of sentence shapes and the interpretations of a language. Had Bolzano been faced with the notion of abstract linguistic shape, he would probably have approved (cf. WL
§
410) of the following gen-
eral assumption concerning sentences (whether shapes or occurrences):
(7)
There are more propositions than sentences.
If we suppose that the number of atoms in the universe is finite, then we have to accept the fact that there are only finitely many
JAN BERG
152
concrete sentence occurrences. From the definitions of simple and compound sentence shapes given above it follows that there are denumerably many sentences (whether shapes or occurrences). The distinction between the denumerably and superdenumerably infinite was not yet clear to Bolzano, although he had one notion of continuum that applied to the set of real but not to the set of rational numbers and had a clear understanding of the fact that isomorphism is a sufficient condition for the identity of powers of infinite sets. However, for a modern development of a system of non-linguistic Bolzanian propositions it is natural to presuppose that there are non-denumerably many such entities. Consider, for example, a denumberable domain of individuals and the non-denumerable infinity of subsets of this domain, and take any of these subsets M, and any individual x. We then have the proposition (true or false) that x is
an element of M.
We call a notion of truth for linguistic or non-linguistic semantic entities Y intuitively satisfactory if its definition implies the classical Aristotelian condition that X is true if and only if where
Y,
'X' stands for a name or description of the substituend of 'Y'.
Now Bolzano would require that (8)
An intuitively satisfactory notion of truth is definable for propositions (WL
§
19).
With respect to the relations between propositions and sentences the following four postulates hold: If a declarative principal sentence (shape or occurrence) expresses something,
P,
then P is a proposition (WL
§§
19, 28).
(10)
If the sentences 8, and 8 2 express the same proposition, then 8 1 and 8 may have different structures (WL § 127). 2
(11 )
If 8
and 8 express the same proposition, then 8 is 1 1 2 logically equivalent to 8 2"
The latter assumption underlies Bolzano's theory of the reduction of sentences to certain canonical forms. The converse does not hold, however, in Bolzano's system:
BOLZANO'S CONTRIBUTION TO LOGIC ( 12)
If the propositions P and Q are expressed by 8 respectively, and if 8
153 1
and 8
is logically equivalent to 8
1
P need not be identical with Q (WL § 32, Note).
2
2,
then
As a consequence, propositions cannot be identified as classes of logically equivalent sentences. In the WL Bolzano used a partly formalized language which embraces an ordinary language extended by constants, variables and certain technical expressions. He also investigated the relations of his semi-formalized philosopical language to colloquial language (WL
§§
121-146, 169-184). He believed that most sentences of collo-
quial language were "reducible" to sentences or sets of sentences of certain canonical forms expressed in the philosophical language. When Bolzano speaks of a sentence being reducible to another sentence, he seems to require that the sentences be synonymous in the sense of expressing the same proposition (WL §§ 127, 171). In Bolzano's theory of reduction, most sentences of ordinary language are reducible to sentences obtained by inserting expressions of particular ideas-as-such for
'A'
and expressions of particular
abstract ideas of second order (Which have a property as the sole member of their extension) for
'b'
in one of the following two
expressions (WL §§ 127, 136): (i)
'[The idea of],
A has b
(i i)
'[The idea or]
A has La c k-r o f'r-b
!
,
! ,
In a formalization of Bolzano's philosophical language, the vocabulary would include as predicates both substituends of 'b' and corresponding instances of 'lack-of-b'
in (i) and (ii), respective-
ly. Hence, the contradictory of a sentence of the form of (i) is another sentence of the form of (i), namely (iii)
'P(i) has falsity',
where P(i) is the proposition expressed by the particular instance of (i)
(WL
§
189).
JAN BERG
154
An example of Bolzano's application of his theory of reduction to ordinary language would be the set of sentences obtained by inserting a particular expression of an idea for
'A'
in 'There is an A'
and 'Nothing is an A'. These sentences are reducible to the canonical sentences 'The idea of A has non-emptiness' and 'The idea of A has emptiness', respectively (WL
§§
137, 170).
Had Bolzano's theory of reduction been completely developed it might have resulted in the construction of an ideal language for philosophical analysis. In this ideal language sentences of the canonical forms of (i) or (ii) would not play the same role, however, as the atomic sentences of modern
~uantification
theory on the basis of
which more complex forms are built up. It seems, on the contrary, that Bolzano intended even.the most complicated sentences to have canonical form or to be reducible to a set of such sentences. Yet there is a fundamental vagueness in Bolzano's theory of reduction, for nowhere in his work does he give any systematic rules for the construction of the extremely complicated names that would occur as substituends for the variables
'A' and 'b'.
Now the interesting point for the discussion of propositions is Bolzano's implicit assertion: (13 )
Sentences of the canonical form of (i) or (ii) mirror their correlated propositions in the sharpest way.
Hence, we have to presuppose that ( 14)
For all A and b within the appropriate ranges, the transition from A and b to the propositions expressed by (i) and (ii) must be explained.
In other words, it has to be explained how two concepts can be combined to make a statement. For propositions corresponding to sentences of the canonical forms of (i) and (ii) the notion of truth is defined substantially as follows
(WL
§
24):
BOLZANO'S CONTRIBUTION TO LOGIC (i+)
155
P(i) is true if and only if every object of A has the property which is the object of b,
(ii+) P(ii) is true if and only if every object of A does not have the property which is the object of b. Here A and b are indicated by the sUbstituends of 'A' and
'b' re-
spectively. Balzano apparently presupposes that the set of sentences of
collo~uial
language and of his own philosophical language could
be mapped into the set of propositions, so that an indirect definition of truth for these sentences would be forthcoming. Balzano elaborates his theory of reduction by imposing a necessary condition for the truth of sentences of the form of (i) and (ii). If A is non-empty (i.e., if there is at least one object of A), then the corresponding propositions, P(i) and p(ii), are also said to be non-empty. And if A is empty (i.e., if there is no object of A), then P(i) and p(ii) are said to be empty (WL P(ii) is true, then A is non-empty (WL ( 1 5)
§
§
146). Now,
if P(i) or
225.4). Hence,
If A is empty, then both P(i) and p(ii) are false.
Since the entity indicated by the substituend of 'A'
in (i) and (ii)
can be a class concept, Balzano's condition implies an existential interpretation of classical syllogistic. And the very same condition shows that Bolzano was heading in substance for a philosophical language without existence assumptions, where even empty domains are permitted. An attempt to grasp Bolzano's theory of propositions must comply with the assumptions (1) through (15). These assumptions function as an axiom system for which a model is sought. A domain of relatively well-known objects, which behave in the same manner as the entities to be explained, has to be described. The fulfillment of the conditions (1) through (15) also means that a philosophical, ideal language must be constructed, which differs considerably from the usual languages of logic. The reduction of the sentences of ordinary language to sentences of canonical form could be accomplished, for example, if the distinction between predicates and function symbols were abolished and if all expressions were considered as the result
156
JAN BERG
of applying a function symbol to a corresponding number of arguments. Furthermore, the assumption (15) would be satisfied if BOlzano's ideal language were syntactically constructed and semantically interpreted in such a way that a non-trivial existence predicate could be introduced.
* Bolzano observed that a proposition may change its truth-value upon variation of certain components. For example, the true proposition: (i)
Bolzano is mortal
can be turned into the falsity: (i i)
Bolzano is omniscient
by changing the predicate idea. But if the sUbject idea Bolzano in (i) is changed into other ideas within the (usually explicitly presupposed) range of variation, proposition (i) cannot be turned into a falsity. Thus we see that Bolzano implicitly utilized a replacement operation for propositions, sending Pinto P(A/B), where the latter proposition is similar to P except that it contains the idea B at all places where P contains the idea A.
This operation can be extended to a
simultaneous replacement in a proposition of the distinct ideas A" ••• ,A
n
by B"
••• ,B
n,
respectively, thereby sending Pinto
P(A , •.• ,An/B, , ... ,B The replaced ideas A,,'" n). 1
,An have to be
distinct, of course, whereas the replacing ideas B"
... ,B
n
need not
be distinct. Each idea occurring in a proposition has its particular range of variation. In doubtful cases this range may be indicated in the proposition itself by means of a description adjoined to the sUbject idea. For example, the proposition: The human being Bolzano is mortal is an explicit counterpart to proposition (i) above.
BOLZANO'S CONTRIBUTION TO LOGIC
157
The variation of ideas-as-such is an original and very fruitful conception. Bolzano exploits his innovation with great thoroughness in a logical theory which, on decisive points, anticipates notions expounded (independently by others) in the philosophical discussion of our time, especially in Logical semantics and axiomatics. First of all, the notion of validity of propositions is introduced essentially as follows (WL § 147):
('6)
P is valid with respe~t to the distinct ideas A" ... ,An in P if and only if P(A" sequences B, , ..• ,B
n
... ,An/B, , .•• ,B
n)
is true for all
, where every idea B. belongs to the range J
of variation of A .. J
Contravalidity is defined analogously: (17 )
P is contravalid with respect to A" P(A, , ... ,An/B1 , ... ,B
n)
... ,An if and only if
is false for all B , ... ,B 1 n.
Here we have to assume that P contains no defined ideas-as-such. Otherwise an unabbreviated proposition could be valid, while a corresponding proposition obtained by definitional abbreviation of certain ideas could be formally non-valid according to (16). Take P as the proposition: 7 + 5 < 7 + 6, and Q as: 7 + 6 c 13, obtained
=
5. Now p(7/N) is true for all natural number ideas N, whereas Q(7/8) is false; hence P but not from P by the definition:
13
def.
7 +
Q is valid with respect to the idea 7. Yet the conception that an unabbreviated proposition could be found for any given conceptual proposition (the content of which embraces concepts only and no intuitions, in the German sense of 'Anschauung') permeates Bolzano's whole philosophy. with regard to certain relations among propositions, Bolzano distinguishes the special logical case, where the ideas varied embrace all non-logical ideas of the propositions (WL We then define:
§§
148.3, 223).
158
JAN BERG
('8)
P is logically valid if and only if P is valid with respect to the sequence of all non-logical ideas of P.
Bolzano also considers logical properties of sets of propositions. He then utilizes what is basically a replacement operation on sets of propositions, sending the set {P"P 2" {PI ,P 2 , · · .}(A" P
2(A"
... ,A
,An/B,,'" ,B
n/B"
n),
.• } into the set
,B n), i.e., (P,(A,,'"
,An/B,,'"
,B n),
... }. That a set K of propositions is true
means, of course, that each member of K is true.
We may then extend
Bolzano's definition of validity as follows:
('9)
{P"P , . · . } is valid with respect to A,,'"
2
{P, ,P 2 , ... HAl" B1, ... ,B
n
.. ,An/B, , ..• ,B
, where every B.
J
n)
,An if and only if
is true for all sequences
belongs to the range of variation
of A .• J
The important notion of consistency is a generalization of this relation (WL
'54):
§
{P"P , ... } is consistent with respect to A , ... ,A if and 1 2 n
(20)
only if (P"P , ... }(A 2
sequence B , · .. ,B 1
1,
... ,A
n/B"
... ,B
n)
is true for some
n•
A corresponding relation between sets of propositions can be introduced:
(2')
{Pi} and {Qi} (i A"
... ,A
n
=
',2, ... ) are consistent with respect to
if and only if (P"Ql,P 2 , Q2 " " } is consistent
with respect to A"
... ,A
n•
As a special case of consistency in the sense of (2'), Bolzano now introduces the relation of derivability between sets of propositions (WL
§
,
55 ) :
BOLZANO'S CONTRIBUTION TO LOGIC (22)
159
{Qi} is derivable from {Pi} with respect to A,,'"
,An if
and only if {Pi} and {Qi} are consistent with respect to A".oo,A n and if {Qi}(A"oo.,An/B"
... ,B n) is true for any
sequence B".oo,B n that makes {Pi}(A"
... ,An/B"
... ,B n)
true.
Bolzano distinguishes even the special logical case of derivability, which is defined essentially as follows (WL (23)
§
223):
{Qi} is logically derivable from {Pi} if and only if {Qi} is derivable from {Pi} with respect to the sequence of all non-logical ideas-as-such of the propositions Pi and Q i (i
= , ,2, ... ).
Bolzano's relation of logical derivability is one of his most impressive discoveries and a highly interesting counterpart of the modern notion of logical consequence introduced by Tarski. There are certain differences, however, between the two relations. First of all, modern notions of consequence are defined for sentences within formalized languages, whereas Bolzano's derivability holds between abstract non-linguistic propositions expressed in a natural language extended by variables and certain constants. The difference is of vital importance for the study of the relationships between consequence and other logical notions. For example, the equivalence of logical consequence and syntactic provability within elementary logic could not be imagined without a highly developed conception of a formal calculus. Only after logic had become completely formalized by Frege was it possible to formulate a precise notion of syntactic provability. Secondly, BOlzano's condition that the sets {Pi} and {Qi} should be consistent with respect to the sequence of all non-logical ideas-as-such of every Pi and Q has no counterpart in the definitions i of modern logical consequence. As a result of this consistency clause, Balzano's theory is less general and more complicated than the modern theories of consequence. For example, consequence relationships with contradictions of the type: P and not P, as
ante~
cedent must be represented in Bolzano's theory by stating that the
160
JAN BERG
disjunctive proposition: P or Q,
is logically derivable from P,
where Q is the original consequent. Moreover, consequence relationships with contradictions of the type: A is not identical with A,
cannot be represented at all in Bolzano's system.
Thirdly, Bolzano's implicit semantics differs in a fundamental respect from Tarski's semantics. Following Tarski, we say that a formula S is a consequence of the set r of formulas if and only if every interpretation over any non-empty domain that makes all elements of
r true also makes S true. Here we generalize universally
over both interpretations and domains. An interpretation over a domain D is a function sending symbols for individuals into D and predicates onto sets of n-tuples of elements of D. 1\ semantics based on such functions may be called an
'interpretation semantics'.
It is
possible, however, to base a semantics exclusively on functions sending formulas of a language onto truth-values. We then get what may be called a
'truth-value semantics' or an
'evaluation semantics'.
Now, S is an evaluation-semantic consequence of a (possibly infi-' nite) set r of formulas of
if and only if there is a finite subset 6
r such that S is true under any evaluation for which all elements
of 6 are true. The decisive difference between an interpretation semantics and an evaluation semantics lies in the different treatment of the quantifiers,
For example, a formula ~ is interpreta-
tion-semantically true for the interpretation J Over D if
~
is true
for all interpretations J/over D which assign at most to ~ a different value from D than does J.
But under an evaluation seman-
tics, ~A is true under the evaluation E if A~ is true under E for -if. all possible substituents if. of the variable ~. In spite of this essential ontological difference, however, the notions of consequence with respect to both kinds of semantics are equivalent at the level of elementary logic, even if there are infinitely many antecedents.
Bolzano never referred to domains of individuals in
his semantics and never thought of combining quantification over domains with his generalization over ideas-as-such.
Instead, his
quantification over sequences of ideas-as-such would, if applied to a formalized language, correspond closely to the evaluationsemantic conception of quantification. Thus, Bolzano may be considered the first philosopher who characterized an evaluationsemantic notion of consequence with a finite number of antecedents.
*
BOlZANO'S CONTRIBUTION TO lOGIC
161
In connection with his definition of validity Bolzano also introduces the function degree of validity. We may write 'g(P,A"
.... A
n)'
to denote the value of this function for the proposition P upon variation of the ideas A,,'" call Q a P(A, ....
.A
n.
To simplify definitions I shall
'variant' of P with respect to A"
,Ani
B,,""
,B
... ,A
n
if Q equals
for some sequence of ideas B, . . . . . B Now n•
n)
Bolzano's definition of degree of validity can be expressed as follows (24)
(WL
§
'47):
g(P,A, , ..• ,A
equals the proportion of the number of true
n)
variants to the number of all variants of P with respect to
the
The domain
of~logical
meaSure function g is restricted to proposi-
tions with a finite number of variants. In order to extend the domain of g. Bolzano adds the further restriction that variants differing only in'ideas-as-such with the same extension should be identified. Within the domain of the function g it obviously holds that P is valid with respect to A"
... ,An if and only if g(P,A"
.. " ,A n
)=' .
Furthermore. P is contravalid with respect to A, •... ,An only in that case where g(P,A"
••. ,An)=O. Now it is possible to generalize
the relation of derivability within the domain of the function g. To say, then. that A,,'"
P
is derivable from {Qi} with respect to
,An is tantamount to saying that the conditional proposition:
If {Qi} is true then P is true, is valid with respect to A, , ... ,An in other words that
th~
, with respect to A,,""
degree of validity of this conditional is ,An'
It is natural, then, to consider weaker
cases of derivability where the conditional has a lower degree of validity.
In this way Bolzano is led to his notion of probability.
Let 'P(P.{Qi},A"
... ,A
n)'
denote the probability of the hypothesis
P relative to the set of propositions {Qi} and the sequence of ideas-s
a s-i s u c h A"
... ,An' Then Bolzano' s partial definition of the notion
of probability can be expressed as follows
(WL
§
'6'):
162 (25)
JAN BERG
If {Qi} (i
= , ,2, ...
,m)
is consistent with respect to ... ,A n) equals the proportion o}~e
A" ... ,A n, then P(P,{Qi},A"
number of true variants of {P,Q,,'" variants of {Qi} with respect to A
,Qm} to the number of true
1.···,An•
In analogy to BOlzano's treatment of the notion of derivability. we can even distinguish the special logical case of probability, p*, of P relative to {Qi}:
(26)
p * (P,{Qi}) equals P(P,{Qi}.A 1,··.,A n), where A" . . . . An are all the non-logical ideas of P and {Qi}'
Bolzano's notion of probability is explicitly conceived of as a relation among propositions and their constituents. The notion of logical probability immediately reconstructible within Bolzano's system is the logical relation of an hypothesis to its evidential support. These features and Bolzano's measurement of probability by the technique of variation are strongly reminiscent of Carnap's recent theory of regular confirmation functions.
Thus, Bolzano was
probably the first philosopher who characterized inductive probability.
* Modern logic distinguishes the three questions: ,
a consequencE of the formula $'- (ii) Is ,
• in the theory TI
(i) Is the formula
formally deducible from
(iii) Is the sequence F of formulas a deduction
in T from. to '1 The theory of derivability developed within Bolzano's logic of variation yields a method of handling the first question. As waS pointed out earlier, however, Bolzano had no precise way of dealing with questions of the second kind. Instead he turned to a theory which was intended to answer certain very special questions of the third type when they are transferred to the non-linguistic level of propositions. Bolzano's relation of ground to consequent (Abfolge) involves the notion of being directly deduced in a pre-existing system of true propositions, where the meaning of 'deduced' is not confined to
BOlZANO'S CONTRIBUTION TO lOGIC
163
purely logical relations but embraces even direct inferences based on causal and deontic implications. The Abfolge is an asymmetric and intransitive relation between sets of true propositions (WL
§§
203, 204, 213). We employ the notation '{Qi}=¢. pI to indi-
has the Abfolge relation to P. Bolzano further ~nd {U } ,.> P i assumes, inter alia, that {Qi}~ P~imply the identity of the two antecedent sets (WL
206), and that P is neVer a member of {Qi}
§
if
{Qi}~
P holds (WL
If
{Qi}~
P holds, the members of {Qi} may be conceived of as
§
204).
immediate s u b s i d i a r y truths of P, and for each of these members, in turn, we may look for its immediate subsidiary
trut~.
ceed in this manner until we reach what Bolzano calls i.e., true propositions P having no (WL
§
We can
pro~
'basic trutru;,
set {Qi} such that
{Qi}~
P
214). In this way we may discover a pre-existing unique
Abfolge hierarchy or "proof-tree" for P (WL
§§
216, 220). The
general notion of a subsidiary truth of P is now forthcoming as a representation of all truthsQ distinct from P and included in a proof-tree for P (WL
217).
§
In his logic of variation Bolzano was aware of an inference rule corresponding to Gentzen's "cut rule": from the proposition that P
2
is derivable from P
we can deduce that P
4
1
and that P
4
is derivable from P
is derivable from P
1
and P
3
(WL
2
and P
3,
155.24).
§
Gentzen's Hauptsatz asserts that under certain conditions any proof in elementary logic can be reduced to a normal form by eliminating all cuts. Bolzano is aware of a similar kind of "normal" proof in his logic of variation. This idea is carried through in his theory of the Abfolge structure. Here it is shown that the number of simple ideas-as-such involved in the propositions of a branch of a proof-tree'will in general increase downwards (WL
§
221).
It further follows from the assumptions of the theory that no unnecessary ideas-as-such are introduced into branches of the deductive hierarchy of propositions.
*
JAN BERG
164
After completing the WL, Bolzano returned in the 1830's to pure mathematics and started projecting a new encyclopedic work, the GroBenlehre, in analogy to his earlier treatments of theology and logic. It was intended that this great work should establish the foundations of mathematics, give a detailed exposition of most known branches of that science, and also pursue original investigations and inaugurate new domains of research. Bolzano's mathematical manuscripts contain a vast body of drafts of the various sections of the GroBenlehre, which deal with the elements of logic and semantics, with artihmetic, algebra, mathematical analysis, set theory and geometry. The second volume of the GroBenlehre is called 'The pure theory of numbers'. It ranges from an analysis of natural numbers to various attempts at constructing a theory of real numbers. It has recently been published in volume 8 of series IIA of the Bernard BolzanoGesamtausgabe (Stuttgart: Frommann-Holzboog, 1976). At the end of the 19th century, Frege's definition of natural numbers as classes of concepts with isomorphic extensions appeared in print. And a few years later Russell, independently of Frege, defined natural numbers as classes of isomorphic classes. But half a century before Frege, Balzano in the first part of his "Pure theory of numbers» defined natural numbers as concepts of isomorphic classes. However, Bolzano did not base his construction of arithmetic on this abstract notion of natural number. In the WL there is another approach to the notion of natural number, which is the outcome of the analysis of sentences about particular numbers, such as the following: (a)
There are exactly
~
objects falling under the concept C.
Bolzano's analysis of such sentences is strikingly reminiscent of Frege's and Russel's corresponding analyses (WL
§§
139, 243). Now,
if (a) is definable by ¢(C), then the abstract number
~
may be
constructed as the concept of all C such that ¢(C). In the second half of the 19th century, theories of real numbers were put forth by Dedekind, Weierstrass, Meray and Cantor. In the
BOlZANO'S CONTRIBUTION TO lOGIC
165
last part of his "Pure theory of numbers" BOlzano made a different kind of attempt in which he based the system of real numbers on a purely arithmetical foundation. According to Bolzano, a rational number is definable by means of finitely many applications of rational operations to natural numbers. He then considers those cases in which there are infinitely many applications of rational operations. Among the examples mentioned are the following: 1+1+ .•.
b
in info and
. Such constructions are called 'infinite number
in info
concepts'
1 + 2 + 3 +
and their expressions are called
'infinite number
expressions'. In his proofs, Bolzano computes infinite number concepts by applying formal arithmetical operations to infinite b
number expressions.
'1' q
For example, he subtracts 1+1+ •.. in info
and transforms the result into
'( 1+1-i". . . q( 1+1+.
in inf. )-gb' in inf.)
from
. This
procedure is possible if the infinite number expressions are conceived of as designations of sequences of rational numbers and if the arithmetical operations are applied to the members of these sequences. Under this interpretation the infinite number expressions ,
b
'1+2+3+ ... in info' and 1+1+ ... in info correspond to the sequences 1
0 when n > bq, q
n
corresponds to the expression of the result of the above-mentioned subtraction.
Such expressions Balzano calls
'essentially positive
number expressions'. Hence, an essentially positive number expression can be represented by a sequence <sn> which there is an no
o~
rational numbers, for
such that sn>O for all n>n
o'
Such a sequence
we may call an 'essentially positive sequence'. Now Bolzano considers those infinite number concepts S such that, for every natural number q, there is an integer p such that the following equations hold:
166
JAN BERG
E.q
S
+ Rand
S
I!.:t.l. q
-
R" •
Here Rand R"are essentially positive number expressions or R denotes O. Expressions of such infinite number concepts are called 'measurable number expressions'. Under the interpretation adopted here, Bolzano's equations state identity of sequences. The numerator p depends on the denominator q and the sequence S. To make this explicit, we may write
'p(q,S)'.
Furthermore, the sequences Rand R"depend on the sequence S, and from later proofs of Bolzano's it also appears that they depend on the denominator q. To make this explicit, we can write 'R(q,S)' and 'R"( 0, in both cases for sufficiently large n. From this there follows:
c n_
l2..l9.....tl, q
p(g,C )+1 q
-
c
n
But C is a null sequence with terms of alternating sign. Hence, we have the contradiction: p(q,C) < 0, p(q,C) > -1. Bolzano observed this gap in his theory, however, in a later amendment to his manuscript, in connection with the definition of equality between measurable numbers.
In the first version of his
168
JAN BERG
manuscript, Bolzano defines the measurable numbers A and B as equal if and only if it holds for all q that p(q,A) = p(q,B). It follows from the properties of the identity relation and the universal quantifier of elementary logic that this relation of equality is an equivalence relation. However, this notion of equality does not satisfy the law of trichotomy. to hold if and ~nly if A -
(A > B is said
B is positive and not infinitely small.)
A counterexample emerges from Bolzano's own text: let A = 1 -
1+1+ ...
~..i n f". and B
~n
and since for all q, to B.
p(q,A)
= 1. Then neither A > B,
=q
-
= q,
1 and p(q,B)
not B
>
A,
A is not equal
In the amendment to his manuscript, Bolzano proposes instead
the fOllowing definition: (28 )
A is equal to B of and only if
iA -
Bi
is infinitely small.
This notion of equality is an equivalence relation and satisfies the law of trichotomy and, moreover, saves many otherwise false statements in Bolzano's theory. This definition also shows that Bolzano was aware of a wider notion of an infinitely small number, Which may be identified as
the general concept of a null sequence.
The generalized notion of an infinitely
s~all
number requires a
change in the definition of a measurable number. This was in fact supplied by Bolzano. According to a late amendment to his manuscript, a measurable number expression corresponds to a sequence S (= <sn»
of rational numbers such that, for any natural number q
there is an integer p(q,S) such that for all n: (29 )
s
n
~ + q
R
(
n q,
S)
= P ( 9 , S ) +m -
q
R
(
n q,
S)
,
where m is a natural number parameter and ••• ), where lfx ranges over all objects. This is what Frege means by incorporating the condition-in the judgment. In Russell we have a stratified universe, but each stratum is a fixed domain. This universality of the Frege-Russell logic is discussed in ~ Heijenoort ~ . With a fixed universe the semantic notions of validity and satisfiability lose their applicability, while with a variable universe they come to the fore. As early as 1885, Peirce gives the correct definition of validity for the sentential calculus: 'To find whether a formula is necessarily true substitute i and ~ for the letters and see whether it can be supposed false by any such assignment of values.' (Moreover, for testing validity Peirce suggests a method that foreshadows refutation trees.) If Frege and Russell corne to consider the formula, say, ~~(~ :> ~, it is only as a temporary equivalent of "f\f~(~ => Fx), and for this latter formula the notions of validity and satisfiability come to coincide and collapse into the notion of truth. Closely connected with these considerations is, of course, the question of first-order logic as a separate system. Embarked in their grandiose logical reconstruction, Frege and Russell do not hesitate to go beyond first-order logic. And they have to, since the notion of the ancestral is necessary for their definition of natural number. When Frege passes from first-order logic to a higher-order logic (!§Zi, § 11; ~ Heijenoort l22L, 24), there is hardly a ripple. Not bound by large-scale requirements, Peirce, Schroder and Lawenheim can more distinctly feel the ground under their feet, and then, of course, the difference in complexity between first-order and higher-order logic is immediately apparent. The birth and development of model theory would hardly have been conceiva~l. without the separate consideration of first-order logic.
SET-THEORETIC SEMANTICS
185
Hilbert's position is somewhat between that of Frege-Russell and that of Peirce-Schroder-Lowenheim. Like the former, he works with axioms and rules. With his mathematician's instinct, however, he is inclined to consider quantifiers ranging not over 'everything', but rather over well-defined collections of objects. He also feels that the jump frOm first-order to second-order logic marks an important change in complexity; and, though second-order logic may be indispensable at a certain stage, it is important to see what can be done in first-order logic. So, when the fusion of the two currents took place in the Twenties, Hilbert was one of its agents. The notion of a formal deductive system supplemented by an interpretation appears in Hilbert's logical writings, and in Hilbert ~ Ackermann 1928 the problem of the completeness of first-order logic is posed in exactly the ~s in which it will be solved in Gadel 1930. Another agent was Skolem. He knew, of course, LOwenheim's work, which he had amended and generalized. But he had also read Principia. He had at hand the various arguments that constitute the proof of completeness, but the very import of the proof seems to have remained hazy for him. Rather than using axioms and rules, he kept looking for a proof procedure, perhaps similar to the one that Herbrand was going to propose. Skolem's work raises many delicate questions of interpretation, which I do not intend to discuss here. I will simply add that his work did not have, at the time, an impact comparable to that of Hilbert's; Skolem's papers, in the Twenties, were hardly read by the logicians that were then shaping logic. In 1930, when the proof of the completeness of first-order logic with respect to set-theoretic semantics was presented, there were two other seaantics, that of Brouwer and that of Herbrand. Brouwer was certainly concerned with meaning. In fact, a great part of his polemics against Hilbert turned around the notion of Inhalt. But neither he nor Heyting thought of systematizing the notion. I have ~ention of discussing intuitionistic semantics here. Herbrand, with his (feigned 7) acceptance of the Hilbert program, rejected set-theoretic considerations that were not decidable. Instead of the set-theoretic notion of satisfaction, he uses a notion based on an infinitely proceeding sequence of finite domains, existential quantifiers having a range different from that of the universal quantifiers. No doubt, Herbrand had the notion of set-theoretic interpretation present in the back of his mind all the time. Only thus can his errors be explained. The semantic argument for Herbrand's theorem is incomparably simpler than the syntaqtic argument, so that, being convinced by the first, Herbrand did not bother to check carefully the second (in the same way that, when we have a simple geometric argument, we are prone to make a mistake in the complicated calculations of the parallel analytic argument, simply out of boredom). Compared to the glamorous axiom systems., which, with so few axioms and rules, claimed to encompass a large tract, if not all, of mathematical practice, semantic considerations seemed to be a homely breed. Gentzen, in 1935, still prefers to establish the constructive equivalence of his calculus of sequents with some existing system, rather than to prove directly the completeness of the new system (he may have thought that merely giving such a proof was a bit like cheating). It is Tarski's codification of set-theoretic semantics, in the mid-Thirties, that finally gave an honorable status to semantic considerations and launched them toward model theory. Model theory has bloomed into an extensive science. But this new discipline does not seem to care much about what was the original object of semantics, namely meaning. It has become an abstract mathematical science, whose philosophical implications are scant. More precisely, model theory does not seem to have anything to contribute to a theory of meaning, if such a theory has to deal with
186
JEAN VAN HEIJENOORT
ordinary language. The only philosophical implications of model theory are at a highly technical level, where results in set theory may affect our views of whse is mathematics. Recently, attempts have multiplied to construct a semantics of ordinary language on a precise basis. In the last hundred years the views of the philosophical communi~ about natural language seem to have gone through a three-stage dialectical process. Frege and Russell wanted to replace natural language with a formal language, because of the imprecision, vagueness, haphazard complexity of the former. The natural language is not considered to be a worthy tool of philosophical investigation, not even a worthy object of study. We may get suggestions from It, but it is no more than a ladder that we kick back, once we have been able to reach the formal language. As a reaction against this point of view, Wittgenstein, in his second period, wanted to leave natural language as it is, not replace it by anything else. The niceties and convolutions of natural language express subtle differences in thought, which we must examine with care and patience. Today, we have entered a third phase. Philosophers have been reminded by linguists that natural language, in its syntactic aspects, can be an object of exact study, a study that, with its generalizations, is reminiscent of that of artificial languages. Then, more recently, this exact study attempts to encompass also the semantic aspects of natural language. I do not intend to survey here the whole field, but to touch upon a number of points. In Tarski's well-known paper on set-theoretic semantics, the only philosophical remarks deal with the Aristotelian realism of his definition of truth. But the semantics rests on specific ontological assumptions that have hardly been made explicit: we have bare individuals; these individuals have no inner structure, they are mere pegs. That is why we often speak of 'elements', even sometimes of 'points'. We assume that we have a collection of such indiViduals, distinguishable by its cardinality. Properties and relations corne subsequently and do not contribute to the identification of the individuals. On the contrary, the indiViduals have to be individuated so that properties and relations can be introduced. How are the individuals individuated 1 This is left out of the account. What we have here is a highly abstract scheme. Can this dessicated ontology encompass everything we want to talk about 1 Is it applicable to material objects, to events, to acts 1 Just to question the universal applicability of set-theoretic semantics I would like to raise here the question of mass terms. These terms, like ~ , gold, ~, are opposed in their syntax to the so-called count terms, like~, tables,~. With them we use the question How much 1, instead of How many 1 They obey peculiar syntactic rules. It is to be noted that in different languages, English and French for instance, the classes of mass terms are quite similar (that is, correspond under translation), while the syntactic rules that allow us to characterize these classes are quite different (that is, do not correspond under translation). This fact seems to give weight to the assumption that the class of mass terms is a stable and significant linguistic object. Mass terms can, in a sentence, be used before the copula (Iron is a metal) or after (This is gold). They can be u8ed with demonstratives (This water is dirty), possessives (My coffee is cold) or the defiaite article (The wine that you gave me was sour), to yield what I would call quasi-individual terms. They can also be used with so-called container words (three buckets of water, two pounds of sugar) in a variety of ways: sometimes the units are individualized, sometimes not. There is here a rich field for linguistic investigations. Concrete mass terms, like those we have mentioned so far, are important because of the linguistic problems they raise. But their importance is increased by the fact that abstract terms, like courage, wisdom, and so on, behave, linguistically, like them.
SET-THEORETIC SEMANTICS
187
Brave attempts have been made to deal with mass terms by means of set-theoretic semantics. Some have taken them as denoting (scattered) individuals (Quine), some as referring to properties (Strawson). These attempts can be pushed up to a certain point, then end up by doing violence to the linguistic facts. The most natural way to deal with mass terms seems to be to provide them with their own semantics. As we have a universal domain of individuals for count terms, we now consider a universal lump for mass terms, a lump out of which we take slices. The two semantics may have to be used concurrently. First, because a sentence may contain both count terms and mass terms, as The man drank some water. Second, because the quasi-individual terms that I just mentioned, obtained from mass terms by a variety of linguistic means, require individuals for their interpretation. There is also, with mass terms, the problem of mixtures, both on the syntactic and on the semantic plane. Since quantifiers cannot be interpreted except in a discrete domain, the proper treatment of mass terms requires a variable-free system. But several such systems are at hand today, and there does not seem to be any difficulty in adapting one of them to the semantics of the universal lump. One can only wonder why all that has not been done before. It is ironic that Tarski's famous example, 'Snow is white', involves a mass term, and Tarski's semantics would have met with difficulties if sentences containing other uses of 'snow' had been considered. My purpose in raising here the problem of the semantics of mass terms has been to try to perceiW& the limits, and l~itations. of set-~heoretic semantics. An ambitious attempt to apply to natural language the methods and procedures acquired in the study of formal systems was made by Richard Montague. His system went through several phases, and it is not my intention to discuss it here in its entirety. I would like to discuss a point on which it differs markedly from standard set-theoretic semantics, namely, on the notion of individual, and I take Montague's doctrine as presented by Barbara Partee after his death ( ~ 1975). For Montague an individual is the collection of all properties that are true of the individual, in our sense. The semantic interpretation of John is the collection of all properties that hold of John. This conception ~ertainly different from our set-theoretic semantics and offers an alternative semantics, which, moreover, has a long philosophical tradition (Leibniz). With this semantics, Montague is able to assign a semantic denotation not only to names (individual constants and definite descriptions), but also to expressions like every ~ or ~~. Or rather, these expressions become, in a way, names, on the same plane as~. The denotation of every ~ is the collection of all properties that hold of every man, that is, it is the intersection of the collections assigned to John, ~, and so on. The denotation of ~ ~ is the collection of properties that hold of at least one man, that is, the union of the collections assigned to John, Paul, and so on. ---Barbara Partee gives a list of contexts in which the substitution of expressions like every ~ or ~ ~ for ~ leads from a meaningful se,uence of English words to another. There are, however, limits to this parallelism, and these limits are so fundamental, linguistically, that the unification becomes doubtful. I now list three such limits. (1) Negation. The negation of John ~ is ~ did ~ ~ . The negation oE Every !!l.!!.!l ~ is not Ever:r: ~ did ~ ~ , but Not every ~ ~ . And so on. Examples could be multiplied. Negation is such an important linguistic and logical notion that these differences cannot be considered to be innocuous and chance peculiarities of the language. They are rooted in its functioning. (2) Scope. In John ~ Mary there is no problem of scope. But, with a and every, the problem of scope arises, as in ~ ~ loves every~. Here again
188
JEAN VAN HEIJENOORT
examples could be multiplied. And here again we are confronting something basic in the functioning of the language. The problems of scope of quantifiers in English are of a degree of complexity that is no less than in quantification theory, as one can convince oneself by a few examples. Hence we cannot hope to get by through the use of a semantics that does not respect this complexity. (3) Inference. From::!2!!!l gave exactly 2!!~ ~ !:!l!!Y. ~ ~ to anybody else, we can infer ~ gave exactly ~ book. But we cannot infer this last ;er;tence from John gave exactly ~ ~ ~ every ~ ~ ~ to anybo~~. The substitutions mentioned above disturb inferences. And here again the same remarks can be made as for (1) and (2). This list is certainly not exhaustive. Other limits, on the linguistic plane, can be found to the unification proposed by Montague. On the philosophical plane, a number of objections present themselves. Without entering here into a full discussion of the problem, let me mention that Montague's conception, at least as presented so far, would make all statements analytic. If the denotation of ::!2!!!l is a collection of properties, then John came simply states that one specific property is in the collection in whi~t-wa; put. A great deal of the present activity in the field of the semantics of natural languages is devoted to the study of the relations obtaining among terms, like between fox and vixen, in the hope that, if we could codify these relations, we would be-;ble to read off the logical entailments that obtain among sentences of a natural language like English. The entailment of (1)
by (2)
should emerge from the proper semantic analysis of mortal, to live and forever. And we should be able, progressively, to deal with ~nd-;o~omplexexamples. What is attempted, or rather contemplated, is the construction of a dictionnaire raisonne. But this an old enterprise. Descartes was already asking for an enumeration of all the idees simples and their hierarchization. Such an enterprise worl~--up to a certain point. There are in English perhaps a few thousand words that lend themselves pretty nicely to such a work of regimentation. But soon doubts and difficulties multiply. Try to unravel a skein of yarn; you will pull a good length of it, but then you will have a worse mess than before. One begins to grope for guidance and would like to have rules and principles. But these are notoriously lacking. The very idea of idee simple is not simple. Where is the line to be drawn between a dictionary and an encyclopaedia ? I do not want to claim that such work cannot be done. But, before it can be done (perhaps with the help of eomputers), certain fundamental questions have to be answered, questions that have not yet been formulated properly. A pervading assumption, implicit or explicit, underlying the construction of a rational dictionary is that we have to proceed from the simple to the complex. And how could it be otherwise? But the realities of a natural language force us to temper such a view. We do not simply proceed by addition, composition, conjunction. Consider the sentences ~ squeezed ~ ~ and He squeezed ~ hand. Don't we have here a kind of feedback effect, through which the meaning of the complement alters the meaning of the verb? Consider red in ~ ~ , red ~ and red hair. The meaning of the noun affects the meaning of the adjective. We do not simply have conjunction. When words are put together, they interact on one another, and there are effects that the inductive mode of definition, used for artificial languages, is not quite able to catch. Should we then consider words only in expressions? Should a dictionary be made up of examples, rather than of
SET-THEORETIC SEMANTICS
189
attempted definitions ? Consider now an example that has been much discussed by linguists. French translation of
The correct
(3 )
is
.ll traversa la riviere
(4 )
~ la nage ,
The word by-word translation of (3), which is (5 )
a
has no clear meaning in French. We translate to swim across by traverser la nage. We have here something reminiscent of idiom;:- ~e translate I im hungry by J'ai faim, we do not use a word-by-word transformation either, and linguists are prone to keep idioms out of their theoretical reconstructions (see, for instance, ~ ~ , page 50). However, he who has some practice in translating knows that the idiomatic aspect that reveals itself when we pass from (3) to (4) is no exception. One would be tempted to say that it is rather the rule. Once a translator has paired off a word of the target language with each word of the original text (perhaps with the help of a dictionary), he rereads the whole thing, catches the meaning, and writes down the translation. There is no upper bound upon the length of the rearrangement. We cannot isolate a definite class of idioms. The idiomatic effect pervades the whole language. But, if we do not have a rational dictionary, if we are not even sure of how such a dictionary can be written, does it mean that we cannot deal with entailments in a natural language, like that of (2) by (l)? Of course, we can. Logic, even today, allows us to deal with the semantics of a natural language locally. That is, we translate the sentences into the language of quantification theory, and then use standard set-theoretic semantics. This is what we do when, in a beginners' course in logic, we give examples. It is a rough job. Many things get bulldozed. Given a sentence that we consider to be logically atomic, we pick one, two or three parts in it, which, we decide, work as individual terms, and we push everything else into the predicate. Adverbs suffer, and many other things. But, nevertheless, within the limits of what we intend to do, it works. If somebody claims that there is a relation of entailment among certain English sentences, we can check, by translating these sentences into quantification theory, whether the entailment obtains or not (check, that is, within the limitations imposed by the decision problem; but it is rare that an example suggested by natural language is not in a decidable fragment). If one and the same sentence occurs in two different sets of sentences, we may have to translate it differently if we want to reveal the entailment corresponding to each set, but in each case the operation is possible. There is nothing strange and mysterious about checking that (2) entails (1). We simply adjoin the side assumption (6)
For every
~, ~
is mortal if and only if
~
does not live forever.
If our dictionnaire raisonne had been completed, we would read (6) off the dictionary. As long as the dictionary is not at hand, we adjoin one or several side assumptions. If we understand the language, this presents no difficulty.
we could compare the natural language to a non-Euclidean space, for which there is, at each point, a tangent Euclidean space. In a neighborhood, the tangent space can take the place of the given space. If we go too far, the distortion becomes too great.
190
JEAN VAN HEIJENOORT
The problem is to pass from the local treatment of the semantics of a natural language to a global one. One feels that it is possible and that it should be done. Why? Because the handling of words is an exact and precise activity. Whoever writes with some care knows that replacing one word by another, apparently close, may change the meaning of a sentence appreciably; even a comma, added or taken away, as its effect. The syntactic c~dification of such facts cannot be the full answer. Unless God-given, syntactic rules must be grounded somewhere. And where, if not in meaning? But between the exact working of a natural language, with the convolutions and intricacies of its meanings, and the logical skeleton representable in quantification theory there is a large gap, in which we are still groping.
References Ackermann, Wilhelm See Hilbert, David, and Wilhelm Ackermann. Bo1zano, Bernhard 1837 Wissenschafts1ehre. Fodor, J. A. 1972 Troubles about actions, Semantics of ~ language, edited by Donald Davidson and Gilbert Harman, 2nd edition, 48-69. Frege, Gott1ob 1879 Begriffsschrift, ~ ~ arithmetischen nachgebildete Formelsprache des reinen Denkens; English translation in ~ Heijenoort 1967, 1-82. GiSde1, Kurt 1930 Die Vo11standigkeit der Axiome des logischen Funktionenka1ku1s, ---- Monatshefte fUr Mathematik ~ Physik 1L, 349-360; English translation in ~ Heijenoort 1967, 582-591. Hilbert, David, and Wilhelm Ackermann ~ Grundzuge der theoretischen Logik. Lowenheim, Leopold tiber MOg1ichkeiten im Re1ativkalkUl, Mathematische Annalen 76, 447-470; 1915 ---- English translation in ~ Heilenoort ~, 228-251-.----Partee. Barbara 1975 Montague grammar and transformational grammar, Ligguistic inguiry £, - - 203-300. van Heijenoort, Jean 1967 ~ Frege !:2. Gildel, A source ~ 1n mathematical .!2s.!.s.. 1879-1931. 1967a Logic as calculus and logic as language, ~ studies in ~ philosophy ----- of ~~, 440-446.
R. Gandy, M. Hyland (Eds.l, LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
ON THE WORK AND INFLUENCE OF STANISLAW LEsNIEWSKI
Stanislaw J. Surma Jagiellonian University Krak6w, Poland
BIOGRAPHY OF STANISLAW LEsNIEWSKI /1886-1939/ The detailed biography of Stanislaw Lesniewski remains rather obscure. The Polish archives from which it could have been reconstructed were destroyed during the World War II and at least some information is known merely from hearsay sources. To some extend we must depend on trusting a few still alived witnesses. One of the sources is KOTARBInSKI /1965/. Numerous comments about Lesniewski and his work are contained in LUKASIEWICZ /1929/, KOTARBInSKI /1929/ and /1957/, TARSKI /1956/, CHURCH /1956/, SOBOCInSKI /1950/ and /1956/ and MOSTOWSKI /1948/. Compare also McCALL /1967/. A large number of details concerning historical and especially of philosophical aspects of Lesniewski's work can be found in LUSCHEI /1962/. Stanislaw Lesniewski was born on 28 March 1886 in Serpukhovo near the town of Ivanovo-Vosniesiensk somewhere beyond Moscow, Russia, as the son of a locally employed Polish rail-way engineer. He attended grammar school/classical division/ somewhere in Siberia, probably at Irkutsk where his father was building a rail-way. As a schoolboy he was shown to be adherent to any principles and intolerant of exceptions. His student years he spent partly at least in Muenchen, Germany, where he was listening to Hans Cornelius and was trying to get to the bottom of his peculiar melange of positivism and Kantianism with its guiding conception.apparently called "das immanente Ding an sich". Then Lesniewski came to Lvov in order to complete his doctoral dissertation under Professor K.Twardowski, the leading philosopher of the University of Lvov at the time. He accomplished his Ph.D. in philosophy in 1912. While in Lvov Lesniewski was also studying mathematics under Professor J.Puzyna and partly also under W.Sierpinski appointed as a professor in 1910. For a moment being influenced by Professor L.Petrazycki's works Lesniewski conceived his future as lying in psychology but when he suddenly appeared in Lvov as a newcomer among K.Twardowski/s doctoral students the book he was under a strong impact was Marty's "Untersuchungen zur Grundlegung der allgemeinen Grammatik und Sprachphilosophie". Another work turning Lesniewski out of psychology and psychologism were E.Husserl's "Logische Untersuchungen"
191
192
J. SURMA
At the time Lesniewski was convinced that one had first to understand the meaning of words and only then one could philosophize at all responsibly. During the last years preceding the World War I Lesniewski threw himself into the whirpool of philosophy. He was endeavouring to overturn the principle of the excluded middle, employing the tools of analysis and criticism against the language in which philosophers argue about the foundations of logic, theory of cognition etc. Later, however, he abandoned it. There was a breakthrough in his attitude. What diverted the direction of his thinking was most probably the reading in 1911 a logical appendix to LUKASIEWIC~ /1910/. This appendix contained Lukasiewicz's explication of a general theory of objects, a branch of logistic as it was then often called, expounded by means of a symbolic method somewhat similar to that of COUTURAT /1905/. That was the end of Lesniewski the commentator and probable translator of Marty and the beginning of Lesniewski the enthusiastic adherent of modern, mathematizing formal logic. What Lesniewski learned from Lukasiewicz was the famous B.Russell's antinomy which fascinated and preoccupied him for the next eleven years. Although the Lukasiewicz's work was a turning point in Lesniewski's career from philosophy to mathematical logic and the foundations of mathematics, for a few subsequent years he remained still particularly reluctant to the use of the logicomathematical apparatus. EARLY WRITINGS Between 1911 and 1914 Lesniewski published in Polish seven papers. All of them are listed below /For bibliographical details see REFERENCES/. /1911/
A contribution to the analysis of existential propositions.
This is Lesniewski's doctoral thesis. /1912/
An attempt to prove the ontological principle of contradiction.
A direct respons to the influence of LUKASIEWICZ /1910/. /1913a/ Logical studies. Russian version of the two above essays originally published in Polish. /1913b/ Is truth only eternab or both eternal and temporal? A respons to the influence of Kotarbinski's "Problem of the existence of future" published in Polish in 1913. /1913c/ A critique of the logical principle of the excluded middle. Although written under an influence of Twardowski and Husserl the essay contains nevertheless a refutation of "platonism" and a defence of the nominalistic principle that there are only individuals. /1914a/ Is the class of classes not subordinate to themselves subordinate to itself? An exposition of Lesniewski's early views on Russell's antinomy written at the time when Lesniewski's own theory of classes had not yet been worked out. /1914b/ The theory of sets on the "philosophical foundations" of B. Bo rris t.e Ln ,
WORK AND INFLUENCE OF LEsNIEWSKI
193
This is a review on B.Bornstein's paper "Philosophical foundations of the theory of sets" published in Polish in 1914. All these early writings were later repudiated by Lesniewski himself as immature or even philosophically unsound. Nevertheless, they are of some importance and, no doubt, they had influenced Lesniewski the philosopher. In particular, they encompass /1/ some interesting arguments against the existence of universals, /2/ a rather strong criticism of conventionalism, /3/ the first outline of the distinction between the language and the metalanguage, /4/ Lesniewski's comprehensive conception of semantic categories later opposed to Russell's theory of types, and /5/ the first analysis of the notion of set in the so called collective sense later developed within the system of mereology and applied to the solution of Russell's antinomy. During his lifetime Lesniewski read a number of papers. Between 1911 and 1915 he presented seven papers to the Warsaw Psychological Society, the papers dealing with the problems of existence, of contradictory objects, of creating the truth, of paradoxes of logic and mathematics, and of various aspects of the foundations of set theory. It happened just before the outbreak of the World War I that Lesniewski was to deliver a lecture on Russell's antinomy at a session of the Warsaw Psychological Society. They say that while preparing the lecture Lesniewski suddenly found an error in his criticism of the antinomy. As things were he decided to concentrate his attention to the utmost munching bars of chocolate for support. And thus chocolate gave birth to his new theory of classes later renamed mereology. It was with the idea of mereology in his mind that Lesniewski left for Russia to spend the years of the wartime there. He lived in Moscow from 1915 until 1918 and taught mathematics in a Polish grammar school /Mrs.Jakubowski's pension/. When in Moscow Lesniewski read a number of papers at meetings of various Polish groups and institutions. The list of the titles of the papers includes: Problem of non-contradictory set theory; Antinomies of formal theories and the Language; Basic problems of contemporary philosophy; and Philosophical foundations of Marxism. This period of Lesniewski's life and in particular his scholarly activities were observed by Professor W.Sierpinski who also spent the war years in Moscow. It was there and then that Lesniewski formulated and published in Polish the first outline of the formal system of m~reology. This publication is found to be somewhere between Lesniewski's early writings he later repudiated and the mature works to have come. THE FIRST OUTLINE OF MEREOLOGY /1916/
Foundations OL the general theory of sets. I.
This work deals with "parts", "ingredients", "sets", "classes", "elements", "subclasses" and certain interesting kinds of classes. Lesniewski developed and axiomatized here what he called the theory of parts or mereology. The work is written in colloquial Polish because Lesniewski had not yet overcame his distrust of technical symbolic notation and had not yet elaborated his own theory of symbolic reconstruction of colloquial languages. Part II of this work has never been published. THE MATURE ACTIVITY After the wartime Lesniewski returned to the liberated Poland. Well acquainted with mathematical logic yet having never had a formal
J. SURMA
194
higher mathematical educationhe joined a group of young mathematicians concentrating themselves on the foundations of mathematics. This group was led by a man of great mathematical gift, Z.Janiszewski soon to die, and S.Mazurkiewicz who were the very cofounders of the Polish school of mathematics. Before long Lesniewski was given a position at the University of Warsaw. That was the university where he was a professor of philosophy from 1919 until his death on 13 May 1939. In Warsaw a period of splendour in the field of mathematical logic and the foundations of mathematics had begun. Just then a new journal "Fundamenta Mathematicae" was created. In July 1920 the first volume of it appeared and this may be considered as a date of birth of the Polish school of mathematics. Lesniewski was a member of the editorial board of "Fundamenta" from the very beginning until 1928 when he resigned for personal reasons. Lesniewski's importance grew. Together with J.Lukasiewicz he was a cofounder of what in the course of time became known as the Warsaw school of logic, for long the biggest centre of logical research. Round Lesniewski and Lukasiewicz a large group of students were gathering: A.Tarski, A.Lindenbaum, B.Sobosinski, M.Wajsberg, M.Presburger, and later J.Slupecki, S.Jaskowski, Cz.Lejewski, W.Sadowski, J.Hossiason, H.Hiz and others. Tarski swiftly won a prominent position and joined Lesniewski and Lukasiewicz as one of the leaders. In 1927 Lesniewski published the first paper of a long series on the foundations of mathematics in which he redeveloped mereology first described in LEsNIEWSKI /1916/, elaborated it and incorporated other results from his earlier publications he foundworth preserving. Bibliographical description of the publication is given below. /1927-1931/ Foundations of mathematics. This series consists of twelve papers and it is divided as follows. /1927a/ Introduction. A brief characterization of Lesniewski's system as compared with other systems of logic and the foundations of mathematics, especially, from the point of view of antinomies and definitions. /1927b/ Section I. On certain questions concerning the meaning of "logistic" theorems. Lesniewski's account of initial difficulties with mathematical logic, and detailed criticism of ambiguities in "Principia Mathematica". /1927c/ Section II. On Rus s e Ll.f s "antinomy" concerning "the class of classes which are not elements of themselves". A brief description of Lesniewski's solution of Russell's antinomy. /1927d/ Section III. On different ways of understanding the words "class" and "set". A characterization of Lesniewski's collective conception of sets and a criticism of other interpretations /Cantor, Dedekind, Frege, Schroeder, Zermelo, Fraenkel, Hausdorff, Sierpinski, Russell, Russell and Whitehead/ . /1928/
Section IV. On the "Foundations of the general theory of sets. I".
A recapitulation of the work LEsNIEWSKI /1916/ with an improvement concerning the "existence of something" /In a number of previously stated mereological theorems the antecedents saying that objects are objects were added. It contains also a series of historical footnotes
WORK AND INFLUENCE OF LEsNIEWSKI
195
and a criticism of Whitehead's theory of events. Concerning the criticism see, e.g., SINISI /1966/. /1929/
Section V. Further theorems and definitions of the "general theory of sets" from the period up to 1920 inclusive.
A long series of mereological theorems and definitions as well as of historical footnotes. /1930a/ Section VI. Axiomatization of the "general theory of sets" from the year 1918. A description of mereology in terms of "part" as the only primitive. /1930b/ Section VII. Axiomatization of the "general theory of sets" from the year 1920. A description of mereology in terms of "ingredient" as the only primitive. /1930c/ Section VIII. On certain conditions established by Kuratowski and Tarski, necessary and sufficient for p to be the class of a. Three sufficient and necessary conditions for P to be the collective class of a, one of them found by K.Kuratowski, the two remaining found by A.Tarski. /1930d/ Section IX. Further theorems of the "general theory of sets" from the years 1921-1923. /1931a/ Section X. Axiomatization of the "general theory of sets" from the year 1921. A description of mereology in terms of "exterior" as the only primitive. /1931b/ Section XI. On "singular" propositions of the type "A £ b". A discussion of the origin and character of Lesniewski's ontology. Section XI above ends with a misleading indication "To be continued" which, however, has never been done. In the meantime Lesniewski published also two papers, independent of his own systems, both on group theory. /1929a/ Ueber Funktionen deren Felder Gruppen mit Rucksicht auf diese Funktionen sind. A reduction of group theory to an adequate single axiom containing a single primitive term f/A,B,C/ meaning A.B = C. /1929b/ Ueber Funktionen, deren Felder Abelsche Gruppen in bezug auf diese Funktionen sind. A single axiom for Abelian groups is given. The next Lesniewski's paper contains a complete design of his most general logical system, protothetic, and it is the most important paper on protothetic. /1929c/ Grundzuege eines neuen Systems der Grundlagen der Mathematik. §1-§1l. Its Introduction contains a brief characterization of the New System of the foundations of mathematics, consisting of three deductive theories: /1/ Protothetic, a generalization of the usual propositional logic; /2/ Ontology, a kind of a modernized traditional logic of names which resembles Schroeder's logic of classes enriched by the theory of individuals; and /3/ Mereology, outlined in LEsNIEWSKI /1916/. §1 includes a justification of equivalence as the sole primitive
196
J. SURMA
term /apart from quantifiers binding propositional variables and functions/. §2 characterizeS semantic categories as an alternative to the theory of types. §3 comprises a detailed completeness proof for pure equivalence based upon axioms /Al/ and /A2/, modus ponens for equivalence and substitution rule, and also the well-known Lesniewski's decidability criterion for equivalence. §4-§10 encompass a characterization of five systems of protothetic, denoted SI to S5, where the system S4 bases protothetic upon implication as the sole primitive. §11 describes successive simplifications of the single axiom for protothetic. In this connection compare SOBOCInSKI /1934/. The same paragraph contains also the well-known Lesniewski's terminological explanations and directives for protothetic, and also a derivation of contradictions in CHWISTEK /1923/ and von NEUMANN /1927/, the contradictions being caused by a careless formulation of rules for introduction of new expressions which do not guarantee the uniqueness condition for decomposing expressions. /1930/
Ueber die Grundlagen der Ontologie.
It contains the terminological explanations for ontology, followed by brief remarks concerning alternative axiomatizations. In 1931 Lesniewski published also a paper on definitions in the theory of deduction which is like papers LEsNIEWSKI /1929a/ and /1929b/ independent of his own systems. /1931/
Ueber Definitionen in der sogenannten Theorie der Deduktion.
It includes a directive permitting addition to the propositional logic of definitions together with the conditions to be satisfied by such definitions. It contains also the appended terminological explanations and examples illustrating mutual independence of the conditions imposed upon definitions. /1938a/ Einleitende Bemerkungen zur Fortsetzung rneiner Mitteilung u. d.T. "Grundzuege eines neuen Systems der Grundlagen der Mathematik". The paper comprises of the resume of Sections I-XI of LEsNIEWSKI /1927-1931/, followed by supplementary remarks on axiomatization of alternative "computative" systems of protothetic. It contains also Lukasiewicz's simplifications of the "deductive part" in Lesniewski's completeness proof for pure equivalence; a reduction to progressively shorter single axioms of protothetic /cf. SOBOCInSKI /1934//; a key to principles of Lesniewski's ideographic notation, the so called wheel and spoke notation /Concerning the subsequent use and adaptation of the Lesniewski's notation compare GOODELL /1952/ and MARTIN /1953//; further criticism of defective formalization of rules for introduction of new expressions, substantiated by additional proofs of contradiction in von NEUMANN /1931/. /1938b/ Grundzuege eines neuen Systems der Grundlagen der Mathematik, §12. This paper was to have followed the first eleven Sections of LEsNIEWSKI /1927-1931/ but after Lesniewski withdrew his manuscript from "Fundamenta Mathemaicae" in 1930 it did not appear until an opportunity was granted by Lukasiewicz and Sobocinski, the founders of a new journal "Collectanea Logica".
WORK AND INFLUENCE OF LEsNIEWSKI
197
In this paper more than 400 protothetical theorems are derived from the axioms /A1/, /A2/ and /A3/ given in LEsNIEWSKI /1929c/. LEsNIEWSKI AS AN ACADEMIC For Lesniewski his publications were not the only way of introducing his New System of the foundations of mathematics. He also attached a great importance to his university lectures though he lectured almost entirely about his own work. The following lecture courses were delivered by Lesniewski at the Univerity of Warsaw between 1919 and 1939. /1/ Foundations of the theory of classes /delivered during the acade-
mic year 1919-20; influenced by discussions with L.Chwistek Lesniewski introduced here for the firat time a logical symbolism as simpler and more precise than colloquial languages/,
/2/ Foundations of arithmetics /delivered in 1920-21; in this course Lesniewski presented a number of systematic and detailed observations and results of ontology/, /3/ Axiomatic foundations of science /1920-21/, /4/ Foundations of three-dimensional Euclidean geometry in the light of the new theory of classes /1921-24/, /5/ Foundations of logistic /1921-27/, /6/ Foundations of ontology /1925-27/, /7/ Axiomatics for group theory /1926-29/, /8/ Theoretical arithmetics /1926-27/, /9/ Introduction to mathematics /1927-28/, /10/ The problem of the primitive terms on the ground of arithmetics /1928-29/, /11/ Elementary outline of ontology /1929-32; students' notes of this lecture course form the basis for SLUPECKI /1955//, /12/ Directives of logistics and ontology /1929-31/, /13/ On the foundations of the co called theory of deduction /1930-31/, /14/ Chosen topics from the foundations of logistics /1930-32/, /15/ From the foundations of protothetic /1932-34; students' notes of this lecture course form the basis for SLUPECKI /1953//, /16/ Chosen topics concerning the axiom systems of geometry /1932-33/, /17/ Arithmetics of real numbers /1933-34/, /18/ Fundamental problems of mathematical logic /1934-35/, /19/ "Antinomies" of deductive sciences /1934-35/, /20/ Introduction to deductive sciences /1936-37/, /21/ Propositional calculus /1936-37/, /22/ On the so called many-valued logics /1936-37/, /23/ Antinomies of semantics /1936-38/, /24/ Traditional "formal logic" and traditional "theory of sets" on the ground of ontology /1937-38/, /25/ Basic publications in the so called many-valued logic /1937-38/,
J. SURMA
198
/26/ An outline of protothetic /1938-39/, /27/ An outline of ontology /1937-39/, /28/ Axiomatics for mereology /1937-39/. Apart from the lecture courses minars.
Le~niewski
also conducted several se-
/1/ On the foundations of mathematics /1919-20; 1930-32; 1936-38/, /2/ On Cantor's set theory /1919-21; Le~niewski put here the problem of the axiomatization of the copula "is" as in the expression "A is b", and made the first observations on ontology/, /3/ On Zermelo's "Untersuchungen ueber die Grundlagen der Mengenlehre" /1920-21/, /4/ Philosophy of mathematics /1938-39/. Between 1918 and 1927 Le~niewski read several papers. Two of them, "On certain theorem from the theory of relations" in 1918, and "On the grades of grammatical functions" in 1921, were presented to the Logical section of the Warsaw Philosophical Institute. The second paper put forth the theory of semantic categories as a counterpart for the hierarchy of Russell's logical types. The first outline of ontology, another Le~niewski's system was presented in two further papers both under the same title "On the foundations of ontology". The firts one was read to the Warsaw Psychological Society in 1921 and the second one to the Warsaw Scientific Society in 1930. Le~niewski also gave two lectures, "On the foundations of ontology" and "On the foundations of logistic" to the Logical section of the second Polish Philosophical Congress held in Lvov in September 1927. LE~NIEWSKI'S
NEW SYSTEM
The three-fold edifice of Le~niewski's New System of the foundations of mathematics consists of what he called protothetic, ontology and mereology. Mereology was developed between 1914 and 1916 and it was presented in written in LE~NIEWSKI /1916/. Then it was recapitulated and redeveloped in the Sections I-X of LE~NIEWSKI /1927-1931/. Broadly speaking mereology may be understood as a theory of the part-whole relation. Starting point for the development of ontology was 1919. As a formal system logically pressuposed by mereology it was constructed in 1920. It first became known through copies of Lesniewski's university lectures and was presented in written only in LE~NIEWSKI /1930/ though the last Section of LE~NIEWSKI /1927-1931/ and the Introduction of LEsNIEWSKI /1929c/ contained quite a lot of details concerning the origin and character of onbology. A survey of Le~niewski's unpublished results based on students' notes collected after the last war by T.Kotarbinski is contained in S~UPECKI /1955/. A very good informal outline of Le~niewski's elementary ontology is included in KOTARBInSKI /1929/. Broadly speaking ontology is a theory of the copula "is" and it comprises the traditional parts of logic: the theory of predicates, of classes, and of relations, including the theory of identity. Protothetic as the most general Lesniewski's system logically pressuposed by ontology started 1921 although the idea of it came back to Le~niewski's early writings from 1912-1914. It was axiomatized in LE~NIEWSKI /1929c/, resumed afterwards in LE~NIEWSKI /1938a/, and
WORK AND INFLUENCE OF LEsNIEWSKI
199
developed in LEsNIEWSKI /1938b/. A survey of unpublished Lesniewski's reSults on protothetic based on students' notes can be found in SLUPECKI /1953/. Broadly speaking protothetic is the logic of propositions with quantifiers binding propositional and functional variables. Notice that for Lesniewski there was no need to introduce the usual first order predicate logic apart from protothetic and antology as "it can be proved that ontology contains not only the first order logic but also the w-order predicate logic can be built up within ontology without additional axioms or rules of inference. PROTOTHETIC When the axiomatic foundations of mereology and then of ontology had been established Lesniewski turned to the problem of the logic of propositions pressuposed by the two theories. He wanted it to be as strong as possible. To construct it he introduced into the language of the usual propositional logic functional variables of arbitrary propositional types and allowed them to be bound by quantifiers and he called the resulting system protothetic or the theory of first principles. Thus protothetic is a kind of a theory of higher order propositional logic with quantifiers and at this point it resembles the higher order predicate logic. Here zero-oder propositional logic is just the ordinary one and first oder propositional logic is the logic with quantifiers binding only propositional variables. historical remark concerning quantifiers in the proposit~onal logic. Russell was presumably the first who in his "Theory of implication" published as early as 1906 studies the possibility of introducing quantifiers binding the propositional variables into the propositional logic. But he had not developed this idea and made no use of it in "Principia Mathematica". A propositional calculus with quantifiers was subsequently constructed in LUKASIEWICZ /1929/ and in LUKASIEWICZ-TARSKI /1930/.
~
Protothetic based on equivalence.Of course, protothetic as a formal system can be based on different propositional connectives as primitives. That based on equivalence is most popular. Lesniewski adopted equivalence as the only primitive in the consequence of an earlier discovery by A.Tarski. In 1922 Tarski, at that time a Lesniewski's doctoral student, discovered the possibility of defining conjunction in terms of equivalence and quantifiers. Notice that the first order propositional logic based on equivalence as the only primitive connective, i.e., the logic of pure equivalence with quantifiers binding only propositional variables gives nothing essentially new. In other words, quantifiers over propositional variables make it possible to define two propositional constants /1/ and /2/
(p)p,
the falsity or the "zero" element,
(p) p ;;. (p)p,
or
(p) (p:p)
the truth or the "one" element,
and four functions of one propositional argument /1/
p,
/2/
(p)p;p,
the assertion of p,
/3/ and /4/
p=p,
the negation of p,
the verum of p,
p .. [(p)(p;p)]
but nothing more.
the falsum of p
J. SURMA
200
The situation changes when we go one step further and pass to what might be called the second order propositional logic based on equivalence, admitting also quantification over functions of propositional variables. Then sixteen functions of two propositional arguments are definable. It suffices, of course, to define conjunction;that can be done in many ways, e.g., p"q _
(f) {p =-
pl\q'"
(f){
[f(P)
'" f(q)]}
or [f(P)
;; f(P"'P)] ;: q}
The first definition of conjunction is due to A.Tarski who published it in his doctoral thesis /Cf. TARSKI /1923a/, /1923b/ and /1924//. For Lesniewski it was a welcome discovery because equivalence constituted for him the most intuitive primitive term. However, Lesniewski was not the only man attaching to equivalence a rather great importance. Mention, for instance, F.Ramsey. In his "Foundations of mathematics" Ramsey wrote these sentences: "The preceding and other considerations led Wittgenstein to the view that mathematics does not consist of tautologies, but of what he called "equations", for which I should prefer to substitute "identities" .... /It/ is interesting to see whether a theory of mathematics could not be constructed with identities for its foundations. I have spent a lot of time developing such a theory, and found that it was faced with what seemed to me unsuperable difficulties" /Cf. RAMSEY /1925/,p.350/. Stanislaw Lesniewski was seemingly the first who realized this programm independently and successfully. Protothetic makes it possible to formulate within the system itself of many theorems which, with a different formalization of the propositional logic, are considered as belonging to its metasystem, e.g., The laws of
~eneralization,
for every propositional category.
/1/ The law of generalization for the lowest propositional category: Cf)[(p) f(p) ;: f(O)l\f(l)J where 0 and 1 stand for (p)p and 0;0 respectively, /2/ The law of generalization for the second propositional category, i.e., for functions of one argument: (A) ~ (f) [A(f) '" A(as) " A(neg) II A (ver) " A (falsum)Jj where as, neg, ver and falsum are the functions of one propositional variable: assertion, negation, verum and falsity respectively. The laws of generalization for functions of two propositional variables as well as for higher propositional categories can be formulated analogously. The laws of extensionality, for every propositional category. /1/ (p,q) { (p ,;:; s) '" (f) [f(p) ;. f(q)Jj /2/ (f,g) { (p) [f(P) ;: g(p)J =- CAJ [A (f) ~ A(g)]} and so on for higher propositional categories. Deductive formalization of protothetic. Lesniewski constructed several axiomatizations for protothetic based on equivalence as the sole
WORK AND INFLUENCE OF LEsNIEWSKI
201
primitive. For illustration we quote one such an axiomatization. Axioms: /PA1/ (p,q,r) (p ;. q) =. [(r =: q) =. (p =: r)J This is a single axiom of the theory of pure equivalence based on detachment and substitution rules.
1
l
/PA2/ (P,f){ f(p) ={fCP == 0) =' (q)[f(p) ;; f(q)JH Rules of inference: /PR1/ Detachment rule for equivalence, /PR2/ Distribution of general quantifiers: From
(x 1, ... ,x
n)
(A;: B)
infer (x 1, ..• ,x i) [(x i + 1,· .. ,X n)A;: (x i + 1, ... ,X n)B] where x ••. ,x 1, n tegory, /PR3/
are variables of arbitrary propositional ca-
Protothetical rule of definition: (x l' ... , x n) [ A (x l' •.. , x n)
=: B (x l' ... , x n )
J
where
A(X ••• ,x is a propositional formula with x •.. ,x 1, 1, n) n the only free variables of any propositional category and B is a new propositional constant distinct of any sign in the same propositional category,
/PR4/
Protothetical rule of extensionality, for all propositional categories /except for the lowest propositional category which is guaranted in the axioms/: (A,B) {(x 1""'xn) [A(X 1, ... ,x n) ;:: B(x1, ...
/PR5/
,xn~==(C)[C(A)
.,
c(~f
Substitution rule, which does not permit to replace complex expressions by other expressions but only to replace free variables representing propositional variables or functions of any propositional category by constants or variables of the same propositional category.
Several versions of protothetic based on equivalence are axiomatized by a single axiom. To give an example. /PA2/ (p,ql{C p:q):. (fJ{f{P,f[P'(S)s]} =: (r)[f(q,r) == (q == PJJ1} This axiom had been found as a result of a series of successive simplifications made by Lesniewski, Tarski, Wajsberg, Sobocinski and others, and it is the shortest one known as yet /Cf. SOBOCInSKI /1934//. Protothetic based on implication. Protothetic can be based on primitives other than equivalence. A system based on implication as the only primitive was developed mainly by A.Tarski. He defined the negation of p as p~(p)p and adopted the following axiom (q,f)i[Cp) (p~)J =l){f[(p) p]=-=) f(ql1} and reformulated the rules of inference accordingly. Protothetic as ~ generalization of the ordinary propositional logic. Protothetic contains all the theorems of the ordinary two-valued propositional logic which was proved by A.Tarski in 1922 /Cf. TARSKI /1923a/, /1923b/ and /1924//. Two-valuedness of protothetic. As a formal system protothetic is com-
202
J. SURMA
plete with respect to the two-element Boolean algebra, this algebra being considered together with all the Boolean functions stated explicitly. That makes protothetical theorems rather easy to predict. Protothetic allows to express within its own language the method of verification by the well-known zero-one matrices. The usual manner of proving theorems resembles the ordinary verification by the zero-one matrices and i t runs as follows: first we prove a theorem in question for the constant zero, then for the constant one and finally we apply a law of generalization. As a formal system protothetic is deductively consistent, decidable and deductively complete, i.e., for every closed propositional formula A either A or the negation of A is provable. In the case of the underlying logic being two-valued protothetic has been extensively examined. On the other hand, many-valued protothetics are not so fully investigated. One paper dealing with the problem belongs to T.Scharle /ef. SCHARLE /1971//. Scharle states without proof that finitely-valued protothetic can be constructed with the completeness property in the following sense: For every closed propositional formula A one of J.(A) is provable, where the functions J. are such that they take desIgnated values only in case the argument has value i. This statement can be proved in many ways, e.g., by using the finitely generated trees technique which has been given in SURMA /1976/. Definitions and extensionality in protothetic. Lesniewski's formalization of protothetic includes the rule of definition allowing to extend at will the variety of propositional categories. So protothetic is presented not as a completed system but as a system developing in the course of time. This property expresses Lesniewski's very conception of the deductive system. The set of well-formed propositional formulas in not actually given as a whole. Systems are being continually enriched by newly defined terms. All definitions are formal theorems of the system in question and these definitions can be creative. Only the translatability or eliminability condition is satisfied by the defined terms. The equivalence sign ~ is just used, not mentioned, i.e., it is of linguistic, not metalinguistic nature. Another peculiarity of protothetic closely related to the role played by definitions is that it includes the extensionality as an inference rule, not as the law of extensionality. By this the formal system of protothetic is finitely axiomatized. The construction of a constant of a certain propositional category automatically supplies the rule of extensionality for that category. remark concerning the theory of propositional semantic category. In order to state precisely the conditions to be satisfied by an expression to be a new theorem Lesniewski worked out a general theory of semantic categories. The theory formally resembles the simple theory of types but, in its intuitive credentials, has more in common with the traditional "parts of speech" or with Husserl's "Bedeutungskategorien". Unlike the authors of "Principia Mathematica" Lesniewski understood logical types as kinds of expressions and not of non1inguistic entities.
~
Let s denote the basic category of propositions. As it is generally known this basic category is common to all languages. All other propositional categories are obtainable by finite application of the following rule: If~, £o"."£n are propositional categories, then
WORK AND INFLUENCE OF LEsNIEWSKI ~/~o"'~n
203
is a propositional category. Every sign, i.e., constant or
variable belongs to one and only one semantic category. Protothetic contains variables, constants and functions of all categories for which the index can be constructed by using only the symbol s. Notice that the variables and constants in the language of protothetic are not provided with category indices so the belonging of symbol to a particular category follows from the part it plays in a formula under consideration, and two formulas are of the same category if they occur in brackets of the same shape or if they precede similar brackets. In Lesniewski's theory of categories there is the following rule of contraction: Any cate~ory of the form {•.• C(a/b '" -n b ) -0 b ) •.. -n b ) - -0
contracts into the category a. This rule represents the fact that when a function of the category alb .•. b is applied successively to n propositional formulas of the cat~gorI2s b , .•• ,b respectively, then a complex formula is formed which is oIOthe ca~egory ~. Notice
that quantifiers are treated by Lesniewski as pseudosigns, as signs of no category at all or in other words, as categorially open signs. No doubt, this is a deficiency of the discussed theory. The deficiency has often been mentioned in the literature /See for example TARSKI /1933/ or PRIOR /1961//.
remark concerning Henkin's theory of propositional ~ . In 1963 L.Henkin published his formulation of the theory of propositional types based on equality and functional abstraction as primitives /See HENKIN /1963//. It was improved by P.Andrews later on /Compare ANDREWS /1963//. This theory was a modification of A.Church's formulation of the simple theory of types /Cf. CHURCH /1940//. In 1975 Henkin converted the ideas of his 1963 paper into a formulation based on equality and relational abstraction as more familiar than the functional abstraction expressed in terms of Church's lambda-notation /Cf. HENKIN /1975//.
~
Henkin's theory of propositional types can be considered as a version of protothetic though it is not identical with Lesniewski's protothetic. In Lesniewski's there are variables for each propositional category, each variable can be bound by quantifiers that are of no category. Lesniewski adopts in his system only one equivalence sign denoting the usual connective of equivalence. Also Lesniewski's protothetic includes a special rule of definition which allows the introduction of new symbols as names. of arbitrary elements of any propositional category. Henkin also constructs a theory with a distinct set of variables for each propositional type but he adopts as many equality symbols as to have one for each propositional type. Then he introduces relational abstraction as /another/ primitive of no propositional type. He starts with names for only a relatively few elements of the hierarchy of propositional types and constructs new names by means of relational abstraction. He proves that in this way each element of a given propositional type has a name in the system, and he defines quantifiers in terms of equality and relational abstraction. The possibility of defining quantifiers is essentially due to Quine /Cf. QUINE /1956//. Henkin gives a deductive formalization of his theory and shows that it is complete in the sense that A is provable in this formalization if it has the value "true" for every assignement of values to its free variables under a given interpretation.
204
J. SURMA
Later on A.Grzegorczyk showed that every equality symbol in Henkin's theory is definable by means of relational abstraction and of the equality symbol for the lowest propositional type, i.e., the equality symbol denoting the usual equivalence /Cf. GRZEGORCZYK /1964//. Notice that this Grzegorczyk's reduction does not finitize the axiom system of Henkin's theory because we need the rule or denumerable many axioms having the form of extensionality. Henkin gives his theory of propositional types an elegant semantic interpretation. His interest for the theory was caused by the problem of constructing non-standard models for the full theory of types as described in HENKIN /1950/. He hoped for insight into the totality of models for the full theory of types through his study of all models of the much simpler theory of propositional types where each type is finite. Such an argument indicates another methodological advantage of protothetic. Similar arguments are widely known. Many problems of ordinary and higher order predicate logic can be reduced to questions about propositional logic as for instance in Herbrand's theorem. remark concerning pure equivalence. Lesniewski initiated investigations into the equivalential fragment of the ordinary propositional logic. He was the first to axiomatize the pure equivalence logic. In LEsNIEWSKI /1929c/ he stated that the axioms
~
/A1/
[(p .. r)
'"
and /A2/
[p;; (q
c.
(q
r)J
c.
p)]
.= (r c.
=[(p;;.
q).=
q)
rJ
together with substitution and detachment for equivalence form a complete system. To prove this Lesniewski formulated the so called now Lesniewski's decidability criterion for equivalence according to which any pure equivalence can be deduced from /A1/ and /A2/ if every propositional variable occurs in it an even number of times. A group-theoretic proof of Lesniewski's decidability criterion is given in STONE /1937/. A detailed discussion of the method Lesniewski used in his proof is given in SURMA /1973a/. A Henkin-style completeness argument for the pure equivalence can be found in SURMA /1973b/. A study of the equivalent fragment of the intuitionistic propositional logic by means of proof-theoretic methods was given in TAX /1973/. An algebraic reformulation of Tax's result can be found in KABZINSKI-WROnSKI /1975/. ONTOLOGY Deductive formalization of ontol~. Lesniewski constructed a formal system of ontology as based on the axioms and rules of protothetic. Ontology includes the following additional axiom: /OA1/
(a,b)[aEb
=l~c)(c£a){c)(c£a =} CEb) ,,(c,d)(c£a
x d t a ==>C2d)]
and the following rules of inference: /OR1/
(a,x1, ... ,xn){[ataAA(x1, ... ,xnl] .=[a€.B(x1, ... ,xnlJ} where B is the constant being defined and A is a propositional formula in which x ... ,x and possibly a are the only 1, n free variables of arbitrary semantic categories. This is the rule of ontological definition.
WORK AND INFLUENCE OF LEsNIEWSKI /OR2/
(A,BH(a,xl,··.,xJ[a~A(xl,···,xn)~ ==
(C )[c
where x
(A)
B(X 1,·.·,xn
205
lJ -
S C(B)]}
.•• ,x
are variables of arbitrary semantic category. 1, n This is the rule of ontological extensionality.
The axiom /OAI/ is the so called long axiom of ontology and according to Lesniewski it gives the precise meaning of the copuls "is" in its existential sense. This axiom can be equivalently substituted by the following one: /OA2/
(a,b)[a£b .. (~c)(aE.c" Cfb)J
called the short axiom of ontology. According to the intended intuitive meaning the rule /ORI/ of ontological definition is the ordinary definition of a set by indicating the unit sets it contains. While the rule /OR2/ of ontological extensionality corresponds to the theorem stating that sets composed of the same unit sets are identical. Notice that it was Lesniewski who gave the first exact description of the rules /ORI/ and /OR2/ which was an achievement of a historical value. Observe also that the above formalization of ontology is proved to be consistent. This particular result was obtained by Z.Kruszewski as early as 1925 /Cf. KRUSZEWSKI /1925/ / by interpreting the ontological e p s.j Lon as the conjunction connective. A rematk on definitions. Lesniewski did a pioneer work in applying new approach to the theory of definition. He rejected the view that definitions are merely conventional abbreviations accepted exclusively for economy and space saving in the presentation of a formal system. According to him definitions are an act of choosing the complex and important ideas that are to be the object of study.
a
He emphasised the fact that definitions require special formative rules and he was the first to give such rules. For him definitions are formal theorems of a system under consideration and they are an important way of introducing new semantic categories. AS to the methodological role of definitions they are allowed to be creative, i.e., in the formalized systems there are theorems that do not contain defined terms and yet cannot be proved without these definitions. Lesniewski affirmed his belief in the admissibility of creative definitions under an influence of J.Lukasiewicz who discussed the problem of creative definitions in his lectures summarized then in LUKASIEWICZ /1928-29a/ and /1928-29b/. The fact that definitions are to be treated as theorems of formal systems and can be creative has not been widely recognized in the foundational studies. Nevertheless there are authors sharing this Lesniewski's view /Cf. for example MORSE /1965//. Recently several interesting papers have been published concerning reative definitions. One of them is MYHILL /1953/ where the author constructs a system of arithmetic with infinitely many creative definitions. Another paper is RICKEY /1975/ where axiomatizations of the propositional logic containing a single creative definition, a finite number of creative definitions, and also an infinite number of creative definitions are given. RICKEY /1975/ gives also the proof
206
J.
SURMA
of the Lindenbaum theorem stating that the propositional calculus containing p=9p as a theorem has no creative definitions. However, there still remains more work to be done in the problem of creative definitions. Ontology and set theory. Ontology is a kind of the simple theory of types combined with the theory of a certain relation between objects synbolized "e" which is to abbreviate the copula "is" as it is used in Polish or in Latin. However, unlike Russell who refered to entitities rather than to symbols of entities, though his position was not free of certain hesitation, Lesniewski would prefer to say that "a is b", in symbols, "atb" is true if "a" is a name or more precisely "a" is a unit or singular name and the predicate "b" must apply to "a". This might suggest that ontology embodies what can be called distributive interpretation of names as symbols for sets. Is this the case will be seen from below. Observe first that according to Lesniewski the relation "£" covers all the cases like these: III Plato is wise where "is" stands for class-membership of the ordinary set theory, 121 The dog is an animal where "is" stands for class-inclusion, and 131 Socrates is the husband of Xantipa where "is" stands for set-identity. These examples show that Lesniewski tried to single out Ibut not to confusel some common properties of class membership, inclusion and identity. That, of course, does not imply there is no distinction between the three relations, and it was Frege who as the first logician pointed out the necessity of making the distinction. The fact that the epsilon of ontology differs from that of the ordinary set theory can be further illustrated by the two following theorems of ontology
III
(a,bIC aj b ~ ae.a)
expressing the semi-reflexivity gical epsilon, and
121
(a,b,c) (aEb
A
Ibut
not reflexivityl of the ontolo-
b ac ~ a sc]
expressing the transitivity of the epsilon. Notice that the ontological epsilon is not reflexive as the formul.a
131
(aJ(aea) is not true in ontology; it is even independent of ontology.
Taking this into account some authors ICf., for example, FRAENKELBAR-HILLEL-LEVY 11973/, p.2031 consider Lesniewski's ontology not as a variant of set theory in the sense of ZF but rather as a rival of set theory in the field of the foundations of mathematics. However, no matter how important a rival of set theory ontology is, or could be, this is a question still very difficult to be answered. The only certain thing is that the standard arguments leading to the logical antinomies cannot be reproduced in ontology: some of the counterparts of these arguments fail to comply with Lesniewski's theory of semantic categories while others require certain steps which are not viable in ontology. Readers interested in more details concerning Lesniewski's unpublished materials on antinomies are advised to consult SOBOCInSKI 119501.
WORK AND INFLUENCE OF LEsNIEWSKI
207
Ontology and the axiom of choice. Ontology has no axiom of choice, and the theorems provable in ZF with the aid of this axiom, in ontology appear as consequences of conditional sentences which antecedents contain the conditions corresponding to the axiom of choice. Lesniewski decided his ontology to be uncommited to the axiom of choice as its acceptance is equivalent to the well-ordering principle and the hierarchy of semantic categories would require then the axiom of choice for every category. but nevertheless Lesniewski studied the contents of the axiom and even established that it is effectively equivalent to each of the following transfinite arithmetics formulas:
/1/ /2/
m·n m+n
m
or
m·n
m
or
m+n
n
=
n
for every transfinite m and n /Cf. for this KURATOWSKI-MOSTOWSKI /1968/, p.297/. Recently several axiomatic extensions of Lesniewski's ontology have been examined. For instance, J.Canty added to ontology an axiom of infinity in order to derive from it Peano's arithmetic, and he showed that Goedel's incompleteness theorem is applicable to this extension /See CANTY /1969a/ and /1969b//. Ch.Davis discussed formulations of the axiom of choice that can be expressed in ontology, one of the axioms having the following very elegant form
(j nCa,g) [g(a) ~ g(f(a))] /Cf. DAVIS /1975//. J.Kowalski established that the axiom of choice, Kuratowski-Zorn's lemma and the well-ordering principle are equivalent in ontology /Cf. KOWALSKI /1975//. However, this interesting area has not been fully treated yet. Proper interpretation of ontology in set theory. Each of the following statements is independent from ontology: /1/ There are no individuals, /2/ There is only one individual, /3/ There are more than one individual, /4/ There are infinitely many individuals, and each of them can be used to extend ontology. Lesniewski himself, due to a strong philosophical belief, carefully avoided the postulating the existence of anything, and he put empty and non-empty names into the same semantic category. This raises the question of a proper interpretation of ontology in terms of the usual theory of sets. In set theory when giving a standard interpretation to a formal system one must specify a non-empty domain of discourse which is assumed to coincide with the range of the variables of the theory. In the usual set theory to be means to be the value of a variable. In Lesniewski's the domain of discourse may be arbitrary, even empty, and it would be non-empty only if ontology contained exclusively singular names. On the other hand, ontology includes the usual first order logic and therefore the classical principle of inference to which ontology adheres is that the particularly quantified statements follow from the universally quantified ones, in symbols, (x)A =*l~ x)A which implies that the range of the variables cannot be identified with the domain of discourse. For the discussion of some further peculiarity of ontology see PRIOR /1965/. There were many discussions on how to interpret ontology in set theory. The most persistent idea which was presented by many authors L's that the Lesniewskian formula a b is interpreted in set theory as:
e.
J. SURMA
208
[a} is contained in b Under this interpretation the axiom of ontology is a theorem of set theory and the ontological rules of inference are valid as certain set-theoretic conditions. In accordance with this interpretation the range of the variables can be assumed to be the power set of the domain of discourse. The elements of the power set assigned to the elements of the category of names are called sometime the extensions of the names. Using this terminology it is clear that if there are only n objects in the domain then there are 2 n extensions to be assigned to the names. If, on the contrary, the Jomain is infinite then there are more than denumerably many extensions. To end the section remark that only a very little has been done concerning model theory of for ontology. Some general model-theoretic aspects of ontology in the style of Henkin's "Completeness in the theory of types" /Cf. HENKIN /1950// have recently been discussed in BOUDREAUX /1976/, CANTY /1976/ and RICKEY /1976a/. Ontology as ~ Boolean algebra. Some authors like, for instance, A.Grzegorczyk /Cf.GRZEGORCZYK /1955// point out a formal resemblance between ontology and the theory of complete atomic Boolean algebras with constants and functions of an arbitrary high type. From this point of view expressions of the form "a E a" should be read "a is an atom" though Ledniewski himself would prefer to read this as "a is an object". The zero element of Boolean algebra is defined in ontology as follows a EO", (a t
a)~ ..... (a E
a)
and the epsilon of ontology can be defined in Boolean algebra as follows a £ b := (a is an atom)/'{b is an element containing a) In Boolean algebraic terminology the rule /ORI/ of ontological definition is the definition of an element of Boolean algebra by indicating its atoms while the rule /OR2/ of ontological extensionality says that elements of Boolean algebra composed of the same atoms are identical. A remark on quantifiers. Adopt the following definition of the empty na~
/1/ (a) [(a f. a)" "'(a E a) '" a t J\.] and observe that the following is a theorem of ontology
/2/
",,(I'. E1\.)
hence by the familiar rule of the existential generalization /3/
~(al(a€a)
Now it is evident that quantifiers as used in ontology cannot be interpreted referentially or objectually, i.e., with variables ranging over objects refered to by names. If they could the formula /3/ would mean that every object is not identical with itself which is false in set theory. Again ontology is·to be a formal system without any ontological commitment and therefore quantifiers need have nothing to do with existence. On the other hand, the substitutional interpretation of Lesniewski's quantifiers also cannot be unconditionally accepted. Remember that a universally quantified sentence is considered as true if from the fact that for any name when substituted for the variable in question it follows that the ope~ sentence after the quantifier comes out as true. First of all, to avoid the trivial case there had to be an infinite
WORK AND INFLUENCE OF LEsNIEWSKI
209
number of names. Secondly, the names need not to be unique or singular. Thirdly, the rule /ORI/ of ontological definition when applied it extends the language, and once the definition is asserted, quantification involving variables of the semantic category in question is permitted. But in Lesniewski's ontology sets, either empty, unit or not, are recognized as purely linguistic entities serving as extensions in the range of quantification, and not in the domain of discourse. The definitions of new linguistic expressions increase only the "logical power" of the system. They do not produce any change in the domain. So the following view on Lesniewskian quantifiers should be accepted: a universally quantified sentence is said to be true if, when any extension is substituted for the variable in question, the open sentence after the quantifier comes out as true For the further discussion of the Lesniewskian quantifiers consult KUENG-CANTY /1970/. comparison of Lesniewski'~ theory of semantic categories with other versions of ~ theory. In Lesniewski's system symbols "a" and "b" in the expression "a E b" belong to the same semantic category, namely, to the category n of names. In Russell's theory of types they denote sets as belonging to different types. In Lesniewski's the category of names includes not only singular names but also empty and shared or general names.
~
This property of Lesniewski's theory of semantic categories makes it more suitable for practical deduction than other versions of type theory are, e.g., Tarski's theory of classes as described in TARSKI /1933/ or that of Church's /Cf. CHURCH /1940//. In Church's we encounter only functions of one argument, i.e., only types of the form alb, and functions of more arguments are constructed by superpositions, e.g., to the category a/bb in Lesniewski's corresponds the type (~/£)/£ in Church's. In TarskI's we have variables of type ~, and variables of type sin denoting unary relations of type a where a is either n or sin. For-instance, to the category s/nnn in-Lesniewski's corresponds the type s/(s/(s/n)) in Tarski's. The-Ianguage of ontology seems to bethe most-appropriate when we employ but the lowest possible types or categories. From this point of view Tarski's language is less convenient. As we have already shown Lesniewski's system contains creative definitions. Now a system of the simple theory of types allowing definitions may be converted equivalently to a system not containing them but containing the so called axiom of definability lor the axiom of subsets or the Comprehension schema/ instead. Lesniewski himself investigated some anologies of the axiom and called them pseudodefinitions. Clearly systems without definitions are easier to be described and therefore more convenient in metalogical research. Application of the theory of semantic categories to Linguistics.Lesniewski's hierarchy of semantic categories has become a fruitful starting point in developing modern linguistics. The hierarchy begins with two basic categories, the category s of sentences, and the category n of names, and it consists of infinitely many complex categories, resembling "parts of speech" in ordinary traditional grammar. Lesniewski himself invented it to prevent logic and mathematics of antinomies.It remained almost unnoticed outside mathematical logic until 1935 when an important K.Ajdukiewicz's paper "Die Syntaktische Konnexitaet" appeared /Cf. AJDUKIEWICZ /1935//. Ajdukiewicz derived his conception of syntactic connection from Les-
210
J.
SURMA
niewski's conception of semantic categories. He introduced a special kind of notation of complex categories, the so called Ajdukiewicz's fractions. Ajdukiewicz's paper revealed an interesting interpretation and application of the theory of semantic categories in the field of formal linguistics though neither Lesniewski nor Ajdukiewicz were directly interested in the problem of the mechanical determination of syntactic structure of expressions of natural languages. Their theory of semantic categories became a popular and influential version of what is called today categorial grammars or categorially based grammars. The importance of the theory of semantic categories has been especially proved by Y.Bar-Hillel /Cf. BAR-HILLEL /1950//, H.Hiz /Cf. HIZ /1961//, D.Lewis /Cf. LEWIS /1972//, M.Cresswell /Cf. CRESSWELL /1973// and others. In this connection compare also MACHOVER /1966/. Bar-Hillel slightly modified the Lesniewski-Ajdukiewicz theory and he extended it in three ways: /1/ assignment of more than one category to the words was allowed, /2/ a new kind of operator category was introduced, the operators act upon arguments at their immediate left: if a and b are categories then a-b, read, "a sub b" is a new category, /3/-in thIs connection an additional rule 01 cancelation was introduced: from a and a-b infer b. Lewis also presented a very interesting extension of Lesniewski-Ajdukiewicz's ideas. As to Cresswell he returned to lambda-operator as a new operator instead of quantifiers. For a number of interesting problems in the formal theory of language based on the theory of semantic categories the reader may consult JAKOBSON /1961/. A detailed discussion of the work on grammatical categories done by Husserl, Lesniewski, Ajdukiewicz, Bar-Hillel, Hiz and others encompasses also LEHRBERGER /1974/. MEREOLOGY Analysis of Russell's antinomy convinced Lesniewski that he should distinguish between the distributive and the collective interpretation of the notion of set or class. The expression "a l b" when understood distributively means "a is an element of the class of b's". If, however, we interpret it collectively it means "a is a part /proper or improper of the whole which is b". The term "part" is understood here as a "piece" of some object and therefore parts cannot be empty. As concrete agregates mereological sets /but, of course, not ontological sets/ possess no cardinal number. Once the distinction between collective and distributive sets was made it was evident for Lesniewski that the presuppositions on which Russell's antinomy was based turned out to be false. Lesniewski constructed a deductive theory called first the "general theory of sets" which he subsequently renamed mereology. The reason was that mereology and set theory make use of different notions of set and of membership, and therefore he relinquished the idea of treating mereology as a set theory. The part-whole relationship has long been examined and apart of Lesniewski's mereology several other approaches were presented. To quote only two examples. J.Woodger independently developed a system very similar to Lesniewski's /Cf. FLOYD-HARRIS /1964//. The so called calculus of individuals of Leonard and Goodman was also a version of mereology apparently weaker than the lastone /Cf. LEONARD-GOODMAN /1940//. deductive formalization of mereology. Mereology may be formalized in terms of many primitives. The most familiar formalization uses "pr", a rnereological elementcalled part, as a primitive where the expression
~
WORK AND INFLUENCE OF LEsNIEWSKI
211
"a E. pr b " stands for "a is (a) part of b". The axioms of mereology are as follows: /MAI/ (a,b}[a~pr(bl ' 9 b H J The axiom states that only the objects having parts are individuals. /MA2/
(a,b,c)[a epr(b) " b ~pr(cl
=> a
E. pr(cJl
Transitivity axiom. To formulate the next axioms we first introduce "Kl", a mereological class, which is defined in the following way: /DEF/
(a,K){KtKl(al; K~K ~(L)[Lca ~Lepr(KJJI\ ~ (L) {L epr(K} =7 (3 M, N) [M fa liN Epr(L)~ N tpr(M)Jl}
Now the next axioms follow. /MA3/
(a,K,L)[KeKl(al~LfKl(a) =9KeL]
The axiom states that mereological classes are unique. /MA4/
(a,K) { K E.a
=>(3 L )
[L Hl(ajJ
1
The axiom postulates the class existence. No new rules of inference are added in mereology. One of the mereological theorems is the following formula (a) [a s a ~ a e pr(al] which states that only individuals are parts of themselves. At first this theorem was accepted as an axiom but afterwards it was proved to be redundant /Cf. CLAY /1970//. The definition of the notion of mereological class as given above is somewhat cumbersome and it might have been expected to be simplified. As a matter of fact a conjecture was made that it could be substituted by the following more intuitive definition. (a,K) { K EKl(a) ~ K £K II (L) [a E preLl '" K E pr(LJ]} But as R.Clay proved /Cf. CLAY /1966// it is weaker than the definition /DEF/ above. The above system of mereology is consistent which can be proved by many ways. One way is to supply it with an appropriately chosen model An interesting mode~ for mereology was given in CLAY /1968/. The domain of it consists of the set of all real numbers whose decimal expansions contain only zeros and ones with the exception of O. The relation "a is a part of b" means here that every position where a has a one in its decimal expansion, b has it too. As to the real number system it is introduced by Clay axiomatically into ontology. In the above axiomatization Kl(aJ is essentially the least upper bound of the a's. This remark was made by Cz.Lej~wski in 1960 /Cf. for this RICKEY /1976b//. The same was also studied in KUBInSKI /1971/. Many other axiomatizations of mereology have been given. For account see LEJEWSKI /1954/, /1955/ and /1962/. Extensions of mereology. The literature concerning mereology is very rich and it is richer than that on other Lesniewski's systems. It includes, among others, two long works by Lesniewski himself, LEsNIEWSKI /1916/ and /1927-1931/. One of the best introductions to mereology is SOBOCInSKI /1954/. An extension of mereology providing a basis for developing the arithmetic of natural numbers is given in S~UPEC KI /1958/. Other extensions of mereology are known under the names of
212
J. SURMA
atomistic and atomless mereologies respectively. Atomistic mereology is a result of adjoing to mereology the new axiom stating that every object is either an atom or a mereological class constructed from these atoms which are the mereological elements of this object. Atomless mereology, on the other hand, consists of mereological axioms enriched by the new axiom statying that no atom exists in the field of mereology. For account see SOBOCInSKI /1971/ where it is shown, among others, that the definitions of atoms given by Schroeder and Tarski /Cf. TARSKI /1935// are equivalent in mereology. Applications of mereology. First application of mereology goes back to 1929 when A.Tarski used it to form a foundation for the geometry of solids. He proved that axioms for Euclidean geometry may be given in mereology with the primitive "solid" added if solids were interpreted as open /solid/ spheres. Points were defined by Tarski as distributive las characterized in ontology/ classes of concentric spheres introduced by means of rather sofisticated definitions /Cf. TARSKI /1929//. These definitions have been simplified afterwards in JAsKOWSKI /1948/. Tarski's idea has been extensively developed by T.Sullivan who obtained similar results for affine geometry where solids are interpreted as parallelepipeds /Cf. SULLIVAN /1971// and for projective geometry where solids are interpreted as convex quadilaterals /Cf. SULLIVAN /1972/ and also /1973//. Mereology seems to serve not only in the geometry of time-space solids, but also in formalization of empirical sciences like geometry of events, some chapters of theoretical biology /Cf. WOODGER /1937//, phonology /Cf. BATOG /1967// and others. On relation of mereology to complete Boolean algebra. In TARSKI /1935/, Footnote 1, one may find: "The extended" /i.e., complete/ "Boolean algebra is closely related to the deductive theory developed by Stanislaw Lesniewski and called by him mereology. / ... /. The relation of the part to the whole, which can be regarded as the only primitive notion of mereology, is the correlate of Boolean-algebraic inclusion". This is then detailly discussed in GRZEGORCZYK /1955/ where the following formal statement can be found: "The models of mereology and the models of complete Boolean algebra with zero deleted are identical". This statement has been subjected to a strong criticism in CLAY /1974/. Clay states that Grzegorczyk's proof is faulty and that the system Grzegorczyk describes as mereology is in fact not Lesniewski's mereology.Lesniewski's description of mereology was presented in colloquial Polish since it was published during his pre-symbolic period. The Lesniewski's expression "a is an ingredient of b" that is "a is a part of b or a is identical' with b" is symbolized in GRZEGORCZYK /1955/ as "a ingr b", i.e., by considering "part" as a binary relation and by eliminating ontological epsilon from mereologyaltogetheL Clay insists, on the contrary, on symbolizing the Lesniewski's expression as "a E pr(b) ". The next Clay's criticism of Grzegorczyk says that the later describes ontology as an elementary theory and in fact proves that just this elementary ontology shares the same models with complete Boolean algebras with zero deleted. CONCLUDING REMARKS Stanislaw Lesniewski's efforts to solve the problem of antinomies resulted in the construction of what he called the New System of the foundations of mathematics, distinguished by originality, comprehensiveness and elegance, a pioneering achievement of the 20-ties. Lesniewski played a considerable role during the period of elabora-
WORK AND INFLUENCE OF LEsNIEWSKI
213
rating modern tendencies of mathematical logic and the foundations ofmathematics. He was the forerunner and originator of many ideas now incorporated into logical and foundational text-books. But Lesniewski's writings are done in a highly condensed and difficult style, most cumbersome in practice, his famous terminological explanations are hardly intelligible. He invented a special symbolism, the so called wheel and spoke notation the use of which was an additional factor determining his isolation on the international scene. This is why Lesniewski's systems have not been so popular as they deserved. Another reason is that general trends of logical research had meanwhile drifted away from "system building" to metalogical investigations mostly of first order languages. All this is quite unfortunate but the fact remains that Lesniewski·systems are not generally accepted as a tool in the foundational practice, and they" are not the systems a mathematician in the street makes use of. But on the other hand, Lesniewski'Swork has greatly influenced the very phflosophy of logic and of the foundational studies. In this field Lesniewski had worked out an original point of view he had called the "intuitionistic formalism" which he characterized by these sentences: "I might take this opportunity to point out that. for many months I have devoted a considerable expenditure of systematic work towards the formalization of these systems / ••• / through a clear formulation of their directives using a number of the auxiliary terms whose meaning I fixed in the terminological explanations / •.. /. Having no predilection for various "mathematical games" that consist in writing out according to one or another conventional rule various more or less picturesque formulae which need not be meaningful, or even - as some of the "mathematical gamers" might prefer - which should necessarily be meaningless, I would not have taken the trouble to systematize and to often check quite scrupulously the directives of my system, had I not imputed to its theorems a certain specific and completely determined sense, in virtue of which its axioms, definitions, and final directives / .•. / have for me an irresistably intuitive validity" and further "I know no method more effective for acquainting the reader with my logical intuitions tban the method of formalizing any deductive theory to be set forth. /Compare for this LEsNIEWSKI /1929/, p.78/.
214
J. SURMA
REFERENCES Abbreviations: A~ Acta Universitatis Wratislaviensis, Wroc!aw, Poland, CL Collectanea Logica, Warsaw, Poland. This journal and its first inaugural volume destroyed with its Warsaw printing house during the World War I, CR Comptes Rendus des Seances de la Societe des Sciences et des Lettres de Varsovie, Classe III, FM Fundamenta Mathematicae, JCS Journal of Computing Systems, JSL Journal of Symbolic Logic, NDJFL Notre Dame Journal of Formal LogiC, North-Holland Publishing Company, NH PF Przeglqd Filozoficzny /Philosophical Review/, poland, PSASA Polish Society of Arts and Sciences Abroad, London, RF Ruch Filozoficzny /Philosophical Movement/, Poland, Studia Logica, Poland SL AJDUKIEWICZ K. 1935 Die Syntaktische Konnexitaet. Studia Philosophica,1/1935/,1-27. ANDREWS P. 1963 A reduction of the axioms for the theory of propositional types. FM,52/1963/,345-350. BAR-HILLEL Y. 1950 On syntactic categories. JSL,15/1950/,1-16. BATOG T. 1967 The axiomatic method in phonology. Routledge and Kegan Paul, London, 1967. BOUDREAUX J. 1976 Set-theoretic models for Lesniewski's logical systems. XXllnd Conference on the History of Logic,5-9 July, 1976. Jagiellonian University and Polish Academy of Sciences. Krak6w,1976. pp.2-5 /Abstract/. CANTY J.T. 1969a The numerical epsilon. NDJFL,10/1969/,47-63. 1969b Lesniewski's terminological explanations as recursive concepts. NDJFL,10/1969/,337-363. 1976 The proper interpretation of ontology. XXllnd Conference on the History of Logic,5-9 JUly,1976. Jagiellonian Univ. and Polish Academy of Sciences. Krak6w, 1976,pp. 6-8 /Abstract/. CHURCH A. 1940 A formulation of the simple theory of types. JSL,5/1940/,56-68. 1956 Introduction to mathematical logic. Vol.I. Princeton University Press, 1956. CHWISTEK L. 1923 The theory of constructive types.
WORK AND INFLUENCE OF LEsNIEWSKI
215
Annales de la Societe Polonaise de Mathematique,2/1923/,9-48. CLAY R. 1966 On the definition of mereological class. NDJFL,7/1966/,359-360. 1968 The consistency of Lesniewski's mereology relative to the real numbers. JSL,33/1968/,251-257. 1970 The dependence of a mereological axiom. NDJFL,11/1970/,471-472. 1974 Relation of Lesniewski's mereology to Boolean algebra. JSL,39/1974/,638-648. COUTURAT L. 1905 L'algebre de la logique. Paris, 1905. CRESSWELL M.J. 1973 Logics and Languages. Methuen, London, 1973. DAVIS Ch.C., Jr. 1975 An investigation concerning the Hilbert-Sierpinski logical form of the axiom of choice. NDJFL,16/1975/,145-1B4. FLOYD W.F. and HARRIS F.T.C./Ed./ 1964 Joseph Henry Woodger, Curriculum Vitae; Form and strategy in Science, studies dedicated to Joseph Henry Woodger on the occasion of his Seventieth Birthday. D.Reidel, 1964. FRAENKEL A.A., BAR-HILLEL Y. and LEVY A. 1973 Foundations of set theory. NH,1973. GOODELL J.D. 1952 The foundations of computing machinery. JCS,1/1952/,1-13, Part 11,1/1953/,86-110. GRZEGORCZYK A. 1955 The systems of Lesniewski in relation to contemporary logical research. SL,3/1955/,77-95. 1964 A note on the theory of propositional types. FM,54/1964/,27-29. HENKIN L. 1950 Completeness in the theory of types. JSL,15/1950/,Bl-91. 1963 A theory of propositional types. FM,52/1963/,323-334. 1975 Identity as a logical primitive. Philosophia.Philosophical Quartely of Israel,5/1975/,31-45. HI~
1961
H.
Congrammaticality, Batteries of Transformations and Grammatical Categories. In: JAKOBSON /1961/,pp.43-50.
216
J. SURMA
JAKOBSON R./Ed./ 1961 Structure of language and its mathematical aspects. Proc.12thSymp.Appl.Math.Providence, R.I.,Amer.Math.Soc.,1961. JAsKOWSKI S. 1948 Une modification des definitions fondamentales de la geometrie de corps M.A.Tarski. Annales de la Societe Polonaise de Math.,21/1948/,298-301. KABZInSKI J.K. and WROnSKI A. 1975 On equivalential algebras. Proc.1975 Intern.Smp.on Multiple-Valued Logic,Indiana Univ., Bloomington. Indiana, May 13-16,1975,pp.419-428. KOTARBInSKI T. 1929 Elementy teorii poznania, logiki formalnej i metodologii nauk. Lvov, 1929. Translated into English as "Gnosiology - the scientific approach to the theory of knowledge", London, 1966. 1957 Wyklady z dziej6w logiki. £6dz, 1957. 1965 Garstka wspomnien 0 Stanislawie Lesniewskim. RF,24/1964/,155-163. KOWALSKI J.G. 1975 Lesniewski's ontology extended with the axiom of choice. ph.D. Thesis under B.Sobocinski. Univ.of Notre Dame,Indiana, USA, /unpublished/. KUBInSKI T. 1971 A report on investigations concerning mereology. AUW,139/1971/,47-68. KUENG G. and CANTY J.T. 1970 Substitutional quantification and Lesniewskian quantifiers. Theoria,36/1970/,165-182. KURATOWSKI K. and MOSTOWSKI A. 1968 Set theory. NH, 1968. KRUSZEWSKI Z. 1925 Ontologia bez aksjomat6w. PF,28/1925/,136 /Abstract/. LEHRBERGER J. 1974 Functor analysis of natural language. Mouton, 1974. LEJEWSKI Cz. 1954 A contribution to Lesniewski's mereology. PSASA,5/1954/,43-50. 1967 A note on a problem concerning the axiomatic foundations of mereology. NDJFL,4/1962/,135-139. 1955 A new axiom for mereology. PSASA,6/1955/,65-70. LEONARD H.S. and GOODMAN N. 1940 The calculus of individuals and its uses. JSL,5/1940/,45-55.
WORK AND INFLUENCE OF LEsNIEWSKI
217
LEsNIEWSKI S. 1911 Przyczynek do ana1izy zdan egzystencjalnych /A contribution to the analysis of existential propositions/. PF,14/1911/,329-345. 1912 Pr6ba dowodu ontologicznej zasady sprzecznosci /An attempt to prove the ontological principle of contradiction/. PF,15/1912/,202-226. 1913a Logiceskia razsuzdenia /Logical studies/. St. Petersburg, 1913, 83 p. 1913b Czy prawda jest tylko wieczna czy tez i wieczna i odwieczna? /Is truth only eternal or both eternal and temporal?/. Nowe Tory,18/1913/,493-528. 1913c Krytyka logicznej zasady wylqczonego srodka /A critique of the logical principle of the excluded middle/. PF,16/1913/,315-352. 1914a Czy klasa klas, nie podporzqdkowanych sobie, jest podporzqdko~ wana sobie? /Is the class of classes not subordinate to themselves subordinate to itself?/. PF,17/1914/,63-75. 1914b Teoria mnogosci na "podstawach Filozoficznych" Benedykta Bornsteina /The theory of sets on the "philosophical foundations" of Benedykt Bornstein/. PF,17/1914/,488-507. 1916 Podstawy og6lnej teo~ii mnogosci. I. /Foundations of the general theory of sets. 1/. Works of the Polish research group in Moscow, Mathematics and Science division, No 2. Printed by A.P.Poplawski. Moscow, 1916, 42p. 1927-1931 0 podstawach matematyki /Foundations of mathematics/. PF,30/1927/,164-206; 31/1928/,261-291; 32/1929/,60-101; 33/1930/,77-105; 34/1931/,142-170. 1927a Introduction. PF,30/1927/,164-169. 1927b Section I. PF,30/1927/,169-181. 1927c Section II. PF,30/1927/,182-189. 1927d Section III. PF,30/1927/,190-206. 1928 Section IV. PF,31/1928/,261-291. 1929 Section V. PF,32/1929/,60-101. 1930a Section VI. PF,33/1930/,77-81. 1930b Section VII. PF,33/1930/,82-86. 1930c Section VIII. PF,33/1930/,87-90. 1930d Section JX. PF,33/1930/,90-105. 1931a Section X. PF,34/1931/,142-153. 1931b Section XI. PF,34/1931/,153-170. 1929a Ueber Funktionen deren Felder Gruppen mit Ruecksicht auf diese Funktionen sind. FM,13/1929/,319-332. 1929b Ueber Funktionen,' deren Felder Abelsche Gruppen in bezug diese Funktionen sind. FM,14/1929/,249-251. 1929c Grundzuege eines neuen Systems der Grundlagen der Mathematik. §1-§11. FM,14/1929/,l-81. 1930 Ueber die Grundlagen der Ontologie. CR,23/1930/,lll-132. 1931 Ueber Definitionen in der sogenannten Theorie de~ Deduktion. CR,24/1931/,289-309. An English version of this paper has been published in McCALL /1967/. 1938a Einleitende Bemerkungen zur Fortzetzung meiner Mitteilung u.d.T. "Grundzuege eines neuen Systems der Grundlagen der Mathematik.
218
J. SURMA An
English version of this paper has been published in McCALL
/1967/. 1938b Grundzuege eines neuen Systemsder Grundlagen der Mathematik. §12. CL,1/1938/,61-144. LEWIS D. 1972 General semantics. Semantics of natural language, ed.by Davidson and Horman. D.Reidel, 1972. LUSCHEI E.C. 1962 The logical systems of Lesniewski. NH,1962, 361p. l.UKASIEWICZ J. 1910 0 zasadzie sprzecznosci u Arystotelesa. Studium krytyczne. Krak6w,1910. 1929 Elementy logiki matematycznej. Warszawa, 1929. English translation as "Elements of mathematical logic" edited by Pergamon Press, 1963. 1928-1929a Rola definicji w systemach dedukcyjnych. RF,11/1928-1929/,164 /Abstract/. 1928-1929b 0 definicjach w teorii dedukcji. RF,11/1928-1929/,177-178 /Abstract/. l.UKASIEWICZ J. and TARSKI A. 1930 Untersuchungen ueber den Aussagenkalkuel. CR,23/1930/,30-50. MACHOVER M. 1966 Contextual determinancy in Lesniewski's grammar. SL,19/1966/,47-57. MARTIN N.M. 1953 On completeness of decision element sets. JCS,1/1953/,150-154. McCALL S. /Ed./ 1967 Polish Logic. Oxford, At the Clarendon Press, 1967. MORSE A.P. 1965 A theory of sets. Academic Press, 1965, 130p. MOSTOWSKI A. 1948 Logika matematyczna. Warszawa-Wroclaw, 1948. MYHILL J. 1953 Arithmetic with creative definitions by induction. JSL,18/1953/,115-118. NEUMANN J. von 1927 Zur Hilbertschen Beweistheorie. Mathematische Zeitschrift,26/1927/,1-46. 1931 Bemerkungen zu den Ausfuerungen von Herrn St.Lesniewski ueber meine Arbeit "Zur Hilbertschen Beweistheorie". FM,17/1931/,331-334. PRIOR A. 1965 Existence in Lesniewski and Russell. Formal systems and recursive functions, ed. by J.Crossley and M.Dummett. NH, 1965. 1971 Objects of thought. Oxford, At the Clarendon Press, 1971. QUINE W. van Orman 1956 Unification of universes in set theory.
WORK AND INFLUENCE OF LEsNIEWSKI
219
JSL,21/1956/ RAMSEY F.P. 1925 Foundations of mathematics. Proc.of the London Math.Soc.,ser.2,25/1925/,338-384. RICKEY V.F. 1972 An Annotated Lesniewski Bibliography. Department of Mathematics,Bowling Green State University, Bowling Green, Ohio, USA. Preliminary version 1972. Supplement I 1976. 1975 Creative definitions in propositional calculi. NDJFL,16/1975/,273-294. 1976a Model theory for Lesniewski's ontology. XXIInd Conference on the History of Logic, 5-9 July, 1976. Jagiellonian University and Polish Academy of Sciences, Krak6w 1976, p.24 /Abstract/. 1976b A survey of Lesniewski's logic. Department of Mathematics, Bowling Green State University, Bowling Green, Ohio, USA, June 1976. SCHARLE T.W. 1971 Completeness of many-valued protothetic. JSL,36/1971/,363-364 /Abstract/. SINISI V.F. 1966 Lesniewski's analysis of Whitehead's theory of events. NDJFL,7/1966/,323-327. SLUPECKI J. 1953 St.Lesniewski's protothetic. SL,1/1953/,44-112. 1955 St.Lesniewski's calculus of names. SL,3/1955/,7-73. 1958 Towards a generalized mereology of Lesniewski. SL,8/1958/,131-163. SOBOCInSKI B. 1934 0 kolejnych uproszczeniach aksjomatyki "ontologii" proLSt.Lesniewskiego. Fragmenty Filozoficzne, /1934/,143-160. 1950 L'analyse de l'antinomie Russellienne par Lesniewski. Methodos,1/1950/,94-107; 2/1950/,220-228; 3/1950/,308-316; 2/1950/,237-257. 1954 Studies in Lesniewski's mereology. PSASA,5/1954/,34-43. 1956 In memoriam Jan Lukasiewicz. Philosophical Studies, Maynooth, Ireland,6/1956/,3-49. 1971 Atomistic mereology. NDJFL,12/1971/,89-103. SULLIVAN T.F. 1971 Affine geometry having a solid as primitive. NDJFL,12/1971/,1-61. 1972
The name solid as primitive in projective geometry. NDJFL,13/1972/,95-97. 1973 The geometry of solids in Hilbert spaces. NDJFL,14/1973/,575-580. SURMA S.J. 1973a A survey of the results and methods of investigations of the equivalential propositional calculus. Studies in the history of mathematical logic, ed.by S.J.Surma. Ossolineum, Wroclaw 1973,pp.33-61. 1973b A uniform method of proof the completeness theorem for the equivalential propositional calculus and for some of its extensions. Studies in the history of mathematical logic, ed.by S.J.Surma.
220
J. SURMA
Ossolineum, Wroclaw 1973,pp.63-79. An algorithm for axiomatizing every finite logic. Computer science and multiple-valued logic, ed.by D.Rine. NH,1976,pp.137-143. STONE M.H. 1937 Note on formal logic. American Journal of Mathematics,59/1937/,506-514. TARSKI A. 1923a a wyrazie pierwotnym logistyki. PF,26/1923/,68-89. 1923b Sur Ie terme primitif de la logistique. FM,4/1923/,196-200. 1924 Sur les truth-functions au sens de MM.Russell et Whitehead. FM,5/1924/,59-74. 1929 Les fondements de la geometrie des corps. Ksi~ga Pamiqtkowa Pierwszego Polskiego Zjazdu Matematycznego, Krak6w 1929, pp.29-33. 1933 Poj~cie prawdy w j~zykach naUk dedukcyjnych. Travaux de la Societe des Sciences et des Lettres de Varsovie, Classe III,34/1933/,116p. German translation as Der Wahrheitsbegriff in der formalisierten Sprachen published in Studia philosophica,1/1936/,261-405. 1935 Zur Grundlegung der Booleschen Algebra. I. FM,24/1935/,177-198. 1956 Logic, semantics, metamathematics. Papers from 1923 to 1938, translated by J.H.Woodger. Oxford, 1956, 469p. TAX R.E. 1973 On the intuitionistic equivalential calculus. NDJFL,14/1973/,448-456. WOODGER J.H. 1935 The axiomatic method in biology. Cambridge University press, 1937, 174p. 1976
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
INFERENCE WITHOUT AXIOM OR PARADOXES
George Temple (Oxford)
Theories of Implication The study of formal, propositional logic has known three great periods the Greek, the Mediaeval Scholastic and the Modern, which are, respectively, commonly associated with the names of their reputed founders:
Philo of Megara, Abelard and Frege.
In each period a number of different theories of implication have been advanced of which the most important are (1)
the theory of material implication;
(2)
the theory of incompatibility, and
(3)
the theory of inclusion.
All three of these theories are attributed to the logicians of the Greek school of Megara in the treatise by Sextus Empiricus ("Outlines of Pyrrhonism", Book ii, 110-112, ca. A.D. 200).
Philo is credited with the
theory of material implication, according to which, a propositionp always implies a proposition
~
unless
p
is true and
~
is false.
An unnamed
Stoic, perhaps Chrysippus, is said to have introduced the notion that implies~
if
/J
is incompatible with the negation of
~.
have not been identified are said to define the implication that
'1
is virtually included in
p
p
And aome who
P
~ ~
to mean
Among the Scholastic logicians all three views are expounded at great length and with considerable acuity in the works formerly attributed to Duns Scotus and especially in William of Ockham. In modern times the theory of material implication is to be found in a casual remark by C S Pierce (Collected Papers, vol. III, para. 441-3, first
221
GEORGE
222
TEt~PLE
published in The Monist, vol. VII, 1896) accepted by Russell (l~he Principles of Mathematics", 1903, para. 16) and formally expounded by Wittgenstein and Post.
The theory of incompatibility was advanced by
C I Lewis but sUbsequently withdrawn.
The theory of inclusion
reappears in the work of Pierce, and can be discerned in the writings of H McColl. The purpose of this note is to show that a careful analysis of these three theories shows that they are not merely mutually compatible, but essentially the same, the superficial differences exhibiting only a shift of emphasis.
Philonian Implication We do not possess any of the original works of the Megarian logicians and therefore do not know for certain how they formulated the theory of material implication, but it seems indubitable that they initiated the study of unanalyzed propositions, which were classified as either "True" or "False" accordingly as they corresponded or did not correspond with reality. This unique scheme of valuation was fatal to their theory of inference. Philo of Megara (ca. 300 B.C.) recognised three varieties of valid inference, viZ. from a true antecedent to a true consequent, from a false antecedent to a false consequent, and from a false antecedent to a true consequent.
This is undoUbtedly a complete classification, but it is
difficult to believe that it was accepted as a definition of inference. I cannot believe that any Greek politician, barrister or tradesman can ever have sought to persuade his adversary, his jUdge or his client that a false proposition implies any proposition (true or false), and that a true proposition is implied by any proposition (true or false). In fact what is called "Phil on ian" implication is completely ineffective as a definition, and the Megarian logicians used in its place various schemes of inference Which we should undoubtedly recognise today as completely satisfactory and sometimes of surprising subtlety. They also used the unique valUation of propositions as true or false to characterise disjunction and conjunction, but there is no evidence that they were under the illusion that they had prOVided formal definitions of these connecti ves.
223
INFERENCE WITHOUT AXIOM OR PARADOXES We owe so much to the historical researches of Jan Lukasiewicz (especially the paper "On the History of the Logic of Propositions", first published in 1934, Selected Works, 1970, translation by S McCall,
pp. 197-217), and I M Bochenski ("Formal Logic", 1956, Eng. trans. by Ivo Thomas, A History of Formal Logic, 1961) that it seems ungracious to criticise their accounts of Megarian logic, especiallY as they are followed by so many writers in generously giving Philo the credit for the introduction of "truth tables".
But in fact the theory of truth tables
depends entirely on a system of the multiple valuation of propositions, and cannot, be constructed from the system of unique valuation, by which each and every proposition is ineluctably labelled as "true" or "false". It was perhaps an obscure appreciation of this fact (or was it just ignorance of the Greek logicians?) that guided the mediaeval logicians to ignore the theory of Philonian inference and to replace it by a theory of incompatibility - which we discuss below.
Scholastic Conseguence The voluminous researches of mediaeval logicians are remarkable for their fertility and subtlety, and for their complete independence, and indeed, ignorance of the work of the Stoic and Megarian philosophers. Conjunction and disjunction were loosely described in terms of the truth or falsity of their component propositions, but the problem of inference, now called "consequence", was minutely examined, not in isolation but in relation to the close1y related concepts of incompatibility, compossibility and necessity.
To summarise an extensive literature, a proposition
said to be a consequence of a proposition incompatible wi th
~
is
p , if the negation of'f. is
p .
This doctrine is essentially the same as that given by C I Lewis ("Implication and the Algebra of Logic",
!:!!.!!.2,
21, 1912, pp. 522-531), and the
scholastics draw the same conclusions, viz. that an impossible proposition implies every other proposition and that a necessary proposition follows from any proposition. But whereas Lewis recoiled from these consequences, the Scholastics accepted them with enthusiasm. Neither the Scholastics nor Lewis gave any definition of incompatibility,
GEORGE TEMPLE
224
which was accepted as a primary concept.
Neo-Philonian Inference Although almost all the nineteenth century logicians, with the exception of C S Pierce (ccri ected Works, vol. III, "The Regenerated Logic", The ~,
vol. VII, 1896, pp. 19-40, paras. 441-443), were ignorant of the
Stoic and Megaric school, they unknowingly adopted the prime principle of that distinguished body, namely that any proposition is either true or false.
Now "truth" and "falsity" have no place in formal logic, nor,
strictly speaking has the concept of absolute "proof".
Formal logic is
concerned only with the inference from one proposition? to another proposition
~
, notwithstanding that it is equally concerned with
inferences from the negation of
~
to the negation of
p.
The gradual
realisation of this essential of formal logic is one main characteristics of the "modern" school of logicians.
Henceforward logic is concerned
with syntax rather than with semantics. It might appear that this exclusion of the concepts of truth and falsity would be fatal to the theory of Philonian implication, but this theory can be amended by replacing the concepts "true" and "false" by "deemed to be true" and "deemed to be false". Frege begins his "Begriffshrift" in 1879 by a neo-Philonian definition but it is significant that instead of saying "A is true" or "A is false" he says "A is affirmed"
or "A is denied".
Even Bertrand Russell who wrote as a convinced Philonian in his Principles of Mathematics seems to have accepted the neo-Philonian doctrine in Appendix C to the second edition of Principia Mathematica (vol. I, p. 661, 1925) where he writes When we say truth or falsehood is, for logic, the essential characteristic of propositions, we must not be misunderstood.
It does not matter, for mathematical logic,
what constitutes truth or falsehood;
all that matters is
that they divide propositions into two classes according to certain rules. I could wish that the language of Wittgenstein in his "Tractus Logico -
225
INFERENCE WITHOUT AXIOM OR PARADOXES
Philosophicus" (1921) was more transparent, but since he speaks of "truthpossibilities" I think he is fairly counted as a neo-Philonian.
McColl and Inclusion The transitive character of the relation of implication (viz. that and 'Y.--)o
implies P ~;z
'1
)
p"'"
'r
must have encouraged many logicians to define
implication as a species of "inclusion" by analogy with the theory of sets of points.
However, I have singled out Hugh McColl ("The Calculus of
Equivalent Statements and Integration Limits", Proc. Lond. Math. Soc., vol. IX (1871), pp. 9-20, 177-186, vol. X (1878), pp. 16-28, vol. XI (1880) pp. 113-21, XXVIII (1897), pp. 555-79, vol. XXIX (1897) pp. 98-109, Note, vol. XXX (1898), pp. 320-32.
"Symbolic Logic" (1906) ) as the representa-
tive of this school of thought because of two other characteristics of this remarkable and neglected worker:
firstly McColl must rank as an independent
creator of the propositional calculus, for he rediscovered the algebra of symbolic logic in complete ignorance of the work of Boole, Jevons and Venn and long before Pierce.
Secondly, he is to my knowledge the only logician
who has ever used his logic to solve any problems of practical utility to the working mathematician.
In fact McColl devised his system of symbolic
logic expressly "to determine the new limits of integration when we change the order of integration or the variables in a multiple integral". Considered in its full generality this is an intricate problem not discussed in any of the great treatises of analysis, but solved completely and algebraically by McColl, without any appeal to geometric visualisation. McColl bases his theory on the undefined relation of the "equivalence" of two proposi tions,
P and if , which he writes as p ~ '/- , and takes to be a
symmetric and transitive relation.
The conjunction (P'f-) and disjunction
(pU,/- ), called by McColl "compound" and -"indeterminate" statements are loosely defined by impli~it truth-tables, and the implication Leibnitzian relation for inc lusion, f'" P' , or 'f and.,' or '\. and
""l ',
To indicate which propositions are asserted we can introduce a "valuation function"
-reI-) and while
, such that '«ft): ( ~O,)",
'"C(f') -t-
0
-r((o ,)
if if
,.
is asserted
/
for all propositions
I-
is denied
p
INFERENCE WITHOUT AXIOM OR PARADOXES
227
and their negations Such a valuation function represents a simple valuation of the propositions taken as premises and provides a syntactic alternative to the unacceptable semantic concepts of "truth" and "falsity". (There is no need to introduce the arithmetic concepts of unity and zero, which can be replaced by elements
W
and
of a two element Boolean
~
algebra. ) The criterion for material implication can be expressed in terms of a valuation function in the form,
P
implies
~
if
"C:((» -r{'i) ,
"7:"((»
which yields at once the alternatives either //:""{p) --. 0 or "U) =: I and
'T{q) =
t ,
But this form of the criterion is just as useless as the verbal form so long as we are restricted to one and only one truth function, i.e. to a simple valuation of propositions. But there is no reason for such a restriction.
A truth function merely
provides an arbitrary classification of propositions into two exclusive classes:
those which are asserted and those which are denied.
In fact the
real significance of the familiar "truth tables" introduced by Wittgenstein and Post is that they display the four possible truth functions from a pair
"
of proposi ti ons ,
",.)
fl,', ...
then each truth function
""
is the "characteristic
function" or "indicator" of a certain collection of basic propositions, namely those for which
'n..(a)
=. /
~)
and that these collections poss eaa a
number of useful properties:(1)
each collection is self-consistent, in the sense that if then '(~ CV) ~ 0;
(2)
each collection is complete, in the sense that if
9
proposition not included in the collection, then
-c,;: (q)
(3)
any pair of collections are incompatiblp., in the sense that there is always some
p
(~L~) ~ /
is any z..
0;
such that if /> belongs to one such
collection, then (./ belongs to the other. I therefore propose to study these collections and call them "atoms", as they turn out to be the ultimate irreducible units of which the propositional calculus is composed.
Wittgenstein called the basic
propositions "atomic", but this seems inappropriate since, if
i'
and "l
are any basic propositions, then r > (f>{J'l) (p V'll) My theory of inference depends on a study of collections of atoms, which I call "clusters", and I therefore proceed to introduce this theory in an informal and suggestive manner, reserving a rigorous and formal treatment for a sUbsequent paper.
229
INFERENCE WITHOUT AXIOM OR PARADOXES The Theory of "Clusters" In the light of the preceding brief survey of the main theories of
inference it is possible to present a unified theory which harmonises the different definitions and, incidentally, eliminates all logical axioms. I propose to define inference as a species of inclusion, and equivalence as mutual inference.
I define inclusion in terms of compatibility and I
define compatibility by recurrence.
Thus I establish a mapping of a
propositional calculus into a Boolean algebra. The primary elements of a propositional calculus are unanalysed propositions (the "atomic" propositions of Wittgenstein).
These basic propositions have
no structure and hence cannot be treated as if they were sets, with obvious To make the basic propositions
definitions of union and intersection.
amenable to mathematical investigation they must therefore be provided with a structure, or rather, a superstructure. We therefore begin with basic propositions
p,/, ",/,...
PI) />"...
and their negations
(The suffixes 1, 2, • • . are mere indices and have no
numerical significance.)
We are in no position as yet to define
conjunction and disjunction but we can define a complete collection of compatible basic propositions
C as
a collection
(ql' '>< or />i< / belongs
where either
Thus if the basic propositions are
A.') 'fFP:..?
/>,/ )
the complete consistent collections are ((./,p,,)) (p,,?,') 0,/) Pa, ) , It is convenient and appropriate to call these complete consistent collections "atoms" for they will form the irreducible, or irresoluable units from which we shall build our algebra. From these atoms we next construct certain "clusters". atoms which "adheres" to a basi.c proposition the atoms which include
PI) the clusters are
;'1/)
?tt (or 1>.. , ;,./
p,,/).
CPu P,) , (PI! b,./) ci«, P... /) CP/J P,.) )
(p" ",)
CPt PI"
(Pl.' ;"/)
I I
(t>,/, f?/)
PtC
(or
The "cluster" of
p" / )
consists of all
Thus, if the basic propositions are adhering to adhering to
h, /,,',
adhering to f> ... J adhering to ["~f.
GEORGE
230
TEI~PLE
These clusters form the superstructure which enables us to make the transition to a species of set theory and to Boolean algebra. We shall denote the cluster adhering to a basic proposition
P.e
~/
PK
(or
p" /)
by
), and henceforward we shall define / pI' a basic proposition to be a couple (I>KI p,. ) or (/>1 of two basic propositions, a and b, as the triad consisting of (1)
a
(2)
b
(3)
the union of the clusters A and B.
This latter is defined as the collection of atoms which include either a or b. or both a and
b,
and will be written as A V 3.
Clearly we can proceed to define propositions in general by recurrence, with the obvious definition that the negation
LS~ 5) of a proposition
is the couple consisting of (1) S'
(2)
the complement S ' of the cluster ~... )
the latter being the collection of all atoms which do not include
s.
We thus succeed in mapping all propositions into an algebra and by a Bourbakian abuse of language we shall speak of the proposition
P ,where
is the collection of atoms specifying the proposition in question. for the basic set 13 defined above we identify {(PI}P.J)
(I'/If',') />2 with the cluster
fll>. with PI P,
or
t;
/>, or
P
Thus
with the cluster t; or
{O.,
P,), ((>'-' !>,'}
and
{(Put.)]·
The crucial step is to define two propositions P and Q to be equivalent if they consist of the same collection of atoms and then we write P"'Q. with an obvious extension of our notation for disjunction, (p~)1' U(PQ~ P'Q, [>'4J') for the collections
(/>''1/»)
form the complement of
(/,',9),
((';'1).
(P',,;')
Thus,
INFERENCE WITHOUT AXIOM OR PARADOXES
231
It is an easy exercise to verify that the propositions so defined form a Boolean algebra, in which conjunction and disjunction are commutative, Moreover each proposition is double
associative and distributive. idempotent in the sense that
P?
P }
s:
'?U'P '"
P.
Moreover there is a unique null proposition ~ which contains no atoms, and a unique universal proposition
pp
(
V
which contains all atoms, such that
v.
1\
s;
It is easily verified that the conjunction of all the propositions, say ~1,aL'
...
, in an atom consists of all these propositions themselves
together with the element of the Boolean algebra which is the intersection of the clusters
A" A",...
Hence each basic proposition is expressed in
the "disjunctive normal form".
For example p, is mapped into the
collection of atoms, say A,13/,_ each of which includes
P,
Each atom
corresponds to a conjunction of propositions consistent with
f,
and the
whole collection corresponds to a disjunction of these conjunctions.
A Synoptic View of Theories of Inference The theory of clusters enables us to show that there is essentially only one possible definition of inference, although there
ar~
a number of different
ways in which it can be expressed. Any two propositions,
f
and ~
,are represented by two clusters P
each of which consists of atoms.
and ~
There is only one possible transitive
relation from p to 'I , viz. 'P , the cluster of atoms which adhere to />
is
included in Q, the cluster of atoms which adhere to 1 (or, of course, vice versa!).
Therefore we
P =
~
the implication (>
~
'i
to mean that
PQ
This definition implies that P and that .-r:(p) ~ 'l:'(I')-r{?).
=
("y
Hence our definition agrees with the definition in terms of multiple valuation, and is clearly the same as McColl's definition by inclusion. The defect of the definition of "strict implication" is that Lewis gave no definition of "incompatibility". following definitions,
To remedy this defect we propose the
GEORGE TEMPLE
232
f> '" !\ ,
P is a "contradiction", if empty c.Lus t er-s of atoms;
thus!'r
1. e. if it is represented by the
,=
(2)
p and q are "incompatible" if f>1 =- 1\;
(3)
p is a "tautology" if f-:'1-
j
PI
are incompatible;
, Le. if p is represented by the cluster of
f'~V
all atoms.
11
thus p and
Then the definition of strict implication is that P~'Y if r i:> /\. To show that these definitions are equivalent, we note that =: ;\, pl\ 'i' ::then 1'1' =: if p '" I''/p(,!Uq') f> V = then and i f 1',' :- /\ P =
p,
(p,)
u({"I' )
s:
o«: U 1\
:;-
ri
As regards the "paradoxes" of implication, it is true that, if p is any proposition, then
Il
z
Il,o
whence P
whence
and
~
V'.
Thus there is a unique null proposition, viz. the contradiction
A,
which
implies any proposition, and there is a unique tautological proposition
V'"
(f>lJ;.')
which is implied by any proposition.
right and proper.
This however is only
It seems that Lewis recoiled from these truisms
because his system did not reveal that all contradictions are "equivalent" to
A and all tautologies to
~
All the usual axioms of inference can now be established as theorems.
For
example to establish the syllogism, we express the inferences ;. ... If and
1-'" L in r »>
the forms P =
(>'1
and '1 '" 11. whence
!' '"
f>li'):- (1'1h :: f> ~ and
An Assessment of Cluster Theory The advantages of the cluster theory are (1)
It provides effective definitions of the logical connectives, and especially of implication and equivalence, and
(2)
It eliminates all the usual axioms of equivalence and inference, and the usual rules of inference.
There is a further advantage which has not been exploited in this paper. Here we have taken the basic propositions to be undefined elements, except that the collection of basic propositions is "polarised", in the sense that, if it contains an element p it also contains a complementary element ~ {
INFERENCE WITHOUT AXIOM OR PARADOXES and
(p'/ ~ P.
233
Now the cluster theory can also be developed for an
"unpolarised" basic set, in which case the concept of negation is absent and we obtain "posi ti ve" logic.
Or the cluster theory can be developed
for a "layered" basic set. in the sense that if it contains an element p it also contains elements
!' I)
P
~
cr")
I
~
p,
in which case we obtain a three valued logic. be developed in the same way.
Clearly n-valued logic can
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76 © North-Holland Publishing Company (1977)
LEOPOLD LOWENHEIM: LIFE, WORK, AND EARLY INFLUENCE Christian Thiel Philosophisches Institut RWTH Aachen, Germany
Speaking at a logic colloquium in 1976, I may presume my audience is well acquainted with the fact that any first-order theory which has a model at all has also a denumerable model. There is abundant information on this result, on the background of the equivalent t h e c-r c m of Lowenheim and on its extensions clustering into something like a theorem of Lo w e n h e i m - Skolem - Tarski - Godel - Malcev - Takeuti - Vaught etc .. But very little seems to be known about the discoverer of the basic fact. I do not mean to claim this as a singluar case in the history of modern logic. I am pretty sure that one would meet with similar problems if he tried to write the biography of Heinrich Behmann, Adolf Lindenbaum, or Emil Leon Post, even though people who have talked to or corresponded with them or who were their colleagues are still amongst us. Out of pity for the despair to Which a future historian of logic might be driven, I am going to present some material on the life of Leopold Lowenheim as well as on his writings and - though scantily - on their reception with the experts. I hasten to add that my pertinent in~uir ies are by no means finished. As, on the other hand, I do not want to bore you with already published material, I will first give a duly shortened summary of my report of 1975 1 , and then produce the outcome of some more recent investigations. Finally, I will try to scan the scene in mathematical logic mainly between Lowenheim 1915 and Skolem 1920/1922, in hope of ending up with a fair judgement on Lowenheim's influence or his non-effectiveness during the blossoming of his research and publishing activity. Throughout, I will take care to indicate what I consider the more promising directions of further historical advances. Even the first steps have proved to be entangling: the available information on Lowenheim is not only scarce, it is also partially wrong. Till very recently, the only reliable biographical source was Poggendorff's Biographisches Handworterbuch which in volume 5, covering the time between 1904 and 1922, contains the few data submitted by Lowenheim himself. 2 But this' volume appeared in 1926, and from then on one had to rcly on circumstantial evidence like the date of receipt by the JSL of Lowenheim's 1939 article Einkleidung der Mathematik in Schroderschen Relativkalkul, signed "Berlin - Lankwi tz". For subsequent years, all biographical data seemed to have simply vanished. Among the most prominent reference works, Gillispie's Dictionary of Scientific Biography does not have an entry on Lowenheim in volume 8 where it should alphabetically occur,3 nor does Meschkowski's Mathematiker-Lexikon;" likewise, the Britannica of 1963 and 1968 5 and the German encyclopedias of Brockhaus 6 and Meyer 7 skip Lowenheim. We may disregard the fact that Zischka's Allgemeines Gelehrten-Lexikon 8 ignores him, too, for it does not mention Frege, Godel, Skolem, Tarski, or Zermelo either; but the gap in Kurschner's Deutscher GelehrtenKalender 9 is certainly deplorable.
235
236
CHRISTIAN THI EL
No wonder that people came to think that Lowenheim, especially as his surname clearly indicated some Jewish descendance, must have fallen victim to the Nazis and possibly have met his death through them like Adolf Lindenbaum." So, Lowenheim's name in the index of Fraenkel and Bar-Hillel's Foundations of Set Theory of 1958 11 came to be supplemented by the data "1878 - c.1940", the authors believing Lo w e n h e i m to be among the victims of the Third Reich and taking the date of Lowenheim's 1939 article as the greatest lower bound. Apparently, it was this information that was taken over in Hunter's Metalogic of 1911. 12 Now, Fraenkel and Bar-Hillel had been cautious enough to signalize the precariousness of their second date by the prefix "circa". Others did not take such care, and so today we find the ostensibly established data "Leopold Lowenheim (1818-1940)" in the 1965, the 1966 and the 1912 edition of the Kleine En z g k l.opd d i e / Mathematik, 13 the "source" presumably being the index of Bar-Hillel and Fraenkel - the latter of whom, incidentally, had himself been erroneously reported to have died at Jerusalem in Lense's book of 1949. ~ Even in 1914, this lack of data can be seen in the "Reminiscences of Logicians" at the Summer Research Institute on Algebra and Logic at Monash University at Clayton, Australia. 15 Actually, Lowenheim survived the years of darkness and was able to resume his position in Berlin in 1946, teaching up to the 72 n d year of his life. He died in Berlin on the 5th of May, 1957. We do not know very well (but see below) how Lowenheim found his way to the algebra of logic to which he devoted all of his published articles and abstracts (in the sense of Selbstanzeige), and most of his reviews and notices. It is tru~ that his father, Dr.Louis (or Ludwig) Lowenheim had been a teacher of mathematics at the polytechnic (Gewerbeschule) at Krefeld, but it seems he did not engage in mathematical research or publications, much less in such an exotic field as algebra of logic. Moreover, in 1881, he took his wife, a writer who is said to have acquired some renown already under her maiden name Elisabeth Rohn, and their three-year old son Leopold first to Naples in Italy and then to Berlin where he lived as a private scholar (Privatgelehrter). There he prepared a comprehensive work on Democritus' influence on modern science which he hoped would gain him a teaching posi tion at the Humboldt-Uni versi tat. 16 When he died in 1894, Leopold was sixteen years old and still attended the Gymnasium. Having taken his final examinations in 1896, he matriculated in mathematics and the sciences at the Humboldt-Universitiit, where he enthusiastically studied synthetic geometry with Hermann Amandus Schwarz, and at the Technische Hochschule in Berlin~Charlottenburg where he may have been a student of Emil Lampe, to whom I will return in a moment. Lowenheim finished his studies in 1900'and began the normal career of a German Gymnasiallehrer, teaching as Oberlehrer at the Jahn-Realgymnasium in Berlin for 15 years from 1904 onwards, interrupted only by WW I when he was sent to France, Hungary and Serbia as a soldier, and then being appointed Studienrat in 1919 and (according to Johannes Teichert, v.infra) Professor soon afterwards. Only in passing will I mention his biographically most important decision to join the Anthroposophische Gesellschaft in 1924 and his marriage to Johanna Teichert, a widow to whose son Johannes Teichert we owe the bulk of biographical data on Lowenheim. I will also skip over the circumstances of Lowenheim's dismissal as "25-% non-Aryan" in 1934, his penurious life as a teacher of eurythmics and, at the anthroposophical School for Eurythmics in Berlin, an instructor in geometry, as well as the death of his wife in 1937 and the years after the war - ~ll these events have been extensively documented in my above-mentioned paper of 1975. I will, however, take the opportunity to refer to further biographical data relating to Lowenheim's research
LEOPOLD LOWENHEIM
237
to which I now turn - with a warning that I will not be able to go very deep into the subject here. I am aiming rather at an informative survey than at a modernized systematic exposition or even at an evaluation which, in my opinion, desirable as it certainly is, will not be available without further long-term and thorough study. For my purposes, I will look at Lowenheim's writings systematically rather than chronologically. I will say something on 1. the 9 published articles (1908-1946), 2. the hitherto unnoticed self-written abstracts (1913-1919), 3. the reviews and notices, most of them unnoted so far (1909-1923), only mentioning the unpublished writings listed in the bibliography, i.e.those on geometry (except the paper "Logic of simplicity or logic of plenitude? "). Let me begin with a remark on the background of the published articles. I noted above that we do not know exactly how Lowenheim came to occupy himself with algebra of logic, although we have Lowenheim's autobiographical note stating that, shortly after taking up his teaching, he "became acquainted with the calculus of logic through reviews and from Schroder's books"17. But this sufficed to involve Lowenheim so deep in the subject matter as to reorganize it, extend it in different directions, discover his "development theorem" and his wellknown results on the satisfiability of formulae in first-order logic with equality, on the decidability of its monadic fragment, and the reduction of the decision problem to first-order logic with binary predicators only. I have not succeeded in finding anyone in the mathematics department of the Humboldt-Universitat or of the Technische Hochschule Berlin-Charlottenburg between 1896 and 1900 who had shown interest, let alone had offered courses in algebra of logic or related topics (as, e.g., Frege did at Jena). Being fully aware of a certain danger of over-simplification, I dare say that in Germany, algebra of logic waS almost exclusively a matter for Studienrate. Ernst Schroder, though having had his Habilitation at the Eidgenossische Polytechnikum at Zurich, had been teaching in Gymnasien from about 1865 to 1874 when he got a call to the Technische Hochschule Darmstadt, two years later changing to the Technische Hochschule Karlsruhe where he taught for the rest of his life. Eugen MUller, the editor of the later parts of Schroder's Vorle~ungen Uber die Algebra der Logik and of the Abriss der Algebra der Logik, did his editorial work in addition to his obligations as professor at the Realgymnasium at Constance, and had pUblished two earlier treatises on the algebra of logic as a supplement to the annual report of the Grand-ducal Gymnasium at Tauberbischofsheim in 1899/1900 and 1901. Alwin Korselt, another contributor to the algebra of logic, though better known for his defense of Hilbert against Frege in their controversy on the foundations of geometry, taught exclusively at secondary schools, eventually at the Realgymnasium at Plauen in Saxony. So Lowenheim, who by the way corresponded with both Muller and Korselt and discussed his 1910 paper with them before pUblication,~ fits well into this tradition. His first article "Uber das Aufl0sungsproblem im Zogischen Klassenkalkul" appeared in the Sitzungsberichte der Berliner Mathematischen Gesellschaft, having been presented - perhaps more extensively and in slightly different form - to the members of the society on the 29th of April and on the 27th of May in 1908, under the title
"Bericht Qber die Aufl0sung von Gleichungen und die Elimination in der Algebra der Logik". ~ Lowenheim is listed as a member of the so-
ciety (which was founded in 1901)H since 1906, and I have a suspicion
238
CHRISTIAN THI EL
that the aforementioned professor Emil Lampe, who edited the Jahrbuch Uber die Fortschritte der Mathematik "in cooperation with the Berlin
Mathematical Society"21 and was one of its prominent and active members, can be held responsible not only for Lowenheim's entry into that circle but also for Lowenheim's later contributions to the Jahrbuch, these suddenly coming to an end after volume 46.1 for 1918, the year of Lampe's death. u In a certain sense, this year signals at the same time the end of Lowenheim's publishing activity, for of the two later articles of 1939/1940 and 1946, the second one is only a revised version of a much earlier paper as I will show later on. First, I will sketch the content and meaning of the published articles of Lowenheim.
Item 1, Uber das AufZosungsprobZem im Zogischen KZassenkaZkuZ (1908) takes up some investigations of Schroder and Muller on the solvability of equations in the calculus of classes. Lowenheim proves several theorems stating necessary and sufficient conditions for the solvability of particular symmetric equations. An assertion of Schroder's concerning the symmetry of a certain solution is shown to be false. Result (4a) later came to be known as the "general development theorem" ("Das Lowenheimsche allgemeine Entwicklungstheorem") and constitutes § 127 of Schroder's Abriss der AZgebra der Logik. 2 3 Maybe item 2, Uber die AufZosung von GZeiehungen im Zogisehen GebietekalkuZ (1910), also belonged to the lecture given before the Berlin Mathematical Society in 1908. "Domains" (Gebiete) are the subsets of a given fixed set M (the universe); Schroder's Abriss is presupposed to be known. Making USe of the work of earlier authors, including Jevons and Johnson, three methods for the solution of equations in the cRlculus of domains are explained. Some theorems of Korselt and Muller on disjunctive (i.e., disjoint and complementary) systems are connected together and a more general form of the development theorem is formulated and proved to hold. Theorems 14' and 14" state that with the help of the development theorem, not only formulae of the calculus, but also metatheorems on deducibility and satisfiability can be proved - a feature later generally claimed by Lowenheim as an advantage of the Peirce-Schroder system of the algebra of logic over its competitors. Item 3, Uber Transformationen im GebietekaZkUZ (1913), is the continuation of item 2 which the reader is presumed to be familiar with. The Peirce-Schroder calculus of relatives is extended to a calculus of matrices of domains, of which the basic idea is as follows. In Peirce and Schroder, a binary relative, i.e. a binary relation, is determined iff for every ordered pair of elements from the field of the relation it is determined whether the relation holds of this pair, or not. Arranging the pairs in a matrix, (al ,a2)
(a 1, a 3 )
(a2 , a2 )
(a2 , a 3)
(a 3 ,a2 )
(a3,a3)
... ...
)
a relation R will be determined by putting "1" in place of each pair of elements standing in the relation R, and putting "0" otherwise. T~e Peirce-Schroder calculus of relatives may be viewed as a technlque of calculating with square matrices of ones and zeros Lowenheim states that all formulae valid in this Peirce-Schrod~r cal-
LEOPOLD LOWENHEIM
239
culus remain valid for arbitrary domains instead of 1 and 0, since, thanks to the verification theorem the validity of an equivalence or implication in X l , ' " ,x n is e q u i v a l e n t to its coming out true under all valuations of the domains Xl, ••• 'X n by 0 and 1. L6wenheim sets about to develop a theory for his generalized calculus of matrices of domains, the calculus of domains being included as a special caSe since every system of domains al,'" ,am and every single domain b can be written as
, resp ..
and
The verification theorem is formulated so as to hold for the extended calculus. As matrices can be used to represent transformations, they are put to use for solving the inversion problem for transformations of a given normal form F =
... ,
aN ]
x r , . •.
'X
n
into another given one G
the ai and b· being the coefficients of the occurring n-tuples of simple or negat~d variables as in
[a,
b,
C,
d ]
xy
=
a xy + b xy +
C
xy + d xy
.
As a by-product of the skilful exploitation of the verification theorem and of L6wenheim's development theorem, together with his USe of disjunctive systems and reduction, a considerable shortening and simplification of proofs is attained. Finally, L6wenheim proudly demonstrates, in § 3, how many theorems from Whitehead's Memoir on the Algebra of Symbolic Logic~ can now not only be established by much shorter proofs but also be essentially extended. The most important result, according to L6wenheim himself, is a metatheorem stating that, under certain precautions in the case of cardinality statements, every theorem about transformations of arbitrary objects is va1id if only its validity can be shown for transformations of domains (p.261). L6wenheim hopes "to turn the thoughts of this treatise to rich profit in the calculus of relatives" (p.212), but postpones the harvest to a later publication; except that a generalization of a theorem proved in the paper is communicated, without proof, at the end. (1913), Potenzen im Relativkalkul und Potenzen allgemeiner endlicher Transformationen [1J, an a c q u a i n t a n c e v i t h the previous paper being assumed. This proof was given in a second article of the same year
More important than the rather special theorem is the method employed. In the last paper, L6wenheim had already interpreted certain special elementary matrices as permutations, and implicitly suggested a generalization for elementary matricesin general. Still pushing on, he now further generalizes the concept of permutation by a notation for relatives where just the places of the Ones in each column are listed, o indicating that the column does not contain any 1 at all. E.g. ,25
CHRISTIAN THIEL
240
(i
0 0
1 0
1 1 1
0 0 0 0
0
)
[ 2 4
.......-
3 ,
o ,
1 2 3 ] .
'---"
Now, this relative can be written as a generalized permutation
(~; ~ ~)
-
representing the replacement of the number 1 by the pair 2 4 , the
--
number 2 by 3, the number 3 by nothing (!), and the number 4 by the triple 1 23 A calculus for this new symbolism for relatives is then established, Lowenheim claiming that in this way the theory of permutations is being embedded in the theory of relatives and thus extended to a general theory of transformations so far unheard of in mathematics, and likely to enrich mathematics proper by a plenitude of new theorems. To support this claim, the theorem mentioned is finally interpreted as a theorem in number theory. I am inclined to asSume that Lowenheim's lecture to the Berliner Mathematische Gesellschaft on November 12th, 1912, titled "The calculus of relatives in the algebra of logic as a generalization of the theory of permutations" (Der ReZativkaZkUZ in der Algebra der Logik alB Verallgemeinerung der Permutationslehre) anticipated the considerations of [7J Another continuation of the transformation paper of 1913 is contained in Lowenheim's paper Uber eine Erweiterung des GebietekalkUls, iael.c he auch die qeunhn l i ch e Algebra umfasst [16J, published in the Archiv fUr SyBtematische PhiZosophie for 1915. The principal idea is another extension of the calculus of domains of which the ordinary calculus of domains as well as the ordinary theory of numbers (transfinite ones not necessarily excluded) and elementary algebra appear as special cases. Following Whitehead, Lowenheim considers the elementary symmetric functions of n domains a 1,··. ,an' 81 8
8
8 All 8
v
2
3
n
I
a
K
K
L
a
KfA
L KfAfll
K
a
A
a K a A all
a 1 a 2· .. an
with v > n are defined to be 0, and the ordered system
(8 1,8
8 8 n + 1' ' ' ' ) is put in correspondence with a so-called 2,,,,, n, superdomain (Ubergebiet) a = a 1 ++ a 2 -t+ ••• ++ an ("++" read "superplus"). As each domain such as a 1 corresponda to the system of elementary functions 81
8
v
a1 ' 0 for V > 1
LEOPOLD LOWENHEIM
241
Liiwenheim identifies a1 with this superdomain (a1, 0, 0, ... ), thereby (with suitable definitions of equality, addition etc.) embedding the calculus of domains in the calculus of superdomains. 26 Having proved a selection of theorems in the latter, Liiwenheim proposes to visualize the concept of superdomain by "a certain similarity" with Riemann surfaces. The other paper of 1915, llb e i- MogZiahkeiten im ReZativkaZkUZ [13J, is the one containing the famous theorem. As an English translation by Bauer-Mengelberg is accessible in van Heijenoort's Source Book, with an illuminating commentary by the editor, and is perhaps Liiwenheim's best known paper, I do not plan to analyse the proof here. However, I feel somebody should some day devote himself to this task, and also try to find out whether the law of infinite conjunction, a proof of which is missing in Liiwenheim's paper, was not considered evident by all the writers on the algebra of logic, while the necessity of a proof for it may have escaped them just because of the symbolism favoured by the Peirce-Schriider tradition Which sometimes, one might say, thrived upon extrapolations from the finite to the infinite. Theorem 1, opening § 1 of the paper, states that not every equation in the language of first-order logic with equality can be expressed in the calculus of relatives, i.e. with symbols for relations only individual variables or constants (except 0 and 1) lacking. This result of Korselt's, which was emended by Tarski in 1941, is furnished with the outline of a complete proof and supplemented by a discussion of some examples of first-order equations valid or invalid only in certain finite individual domains. For Lowenheim, this leads up to the consideration of first-order equations which are valid in every finite individual domain but not valid generally, i.e. also in infinite individual domains; Liiwenheim calls them "fleeing equations". By a rather complicated argument, he shows that for every equation of this type there are values of its predicate letters such that the equation is not valid in any denumerably infinite domain: F finitely valid II
F valid
... F 1't o - v a l i d
or, classically equivalent, F satisfiable
... F finitely satisfiable
v F
~o-satisfiable,
which is the formulation more familiar today. Liiwenheim does not pay as much attention to this theorem in itself as we all feel it deserves; his first thought is that, as an application, all independence problems concerning axiom systems for the calculus of domains can in principle be decided in a denumerable universe. Liiwenheim reports to have shown this for the axiom system of MUller, the results being ready for publication. Likewise seemingly underestimated by Liiwenheim is his next result that amounts to the decidability of the monadic fragment of firstorder logic with equality. Only the third important discovery receives Liiwenheim's wholehearted attention. This is, as the heading of § 4 announces, the "reduction of the higher calculus of relatives to the binary", which turns upon theorem 6 stating that every relative or first-order equation is equivalent to a binary one. As Liiwen-
242
CHRISTIAN THIEL
heim is convinced that "every theorem of mathematics, or culus that can be invented, can be written as a relative (p.463l, it follows that "one can decide the truth of an mathematical proposition provided one can decide whether lative equation is identicallY satisfied or not" (ibid. l. this subject to turn to L5wenheim's paper of 1919.
of any calequation" arbitrary a binary reI leave
Simply titled Geb1~etsdeterminanten [27J, it parallels the paper on powers in the calculus of relatives in being another and, moreover, the proper continuation of the paper on transformations- [5J. It is interesting to note that the articles so connected,
Solution in CD (1910) Transformations in CD (1913) Powers in CD (1913)
Determinants in CD (1919)
are precisely the writings mentioned by Lo w e n h e i m in 1940 ([36J, p . 1 l as those which aimed at rendering parts of the logical calculus smoother and more flexible, but which had passed unnoticed. As for the last-mentioned paper of 1919, this can be easily accounted for: the subject matter, determinants for systems of linear equations in the calculus of domains, must have appeared quite esoteric to potential readers, in spite of close analogies to properties of the usual determinants of linear algebra which, for their part, would hardly have excited the readers of the Mathematisehe Annalen in 1919. So it is not hard to explain how parts of the paper also went unnoticed, in which theorems on transformations are shown to be valid far beyond the calculus of domains - namely, for very general systems of functions, such as those on arbitrary domains, not necessarily everywhere defined, and possibly many-valued.
Einkleidung der Mathematik in Sehrodersehen Relativkalkul [36} is that paper which I consider was composed last among the published writings of Lowenheim; it was published in the JSL in 1940 in German. In Lowenheim's opinion, the abandonment of the Peirce-Schroder calculus in favour of the Peano-Russell system has doubtless led to tremendously important results, but it meant, at the same time, the renunciation of harmony and beauty in mathematics which had promoted a kind of certainty of instinct for topics significant and fruitful for mathematics proper. Above all, however, the Schroder calculus avoids the set-theoretic antinomies since the concept of a set of sets is not legitimate in that system. Yet, all mathematics, including set theory, can be formulated within the Schroder system or, to use Lowenheim's unintentionally amusing expression, all mathematics can be "verschrodert", i.e. "schroderized". Referring to his result about the reducibility of ternary and higher relatives to binary ones, mathematice appears to Lowenheim like a building with three levels, level 1 comprised of quantifier-free or first-order propositions, level 2 consisting of all propositions with at least one quantifier over unary relatives, i.e. monadic predicates, and level 3 embracing the remaining propositions with quantifiers over binary relatives. Russell's logicisation of mathematics is in principle accepted; the problem is how to transpose it into the Schroder system or rather, into the modified and simplified system of Lowenheim. The question as to sete of sets is considered answered by 1-1-correlating with each
LEOpoLD LOWENHEIM
243
set M which is an element of another set M, an element m "representing" M, and studying, instead of the set M, the set M' of the representa+ives of the elements of M. For this purpose, a relative E is introduced such that M is represented by miff g(E i m = Mi)' Writing this as 1\ x.E(x,m)+-+xE: M., the affinity to the axiom of subsets or Aussonderung is obvious. Of course, not every subset of the universe has a representative (otherwise we would end up with Russell's antinomy). So, dealing with a particular domain of problems, one has to determine a suitable set of all representatives needed. It matters much to Lowenheim that in introducing his representatives of sets, his purpose was neither the axiomatization of set theory nor the es~ape from its antinomies, but solely the logicisation of mathematics by "s c h r o d e r i z a t i o n !". ~s examples of how this task could be performed, Lowenheim treats Cantor's diagonal procedure, some simple calculi like that of domains and of relatives, and a more sophisticated type of calculus. As Cantor's set theory happily survives in Schroder's system, "one need not let oneself be expelled from the paradise of Cantorian set theory by intuitionism", and intuitionism as a universal demand seems to Lowenheim to be a useless restriction. On the other hand, the concept of intuitionistic provability has also i's legitimate place in the Schroder calculus, and, as expressed by Lowenheim's beautiful formulation, "we need not let ourselves be expelled from the paradise of intuitionism either" (p.2). Lowenheim's last published paper appeared in 1946 in an English translation by w.V.Quine with the title On making indirect proofs direct [37J. As Quine says in his introduction, i L is a contribution to "elementary practical points of mathematical methodology" (p.125). Lowenheim discusses several types of proof by reductio ad absurdum and claims that every such indirect proof can be turned into a direct one by either (1) the method of complete contraposition, (2) the method of incomplete contraposition, or (3) the method of generalization. Actual proofs from mathematical texts are taken as examples. The manuscript of this paper found its way to the editors of Scripta Mathematica after quite an odyssey. Documenting this, there are two letters of Lowenheim to Bernays and one letter in the other direction which Professor Bern~ys was kind enough to let me read and quote. In his letter of May 25th, 1937, Lowenheim agrees ~o have his "treatise on indirect proof" published, and adds a word on its origin: "I lectured before the B~rlin Mathematical Society on November 28th 1917, and shortly afterwards the treatise was cast in its final form by adding the critical discussion on Hessenberg, a matter pointed out to me after the lecture."27 With his letter o'f March 15th, 1939, Bernays returns the manuscript which he had recommended for publication to the Mathematische Annalen without success; he finds it "very satisfying t'at Mr.Tarski will bring the treatise out". I assume that Lowenheim had himself asked for the manuscript back in order to send it to Tarski for Fundamenta Mathematicae, which would fit well with an autobiographical remark of Lowenheim's, saying: "Shortly before the Second World War, I published a paper on logical calculus in an American journal. Of another one destined for Warsaw, I had even read the proofs; whether the war has prevented its publication, I do not know." In April 1946, Professor Ginsburg told Professor Fraenkel that he had received "a German manuscript by Lowenheim"u some years ago, but could not find a translator. Fraenkel, who was to lecture at Harvard soon afterwards, offered to talk the matter over with Quine. In a letter
244
CHRISTIAN THIEL
of August 7th, 1965, Profe~sor Quine remembers that he got the manuscript from Fraenkel "with the request that I find some student of minG who might translate it. Knowing of no such student, I did it myself." So much ings, I new. Up or with
for the published treatises. On the rest of Lowenheim's writmust of necessity be briefer. There is, however, something to now, nobody seems to have met with either the abstracts the short reviews contributed by Lowenheim to the Jahrbuch Uber die Fortschri&te der Mathematik which I mentioned before. T would not call their discovery sensational in any respect, all the more as their length, or better shortness, ranges from 11 to 23 lines only, but they certainly enable us to form a considerably better picture of Lowenheim's fields of interest and competence. From the abstracts I have drawn information on Lowenheim's particular ranking of his own diverse results and, indirectly, on the self-appreciation of his work in general. The paper of 1908 on the solvability problem had been pUblicized by a substanceless seven-line review by Lampe, and the related 1910 article by twelve lines from the same reviewer, in the Jahrbuch. But there are abstracts written by Lowenheim on the transformation paper [5] of 1913 as well as of its continuations [7J of 1913 and ~7J of 1919, the latter being somehow misplaced in the volume for 1916 to 1918. Equally due, I presume, to the circumstances of the time is the appearance of two abstracts of the famous 1915 paper on possibilities in the calculus of relatives, both written by Lowenheim, one prematurely in the Jahrbuch volume for 1913 (published, however, in 1918), and a somewhat longer second one in the volume for 1914-15. Among Lowenheim's reviews and notices of works and papers by other authors, those in the Archiv der Mathematik und Physik were known before and should be separated from the necessarily shorter ones in the Jahrbuch. The reviews in the Archiv on Parts I and II of Schroder's Abriss der Algebra der Logik are very carefully and thoroughly written; Lowenheim's remarks on the Frege-Hilbert controversy over the axiomatical approach show his sense for philosophically important key subjects in mathematical and logical research. (Of course, they also mirror Lowenheim's simultaneous correspondence with Frege to be talked ab~ut in a minute.) Essentially, Lowenheim welcomes the Abriss as making a good advance, but he finds fault with the omission of Schroder's clear separation of the calculus of domains from the calculus of propositions which leads to a faulty conception of disjunction and negation depending on a theory of "validity points" for propositions in time. Secondly, Schroder's axiomatics for the calculus of domains is shown to suffer from a serious weakness, sum and product being defined only for two domains at a time. Thus Schroder is "unable to reach infinite products and sums without a f a L'l.a c y " (p.72). Even if the definitions are expanded, "the rules of calculation for infinite sums of propositions have to be proved", and this must be done, since "already in the general calculus of domains the extension of axioms and proofs to infinite products and sums is absolutely necessary, otherwise falling short of the most important applications" (ibid.). Spicy annotations, in face of the modern critique of Lowenheim's later proof of his famous theorem. H The third review in the Archiv is on Padoa's "La logique dfiductive dans sa derniere phase de developpement", where Some interesting remarks are made on the importance of a perspicuous notation in logic and mathematics and On the comparison of Peano's symbolism with that of the Schroder school (with which, of course, Lowenheim sides).
LEOPOLD LOWENHEIM
245
The twenty short reviews and notices in the Jahrbuch are mainly concerned with works on logic and foundations and their history, the only real exception being J.Pfluger's book on the beauty of simple geometrical figures (Die Formenschonheit einfacher geometrisaher Gebilde), showing an early interest on L6wenheim's side for related questions. Of the authors reviewed, I mention only Burali-Forti, Hugo Dingler, Philip E.B. Jourdain, Norbert Wiener, and Henry M.Sheffer. A look at the unnoticed writings is a suitable starting-point for my concluding reflection on Lowenheim's recognition or neglect by his contemporaries. A first conclusion can perhaps be drawn from the fact that Lowenheim's papers, it seems to me, had to be reviewed by himself in the Jahrbuch as nobody else was willing or even able to do it - in contrast to other journals, abstracts written by the author are not very numerous in the Jahrbuch. Reviews by others came remarkably late: only the 1940 paper on schr6derization got a review in the Jahrbuch by Karl Schr6ter, and a rather saucy one; but then Curry's review of the same paper in the Mathematical Reviews, the successful competitor to the Jahrbuch since 1940, is, though more Objective, not very favourable either. The same can be said of the four lines devoted by Heyting, in the same review journal, to Lowenheim's paper on indirect proof in 1947. Admittedly, the two last papers do not in fact .count among his strongest; but then the earlier ones should have enjoyed a louder echo while at first receiving almost none. Prior to Skolem's paper of 1920, only Eugen Muller quoted L6wenheim's development theorem in the second part of Schr6der's Abriss in 1910. No reference to L6wenheim or his theorem can be found between 1915 and 1922 in the Mathematische Annalen (except in L6wenheim's oWn paper of 1919), in the Jahresberichte der Deutsahen Mathematiker-Vereinigung, in the Mathematische Zeitschrift and of course not in Crelle's Journal which did not publish a single paper on foundations or set theory within those years. The first edition of Fraenkel's Einleitung in die Mengenlehre in 1919 does not mention Lowenheim at all, nor does the second edition of 1923 which has a footnote on p.187 that "Skolem's weighty lecture became accessible to the author only in the course of the printing of the book". No earlier than in Zehn Vorlesungen tiber die Grundlegung der Mengenlehre of 1927 does L6wenheim's name appear in any writing of Fraenkel. There is still some'unclearness regarding C.I..Lewis' Survey of Symbolic Logia (1918, abridged second edition 1966), the bibliography of which
lists all five papers of L6wenheim up to 1915, except the one on possibilities in the calculus of relatives - four of them decorated with an asterisk as "most important contributions to symbolic logic" (p. 307; see p.315 of the 1966 edition); yet none of them, nor even L6wenheim's name, seems fo appear in the text. The breakthrough of the theorem, if not of its discoverer, seems to have come more than five years after Skolem's continuation of L6wenheim's research. Skolem's paper on the axioms of the calculus of classes 30 published in 1919 but signed May 1917, shows that Skolem had read L6wenheim's paper of 1915 well and refers to L6wenheim's result on the nonexistence of fleeing equations in the monadic predicate calculus as following from a more general theorem proved in this new paper of Skolem who, as we know, worked quite in the Schroder tradition and even with the notation of that school. I take it as an indication of the neglect of L6wenheim personally that Skolem's papers were not reviewed by L6wenheim in the Jahrbuah - whatever the reason for his disappearance from the staff of contributors may have been.
246
CHRISTIAN THIEL
A last question remains: Why did Lowenheim withdraw from further research on the algebra of logic and, as his unpublished writings prove, shift to comprehensive but rather elementary expositions of special fields of geometry? Did Lo ve n h e i ra resign from logical research in bitterness? Let me first state that his oft-mentioned isolation was rather an isolation of the whole Peirce-Schroder tradition. Schroder had died in 1902, Peirce in 1914, Mliller lived to 1935, and Korselt, after his retirement in 1924, to 1947, but neither of them to my knowledge published anything on the calculus of logic after 1910. And Skolem, as the only and, geographically at least, isolated researcher in the movement, could not singly change the course of history. On the contrary, he was the one to meet with open hostility from Zermelo when the latter in 1931 made a fierce assault on "skolemism, the doctrine that every mathematical theory, the theory of sets not exempted, is realizable in a denumerable model", and severely criticized the "very odd consequences" of what he baptized "the finitist prejudice".n Personally, I would not view Lowenheim as so isolated as he is sometimes pictured. We must not forget that he loosened his contacts with logicians by turning to other fields of intellectual life as a consequence of what he called his spiritual turn. As I mentioned before, he had been in contact with Mliller, Korselt, and Zermelo during the blooming of his logical research, he received visits by Bernays before 1931 and by Scholz and Tarski about 1936, and he was still interested in publishing then. Most fascinating is his exchange of 20 letters with Frege in which he is reported by Bachmann and Scholz to have convinced Frege that an uncontestable foundation of formal arithmetic was possible. The correspondence must be considered completely lost, our knowledge being restricted to the general topic, the dates of most of the letters (1908-1910), a remark on this correspondence in Lowenheim's first letter to Bernays, and some circumstantial evidence to be found in a volume containing what is left of Frege's collected scientific correspondence.~ Did Lowenheim have unpublished manuscripts He did. Many of them were on geometry, one reason for his work in this field being Rudolf Steiner's inclusion of synthetic geometry in the mathematics curriculum for Waldorf schools. The mathematics teachers of the Waldorf mother school at Stuttgart asked Lowenheim to elaborate parts of synthetic geometry for teaching purposes as he was known to have been very proficient in synthetic geometry since the days of his study with H.A.Schwarz. In the bombing of Berlin on August 23rd, 1943, Lowenheim's flat at Berlin-Lankwltz was completely destroyed. Among other things, nearly 1200 geometrical drawings and models and two book cases with many unpublished scientific and literary manuscripts were burnt to ashes. We do not even know exactly what these manuscripts were about. It would be a rash conclusion to interpret Lowenheim's tiny remark of 1940 - "as long as I had time for 10gistics"33to mean that he had given up his logical work altogether; it may just have meant that he wanted to, or had to, devote the greater part of his time to other things. In 1915,1939 and 1940, Lo w e n h e i.m indicated that he still kept unpublished manuscripts on logic. And in autobiographical notes, he does not conceal a justifiable pride in his logical work. Let me come to an end with two quotations from these reflections which are kindly given to me in letters from Johannes Teichert: "I have a lively interest in logic and have pointed out new paths for science in the field of the calculus of logic which had been founded
LEOPOLD LOWENHEIM
247
by Leibniz but which had come to a deadlock. ( ... ) On an outing, a somewhat grotes~ue landscape stimulated my fantasy and I had an insight that the thoughts I had already developed in the calculus of domains might lead me to make a breach in the calculus of relatives. Now I could find no rest till the idea was completely proved, this giving me still a host of troubles. (oo.) This breach has been scarcely noticed, but all the more some other breaches which I made through my treatise 'On possibilities in the calculus of relatives' were noticed. This became the foundation of the modern calculus of relati v e s , But just in this paper I did not take much pride, for the point had only been to ask the right ~uestions while the proofs could be found easily without ingenuity and imagination. As I was then the only one working seriously in the field, I could gain a respected name among experts in a rather cheap way. My first paper on the calculusof relatives which did not find much attention had cost me much more intellectual exertion. The later works have remained unpublished and, with the eiception of a single one, fallen victim to the ~lames." In a letter to his step-son during the last war, he says: "The loss of the entire calculus of logic is the most painful thing for me." Thus I gather he was still serious about his work in logic. As there is no reasonable hope of enriching our knowledge of Lowenheim's writings extensionally by the discovery of hitherto unknown manuscripts or published writings, historians of modern logic interested in Lowenheim would be well advised to concentrate on clarifying and penetrating his available writings, connecting his ideas up with earlier and contemporary work, and finding the key concepts and methods in Lowenheim which were to become pregnant for the development of present-day lOgic and foundational research. There will be plenty of work to do. NOTES
2
3 4 5 6 7 8 9
Christian Thiel, Leben und Werk Leopold Lowenheims (1878-1957). Teil I: Biographisches und Bibliographisches. Jahresberiohte der Deutsohen Mathematiker-Vereinigung (subse~uently abbreviated "JDMV") 77 (1975) 1-9. On p.5, line 13, for "Reylgymnasium" read "Realgymnasium"; on p.7, line 16, for "1909" read "1896". CLalso note 16 below. Poggendorff's Bipgraphisch-literarisches Handworterbuch fUr Mathematik, Astronomie, Physik, Chemie und verwandte Wissenschaftsgeb i e t e , Band V: 1904-1922. II.Abt.: L-Z. Berlin 1926, p.759. The latest edition (under the title "Poggendorffs Biographisch-literarisches Handworterbuch der exakten Naturwissenschaften") , covering the years from 1932 to 1953, does not mention Lowenheim any longer (cf. Bd.Vlla, Teil 3: L-R: Berlin 1959). Charles Coulston Gillispie (ed.), Dictionary of Scientific Biography, vol. VIII (Lane-Macqueur), New York 1973. Herbert Meschkowski, Mathematiker-Lexikon, Mannheim/ZUrich 1964, 2.erw.Aufl. Mannheim/Wien/ZUrich 1973. Encyclopaedia Britannica, 1963 edition and 1968 edition. Brockhaus Enzyklopadie, Wiesbaden 1966ff; Elfter Band (L-Mah)1970. Meyers Enzyklopadisches Lexikon, Mannheim/Wien/ZUrich 1971ff; Bd. 15 (Let-Meh) 1975. Gert A. Zischka, Allgemeines Gelehrten-Lexikon. Biographisches Handworterbuch zur Geschichte der Wissenschaften, Stuttgart 1961. KUrschners Deutscher Gelehrten-Kalender. Erster Jg., 1925. Berlin/ Leipzig 1925. - 2.Jg. (auf das Jahr 1926), Berlin/Leipzig 1926. 3.Ausgabe 1928/29. - 4.Ausgabe 1931. - 5.Ausgabe 1935. - 6.Ausgabe 2.Band, Berlin 1941. - 7.Ausgabe Berlin 1950. - 8.Ausgabe Berlin
248
10
CHRISTIAN THIEL 1954. - 9.Ausgabe Berlin 1961. - 10.Ausgabe Berlin 1966 (Lowenheim not mentioned in the necrology of the two last-mentioned editions). Cf. J.H.Woodger's preface to Alfred Tarski: Logic, Semantics, Metamathematics. Papers from 1923 to 1938 (Oxford 1956), p i x . According to Abraham A. Fraenkel, Lebenskreise. Aus den Erinnerungen eines judischen Mathematikers, Stuttgart 1967, p.205, Lindenbaum lived from 1904 to 1941, according to Meschkowski (op.cit.) from 1905 to 1942. A.Fraenkel and Y.Bar-Hillel, Foundations of Set Theory, Amsterdam 1958, p.409. Geoffrey Hunter, Metalogic. An Introduction to the Metatheory of Standard First Order Logic. London etc. 1971, p.189. Kleine En z y k Lo p a d i e / Mathematik, hrsg. v. W.Gellert, H.Klistner, M.Hellwich und H.Kastner, Leipzig und Basel 1965, Zurich/Wien 1966, and Frankfurt a.M./Zurich 1972, p.785. Josef Lense, Vom Wesen der Mathematik und ihren Grundlagen, Munchen 1949, p.67. Reminiscences of Logicians, edited by J.N.Crossley, in J.N.Crossley (ed.), Algebra and Logic. Papers from the 1974 Summer Research Institute of the Australian Mathematical Society, Monash University, Australia, Berlin/Heidelberg/New York 1975 (Lecture Notes in Mathematics no.450), p.28. Louis L6wenheim, Die Wissenschaft Demokrits und ihr EinfluB auf die moderne Naturwissenschaft, hrsg. v. Leopold L6wenheim, Berlin 1914. I have not yet been able to find a copy of this book, but a 48 page paper (ending on page 48 in the middle of a sentence, the text presumably being nothing else than page 1 to 48 of the book) appeared, under the same title, as "Beilage zu Heft 4 des Archivs fur Geschichte der Philosophie, Band XXVI" in 1913. The title "Uber den EinfluB Demokrits auf Galilei", quoted on p.5 of my 1975 paper, is incorrect, this being the title of a paper by Louis L6wenheim published in Archiv fur Geschichte der Philosophie I (1894) 230-268. Copies of autobiographical notes by Leopold L6wenheim, including all passages quoted in this paper, were kindly provided by Mr. Johannes Teichert, with a letter of 14 September, 1976. The notes are undated, but all written after 1945. Cf.[3J, p.171. cr . JDMV 17 (1908) 2.Abt., p.62. Cf. JDMV 11 (1902) p.71. I t seems L6wenheim lectured before the Society for the first time on October 30, 1907, this lecture (titled "Einfuhrung in den logischen K'I a s s e n k a Lk ii L'"] being the only one preceding his lecture of 1908. Cf.the title pages, e.g: of the volume quoted in note 19. The Jahrbuch uber die Fortschritte der Mathematik was the immediate predecessor of the Mathematical Reviews and the zentralblatt fur Mathematik and played a comparable role. Emil Lampe died on September 4th, 1918. See Arthur Korn's obituary of E.L. in JDMV.!2 (1914-1915, pub1.1922) pp.I-VII. Lo w e n h e i m is still listed as a member of the Berlin Mathematical Society in 1918, but the publication of the Archiv der Mathematik und Physik, to which the Proceedings of the Berlin Mathematical Society were appended, had to be discontinued in 1920. :n Pt.2, pp.756f in the 1966 joint reprint of Vorlesungen andAbriss. Special cases of the development theorem were mentioned on p.746f, op.cit .. Lowenheim [lJ reports in 1908 that Zermelo claimed to have known the general theorem "for a long time", but it seems it had never been published before. A.N.Whit~head, Memoir on the Algebra of Symbolic Logic, American Journal of Mathematics ~ (1901) 139-165 and 297-316. i
11 12 13 14 15
16
17
18 19 20
21
22
23
24
LEOPOLD LOWENHEIM 25 26
27
28 29
30
31 32
33
249
In column 4 of the full matrix of the original paper, p.65, the of row 2 and the 1 of row 4 should be interchanged. Dr.Albert C. Lewis has pointed out to me that a comparison of Lowenheim's construction of superdomains with some ideas of Hermann Grassmann might be promising. I would like to take this opportunity to state my indebtedness and express my gratitude to Dr. Lewis for many helpful and stimulating discussions of the present paper as well as for a large number of more appropriate formulations suggested by Dr. Lewis for ~he final version of the text. Whereas the lecture of November 28, 1917, is documented by the Sitzungsberiehte (151st session), I am puzzled by the reports of the Berlin Mathematical Society in the JDMV where two lectures by Lowenheim under the title "tiber die Yerwandlung eines indirekten' Beweises in einen direkten" are listed: beside that on November 28th, 1917, an earlier one in a doubtful session on July 27, 1917, undiscoverable in the Sitzungsberiohte where June 27, ·1917, is given as the date of the 149th session and October 31, 1917, as the date of the 150th session. The only explanation I can think of is an error of the JDMV concerning the month, Lowenheim's lecture possibly having been scheduled for the session of June 27, 1917, but finally being postponed because of Lowenheim's absence (who at that time served in the army, might have announced his coming, and then been unable to appear). Professor Fraenkel in a letter to the present author of 20 August 1965. See, however, Hao Wang's "digression on Lowenheim 1915" in A Survey of Skolem's Work in Logio, in Th.Skolem, Selected Works in Logic, ed.by J.E.Fenstad, Oslo-Bergen-Tromso 1970, pp.17-52 (the digression on pp.27-29). Untersuchungen tiber die Axiome des Klassenkalklils und tiber Produktations- und Summationsprobleme, welche gewisse Klassen von Aussagen betreffen. Reprinted in Th.Skolem, Selected WorkS in Logic, op.cit. in n.29, pp.67-101. Ernst Zermelo, tiber Stufen der Quantifikation und die Logik des Unendlichen, JDMV 41 (1932) 2.Abt., pp.85-88, quotations from p.85. -Gottlob Frege, wissenschaftlicher Briefwechsel. Herausgegeben, bearbeitet, eingeleitet und mit Anmerkungen versehen von Gottfried Gabriel, Hans Hermes, Friedrich Kambartel, Christian Thiel, Albert ·Yeraart. Ijamburg 1976. (XXIX. Frege - Lo v e n h e i.m on pp.157161. ) [36], p.7.
o
BIBLIOGRAPHY OF LEOPOLD LOWENHEIM
2 3 4
5
tiber das Auflosungsproblem im logischen Klassenkalkul. Sitzungsberichte der Berliner Mathematischen Gesellsohaft 7 (1908) 89-94.
Review of Ernst Schroder, Abriss der Algebra der L~gik, Erster Teil [Elementarlehre. Leipzig 1909J. Arohiv der Mathematik und Physik, III.Reihe, 15 (1909) 194-195. tiber die Auflosung VOn Gleichungen im logischen Gebietekalkul. Mathematische Annalen 68 (1910) 169-207. Rezension von Ernst Schroder, Abriss der Algebra der Logik, Zweiter Teil [Aussagentheorie, Funktionen, 'Gleichungen lind Ungleichungen. Leipzig 1910J. Archiv der Mathematik und Physik, III. Reihe, 17 (1911) 71-73. tiber Tr~sformationen im Gebietekalklil. Mathematische Annalen 11 (1913) 245-272.
250
6 1 8 9
10
11 12
13
14 15 16 17
18
19
20
21
22 23
CHRISTIAN THIEL Abstract of [5], Jahrbuch uber die Fortschritte der Mathematik 44 (1913, publ. 1918) 11. Potenzen im Relativkalkul und Potenzen allgemeiner endlicher Transformationen. Sitzungsberichte der Berliner Mathematischen Gesellschaft 12 (1~13) 65-71. Abstract of [7T, Jahrbuch uber die Fortschritte der Mathematik 44 (1913, publ.1918) 17. Review of Alessandro Padoa, La logique deductive dans sa dernie~ re phase de developpement. Avec une preface de Giuseppe Peano rParis 1912], Archiv der Mathematik und Physik, III.Reihe, £1 (1913) 360-361. Review of Alfonso Del Re, Sulla indipendenza dei postulati dell' Algebra della Logica[Rendiconti dell'Accademia delle Scienze Fisiche e Matematiche, Napoli, (3) 11 (1911) 450-458], Jahrbuch uber die Fortschritte der Mathematik 42 (1911, publ.1914) 19. Review of E.Stamm, Beitrag zur Algebra-der Logik [Monatshefte fur Mathematik und Physik 22 (1911) 131-149J, Jahrbuch uber die Fortschritte der Mathematik 42 (1911) 19. Review of Balakram, The Gene~l Equation in the Algebra of Logic [Journal of the Indian Mathematical Society 2 (1911) 213-218], Jahrbuch uber die Fortschritte der Mathematik 42 (1911, publ. 1914) 80. -tiber M6glichkeiten im RelativkalkUI. Mathematische Annalen 76 (1915) 441-410. English translation by Stefan Bauer-Mengelberg: On possibilities in the calculus of relatives, with introduction by the editor, in Jean van Heijenoort (ed.), From Frege to G6del. A Source Book in Mathematical Logic, 1879-1931, Cambridge, Mass. 1967, pp.228-251. (First) Abstract of [13]. Jahrbuch uber die Fortschritte der Mathematik 44 (1913, publ.1918) 78. (Second) Abstract of [13], Jahrbuch uber die Fortschritte der Mathematik 45 (1914-15, publ.1922) 108-109. tiber eine Erweiterung des Gebietekalkuls, welche auch die gew6hnliche Algebra umfasst. Archiv fur systematische Philosophie ~ (1915) 131-148. Review of P.E.B.Jourdain, The development of the theories of mathematical logic and the principles of mathematics [QuarterlY Journal of Pure and Applied Mathematics ~ (1912) 219-314J, Jahrbuch uber die Fortschritte der Mathematik ~ (1912, publ.1915) 97. Review of Alessandro Padoa, Frequenza, Previsione, Probabilita [Atti della Reale Accademia delle Scienze di Torino ~ (1912) 878-886], Jahrbuch uber die Fortschritte der Mathematik ~ (1912, publ.1915) 98. Review of Cesare Burali-~orti, Gli enti astratti definiti come enti relativi ad un campo di nozioni [Atti della Reale Accademia dei Lincei (Roma)(5) £12 (1912) 677-682J, Jahrbuch uber die Fortschritte der Mathematik 43 (1912, publ.1915) 98-99. Joint review of W.H.Young, On a proof of a theorem on overlapping intervals [Messenger (2) .!±£. (1912-13) 113-118J and Arthur Schoenflies, Entgegnung [Messenger (2) .!±£. (1912-13) 119-121], Jahrbuch uber die Fortschritte der Mathematik 43 (1912, publ.1915) 112. Review of Theophil Henry Hildebrandt,~ contribution to the foundations of Frechet's Calcul fonctionnel [American Journal of Mathematics ~ (1912) 237-290], Jahrbuch Uber die Fortschritte der Mathematik 43 (1912, publ.1915) 113. Notice of G~e Vries, Calculus Rationum [Amst.Ak.Versl. 20 (1912) 1057-1073 and £1 (1912) 1183-1199], Jahrbuch Uber die Fortschritte der Mathematik 43 (1912, publ.1915) 227. Review of. Eugenio Maccaferri, Le Definizioni per Astrazione e la Classe di Russell [Rendiconti del Circolo Matematico di Palermo
LEOPOLD LOWENHEIM
251
12 24
25
26
27 28 29
30
31 32 33
35
36 37 38
39 40 41 42 43
(1913) 165-171J, Jahrbuch fiber die Fortschritte der Mathematik 44 (1913, publ.1918) 75.
Review of P.E.B.Jourdain, The development of the theories of mathematical logic and the principles of mathematics [Quarterly Journal of Pure and Applied Mathematics ~ (1913) 113-128J, Jahrbuch fiber die Fortschritte der Mathematik ~ (1913, publ.1918) 75-76. Review of Henry M. Sheffer, A set of five independent postulates for Boolean algebras, with applications to logical constants [AMS Trans. II (1913) 481-488], Jahrbuah fiber die Fortschritte der Mathematik ~ (1913, publ.1918) 76. Review of P.A.MacMahon, The indices of permutations and the derivation therefrom of functions of a single variable associated with the permutations of any assemblage of objects [American Journal of Mathematics 12 (1913) 281-322J, Jahrbuah fiber die Fortschritte der Mathematik 44 (1913, publ.1918) 76-77. Gebietsdeterminanten. Mathematische AnnaZen 79 (1919) 223-236. Abstract of [27], Jahrbuah fiber die Fortsahritte der Mathematik 46.1 (1916-18, publ.1923!24) 93. Review of Hugo Dingler, Das Prinzip der logischen Unabhangigkeit in der Mathematik, zugleich als Einfuhrung in die Axiomatik [Munchen 1915], Jahrbuch fiber die Fortschritte der Mathematik ~ (1914-15, publ.1922) 100-101. Review of Cesare Burali-Forti, I numeri reali definiti come operatori per le grandezze [Atti della Reale Accademia dei Lincei (Romal 24 1 (1915) 489-496L Jahrbuo n fiber die Por t e ch r i bt:e de» Mathematik 45 (1914-15, publ.1922) 100. Review of-cino Poli, Paradossi logici [Atti della Reale Accademia dei Lincei (Roma) ~2 (1914) 62-65], Jahrbuch fiber die Fortsahritte de r Mathematik 45 (1914-15, publ.1922) 107. Review of J.Pfluge~ Die Formenschonheit einfacher geometrischer Gebilde [Stuttgart 1915], Jahrbuch fiber die Fortschritte der Mathematik 45 (1914-15, publ.1922) 118. Notice of~orbert Wiener, Certain Formal Invariances in Boolean Algebras [AMS Trans. ~ (1917) 65-72J Jahrbuch fiber die Fortsahritte der Mathematik ~ (1916-18, publ.1923!24) 92. Notice of Percy John Daniell, The Modular Difference of Classes [BUlletin AMS gj (1917) 446-450] Jahrbuch fiber die Fortschritte de r Mathematik 46.1 (1916-18, publ.1923!24) 92. Notice of Lagne~Logique des propositions [Atti della Reale Accademia delle Scienze di Torino 21 (1918) 428-444] Jahrbuch fiber die Fortschritte der Mathematik 46.1 (1916-18, publ.1923!24) 92. ---Einkleidung der Mathematik in Schr6derschen Relativkalkul. Journa l: of SymboZia Logic 5 (1940) 1-15. On making indirect proofs direct. Translated from the German manuscript by w.V.Quine. Scripta Mathematica ~ no.2 (1946) 125-139. Die logarithmische Spirale. Eine ausfuhrliche geometrische Darstellung ihres Wesens und ihrer GesetzmaBigkeiten. Mimeographed (Vol.VI of the series "Geometrie als eine Bildersprache" edited by Johannes Teichert), Berlin-Charlottenburg 1957. 169 pp. with a separate Figuren-Mappe consisting of 145 plates with 204 fig .. FuBpunktkurven und Spiegelkurven. Unp ub l i e he d MS., 30 pp., 38 fig. Allgemeine Satze uber Zykloidalen und Trochoidalen. UnpubZished MS., 99 pp., 101 fig .. Rosenkurven und ihre Conchoiden (Eurythmiewellen). UnpubZished MS., 67 pp., 65 fig .. Pascalsche Schnecken. UnpubZished MS., 154 pp., 176 fig .. Jacob-Steiner-Kurven und Astroiden. UnpubZished MS., 141 pp., 210 fig ..
252 44 45
CHRISTIAN THIEL Die fiinf regelmiiBigen (Platonischen) Kiirper. UnpubZished MS., 206 pp., 470 fig •. Einfachheitslogik oder Reichtumslogik ? Unpublished MS., 29 pp. of the first version; some pages of an (incomplete) revised ver-
sion.
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76 © North-Holland Publishing' Company (1977)
RELATIVELY HOMOGENEOUS STRUCTURES David M. Clark
and
Peter H. Krauss
State University of New York New Paltz, New York 12561
~ -categoricity is a model theoretic notion. An elementary (that is firstorder) tgeory is called !fo-categorical if its denumerable (that is countably infinite) models are all isomorphic. A structure is called ~o-categorical if its elementary theory is ~o-categorical. The following question arises immediately: What morphisms are available to establish isomorphisms between the denumerable models of Tio-categorical elementary theories which are not axiomatizable (Henson [2]). Nobody claims that the >io-categoricity of such theories is interesting to the algebraist. On the other hand, many applied categoricity results ca~ be instantly augmented by an explicit axiomatization of the theory under consideration. In fact, such an axiomatization usually can be extracted from the argument establishing Ho-categoricity. Once this has been accomplished, an T. This is the clue to H o-categoricity.
e
Ryll-Nardzewski's Theorem Let be a complete elementary theory. Then @ is if and only if for every n be a complete elementary theory.
is Ho-categorical. For every 07.,~ E Mode,
fj)
property. (iii) For every Ol,CGfinite extension property.
Eo
EL(02,~)
~o-categoricity
Then the following are
has the finite extension
ModG there exists a
Ki:oLM( 01,£1-)
with the
Proof: (i) implies (ii) by Ryll-Nardzewski's Theorem and (iii) implies (i) by Lemma 1 .1. Anybody who studies applied T{o-categoricity results will notice that these investigations usually come in two parts. In a first part Ryll-Nardzewski's Theorem is used to earmark certain theories as candidates for Ho-categoricity. And then the second part consists of showing that these theories are in fact H 0categoricaL Now frequently this is accomplished by an algebraic construction of isomorphisms between the denumerable models of the theory. This is where Theorem 1.2 comes into play. It turns out that condition (ii) of this theorem in most cases is just as unwieldy for applications as Ryll-Nardzewski's Theorem because usually it is quite difficult to determine the family of all elementary local monomorphisms between two structures. However, frequently it:fs possible to purely algebraically construct families of local monomorphisms between the models of a theory and then to show that these families have the finite extension property. Now an application of condition (iii) of Theorem 1.2 yields a purely algebraic proof of ~o-categoricity. In this study we shall first develop a uniform algebraic method of constructing families of local monomorphisms with the finite extension property and then we shall verify that this method can be used to establish several major applied H ocategoricity results known from the literature (namely linear orderings, Boolean algebras, abelian groups and rings with identity and no non-zero nilpotent elements). To motivate our approach we shall begin with a few remarks on the notion of an elementary local monomorphism for the benefit of the algebraist. Let us consider an elementary local monomorphism f, where (01. ,x)x~.x
== (t;. ,f(x»xcx
The problem with this notion is that it does not just mean that f is a local monomorphism from 01. into;(;., but that it entails an extremely complex description of how the substructures of 07. and;{;- generated by X and f'(X) "sit inside" rJL and ;(Jrespectively. In particular, to explore the "environment" of these substructures inside en and ;{;- as it is described by an elementary local monomorphism requires an unbounded search on both sides. It appears that algebraists are only willing to conduct a bounded search into the environment of a morphism. This motivates the next definition which is due to Fraisse. For any two structures rJL and JJ-. we first define
if either 01. and iff. have no distinguished elements (that is elements denoted by individual constants) or the substructures of Ul. and ~ generated by their distinguished elements respectively (that is the substructures determined by the elements which are denoted by constant terms) are isomorphic. Next we define
259
RELATIVELY HOMOGENEOUS STRUCTURES 01. =r+1 ;t;if for every
a
lOll there exists bE: 1;(;,1 such that
Eo
(ill
and vice versa.
,a)=r
(~,b)
Finally we define
if Ol == t- for all r<w. Now we can define the central notion of this study. f is called a local "-monomorphism from 01. into ;& if for some non-empty finite Xs;. lOl.l , f:X ~I£..I and (07. ,x)x~X =0