Probability and Experimental Errors in Science

PROBABILITY (ID EXPERIMENTAL ERRORS IN SCIENCE u« G m \\ Probability and Experimental Errors in Science Scie...

Author: Lyman George Parratt

136 downloads 877 Views 20MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

PROBABILITY (ID

EXPERIMENTAL ERRORS IN

SCIENCE

u«

G

m

\\

Probability and

Experimental Errors in

Science

Science Editions®

JOHN WILEY

and SONS, INC.

NEW YORK

PROBABILITY

AND EXPERIMENTAL ERRORS IN

SCIENCE An elementary survey

LYMAN

G.

PARRATT

Professor of Physics

Chairman of the Department of Physics Cornell University Ithaca,

New York

COPYRIGHT

r

1961

BY

JOHN WILEY

All Rights

& SONS, INC.

Reserved

This book or any part thereof must not

be reproduced

in

any form without the

written permission of the publisher.

Library of Congress Catalog Card Number: 61-15406 Printed in the United States of America First Science Editions printing 1966 Science Editions Trademark Reg. U.S. Pat.

Off.

DEDICATION This book

is

dedicated

to those timeless intellectuals

who

have so shaped our cultural pattern

that experimental science can live as a part of a science that seriously

tampers with

the plaguing and hallowed uncertainty in

man's comprehension of and of the universe.

his

gods

it,

"He

that

is

unaware of

his

ignorance will be

misled by his knowledge."

WHATELY

Preface

Although

the concepts of probability

everything in science, the student in

haphazard fashion. His

first

is

and

too often

statistics left to

underlie practically

acquire these concepts

contact with quantitative probability

may

be in extracurricular gambling games, and his second in some requirement of a laboratory instructor

who

insists arbitrarily that

a

±

number should

follow a reported measurement. In his undergraduate training, he

may be

introduced in a social science to sampling procedures for obtaining data, in a biological science to formulas for the transmission of genes in inherit-

ance characteristics, and in a physical science to Heisenberg's uncertainty

which he intuitively concludes is essentially nonsense. Such good as far as they go (except, of course, any excess in gambling and the conclusion as to nonsense), are left woefully disconnected and do not prepare him adequately to understand the intrinsic "open-ended" feature of every measurement and concept in science. Probability is the lens that brings science into philosophic focus. Without a fairly clear comprehension of this fact, the scientist cannot be really "at home" in his own field. And, at a time when as never before the results of science rush on to overwhelm society, the scientist, for lack of focus, is a poor ambassador. Not only is he spiritually uncomfortable in his own field, but science itself, as he portrays it, cannot fit comfortably in the society of other human activities and knowledge. In a very humble way, we are attempting at Cornell University to principle,

experiences,

introduce the undergraduate student to the unifying concepts of probability

Preface

vim

and

statistics as

they apply in science. This

The

a difficult task at this level of

is

broad scope is no doubt the most mature and sophisticated one with which man has ever struggled. But it is believed that the best time to instill a general attitude in a student he will have less trouble later in maturing properly. is when he is young This is admittedly the objective of a teacher rather than of a book. But experience shows the impracticality of trying to teach undergraduate students without a book. The present volume has been assembled to fill the student's development.

subject in

its

—

this

pedagogical need, at least to

fill it

be a base from which the teacher

A

is

patterned to

to discuss further aspects,

deepen and broaden the understanding of

especially those aspects that science.

The book

in part.

may go on

few suggestions are given of such excursions in the under-

standing of science, particularly in the

first

chapter.

comments on the different meanings of probability, then goes into the classical games of chance as examples of the classical or a priori meaning. Although these games have almost nothing to do with science, they provide a convenient framework for the teaching The book begins with

brief

of basic principles, for example, of combinatorial analysis which

is

funda-

probability reasoning, and of the sampling process inherent in

mental in

all

scientific

measurements.

The games

are also remarkably successful in

arousing the undergraduate's interest in the subject, and in providing

numerous problems

to help

quantitative concepts.

we

him develop

Once

his feelings for probability into

the basic principles are well established,

turn to their applications in problems

more

serious than gambling

games. In the bulk of the book, emphasis

is

placed on the experimental definition

of probability rather than on the classical definition. After the ideal games,

and

after

bility,

comments on

the role in science played by both kinds of proba-

namely, classical and experimental, the discussion

ments and to the general

maximum

statistical concepts.

shifts to

measure-

These concepts include

likelihood, rules for the propagation of errors, curve fitting,

several applications of the least-squares method, consistency tests, a

little

on the analysis of variance, a little on correlation, and so on. Then, the normal (Gauss) and the Poisson models of mathematical probability are explored both analytically and with typical problems. The normal and the Poisson models are given about equal weight, this weighting being roughly commensurate with their invocations in modern scientific measureof statistics our discussion is very an undergraduate student in an experimental science. But in both statistics and probability, the point of view taken in this book is somewhat different from that of the professional ments.

Especially in

elementary

the

subject

—

just the essentials for

statistician or

mathematician.

Preface

ix

Numerous problems

are given in each of the five chapters.

Many

of

these problems are intended to provoke discussion,

and the instructor should look them over carefully before he assigns them to the student. The most commonly used equations in statistics and probability are gathered together for convenience and placed at the end of the book, just before the index.

am

I

K.

I.

pleased to express

my

indebtedness and thanks to Professor

Greisen of Cornell University for reading the manuscript, for checking

and for making numerous helpful general suggestions. And, needless to say, practically all that I know about the subject I have learned from others.

the problems,

When 'Omer smote his bloomin' lyre, 'E'd 'eard men sing by land and sea; An' what he thought 'e might require, 'E went an' took the same as me!

—

Kipling In partial acknowledgment, the following books are listed and

them 1.

I

recommend

for collateral reading.

T. C. Fry, Probability

and

Its

Engineering Uses (D.

Van Nostrand

Co.,

New

York,

1928). 2.

3.

4.

G. Hoel, Introduction to Mathematical Statistics (John Wiley & Sons, New York, 1954), 2nd ed. A. G. Worthing and J. Geffner, Treatment of Experimental Data (John Wiley & Sons, New York, 1943). H. Cramer, The Elements of Probability Theory and Some of Its Applications P.

(John Wiley

& Sons, New

5.

A. M. Mood, Introduction York, 1950).

6.

B.

7.

E. B. Wilson,

York, 1955). to Theory of

Statistics

(McGraw-Hill Book Co.,

W. Lindgren and G. W. McElrath, Introduction (Macmillan Co., New York, 1959).

to Probability

and

New

Statistics

Jr., An Introduction to Scientific Research (McGraw-Hill Book Co., York, 1952). R. B. Lindsay and H. Margenau, Foundation of Physics (John Wiley & Sons, New York, 1943). William Feller, An Introduction to Probability Theory and Its Applications (John Wiley & Sons, New York, 1957), 2nd ed. R. D. Evans, The Atomic Nucleus (McGraw-Hill Book Co., New York, 1955), Chapters 26, 27, and 28.

New 8.

9.

10.

11.

Emanuel Parzen, Modern

New Ithaca,

May

Probability

and

Its

Applications (John Wiley

&

Sons,

York, 1960),

New

1961

York

LYMAN G. PARRATT

Contents

Chapter

I

EARLY DEVELOPMENTS: IDEAL GAMES A.

B.

INTRODUCTION,

1

1-1.

"Three" Meanings of Probability, 2

1-2.

Historical Perspective, 6

CLASSICAL (A PRIORI) PROBABILITY, 1-3.

Definition of Classical Probability, 8

1-4.

Probability Combinations, 9

8

Mutually exclusive events, 10 Independent events, 10

Compound

events: general addition theorems, 11

Conditional probability

:

multiplication

theorem, 12 1-5.

Inferred Knowledge, 17

1-6.

Problems, 20

1-7.

Combinatorial Analysis, 23 Permutations, 23

24 Sampling without replacement, 26 Sampling with replacement, 26

Stirling's formula,

Combinations: binomial

coefficients,

Binomial distribution formula, 30

Multinomial

coefficients,

35

Multinomial distribution formula, 37 XI*

27

Contents

xii

Sampling from subdivided populations without lottery problem and bridge replacement: hands, 39 1-8.

Classical Probability

and Progress

in

Experimental

Science, 41

Applications in statistical mechanics, 42 Classical statistics, 43

Quantum 1-9.

C.

bosons and fermions, 43

statistics:

Problems, 45

EXPERIMENTAL 1-10. Definition

(A POSTERIORI) PROBABILITY, 40

of Experimental Probability, 49

Number of "equally probable

outcomes''''

meaningless, 51

Chapter 2

1-11.

Example: Quality Control, 51

1-12.

Example: Direct Measurements

in Science, 52

DIRECT MEASUREMENTS: SIMPLE STATISTICS A.

B.

MEASUREMENTS IN SCIENCE: ORIENTATION, 2-1.

The Nature of

2-2.

Trial

2-3.

Random

a Scientific Fact, 55

Measurements and

Statistics,

2-4.

Probability Theory in Statistics, 61

2-5.

Computed Measurements, 62

2-6.

Conclusions, 63

BASIC DEFINITIONS: FIGURES, ETC., 63 2-7.

56

Variation, 59

ERRORS, SIGNIFICANT

Types of Errors, 64

Random

(or accidental) error, 64

Systematic error, 67 Precision and accuracy, 68

Discrepancy, 69 Blunders, 69 2-8.

C.

Significant Figures

and Rounding of Numbers, 69

FREQUENCY DISTRIBUTIONS AND PRECISION INDICES, 2-9.

71

Typical Distributions, 72

Terminology: types of distributions, 72 2-10. Location Indices,

76

Median, 76

Mode Mean

(most probable value), 76 (arithmetic average)

m

and

/i,

76

55 55

Contents

xiii

2-11. Dispersion Indices, 79

Range, 79 Quantile, 79

Deviation {statistical fluctuation), 79

Mean

{average) deviation, 80

Experimental standard deviation

s,

82

Moments, 84 Variance a2 : "universe" or "parent"" standard deviation a, 86

Degrees offreedom, 89 Variance: binomial model

Standard deviation error) s m

,

distribution, 91

mean {standard

in the

92

Skewness, 94

Other dispersion

indices,

95

Conclusions, 97 2-12. Problems, 98

Chapter

3

OF MEASUREMENTS FUNCTIONAL RELATIONSHIPS

STATISTICS

3-1.

IN 101

Method of Maximum Likelihood,

103

p

in the

binomial distribution, 105

//

and a

in the

/
(#)] = 1 - p(A) - p(B) + p(A) p(B) (1-6) = p(A) [1 - p(B)] + p(B) [1 - p(A)] = p(A) + p{B) - 2p(A) p(B) (1-7) = p(A) p(B) + ^(either A or B, not both) = 1 — ^(neither A nor B) = p(A) + p(B) - p(A) p(B) (1-8) •

^(either

A

^(either

or B, not both)

A

or

B

or both)

•

•

•

•

Equations

1-6, 1-7,

theorems.

Equation

and 1-3

1-8 are is

commonly known

as the general addition

the special case for mutually exclusive indepen-

dent events.

The concept of "sample space"

is

a very convenient one.

In a given

probability situation all possible outcomes comprise sample space.

For

example, in the drawing of one card from a deck of 52 cards, there are 52 points in sample space. (Sample space

may be

visualized as points appro-

on a sheet of paper.) The convenience of the concept is readily seen in the idea of overlapping component events. Consider the 52-point space just cited: of the four points representing aces, two also are found among the 26 points representing red cards. Other examples are given in the problems of Sections 1-6 and 1-9. priately arranged

Conditional

multiplication

probability:

multiplication theorem, of

which

Eq. 1-5

is

theorem.

branch of the subject

general

This leads to a

events, involves the ideas of partially dependent events.

known

The

the special case for independent

as conditional probability.

If

event

B cannot

occur unless some condition is imposed, say unless event A has occurred, then the probability for B must include the probability that the condition

Classical (a priori) Probability is

13

In this simple example, the probability for the

satisfied.

event (A and B)

may

compound

be written

=

p(A and B)

p(A)pA (B)

(1-9)

on the assumption that A has p A (B) is written as p(B A). Equation 1-9, in its general form in which B depends upon more than one condition, is known as Bayes' theorem. (Bayes' theorem is usually stated a little differently, viz., as the probability that B was preceded by the where p A (B)

to be read as the probability,

is

B

already occurred, that

specified events

Ax A2 ,

Often,

will occur.

•

•

•

,

;

this is also

|

known

as inverse probability.)

Incidentally, the definition for the independence of events

p(B

|

A)

=

and

p(B)

p(A

=

B)

\

A and B

is

p(A)

Consider the following example. Three white balls and one black ball and one white ball and two blacks are placed in an

are placed in a jar identical jar.

If

withdrawn from

We

one of the two jars it,

what

selected at

is

argue that on the condition that the

f ; and that the white probability is |.

probability

is

on the condition Either jar

Hence, the probability that a white

and from the second white ball

is

the

jar, \

random and one

ball

the probability that this ball will be white?

is

•

The

\.

first

jar

chosen the white

is chosen chosen with a probability of \.

is

drawn from

is

is

that the second jar

the

first

over-all probability for

jar

is

\

•

f

drawing the

sum A)(^l)

= 2*4+2'3 =

24

The

subscript on p indicates that this is the probability based on our knowledge before any ball has been drawn; we make use of the subscript

notation

later.

Another example of conditional probability is the following. Suppose that in a jar are two balls of which either is black (B) or white (W), and suppose that we have no additional a priori information about the particular color complex of the balls in the jar. What is the probability that the first ball drawn will be white? In the absence of any additional a priori information, it is customary to presume that there is an equal

p that each of the possible hypotheses is the correct one. We might further presume that there are three hypotheses, viz., two whites in probability

the jar, a white

we would

and a black, and two blacks. With these two presumptions,

write

p (Hyp

WW) =

p (Hyp

WB)

= j^Hyp BB) =

Accordingly, the over-all probability for a white on the

by the sum Po\ "l)

=

3

'

I

+

3

'

2

+

'

3.

t

=

2

±

first

draw

is

given

Probability and Experimental Errors in Science

14

since the three hypotheses are mutually exclusive.

chosen at random, It is

in the

In this problem, the

argument to the two jars, one of which

three hypotheses are similar in the

is

preceding problem.

to be emphasized that the presumption of equal probabilities for

the three different hypotheses

is

really

made

in

desperation since

information on which to form a better judgment. All we (1) that the probability for

range of

to

hypotheses

is

1

and

each possible hypothesis

sum of

(2) that the

is

in fact are

somewhere

the probabilities for

all

in the

possible

unity.

Depending upon our view

This example has a further complication.

of the conditions under which the balls were placed or the jar,

we have no

know

we might presume

somehow

got into

that there are four, instead of three, equally

probable hypotheses. According to this view we would write

WW) =

p (Hyp

For our purpose now, no

Hyp BW,

made between

=\ Hyp WB and

would be assigned a probability of

\ instead of \

p (Hyp

but as a unit

it

WB)

=

p (Hyp

BW) =

distinction need be

p (Hyp BB)

of being the correct one. Accordingly, the over-all probability for a white

on

the

first

draw

is

a probability that

given by the

is

the

equally likely hypotheses. the ball replaced

sum

same as But

if

that determined

the color of the

on the basis of three ball drawn is noted,

first

and the jar shaken, and then a second

ball

is

to be drawn,

it can be easily shown that the numerical value of the white probability number of equally likely hypotheses assumed 2 ) is dependent upon the Pi( at the time of the first draw. (As an exercise, the student should show this

W

dependence.) as to the "proper" number of a priori equally probable an inherent part of problems of this sort. This is one non-

The ambiguity hypotheses trivial

is

aspect of the arbitrariness inherent in the concept of probability as

mentioned in Section 1-1. For practice, let us rephrase the problem and extend it, knowing however that the numerical values from here on depend upon the initial number of equally probable hypotheses assumed. Consider a thin metal disk which we propose to toss as a true coin. Suppose that there are only three hypotheses: (a) that the disk has a mark on both sides that we may call heads, Hyp HH; (b) that it has a different mark on both sides that we may call tails, Hyp 7T; and (c) that it has heads on one side and tails on the other, Hyp HT.* With no a priori information as to which of these *

Again,

owing likely

if

Hyp

in any way from Hyp HT, e.g., we would start with four equally

77/ were recognized as different

to the process of manufacture of the disk,

hypotheses instead of three.

Classical (a priori) Probability

three hypotheses

is

the correct one, suppose that

an equal probability,

we

write

p H h(H\) as trie Hyp HH is

condition that toss

first

is

correct, then toss

first

is

Po(tfi)

/>

(Hyp HT)

we may

=

p (Hyp TT)

probability that the correct,

Pht(^i)

heads on the condition that

as the probability that the is

we assume

in

desperation

viz.,

HH) -

p (Hyp If

IS

first

toss

is

Hyp

first

=

toss

}

is

heads on the

as tne probability that the

HT

is

correct,

and p TT {H^)

heads on the condition that

Hyp TT

write the expression for the probability that the

heads as

=

Po(Hyp

HH) p HB(HJ + p (Hyp HT)

+

Wi)

p (Hyp TT) pTT (H x )

where the subscript on p refers to the probability before the outcome of any toss is known. Substituting the known quantities as discussed above, />(# i)

= i-i +

as expected.

Next,

we

Now we

toss the thin metal disk

W + i-o-i

and observe

that the

outcome

is

heads.

have some information about the relative probabilities of the

three hypotheses; viz.,

we know

that

A(Hyp TT)

=

and, furthermore, that* Pl

(HypHH)>p (UypHT)(Hyp bad) would be the same as here given for/>(Hyp HH) and the restriction n ^ 1 would be removed. This is the case in the "sunrise" example discussed presently. t C. S. Pierce, The Collected Papers of Charles Sanders Pierce (Cambridge, Mass., 1931-35), ed. Hartshorne and Weiss, said in effect, "All beliefs and all conclusions, however arrived at, are subject to error. The methods of science are more useful than old wives' gossip for achieving stable and reliable conclusions, but science offers no * If the

sides,

i.e.,

a

access to perfect certitude or exactitude.

We can

never be absolutely sure of anything."

"There is no absolute certainty" is itself inconsistent, Pierce answered "If I must make any exception, let it be that the assertion 'Every assertion but this is fallible" is the only one that is absolutely infallible."

Then

to the objection that the proposition


18 limited

number of observations.

a scientific generalization rests

than one experiment

is

Usually, however, the evidence on which

complex that the outcome of more

so

is

needed to topple

it

Rather than

completely.

when confronted with an unexpected outfurther developed) to include the new infor-

toppled, a well-based theory,

come, is usually altered (i.e., mation as "expected." Such is the progress of inferred knowledge of any sort; and such is the central feature of experimental probability asdiscussed later.

upon the "proper" number of equally an inherent part of each problem of inferred knowledge. Let us explore this a little further and ask the queswas

It

said earlier that deciding

probable a priori hypotheses

What

is

tomorrow? One two hypotheses to be considered cither assumption is the sun will rise or it will not rise analogous to the two outcomes in the toss of a coin. These two hypotheses are presumed in desperation to be

tion,

the probability that the sun will rise

is

—

that there are only

—

equally probable, each probability being t at the start of things,

before the

sun

first

Some

sunrise.

will rise again, after

having risen n days in a row,

obviously erroneous (notice

an increase

in

i.e.,

people argue that the probability that the

how

small

it

is!)

because

it

is

(|)

n+1 but this ,

the sunrise probability as experience accumulates.

other people argue that, after the

is

does not allow for So,

and more sunrise observations, the

first

the probability decreases that the hypothesis "the sun will not rise"

the

is

correct one. This argument is identical to the one in the thin metal disk problem as discussed above, and the desired probability is (2" + 1)/ n (2 + 2). As a third and last argument, we might consider that at the dawn of history or at whatever time n = 1, all hypotheses in the entire are equally probable. This assumption to range of probabilities from infinite number of hypotheses is again each of an of equal probabilities for 1

a desperation-in-ignorance type of assumption. universe was chosen at

random from

It is

to the effect that

our

a collection of universes in which

all

conceivable universes in regard to the sunrise probability were equally probable.

On

bility is

+

*

(/?

This result

this \)\{n

may

argument we would conclude that the desired proba+ 2).* Laplace advanced this last argument in 1812,

be derived along lines of conditional probability without specifi-

cally evaluating hypotheses as such.

white balls such that the

/th jar

Imagine

contains

/

N+

1

jars,

black and

each containing

N—

i

white

balls,

N

black and

i

taking on

to N. A jar is chosen at random and n balls drawn one by one with integer values from replacement after each draw. Suppose event (nB) has occurred, i.e., that all n balls are black. What is the probability that the next ball drawn from the jar will also be black? If

we choose

the

;'th

jar, the probability for

(nB)

is

p,(nB)

=

(i/N)".

Therefore,


19

and the expression for the probability

(n

+

+

l)/(n

2)

is

called the Laplace

law of succession. Laplace offered publicly to bet anyone 1,826,214 to that the sun would rise tomorrow (he reckoned n as 5000 yearsf).

1

These three arguments conclude with quite different numerical values of the desired probability, aside from the question of the proper value of n,

and

test

serve to illustrate the inherent difficulty in the development

of the

when

reliability

a bit of

of knowledge. The problem

new knowledge

is

is,

and

of course, most acute

just being conceived.

At

this time,

what

are the equally likely hypotheses, or what are the particular hypotheses

Think of the plight of the observer, who was born during the night 5000 years ago and who has never seen or heard of the sun or of a tomorrow, contemplating the prospect that the sun will rise tomorrow, or, if he has seen it just once, contemplating the probability that it has regular habits. Of course, now, with our accumulated experience, confidence in our knowledge that the sun will rise tomorrow is great, and the difficulties in the origin of this knowledge may be amusing. But the alert student will see immediately many modern examples of such even worth considering?

inherent difficulties in new hypotheses or theories and of the inherent arbitrariness in the probability or reliability of a prediction in terms of a

—

new theory or, indeed of any theory, old Further comment is in order in regard This assumption possible tion

is

or new. to the desperation assumption.

with no information whatsoever, each of only two

that,

outcomes should be assigned a probability of \. say that the probability of "life on Mars"

we would

since choices of jars are mutually exclusive events.

=

pUiB) ^

+

!»

+

1

jn+i

The required

+

l)B)

+

N"(N

and, likewise, the probability that n

p((n

2"

=

probability, viz., that («

balls

+

+

assumpwould

We

AT"

in a

row

are

all

black

is

t-N«+i

-j

N n+1 (N + +

this

-|.

1)

drawn

2»+i

is

Then,

•

+

On

1)

1)5 occurs after we

know

that

nB has

occurred,

is

p({n

+

\)B)

lim jy_*oo

p(nB)

=

n n

+ +

1

2

equivalent to evaluating and using the approexample with the thin metal disk, but this may not be obvious.] t The modern scientist would find numerous inferred reasons" for believing that the sun had been rising regularly for many years before 5000 years ago. For example, we may invoke generalizations about planetary motion, interpretations of archeological and geological records of sun-dependent life preceding the earliest records of man, of the

[Dividing by p{nB) in the

last step is

priate hypotheses in the

time involved

in the

evolution of stars,

etc.


20

on Mars

also say that the probability of cats

of every individual form of

the probability of at least one form

is §,

of elephants

is

indeed,

.>,

N

life is J. is

If there are different forms of life, A which, if 1 is large, is very (A)

—

N

What

near certainty and

much

wrong? Nothing

wrong. As soon as we know or profess to know that

there

we

is

a reasonable probability for

is

no longer

are

greater than the

Nor

question.

is

in

answer of

first

more than one form of

\.

life

such complete desperation as to answer

on Mars,

to the

\

is

first

our ignorance quite so profound as for us to answer

any of the additional questions, although we are admittedly rather when confronted with such questions. We must be very careful making the complete desperation assumption. There are numerous in classical "paradoxes" that have been expounded on this point. Additional knowledge always modifies a probability, except for the Omniscient for \ for

disturbed

Whom

the answer to any probability question

is

always either zero or

unity.

1-6.

Problems

Note:

A

numerical answer to a problem

is

not a complete answer;

the

student should justify the application of the equation(s) he uses by giving an

how

analysis of the problem, pointing out

conditions on which each equation

is

the problem meets satisfactorily the

based.

To develop

his "intuition," the

student should contemplate the comparison of the correct answer with his

To

a priori expectation. 1.

What

this end,

answers are given to most problems.

the probability of drawing at

is

jar containing 3 red

and

random each of

the following

from a

5 black balls:

(ans. 2-g)

(a) 2 red balls simultaneously,

(b) 3 red balls in successive

draws with replacement

after each

draw, and (ans. 6-jt-2 )

2 reds and a black in a single draw of 3 balls?

(c)

Ten people are arranged

2.

at

random

the probability that 2 given people will be

by

1

person between them?

3.

Two

(i)

in a ring.

next to each other, and

[ans. (a)

cards are drawn

(ans. 5^)

(a) in a row, and (b)

(i) |, (ii)

(ii)

0.178; (b)

simultaneously from a 52-card deck.

What

is

separated

(i) f, (ii)

What

is

f]

the

probability that

one of them is a spade, and an ace and the other a black ten?

(ans. 34)

(a) at least

(b) they are both red, (c)

one

is

4.

With two dice

(a)

an even number on each

(b) either a 7 or (c)

neither a

1

2,

cast together,

an

II,

nor an

what

is

(ans. x*%\ (ans. 8 f 3 )

the probability for (ans. ])

die,

and 1

1

,

nor a 7

(ans. |) in the first cast,

and a 7

in the

second cast

?

(ans. i)


What

5.

is

the probability that, in tossing a penny,

heads appear

(a) 5

21

(ans. 3A2 )

in the first 5 tosses,

second head appears on the fifth toss, and 5 tosses, a head appears exactly twice?

(ans. |)

(b) the in

(c)

In

6.

how many throws of

die

1

is

there a probability of less than 0.2 of (ans.

(a) seeing a 5,

(b) seeing a 5 for the first time in the last throw,

7.

odd,

2

k2

since

=

—

n

k x and

M =

I

which there are three groups of

same argument

as in the

Now

like objects,

P

1

k2

(n

kx

—

kx

—

-

.k 2 .k 3

k2

—

«

=

Ar

x

+

Ar

2

+

kz

-

k 2 )\

By

.

the

2

k3

)\

n\

k x \{n

consider the next step in

first step,

_lA(n-k\ln-k -k

^*"Wl

since (n

1.

kx )\ k 2 \(n

-

k.V.

-k x

(/i

-

fci

-k -k -

k 2 )\ k 3 \(n

x

2

k3 )\

3 .

k 3 )\

Pkk

n^ = k

0!

=

=

1.

By 5!

generalizing,

we

= -5L»

=

see that

(1-29)

l

It is perhaps clearer in this derivation than in the former that the order is preserved among all the r groups, each group being a single combination.

37


The symbol

Pk

n

k

...

appears as the coefficient of each term in the

k

+

algebraic expansion of (a x '

+

a2

•

•

+

•

a r) n

and

,

for this reason

is

it

that it is called a multinomial (or polynomial) coefficient. Consider a few illustrative examples. First, how many different ways can five letters be arranged if three of the letters are x and two are p.

The answer

is

5^3,2

=

=

5!/3!2!

and these ten arrangements are and jjxxx. Note

10,

xxxjj, xxjxj, x/'xxj, jxxxj, xx/jx, xjxjx, jxxjx, xj'jxx, jxjxx,

again that the group sequential order

from

jjxxx, although the order

is

important;

In an earlier example, we inquired as to how bridge hands could be dealt from a 52-card deck.

many

permutations are possible

normal way.

We

when

xxxjj

e.g.,

not preserved in the

is

z's

many

is

different

or in the

y's.

different single

Now

let

us ask

how

the four hands are dealt in the

are concerned with permutations because, in the

game

of bridge, the sequential order of North, East, South, and West is important. There are four combinations, each of size 13 cards, to be selected

from a population of 52 cards. Hence, n

52!

P*!,*8 ,*3,*4 =

13! 13 13! 13 1

which

is

a very large number,

viz.,

(5.3645

outcomes,

to the case in

i.e.,

is

.

distribution for-

which the object population or outcome

subdivided into more than two groups of like elements.

For example, a

may

die has six different sides, a jar

different colors than two, a

in a gas

28

)10

•

be generalized to the case of more than two possible

easily

population

•

The binomial

Multinomial distribution formula.

mula can

•

contain balls of

deck of cards has four different

have many different values of velocity,

Ax A2 A3 ,

,

•

•

Ar

•

,

,

Let

.

more

molecules

etc.

Consider a probability situation in which there are possible outcomes, viz.,

suits,

r

mutually exclusive

p be

the probability

t

outcome A occurs at a trial and let n independent trials be made. The probability that outcome A x occurs exactly k 1 times, that outcome A 2 occurs exactly k 2 times, etc., is calculated in a manner identical to that used in deducing the binomial formula. The probability/? of obtaining a partickr ular sequence of outcomes is pfrp^p** p r and if we are not interested in the sequential order in which the A outcome occurs in the k times it is observed, and if we do wish to preserve the sequential order in which the various groups of like outcomes occur, we must multiply by the multinomial coefficient n Pk k 1-29. Thus, k from Eq. that

t

'

'

'

,

i

x

^(particular sequence)

=

:

«!

=

M[(/c x

;

n, p^){k 2

;

./c

n,

2

kr

.

p2 )

•

•

Pi^pf*

'

' :

Pr

lr

.

(k r

;

n,

p r)~\

(1-30)


38

which may be read as the probability that in n independent trials A 1 A 2 occurs exactly k 2 times, etc., when the respective outcome probabilities are p v p 2 etc. Here, k is any integer from to n

occurs exactly k x times,

,

t

=

=

1. with the condition, of course, that ^; = i^, «• Also, of course, ]£• x=1 p in Eq. 1-30 stands for multinomial. It can be shown easily The symbol i

M

that the

sum over

all

values of

A:

Eq. 1-30 gives the expression for the

in

multinomial (ft

and Eq. 1-30

may

is

known

+ ft +

all

*

*

+

PrY

=

(1-31)

1

as the multinomial formula.

be put in graphical form

a graph for

'

if

the graph has r

+

1

Equation 1-30

dimensions; such

values of k represents the multinomial distribution of

probabilities.

An

understanding of the multinomial coefficients and distributions

imperative

if

is

the student seeks an understanding of the kinetic theory of

gases or, indeed, of any physical theory involving statistical mechanics.

Note

well that such theories are of increasing importance

in

the

all

physical sciences.

We may

point out in the interests of general perspective that

the analysis of errors of experimental measurements, we

later, in

shall conceive

of some probability distribution as being the subdivided population of "objects" from which a sample, i.e., a single measurement or a limited

number of trial measurements,

is

sample, with replacement, from the

same

taken.

Each measurement or

trial set is

a

a rather specially subdivided population of

our considerations of the multinomial and the probability per outcome, e.g., the probability of a

sort as that described in

coefficients,

particular measurement,

is

given by the population distribution probability.

This distribution, for which n

remain unknown or

it

is

very large, infinite in

may be assumed to be known. Commonly assumed parent

some It is

instances,

may

also called the

an Poisson and the experimental science are the normal (Gauss) distribution distribution, both of which may be considered as special cases of the binomial distribution as stated earlier in this section. The statistical problem in experimental measurements is generally to infer from a limited number of trial measurements (a) what is the most appropriate parent probability distribution, e.g., normal or Poisson, and (b) what are the quantitative values of its descriptive parameters. Help in the answer to (a) is usually afforded from a priori experience and the particular type of measurement or from statistical analysis from a rather large number of trials; obtaining the answer to (b) is often solely an a posteriori problem. The features of the measurement problem should become clear as the "parent" distribution.

reader progresses in this book.

distributions in


39

Sampling from subdivided populations without replacement: problem and bridge hands. A basic condition in the binomial distribution and in the multinomial distribution is that the component lottery

probability p be constant for

This condition restricts applications

all trials.

to sampling with replacement. But, the use of the binomial coefficient as

giving the

extended

number of combinations can be

to, a

common

replacement from a subdivided population. in

further illustrated by, or

type of problem in which sampling

is

done without is one

This type of problem

which we ask for the probability that a random sample of size/ contains i elements of a specified type k from a population of n elements

exactly

subdivided into n

— k + k2 + x

'

+k

* '

r,

with

r

)

and

girls,

(ans. 6\)

the children will be boys.

(c) exactly half

(ans. re)

A

box contains 90 good and 10 defective screws. If 10 screws are selected as a sample from the box, what is the probability that none in the sample is 18.

defective

if

(ans. 0.330 ••

sampling is done without replacement, and (b) with replacement? (a)

How many

19. (a)

different

(ans. 0.348

outcomes (permutations) are possible

k

=

12,

appearing twice

)

(ans. 6*0

what

(i.e.,

the probability for the event of every face

is

number

(ans. 0.003438

2 aces, 2 deuces, 2 treys, etc.)?

How many

20. In a lottery of 10,000 tickets there are 100 prizes.

must a person buy so that the probability of his winning 50%? An approximate answer suffices.

at least

1

•

tickets

prize will

(ans: 69)

exceed 21.

•

••

in the cast

of k dice together ? (b) If

•

A certain professor always carries 2 match boxes, each initially containing

25 matches. Every time he wants a match he selects a box at random. Inevitably a moment occurs when, for the first time, he finds a box empty. Then, what is the probability that the other

box contains

r

=

0, 1, 2,

•

•

•

matches?

22. In a laboratory experiment, projectiles (small steel balls) are shot (at

random

The screen

times) through a screen (the spokes of a rotating wheel).

(spokes) deflects or scatters

some and allows

others to pass through undeflected.

Suppose 8 projectiles are shot. Suppose that the probability of each passing through undeflected is 0.8. Compute and plot the probability distribution for traversals without deflection. If the experiment were to be repeated many times, what proportion of the trials would yield results within ±2 of the mean value? This

is

a typical "scattering cross-section" experiment in which, usually, the is determined from the observed numbers of undeflected

basic event probability/?

When

projectiles. it is

the experiment

is

performed for the purpose of determining/?,

a typical a posteriori probability experiment.

23.

Among

(a) If

N

two

TV different keys,

probability that the lock will be (b)

What

will

open a certain

100 and half of the keys are selected at

is

is

lock.

random

to try,

opened?

what

is

the

(ans. 0.752)

the limiting value of this probability as

N increases

indefinitely? (ans. |)

(c) If TV is 100,

how many

keys should be selected to try in order that there

should be just more than an even chance of opening the lock? 24. (a) If the

on any

odds are k to

particular day,

show

1

(ans. 35)

against a machinist's meeting with an accident

that the

odds are

(1

+

\jk) n

—

1

to

1

against

escaping injury for n days. (b) If

k

=

1000,

escaping injury?

what

is

the greatest value of n giving favorable odds for (ans. 693)

48

Probability

The A gene

25.

Aa have

A

the

is

and Experimental Errors

dominant, the a recessive;

characteristic,

i.e.,

and of type aa the a

Science

in

A A and Assume {, i,

organisms of types

characteristic.

and \ to be the a priori probabilities for the gene types AA, Aa, and aa (Assume Aa = aA). (a) If both parents are A, what is the probability that an offspring will be a?

respectively.

(ans. £)

(b) If all

4 grandparents and both parents are A, what

the second generation will be 26. In testing

ESP

is

the probability that

A?

(ans. if)

(extrasensory perception), an experiment

is

conducted

with 4 red and 4 black cards. The cards are thoroughly shuffled and placed face

down on

the table.

black cards, but he

The person A to be tested is told that there are 4 red and 4 knows nothing as to their arrangement. Person B draws a

card and, without either looking at If

A

answers "red,"

B places

it

it

on one

himself or showing

it

side of the table

A

on the other side. This process drawn. Let us assume A has no ESP. places

(a)

it

What

is

is

;

if

repeated until

the probability that there will be just

1

to A, asks

its

color.

all

cards have been

black card in the "red"

pile?

A)

(ans.

(b) If the first card to

appear

is

black but

is

called red,

what

is

the probability

that there will be exactly 3 red cards in the "red" pile at the

experiment? the

(c) If

end of the 4

(ans. first

card

is

called correctly,

what

is

B

answers "black,"

3 5)

the probability of having

exactly 3 correct cards in each pile?

(ans. §§)

27. In the game of craps, the person casting 2 dice wins if he gets a 7 or an II on the first cast or, alternatively, if the first sum is a 4, 5, 6, 8, 9, or 10 and the same sum reappears before a 7 appears in any cast after the first. (a) What is the win probability when the game is defined in this way? (ans. 0.49293

•

•

)

Sometimes the game is defined so that the player does not automatically lose if he casts a 3 on the first throw, and 3 is then added to the winning sums for succesive throws. What is the win probability in this case? (ans. 0.50682 (b)

•

28.

A

poker hand of

5 cards

is

dealt

•

from an ordinary 52-card deck. What

•

is

the probability for each of the following: (ans. 0.422)

(a) a single pair,

(ans. 0.0476)

(b) 2 pairs, (c) 3

of a kind,

(ans. 0.0211)

(d) straight (5-card sequence, ace permitted at either end, including a flush), (ans. 0.00394) (e) flush (5 (f) full

(g)

cards in a single

suit,

including a straight),

4 of a kind,

(ans. 0.00024)

(h) straight flush (including a royal flush), (i)

royal flush, and

(j)

"opener"

(a pair

(ans. 0.00198) (ans. 0.00144)

house,

(ans. 0.0000155) (ans. 0.0000015)

of jacks or better)?

(ans. 0.206)

49

Experimental (a posteriori) Probability

29. Consider the "pros" and "cons" of the following system of betting: Suppose in successive games, in each of which the odds are 50-50, you bet SI. At any time that you win, you pocket the winnings and start betting again at SI At any time that you lose, you bet double the amount on the next game. No matter how long the series of consecutive losses, when you win you are $1 ahead as though the losses had not occurred. (a) If you were the owner of a gambling house, under what conditions would you allow a client to use this system? (b) How would you alter the system if the odds were known to be 75-25?

(In considering both parts of this problem, ignore the usual bias

the house in

C.

its

own

EXPERIMENTAL 1-1 0.

imposed by

interest.)

(A POSTERIORI) PROBABILITY

Definition of Experimental Probability

Suppose that for some reason we wish to check the

classical (a priori)

idea that the probability for observing a head with a tossed coin

The obvious

thing to do

keep a record of the

results.

We

moderately large number n obs of independent ratio

u' ot)S /tf obs is,

|.

We

trials.

say that the

for this value of « obs the best experimental value of the ,

probability for heads in any single toss,

e.g.,

in this value increases as n oX)S is increased.

experimental probability

is

fluctuate rather erratically

probability steadies

down

the next toss.

Indeed,

if

Our confidence

the value of this

plotted as a function of « obs

when n ohs

is

small, but, as

/?

,

it

is

Fig. 1-3.

By

the

definition, the experi-

mental probability (sometimes called the frequency probability)

becomes

seen to

obs increases,

to an apparently constant equilibrium value.

A typical graph of this sort is shown in this ratio as « obs

is

number of times and to observe heads u obs times after some

to toss the coin a large

is

is

simply

indefinitely large, viz.,

pobs

=

limit

^

(1-39)

nobs— 00 Hobs the outcome of each trial (toss) is (a) independent of all preceding trials, and (b) determined entirely by chance. There are four difficulties with this definition. First, how can we be sure that all the trials are independent? The practical problem here is that the coin may wear out asymmetrically or that the person (or device) tossing the coin gradually but inadvertently acquires a "system" which favors a particular outcome. It should be noted here that we do not require the absence of a "system," but merely that if it is present it must

if


50 Heads

Tails

400

Fig. 1-3. Experimental probability (frequency ratio for "heads") steadies

apparently equilibrium constant value as n

bs increases.

down

to

an

(Note the logarithmic abscissa

scale.)

Second,

remain constant. trial

is

how can we

be sure that the outcome of each

determined entirely by chance?

related to the

The

practical

one for the independence of successive

problem here

trials.

is

Third, the

limit n obs -*> oo is obviously impractical. In this regard, we substitute a conceptual extension of the experiment after « obs has become "satis-

However, the value of p obs for any large but finite n obs it as a consequence of the fact that n obs not strictly converge mathematically no ratio does finite. Fourth, the is matter how large « obs becomes. This is because, after any specified n obs

factorily" large.

contains some small uncertainty in

,

there

The

is

a

finite

chance that a long run of heads (or of

tails) will

occur.

experimentalist points out that as n obs increases, such a run must be of

increasing length to have a given effect in the value of p ob9

,

and

that after

verv ' ar g e trie probability for having a significant effect of this sort >*obs This has been proved mathematically in is so small as to be negligible. terms of the so-called strong law of large numbers. It is important, i

s

nevertheless, that n obs be very large indeed if p obs is to be expressed with very high precision. Later we shall show that the standard deviation,

a measure of the

statistical uncertainty, in the

proportional to the square root of n oba

.

measure of p obs

is

inversely


SI

Even with these difficulties, the experimental definition is the one that must be invoked to "prove" that the coin is "satisfactorily honest," i.e., that the a priori probability is reasonably valid, or sometimes even to prove that a very complex combinatorial analysis is indeed correct.

Number of "equally probable outcomes" meaningless. Outside the realm of ideal games numerous probability situations exist in which the number of equally probable outcomes is entirely meaningFor these situations the classical probability, Eq. 1-1, cannot be Examples are legion: A marksman shoots at a target; evaluated. what is the probability of a hit? What is the probability that a particular person of given age will die within one year? What is the probability that a given house will be ravaged by fire within a specified time? If a baby is to be born, what is the probability that it will be a boy? What is the probability that John Doe, a candidate for public office, will be elected? What is the probability that the next measurement of cosmic-ray intensity will differ by a given per cent from the immediately preceding measurement? What is the probability that two different measurements of the velocity question of the

less.

of light agree within the experimental errors? In such probability situations

we

are at a complete loss in trying to apply the classical definition for

the probability.

Rather than rely on "armchair" reasoning, or make a

basic desperation-in-ignorance guess,

we may experiment, make

actual

measurements, and use the experimental definition of probability. I-II.

Example: Quality Control

Determining the experimental probability of a specified outcome generally involves rather intricate statistical reasoning in order to achieve

satisfactory numerical value with a

minimum of

the heads probability in tossing a coin

is

effort.

very simple.

a

The example of

To

illustrate the

problem discussed problem, a random sample of limited

typical complexity, let us consider the lottery type of in the last part

sizey

was

of Section

selected

from a

1-7.

In this

large population n subdivided into n

We

=

kx

+

k2

how many elements of the kind k x may we expect to have in the sample j. Suppose now that we alter this problem as follows. The numerical value of n is known but the division of n between k 1 and k 2 is not known, and we wish, from

with

all

numerical values known.

inquired then as to

an observation of the number of k x elements in j, to determine the ratio kjn. This is a typical problem in what is called "quality control." It is instructive to consider this type of problem a little further because it illustrates one essential feature of the measurement problem in an experi/'

mental science.

A factory turns out a very large number n of supposedly identical items,


52 but some

unknown

fraction are defective,

whether or not

infer

can be discussed

in

this fraction

and we wish, by sampling,

to

exceeds a specified value. The problem

terms of the equation for the probability for having

i

defectives in sample j, viz.,

" (M ( '~ M //l; /; n

KexactlyO='

(;)

As

a

first

approximation,

this expression for /^(exactly

"equal" to the observed ratio

i/j,

i)

may

be placed

and a value of the defective fraction kjn

deduced therefrom. The difficulty is that a different value of kjn is obtained from each different sample ratio i/j. Of course, the reliability of the deduced value of kjn increases as the sample size increases or as the

number of independent samplings increases to provide a more reliable mean value of i/j. The problem really is to determine, for preassigned reliability in k x jn, the optimum sample size and number of samples commensurate with a minimum of effort in examining the samples. There are various statistical arguments in treating quality control problems of this sort, and discussion of them is beyond the scope of this

But one approach to this problem, in case n is very much larger mentioned now because it ties together some of the concepts discussed earlier in this chapter. In this case, the problem can be approxibook.

than

y, is

mated by one of sampling with replacement. Then, the binomial equation can be used, Eq.

1-20, viz., ]-i

\\)l

and the problem becomes one of determining the parameter p (= kjri) from the observed ratio i/j. Suppose that a guess as to the true value of p puts it in the range 2 to 4%, and that it suffices to know/? to one significant figure.

hypotheses,

One procedure then is to make five mutually exclusive 2, 3, 4, or 5 % and to guess initially (in desperation) p = all equally likely, i.e., probability \. The binomial probability

viz.,

that they are

1

,

j, p) may be calculated for each value of p, and comparison with the outcomes of successive independent samples serves to

distributions B(i;

increase the probability that one of the hypotheses

is

to be favored over

the others.

1-12.

Example: Direct Measurements

Now

let

in

Science

us extend the quality control problem so as to

to a typical

measurement problem.

make

it

similar

This illustrates a most significant


S3

application of the multinomial probability distribution in the science of

measurements. As a

first

step in this extension, suppose that the definition

of "defective" involves an upper and a lower limit of tolerance in the pertinent aspect of quality, e.g., in a linear dimension such as the diameter of ball bearings. With n

=

+

kx

k2

+

The problem,

k3

if

n

this extension, n r

i.e.,

,

=

3,

subdivided into three categories,

is

with category k 2 being "nondefective."

very large compared toy, becomes one of multinomial

is

p x p 2 pz unknown. In this optimum sample size and number of the compromise between reliability and

probabilities with the respective probabilities

determination of the

the

case,

samples, with consideration for

,

,

even more complicated than in the case in which n was divided

effort, is

two

into only

categories,

and we

shall

not attempt

it

further here.

Next, suppose that the n elements are subdivided into a

much

larger

number of different categories. Suppose that these categories are ordered in terms of some numerical characteristic of the elements, perhaps the diameter of the ball bearings. Our objective is to infer the average or arithmetic mean value of the entire population of n possible values from a sample of

In the determination of a length

size j.

(e.g.,

the balls as measured with a pair of calipers or with instrument),

we

take a

number j of independent

a sample of size/, from an essentially

From i

z

+

•

'

,

h

-,

i

r

'

K

=j

and

the arithmetic mean.

+

^1

+h+

^2

of k x

,

i

2

of k2 ,

large that adjacent numerical

limit

is

'

'

kr

ks

,


84

convenient expression for

in the calculations, a

(x (

— mf in

Eq. 2-9 and by using Eq.

=

5

I

(*
co, with ju as the Variance a 2 :

/li

reference value

is

t

is

known

=

denoted by

—

l

as the variance,

a,

= a

and

matical model.

it

is

square

= 2

*=*

is

standard deviation. The variance distribution whether

its

fo.

- n?Pi

(2

-

20 )

also called the "universe" or "parent" is

a parameter of the universe or parent

of the "imagined

Incasethe parent distribution

real-life" is

type or

is

a mathe-

continuous, thesummation

Frequency Distributions and Precision Indices

may

Eq. 2-20

in

87

be replaced by an integration.* Thus, using/as the con-

tinuous frequency function from

to oo,

fxffdx Jo

(2-21)

f

fdx

Jo

The

integral in the

denominator

that the frequency function

function

included in this expression to be sure

is

normalized;

is

when a model

used, the integral in the denominator

is

The variance a 2

,

i.e.,

the second

moment about //,

is

probability

unity by definition.

is statistically

the

most

important parameter in describing the dispersion of any universe or parent distribution, including any mathematical model distribution. With either

the

"real-life"

distribution,

imagined universe

we must assume

that

//,

is

The value of [x, hence of a 2 can never be ,

set

We

of measurements.

* In

distribution

or

the

model a2

known

in order to calculate

exactly

known from any

.

real-life

use the "best" estimate that can be obtained.

going from a discrete to a continuous frequency distribution,

we

use the basic

For some students, it may be helpful to review this argument. Consider the 12 measurements listed in Table 2-1 and graphed in the discrete distribution of Fig. 2-1. Or consider any list of a large number n of measurements. The range of the abscissa or x axis of interest can be arbitrarily divided into a large number N of equal increments Ax,. The number of measurements that fall into the ith interval is «,, and the normalized frequency (or the experimental probability) with which a measurement is observed within this interval is argument of

calculus.

N i

The normalized frequency distribution

the graph of/>, vs. x,, where x, is the coordinate taken as the average of the measurements within this interval. of course, discrete so long as Ax, is finite in size.

of the interval Ax, and This distribution

is,

n

I-, =i

is

is

We wish now to approximate the discrete frequency distribution/?,(x,) by a continuous function /(x), defined by

i.e.,

one for which Ax,

means of the

->-

in the limit as n

-*

oo.

This function can be

relation

which says that the value of/(x) at x = x, is to be made such that the product of this value and the width of the interval Ax, is equal to the normalized frequency of the observed measurements within this interval. Actual real-life measurements are far too few in number in any given situation to determine /"(x) in fine detail (i.e., Ax, small, zero in the limit) by direct use of this definition. We are usually content to guess the "way/"(x) should go" reasonably consistent with the actual measurements at hand. This gives a continuous function that approximates not only the actual discrete frequency distribution but also the presumed-to-becontinuous parent distribution.


88

For a

set

of n measurements, as stated above, the best value of

generally taken as the experimental value

m; and

to a 2 that can be

of n measurements

deduced from a

set

/x is

the best approximation is

generally

taken asf n

9

o

(2-22) 1

one estimator of a, but Vn/(n — \)s is generally considered to be a better estimator of a because the radical factor corrects for a bias inherently present in s. This bias was mentioned [The sample standard deviation s

earlier, in

connection with the

is

mean

deviation,

and

is

discussed again

below.]

Combining Eqs.

2-9

and

2-22,

we note

that

(n

\

'-^1

n

M (2-23)

/

—

/

1

In practice, this is commonly put in the form of a guess that a particular known continuous function satisfactorily "fits" the finite set of actual measurements; in other words, the guess is made that more measurements, were they to be taken, would merely increase our satisfaction that the analytic function fits and describes the parent distribution.

Then, the

common problem becomes one

of determining the best guesses

as to the important parameters of the continuous function.

of the sample and of the parent distribution respectively,

For example, for the means

we

write

N

m = i—L_ = n

^ />.(*)>

tl

wm

>

ancl

f*

=

f(x) d" Jo

i=i

and for the sample and the continuous-parent k moments about the mean

(see Eq. 2-17),

/*00

«*/(*) dx

N ek m

=£(*,(*

ii

Next, n and

p

—

k\(n-k)\

sum

1.

is

(2-25)

us that the term for

by changing the lower

0,

pY~ k

are factored out, giving

I,

and

7]

=

/"

n

=

(n

= i (k

—

1

nP

— ;

-

see that the

summation

fc

—

_! n _ fc

k)\

then K
(A,

Then

xt).

the probability

equal to the product of the n factors

a single experimental observation.

pjm) =

trials

are

all

II (A,

x)

is

t

independent (see Eq.

and

A and

B,

is

x { ), each factor being

n #a *d

(3-3)

=1

written with the assumption that the n 1-5).

A similar expression may be written To compare the

for each of the other possible hypotheses. different hypotheses

that set x {

pA (x^)

Thus,

1

where the product

cf>(A,

we may

write the ratio

reliability

from Eqs.

of two

3-1, 3-2,

3-3,

Equation 3-4

is

likelihood ratio,

pA (x t)

is

PmodM)

p in (A)

Pmoa{B)

pm(B)

U(f>(A,

x^

U<j>{B,

x)

(3-4) •

t

recognized as the "betting odds" and

and

Ucf>(A,

ar ) 2

is

a normalized probability as

known all

often called the

is

as the likelihood function

proper probabilities

cal evaluation of the likelihood function

is

straightforward

are].

if

[if

Numeri-

the functional

form of each hypothesis is known. And, as mentioned in Chapter 1, if we have no a priori knowledge favoring A or B, we usually resort to the desperation-in-ignorance guess that pm(A) = pin(B). 3-1.

Method

of

Maximum

Likelihood

As an example of the use of the likelihood ratio, Eq. we wish, from n independent trial measurements of x,

3-4,

suppose that

to find the

most

Probability and Experimental Errors

104

in

Science

g of a true parameter y in a known matheAssume that there is only one parameter x n) °f tne up some function g = g{%x, #2

likely estimate (or estimator)

matical functional form

We

to be determined.

y).

(f>(x;

set

>

'

'

'>

values of x from which the estimate

g is to be deduced. There are several methods for setting up such g functions, and each method gives a different degree of goodness of estimate ofg. The statisticians rate these methods in terms of their relative efficiencies. As stated in the discussion of the mean deviation in Section 2-11, the relative effitrial

ciency

is

defined as follows.

If

N sets of samples each

of size n are taken

from the parent population, N different values of g are obtained. These N values of g themselves form a distribution, and let us say that the standard deviation of this g distribution is noted. This process is repeated methods for estimating g, and the standard deviation ob-

for each of the

tained with the respective different

methods

deviations.

is

Of many

g

most

The

noted.

is

relative efficiency

method having

possible methods, that

standard deviation has said to be the

method

its

the smallest

values clustered most closely together and

g

Also, with any method,

efficient.

if

the

mean of

distribution for a sample size TV tends to a value different

estimate

is

of two

taken as the inverse ratio of the squares of the standard

said to be biased.

If the estimate

g converges

from

is

the

y, the

to the true value

y as N —* co, the estimate is said to be consistent, i.e., free of bias as the sample size increases without limit. (An example of bias is mentioned presently.) For scientific work, it is generally agreed that a good estimate must have zero (or at most small) bias as well as reasonably high efficiency.

For most parametric estimation problems, the method of estimation as the method of maximum likelihood is the most efficient, and, if n is large, the estimate is usually satisfactorily consistent. The likelihood

known

function, the product of

L(x x x2 ,

,

•,

all

n values of

x n y) ;

=

{x x

;

(:r

y), is written

t ;

y)(x 2

;

y)

•

•

•

0(x„; y)

(3-5)

Especially in the case of a discrete population, certain values of x i are

observed with a frequency/, which

is

greater than unity.

actual frequency/, appears as an exponent total

number of

factors

is

r

with

r

a.

on the average

on a

if

we take

N samples each of small

oo).

Suppose that the variable k is known mean /u unknown. The m, can be obtained from the likelihood function

the Poisson distribution.

to have a Poisson distribution, Eq. 1-26, with the

estimate of

by

//,

viz.,

differentiating with respect to

[x

and

setting the derivative equal to zero,

=0=i4^-l)

flogl)

(3-16)

and solving for m, 1

-Ik f = -Xx n i=i n

™=

i

i

i

(3-17)

i=i

in

agreement with Eq.

Can mean is

2-1.

the statement that the

this

be interpreted in any sense to confirm

statistically the best location

value for an

asymmetric distribution ? Instrumental parameter.

method of maximum

As

a final example in this section of the

likelihood, suppose that

/ is

the time interval between

counts in a Geiger counter measurement of the intensity of cosmic rays,

and suppose that the frequency function for

= V2s£

.

(This test

further developed in Section 5-8 as a quantitative test to determine

whether or not a measured signal is "really" above the background noise.) An even better criterion for consistency is the following one. Assume that the two sets of measurements, n x of x 1 and « 2 of x 2 are consistent and ,

are pooled. Then, the best estimate of the standard deviation of the parent

population, of which the n x

sample,

+

n % measurements are a

compound random

is

where n x + n z — 2 is the number of degrees of freedom. parameter in this test, which is called the t test, we write t

= h^I?

In the special case for which n x t

On

n 1 =n z

our assumption that the

=

=

sets

l^h!h^\

As a working

A (3 .64)

n z Eq. 3-64 becomes ,

1

^""

2

l~\

V2 n

x x and x% are consistent, the value of

as different pairs of sample sets are considered,

is

t,

expected to fluctuate

for the reason that they are sample sets each of finite size.

If

an

infinite


122 0.4

_4

_5 Fig. 3- 1

.

/

-2

distribution curves for different numbers of degrees of

the distribution

number of /

-3

is

pairs of

constitute a

t

sample

sets are

imagined, the corresponding values of

x4 and y are

the

if

t

parent distribution, viz., as

f{t)

where

oo

This distribution can be expressed in rather

distribution.

simple analytic form (not easily derived, however)

from a normal

=

freedom v. For v

normal.

c is a constant

=

/

c[l

,2\-[^(v + l)]

+

(3-65)

'-)

chosen to make the integral of/(/) equal unity, and

number of degrees of freedom. This t distribution, illustrated in and, for v < oo, Fig. 3-1, is symmetrical in shape about the mean t = Knowing distribution. normal the is relatively higher in the tails than is probability the can calculate we the analytic form of the t distribution, v

is

the

that the value of

range,

e.g.,

is

of the next sample pair of sets

outside a range set by the values

made of ±t c to bound

tion

/

±t c

will fall outside a specified

this

range

the calculated probability

is

is

arbitrary.

0.05,

i.e.,

This calcula-

in Fig. 3-1.

The

by integrating Eq. 3-65 over the set range.

Commonly,

t

c

specification

chosen so that

is

that out of 100 sample values of

/

only 5 on the average will fall outside the bounds of ±t c Note that the calculated probability is based on the assumption that x\ and x2 are consistent. If it turns out that the magnitude of the experimental value of t as deduced from Eq. 3-64 is larger than the magnitude of t c this fact does not prove that the two means are inconsistent but it argues rather strongly in favor of a suspicion that they are inconsistent. The argument is even .

,

Statistics of

stronger

Measurements

if t c is set

Inconsistency

at

any

in

Functional Relationships

limit less

than the

5%

123

limit, e.g., the

would be caused by the presence of any

1

%

limit.

significant

systematic error affecting the observed deviations differently in one set

of measurements than in the other Values of if

no

/

set.

that are exceeded only

significant

1

(or 5) in 100 times

on the average,

nonconstant systematic errors are present, are

Table 3-1 for several different numbers of degrees of freedom n x Table n1

+

«2

—

2

3-1.

Values of

t

c

in

the

t

Test,

I

%

and

5%

Limits

+

listed in

n2

—

2.


124

like to pool them so as to increase the total number of measurements and thereby increase the reliability of the mean according to Eq. 3-45. The pooling of such sets of measurements requires that the sets be

we would

consistent not only in regard to their

Or,

standard deviations.

we may wish

means but

also in regard to their

to test the internal consistency of

the precision of two sets of measurements (rather than of merely the means)

recorded on different days, with different apparatus, or by different observers. Again, the standard deviations of the various sets or samples are expected to

differ

somewhat among themselves, even

because of the fact that they are merely samples.

they are consistent,

if

We

seek a

test,

proba-

of course, for consistency of standard deviations.

bilistic

In the

test for

/

consistency of two different means, as just described,

that both sample sets of measurements are from (from a normal population if Eq. 3-65 is used). the same population of the validity of this assumption is tested. We shall Then the probability

the assumption

made

is

assumption again, but this time use the so-called F ratio as the working parameter which is defined in terms of the standard deviations of

make the

this

two

sets.

means.)

(The

t

parameter was defined in terms of the difference

Incidentally,

strictly

F

speaking,

(or

/)

in the

a "statistic" not a

is

"parameter."

Suppose that s x and sx are the respective sample standard deviations Then, in the nx measurements of x 1 and in the n 2 measurements of x 2 ox and ax are the best estimates of the standard deviations of the parent .

populations.

The

F ratio

is

defined as "i

_

a

F = -=;2 -

"2

n2

method of the

in the

sets

This

is

c

=

in the

e.g.,

infinite

number of

^~

2)

pairs of

sample

constitute an

V2)]

vx

/?

x

1

F

/

test,

F of the

we

Fig. 3-2.

in Fig. 3-2 for a typical pair

x

.

n2

—

1

are the

of values of

\\

and

v2

.

can calculate with Eq. 3-68 the probability that the

next sample pair of sets will

This calculation

F and F2

(3-68)

shape of the /"distribution

fall

outside a specified range,

outside the range set by the arbitrary values

limits

F

+ ri F)- [!i(Vl + = — and v2 =

(v 2

numbers of degrees of freedom. The

asymmetric, as shown

value of

cF H

a constant and where

is

respective

As

*X 2 1

a continuous distribution

f(F)

is

an

—

2

whose analytic form, if the of measurements are from the same normal parent distribution, is

distribution.

where

test,

2

imagined, and the corresponding values of

sets are

two

/

5 *i

(3-67)

a *2

As

j

is

F and F2 x

indicated in

an integration of Eq. 3-68 with the particular

of Measurements

Statistics

Fig. 3-2.

F distribution

Now, however, define

F

in


curves for different pairs of numbers of degrees of freedom

since f{F)

is

not symmetric,

as being greater than unity,

i.e.,

values of

F

is

the arbitrary value 0.05,

If

it

i.e.,

is

is

chosen so

that out of 100 sample

F2

.

Note

that the

based on the assumption that ax and ax are

comon parent population. Fas deduced from Eq. 3-67 is

estimates of the true value a x of a

turns out that the experimental value of

larger than

F2

Suppose

i.e.,

only 5 on the average will be larger than

calculated probability consistent,

convenient further to

it is

to have the larger standard

deviation always in the numerator of Eq. 3-67. that the probability

I2S

F2 we may ,

say that, statistically, the standard deviations are

not consistent.

Table 3-2 limit are 5

lists

%, for

the limits for F, different

if

the chances that

it

will

numbers of degrees of freedom

not exceed the in

determining


126 Table

"l(=«l -1) Denominator

(for

in

Eq. 3-67)

3-2.

Limits for F

in

the F Test,

5%

Level

of Measurements

Statistics


in

127

between the dependent and the the problem often justifies considerable effort in the analysis of the measurements. If there are K constants to be determined and if there are K pairs of measurements, then there are K simultaneous equations which may be solved for each constant. This is a so-called "exact" determination with no degrees of freedom for the evaluation of the precision. But if more than K pairs of measurements are available, the constants are said to be "overdetermined"; the errors in the measured values of the variables prevent an "exact" determination but do provide a basis for evaluating the in a specified functional relationship

The importance of

independent variables.

precision.

The usual procedure

is

to

make a graph of the measured quantities and we can. If we rely chiefly upon the eye

to "fit" a curve to the data as best

making this fit, there is a strong tendency to give undue weights to the end points. As a matter of fact, the end points are often the least reliable

in

because of experimental factors in the extremes of the range. By the method of least squares, however, we can give either equal or unequal weights, as desired, to the various points of the graph.

The method of least squares does not functional relationship;

it

does

constants appearing in the equation.

between two Best

Also,

different functional relations, as

of a straight

fit

us in a practical

tell

line.

Many

way

the best

us precisely the best values of the

tell

it

does allow us to choose

is

seen presently.

functional forms can be expressed

as a linear relation,

y

The

=

a

+

photoelectric equation cited above

bx is

(3-69)

an example of

this

The

form.

constant that relates the variations in electrical resistivity p with the temperature Tis given by the expression a (T/p) (dp/dT), and this expres-

=

= A a log T. The Cauchy equation for the refractive index of a substance is n = a + 6/A 2 which is seen to be linear when x is written for 1/A2 The exponential decay law, / e -/iX can be rephrased to be log / / jux. log I sion can be put in the linear

+

form log p

,

.

=

It

=

,

usually turns out that, of the

3-69,

we

are

more

—

two general constants a and b

interested in b than in a, but this

graph and a the intercept. Consider the graph of measured values of x and

is

in Eq.

not always

so.

b

gives the slope of the

and a

y,

such as in Fig.

3-3,

straight line

yQ

=

a

+

bx

such that the sum of the squares of the deviations from

minimum.

(3-70) it

shall

be a

In what direction should the deviations be reckoned ? Ideally,


128

(a)

Fig. 3-3.

fitted

by a curve:

(a)

by a

straight line,

by a parabola.

(b)

only

if

(b)

Graphs of experimental points to be

random

and

errors are present in both x

the deviations should be

y,

reckoned perpendicular to the straight line. But the arithmetic involved in the determination of the constants a and b is rather formidable in this case and, in general, the result depends upon the choice of the scale of each correction for even this effect

coordinate axis;

The usual procedure

laborious.

is

is

possible but

very

is

to choose either the x or the y direction

for the deviations, recognizing that the price paid for the simpler arith-

metic

is

a sacrifice, usually negligibly small, in the accuracy of the best

of the

fit

The choice between

line.

the x

and the y direction

is

favor of that direction in which the larger standard deviation

made

is

in

found;

comparison in the same dimensions and units, s y In almost all cases in experimental science, x is taken as the independent variable whose values are selected with practically negligible error, and in these cases the deviations are reckoned along the

make

this

compared with

bs x

in order to is

y

.

axis.

We

shall

along the y

assume

in the following text that all the deviations are taken 2

axis, i.e., that b sx

the exact value of y,

viz.,

•> (3-108)

Statistics

of Measurements

in Functional Relationships

143

Fig. 3-5. Scatter diagrams illustrating different correlation coefficients

and regression

lines.

Covariance.

The term covariance

is

commonly

samples.

It

may

observations

be written oxy

This

used.

is

a

which x and y are individual and the best evaluation of it from n actual

characteristic of the parent population of

t

i

is

n

The covariance may be divided by

-

1

n

-

(3-109) 1

the product of ax

.

and ay to give the -

best determination of the parent or universe correlation coefficient.


144 It is

often convenient in statistics to speak of the experimental covariance

which

s

from Eq.

H=

n

= "A'

(3-HO)

3-107.

The

Interpretation.

help

—

given by

is

usefulness of the correlation coefficient

provides in answering such questions as:

it

mathematics

what grade may he expect

75,

is

such a question can be worked out simply

The answer to and b are known.

in physics?"

if r, y, x,

The equation of

the least-squares fitted straight line

This

M indicated

sy , is

(3-111)

V'si=bx' is

the line

given value of

Sy

The expected value of y

in Fig. 3-4.

readily obtained

for a

from Eq. 3-111. Then, the value of

computed from Eq. 3-105 as

is

'

is

a;

in the

is

"If a student's grade in

SvU and the answer

to the question

y

= s Jl y

is

r

2

(3-112)

simply

+ M«- ±

syU

(3-H3)

where the plus or minus value is the standard deviation in the expected physics grade. If the x and y' frequency distributions are normal, the i.e., the chances are is 0.6755^ 50-50 that the student's physics grade would be y + [y's i\ x ± 0.6755^. and mark the 50-50 Lines A and B in Fig. 3-4 are parallel to the line

probable error in the expected value

,

-

M

limits;

calculation of lines

distribution

is

A and B

independent of

also

presumes that the parent y

x.

In the question as posed, the student's grade in mathematics

To make x

=

70, sv

9 if

the calculated answer numerical, suppose that r >

=

and b

10,

+ M*'= = 5

the reliability

V

is

80

=

y/x.

Then,

+

f(75

-

70)

±

10Vl

-

(0.60)

2

=

0.60,

=

85.7

y

±

is

75.

=

80,

8

expressed as the standard deviation, and as

+ M*(lb)

w

a weight

w by means

made

following measurements are

(lb)

= =

12

15

21

25

50

70

100

120

(a)

Find a linear law of the form p

(b)

Compute/? when w

=

150

lb.

=

a

+

bw.

of a pulley block, and the


152

Find the sum of the deviations. Find the sum of the squares of the deviations of the given values of from the corresponding computed values. Note significant figures in all parts of this problem. (c)

(d)

p

In a determination of h/e by the photoelectric method, the following stop-

9.

ping potentials were found, after correction for the contact potential difference,

corresponding to the various wavelengths of incident

A(A)= V(y) =

3126 -0.385

2535

+0.520

light:

4047 -1.295

3650 -0.915

Using the least-squares method, determine h/e and

Assume

V only,

errors in

a

R is the resistance to motion of a + b V 2 from the following data

weighted equally, and

(b)

weighted in proportion to the speed V:

K(mi/hr) = R (lb/ton) =

The

standard deviation.

car at speed V, find a law of the form

(a)

11.

its

5461

-2.045

a fractional standard deviation of 0.5 %.

10. If

R =

4339 -1.485

10

20

30

40

50

8

10

15

21

30

a-ray activity of a sample of radon, expressed in terms of

measured

its initial

each succeeding 24-hr interval to be: 0.835, 0.695, 0.580, 0.485, 0.405, 0.335, 0.280, and 0.235. On the assumption that the activity obeys an exponential decay law, find the equation that best represents activity as unity,

the activity,

is

after

and determine the decay constant and the (ans.

12.

What

is

y

=

Solve this problem

(ii)

(iii)

and

E = olT4 In E = In

a

=

a

£/(r 4 )

+

4

In

E=

oiT 4 ,

from n

pairs of

measurements?

in

each of the following forms

T

give a qualitative reason for the differences in the answers.

ans. (b) Solve the

for

half-life.

/day, 0.1815/day, 3.82 days)

without knowledge of the precision of the measure-

first

ments by writing the relation (i)

1815

Probability and Experimental Errors in Science

Probability and statistics in experimental physics

Probability and Statistics in Experimental Physics

Probability and Theory of Errors (Fourth Edition)

Probability Models in Engineering and Science

Probability Models in Engineering and Science

Chemistry An Experimental Science

Science, Probability, and the Proposition

Logics & Experimental Science

Errors, Lies, and Libel

Heavenly Errors

Mechanical Reliability Improvement - Probability and Statistics for Experimental Testing Marcel

Mechanical Reliability Improvement: Probability and Statistics for Experimental Testing

Probability and statistics. The science of uncertainty

Probability and Statistics for Computer Science

Probability and Statistics for Computer Science

Probability and statistics: the science of uncertainty

Probability and Statistics: The Science of Uncertainty

Experts in Uncertainty: Opinion and Subjective Probability in Science (Environmental Ethics and Science Policy)

Deadly Errors

Bible Errors

Rounding errors in algebraic processes

Rounding errors in algebraic processes

common errors in english usage

Heavenly Errors

Induction, Probability and Confirmation (Minnesota Studies in Philosophy of Science)

Survey Errors and Survey Costs

Infrared Thermography: Errors and Uncertainties

Reversible Errors

Heavenly Errors

Errors, Medicine and the Law

Probability and Experimental Errors in Science

Probability and statistics in experimental physics

Probability and Statistics in Experimental Physics

Probability and Theory of Errors (Fourth Edition)

Probability Models in Engineering and Science

Probability Models in Engineering and Science

Chemistry An Experimental Science

Science, Probability, and the Proposition

Logics & Experimental Science

Errors, Lies, and Libel

Heavenly Errors

Mechanical Reliability Improvement - Probability and Statistics for Experimental Testing Marcel

Mechanical Reliability Improvement: Probability and Statistics for Experimental Testing

Probability and statistics. The science of uncertainty

Probability and Statistics for Computer Science

Probability and Statistics for Computer Science

Probability and statistics: the science of uncertainty

Probability and Statistics: The Science of Uncertainty

Experts in Uncertainty: Opinion and Subjective Probability in Science (Environmental Ethics and Science Policy)

Deadly Errors

Bible Errors

Rounding errors in algebraic processes

Rounding errors in algebraic processes

common errors in english usage

Heavenly Errors

Induction, Probability and Confirmation (Minnesota Studies in Philosophy of Science)

Survey Errors and Survey Costs

Infrared Thermography: Errors and Uncertainties

Reversible Errors

Heavenly Errors

Errors, Medicine and the Law

Recommend Documents