Binomial Random Variable Applications, Conditional Lect4a

4.
Binomial Random Variable Approximations,

Conditional Probability Density Functions
and Stirlings Formula
Let X represent a Binomial r.v as in (3-42). Then from (2-30)
n k nk
P (k1 X k 2 ) = Pn ( k ) = p q .
k = k1
k = k1 k
k2
k2
(4-1)
n
n!
=
k ( n k )! k !
Since the binomial coefficient

grows quite
rapidly with n, it is difficult to compute (4-1) for large n. In
this context, two approximations are extremely useful.
4.1 The Normal Approximation (Demoivre-Laplace
Theorem) Suppose n with p held fixed. Then for k in
1
the npq neighborhood of np, we can approximate
PILLAI
n k nk
p q
k
2
1
e ( k np ) / 2 npq .
2 npq
(4-2)
Thus if k1 and k 2 in (4-1) are within or around the

neighborhood of the interval (np npq , np + npq ), we can
approximate the summation in (4-1) by an integration. In
that case (4-1) reduces to
P (k1 X k2 ) =
k2
k1
where
x1 =
x2
1
1 y2 / 2
( x np ) 2 / 2 npq
e
dx =
e
dy , (4-3)
x1
2npq
2
k np
k 1 np
, x2 = 2
.
npq
npq
We can express (4-3) in terms of the normalized integral

erf ( x ) =
1
2
x
0
ey
/2
dy = erf ( x )
that has been tabulated extensively (See Table 4.1).
(4-4)
2
PILLAI
For example, if
x1
and
x2
are both positive ,we obtain
P (k 1 X k 2 ) = erf ( x 2 ) erf ( x1 ).
(4-5)
Example 4.1: A fair coin is tossed 5,000 times. Find the

probability that the number of heads is between 2,475 to
2,525.
Solution: We need P ( 2 , 475 X 2 ,525 ).Here n is large so
that we can use the normal approximation. In this case p = 12
so that np = 2 , 500 and npq 35 . Since np npq = 2,465,
and np + npq = 2,535, the approximation is valid for k1 = 2,475
and k 2 = 2,525. Thus
P (k 1 X k 2 ) =
Here
x1 =
x2
x1
1 y2 / 2
e
dy .
2
k np
k 1 np
5
5
= , x2 = 2
= .
7
7
npq
npq
PILLAI
erf ( x ) =
x
0
y2 / 2
dy = G ( x )
1
2
erf(x)
erf(x)
erf(x)
erf(x)
0.05
0.10
0.15
0.20
0.25
0.01994
0.03983
0.05962
0.07926
0.09871
0.80
0.85
0.90
0.95
1.00
0.28814
0.30234
0.31594
0.32894
0.34134
1.55
1.60
1.65
1.70
1.75
0.43943
0.44520
0.45053
0.45543
0.45994
2.30
2.35
2.40
2.45
2.50
0.48928
0.49061
0.49180
0.49286
0.49379
0.30
0.35
0.40
0.45
0.50
0.11791
0.13683
0.15542
0.17364
0.19146
1.05
1.10
1.15
1.20
1.25
0.35314
0.36433
0.37493
0.38493
0.39435
1.80
1.85
1.90
1.95
2.00
0.46407
0.46784
0.47128
0.47441
0.47726
2.55
2.60
2.65
2.70
2.75
0.49461
0.49534
0.49597
0.49653
0.49702
0.55
0.60
0.65
0.70
0.75
0.20884
0.22575
0.24215
0.25804
0.27337
1.30
1.35
1.40
1.45
1.50
0.40320
0.41149
0.41924
0.42647
0.43319
2.05
2.10
2.15
2.20
2.25
0.47982
0.48214
0.48422
0.48610
0.48778
2.80
2.85
2.90
2.95
3.00
0.49744
0.49781
0.49813
0.49841
0.49865
Table 4.1
4
PILLAI
Since x1 < 0 , from Fig. 4.1(b), the above probability is given

by P (2 , 475 X 2 ,525 ) = erf ( x 2 ) erf ( x 1 ) = erf ( x 2 ) + erf (| x 1 |)
5
= 2 erf = 0 . 516 ,
7
where we have used Table 4.1 (erf (0.7) = 0.258) .

2
1
ex /2
2
2
1
ex /2
2
x1
x2
(a) x1 > 0, x2 > 0
x1
(b)
Fig. 4.1
x2
x1 < 0, x2 > 0
4.2. The Poisson Approximation

As we have mentioned earlier, for large n, the Gaussian
approximation of a binomial r.v is valid only if p is fixed,
i.e., only if np >> 1 and npq >> 1 . what if np is small, or if it
5
does not increase with n?
PILLAI
Obviously that is the case if, for example, p 0 as n ,

such that np = is a fixed number.
Many random phenomena in nature in fact follow this
pattern. Total number of calls on a telephone line, claims in
an insurance company etc. tend to follow this type of
behavior. Consider random arrivals such as telephone calls
over a line. Let n represent the total number of calls in the
interval (0, T ). From our experience, as T we have n
so that we may assume n = T . Consider a small interval of
duration as in Fig. 4.2. If there is only a single call
coming in, the probability p of that single call occurring in
that interval must depend on its relative size with respect to
n
2
1
T.
"
0
Fig. 4.2
PILLAI
Hence we may assume p = T . Note that p 0 as T .
= = is a constant,
np
T
However in this case
T
and the normal approximation is invalid here.
Suppose the interval in Fig. 4.2 is of interest to us. A call
inside that interval is a success (H), whereas one outside is
a failure (T ). This is equivalent to the coin tossing
situation, and hence the probability Pn ( k ) of obtaining k
calls (in any order) in an interval of duration is given by
the binomial p.m.f. Thus
Pn ( k ) =
n!
p k (1 p ) n k ,
( n k )! k !
(4-6)
and here as n , p 0 such that np = . It is easy to

obtain an excellent approximation to (4-6) in that situation.
To see this, rewrite (4-6) as
7
PILLAI
n ( n 1) " ( n k + 1) ( np ) k
n k
(
1
/
)
Pn ( k ) =
np
n
nk
k!
1
2
k 1 k (1 / n ) n
= 1 1 " 1
.
k
n
n
n k ! (1 / n )
Thus
k
lim
Pn ( k ) =
e ,
n , p 0 , np =
k!
since the finite products 1 n1 1 n2 " 1

as 1 n tend to unity as n , and
k
(4-7)
(4-8)
k 1
as well
lim 1 = e .
n
n
The right side of (4-8) represents the Poisson p.m.f and the
Poisson approximation to the binomial r.v is valid in
situations where the binomial r.v parameters n and p
diverge to two extremes ( n , p 0 ) such that their
product np is a constant.
8
PILLAI
Example 4.2: Winning a Lottery: Suppose two million

lottery tickets are issued with 100 winning tickets among
them. (a) If a person purchases 100 tickets, what is the
probability of winning? (b) How many tickets should one
buy to be 95% confident of having a winning ticket?
Solution: The probability of buying a winning ticket
p=
No. of winning tickets

100
5
5
10
.
=
=
6
Total no. of tickets
2 10
Here n = 100 , and the number of winning tickets X in the n

purchased tickets has an approximate Poisson distribution
with parameter = np = 100 5 10 5 = 0 . 005 . Thus
P( X = k ) = e
and (a) Probability of winning
k
,
k!
= P( X 1) = 1 P( X = 0) = 1 e 0.005.
9
PILLAI
(b) In this case we need
P ( X 1) 0 .95 .
P ( X 1) = 1 e 0 .95 implies ln 20 = 3 .
But = np = n 5 10 5 3 or n 60 ,000 . Thus one needs to

buy about 60,000 tickets to be 95% confident of having a
winning ticket!
Example 4.3: A space craft has 100,000 components (n )
The probability of any one component being defective
is 2 10 5 ( p 0 ). The mission will be in danger if five or
more components become defective. Find the probability of
such an event.
Solution: Here n is large and p is small, and hence Poisson
approximation is valid. Thus np = = 100 , 000 2 10 5 = 2 ,
and the desired probability is given by
10
PILLAI
P ( X 5) = 1 P( X 4) = 1 e
k =0
4
k
k
2
= 1 e
k!
k = 0 k!
4 2
= 1 e 2 1 + 2 + 2 + + = 0.052.
3 3
Conditional Probability Density Function

For any two events A and B, we have defined the conditional
probability of A given B as
P( A | B ) =
P( A B )
, P( B ) 0.
P( B )
(4-9)
Noting that the probability distribution function F X ( x ) is

given by
(4-10)
F X ( x ) = P { X ( ) x },
we may define the conditional distribution of the r.v X given
the event B as
11
PILLAI
F X ( x | B ) = P { X ( ) x | B
}=
P { ( X ( ) x ) B
P(B)
}.
(4-11)
Thus the definition of the conditional distribution depends

on conditional probability, and since it obeys all probability
axioms, it follows that the conditional distribution has the
same properties as any distribution function. In particular
P { ( X ( ) +
P(B)
P { ( X ( )
F X ( | B ) =
P(B)
F X ( + | B ) =
}=
P(B )
= 1,
P(B )
) B } = P ( ) = 0 .
P(B)
(4-12)
Further
P ( x 1 < X ( ) x 2 | B ) =
P { ( x 1 < X ( ) x 2 ) B }
P(B)
= F X ( x 2 | B ) F X ( x 1 | B ),
(4-13)
12
PILLAI
Since for x2 x1,
( X ( ) x 2 ) = ( X ( ) x1 ) (x1 <
X ( ) x 2 ) .
(4-14)
The conditional density function is the derivative of the

conditional distribution function. Thus
f X ( x | B) =
dFX ( x | B )
,
dx
(4-15)
and proceeding as in (3-26) we obtain

FX ( x | B ) =
f X ( u | B ) du .
(4-16)
Using (4-16), we can also rewrite (4-13) as

P ( x 1 < X ( ) x 2 | B ) =
x2
x1
f X ( x | B ) dx .
(4-17)
13
PILLAI
Example 4.4: Refer to example 3.2. Toss a coin and X(T)=0,

X(H)=1. Suppose B = {H }. Determine FX ( x | B ).
Solution: From Example 3.2,
We need FX ( x | B ) for all x.
FX ( x )
has the following form.
For x < 0, {X ( ) x } = , so that { ( X ( ) x ) B } = ,

and FX ( x | B ) = 0.
FX (x)
FX (x | B)
1
(a)
Fig. 4.3
1
(b)
14
PILLAI
For
0 x < 1,
{X ( ) x } = { T }, so that
{ ( X ( ) x ) B } = { T } { H } = and FX ( x | B ) = 0.
For
x 1,
{X ( ) x } = , and
P( B )
=1
{ ( X ( ) x ) B } = { B } = { B } and FX ( x | B) =
P( B )
(see Fig. 4.3(b)).

Example 4.5: Given FX ( x ), suppose B = {X ( ) a}. Find f X ( x | B ).
Solution: We will first determine FX ( x | B ). From (4-11) and
B as given above, we have
FX ( x | B ) =
P { (X x ) (X a ) }
.
P (X a )
(4-18)
15
PILLAI
For x < a , ( X x ) ( X a ) = ( X x ) so that

FX (x | B ) =
P (X x )
FX (x)
.
=
P (X a )
FX (a )
For x a , ( X x ) ( X a ) = ( X a ) so that
Thus
FX ( x )
,
FX ( x | B ) = FX (a )
1,
(4-19)
FX ( x | B ) = 1.
x < a,
x a,
(4-20)
and hence
fX (x)
,
d
fX (x | B) =
FX ( x | B ) = FX (a )
dx
0 ,
x < a,
(4-21)
otherwise.
16
PILLAI
FX ( x | B )
f X ( x | B)
f X ( x)
FX ( x )
a
(b)
(a)
Fig. 4.4
Example 4.6: Let B represent the event { a < X ( ) b} with b > a.

For a given FX ( x ), determine FX ( x | B ) and f X ( x | B ).
Solution:
P { ( X ( ) x ) (a < X ( ) b ) }
F X ( x | B ) = P { X ( ) x | B } =
P (a < X ( ) b )
P {( X ( ) x ) (a < X ( ) b )}
.
=
FX (b ) FX (a )
For x < a, we have { X ( ) x} { a < X ( ) b} = , and

hence FX ( x | B ) = 0.
(4-22)
(4-23)
17
PILLAI
For a x < b, we have { X ( ) x } { a < X ( ) b } = {a <

and hence
P (a < X ( ) x ) F X ( x ) F X ( a )
=
.
F (x | B) =
X
FX (b ) FX (a )
FX (b ) FX (a )
X ( ) x }
(4-24)
For x b, we have { X ( ) x} { a < X ( ) b} = { a < X ( ) b}

so that F X ( x | B ) = 1 .
(4-25)
Using (4-23)-(4-25), we get (see Fig. 4.5)
fX (x)
f X ( x | B ) = FX (b ) FX (a )
0,
a < x b,
otherwise.
(4-26)
f X ( x | B)
f X ( x)
Fig. 4.5
x
18
PILLAI
We can use the conditional p.d.f together with the Bayes

theorem to update our a-priori knowledge about the
probability of events in presence of new observations.
Ideally, any new information should be used to update our
knowledge. As we see in the next example, conditional p.d.f
together with Bayes theorem allow systematic updating. For
any two events A and B, Bayes theorem gives
P(A | B) =
Let
B = {x 1 < X ( ) x 2 }
P (B | A)P ( A)
.
P(B)
(4-27)
so that (4-27) becomes (see (4-13) and
(4-17))
P {A | ( x 1 < X ( ) x 2 ) } =
P (( x 1 < X ( ) x 2 ) | A )P ( A )
P (x 1 < X ( ) x 2 )
F ( x | A ) F X ( x1 | A )
= X 2
P ( A) =
F X ( x 2 ) F X ( x1 )
x2
f X ( x | A ) dx
x1
x2
x1
f X ( x ) dx
P ( A ).
(4-28)
19
PILLAI
Further, let
x1 = x, x2 = x + , > 0,
so that in the limit as
lim P {A | ( x < X ( ) x + ) } = P ( A | X ( ) = x ) =
0
f X ( x | A)
P ( A ).
fX (x)
0,
(4-29)
or
f X |A ( x | A ) =
P( A | X = x) fX (x)
.
P ( A)
(4-30)
From (4-30), we also get

+
P ( A ) f X ( x | A ) dx =

or
P ( A | X = x ) f X ( x ) dx ,
(4-31)
P ( A) =
(4-32)
P ( A | X = x ) f X ( x ) dx
and using this in (4-30), we get the desired result

f X |A ( x | A ) =
P( A | X = x) fX (x)
P ( A | X = x ) f X ( x ) dx
(4-33)
20
PILLAI
To illustrate the usefulness of this formulation, let us

reexamine the coin tossing problem.
Example 4.7: Let p = P(H ) represent the probability of
obtaining a head in a toss. For a given coin, a-priori p can
possess any value in the interval (0,1). In the absence of any
additional information, we may assume the a-priori p.d.f f P ( p)
to be a uniform distribution in that interval. Now suppose
we actually perform an experiment of tossing the coin n
times, and k heads are observed. This is new information.
How can we update f P ( p) ?
Solution: Let A= k heads in n specific tosses. Since these
f ( p)
tosses result in a specific sequence,
P
P ( A | P = p ) = p k q nk ,
(4-34)
0
Fig.4.6
p
21
PILLAI
and using (4-32) we get

P ( A) =
1
0
P ( A | P = p ) f P ( p ) dp =
1
0
p k (1 p ) n k dp =
( n k )! k !
.
( n + 1 )!
(4-35)
The a-posteriori p.d.f f P|A ( p | A) represents the updated

information given the event A, and from (4-30)
f ( p | A)
P( A | P = p) fP ( p)
f P |A ( p | A ) =
P ( A)
( n + 1 )!
p k q nk , 0 < p < 1
=
( n k )! k !
P|A
( n , k ).
(4-36)
Fig. 4.7
Notice that the a-posteriori p.d.f of p in (4-36) is not a

uniform distribution, but a beta distribution. We can use this
a-posteriori p.d.f to make further predictions, For example,
in the light of the above experiment, what can we say about
the probability of a head occurring in the next (n+1)th toss?
22
PILLAI
Let B= head occurring in the (n+1)th toss, given that k

heads have occurred in n previous tosses.
Clearly P ( B | P = p ) = p , and from (4-32)
P(B) =
1
0
P ( B | P = p ) f P ( p | A ) dp .
(4-37)
Notice that unlike (4-32), we have used the a-posteriori p.d.f

in (4-37) to reflect our knowledge about the experiment
already performed. Using (4-36) in (4-37), we get
P(B) =
1
0
( n + 1)!
k +1
k nk
.
p
p q dp =
( n k )! k !
n+2
(4-38)
Thus, if n =10, and k = 6, then

7
= 0 . 58 ,
P(B) =
12
which is more realistic compare to p = 0.5.
23
PILLAI
To summarize, if the probability of an event X is unknown,

one should make noncommittal judgement about its a-priori
probability density function f X (x ). Usually the uniform
distribution is a reasonable assumption in the absence of any
other information. Then experimental results (A) are
obtained, and out knowledge about X must be updated
reflecting this new information. Bayes rule helps to obtain
the a-posteriori p.d.f of X given A. From that point on, this
a-posteriori p.d.f f X |A ( x | A) should be used to make further
predictions and calculations.
24
PILLAI
Stirlings Formula : What is it?

Stirlings formula gives an accurate approximation for n!
as follows:
n + 12 n
(4-39)
n ! ~ 2 n e
in the sense that the ratio of the two sides in (4-39) is near
to one; i.e., their relative error is small, or the percentage
error decreases steadily as n increases. The approximation
is remarkably accurate even for small n. Thus 1! = 1 is
approximated as 2 / e 0.9221, and 3! = 6 is
approximated as 5.836.
Prior to Stirlings work, DeMoivre had established
the same formula in (4-39) in connection with binomial
distributions in probability theory. However DeMoivre
25
did not establish the constant
PILLAI
term 2 in (4-39); that was done by James Stirling

( 1730).
How to prove it?
We start with a simple observation: The function
log x is a monotone increasing function, and hence we have
k
k 1 log x dx <
k +1
log k < k log x dx.
log x
Summing over k = 1, 2, ", n

we get
n
n +1
0 log x dx < log n ! < 1 log x dx

or
log k
1
k 1 k k +1
( log xdx = x log x x )
n log n n < log n ! < ( n + 1) log( n + 1) n.
(4-40)
26
PILLAI
The double inequality in (4-40) clearly suggests that log n!

is close to the arithmetic mean of the two extreme numbers
there. However the actual arithmetic mean is complicated
and it involves several terms. Since (n + 12 )log n n is
quite close to the above arithmetic mean, we consider
the difference1
an log n ! ( n + 12 ) log n + n.
(4-41)
This gives
an an +1 = log n ! ( n + 12 ) log n + n log( n + 1)!
+ ( n + 23 ) log( n + 1) ( n + 1)
= log( n + 1) ( n + 12 ) log n + ( n + 23 ) log( n + 1) 1
= ( n + ) log
1
2
1According
n +1
n
n + 12 + 12
1 = ( n + ) log n + 1 1 1.
1
2
to W. Feller this clever idea to use the approximate mean

to H.E. Robbins, and it leads to an elementary proof.
2 2
(n + 12 )log
n n is due
27
PILLAI
Hence1
1
1
+
an an +1 = ( n + 12 ) log( 1 2 n1+1 ) 1
2 n +1
= 2( n + 12 ) 2n1+1 + 3(2n1+1)3 + 5(2n1+1)5 + " 1

= 3(2n1+1)2 + 5(2n1+1)4 + 7(2n1+1)6 + " > 0
(4-42)
Thus {an} is a monotone decreasing sequence and let c

represent its limit, i.e.,
lim an = c
(4-43)
n
From (4-41), as n this is equivalent to

c
n + 12 n
(4-44)
n! ~ e n e .
To find the constant term c in (4-44), we can make use of
a formula due to Wallis ( 1655).
1By
Taylor series expansion

1 2 1 3 1 4
log(1 + x ) = x 2 x + 3 x 4 x + " ; log
1
(1 x )
28
1 2 1 3 1 4
= x+2 x +3 x +4 x +"
PILLAI
sin x
The well known function sinx x = sinc x
x
goes to zero at x = n ;
moreover these are the only
zeros of this function. Also

sin x has no finite poles.
x
(All poles are at infinity). As a result we can write
sin x = (1 x22 )(1 x22 )" (1 2x2 2 )"

x
4
n
or [for a proof of this formula, see chapter 4 of Dienes,
The Taylor Series]
sin x = x (1 )(1
x2
2
x2
4 2
)" (1
x2
n 2 2
)",
which for x = / 2 gives the Wallis formula

1 = 2 (1 212 )(1 412 )" (1 (21n )2 ) = 2 ( 1223 )( 3425 )( 5627 )" ( (2 n(21)n(2)2n+1) )"
29
PILLAI
(2 n )2
4 n2
62
22
42
= (2 n 1)(2 n +1) = (1 3 )( 3 5 )( 5 7 )" ( (2 n 1)(2 n+1) )"
2 n =1
= (1 2354" 6(2"n2n1)"" ) 2 2 n1+1".
Thus as n , this gives
or
6 " 2n
= 1 2354"
(2 n1)
1
n+ 1
2
"
2
(2 4 6 " 2 n )2 1
2 n ( n!)
1 .
=
"=2
(2 n )!
(2n )! n + 1
n+1
Thus as n
log = 2n log 2 + 2 log n ! log(2n )! 12 log( n + 12 ) (4-45)

But from (4-41) and (4-43)
lim log n ! = c + lim {( n + 12 ) log n n}
n
and hence letting n in (4-45) and making use
(4-46)
30
PILLAI
of (4-46) we get
log = c 12 log 2 lim 12 log(1 + 21n ) = c 12 log 2
n
which gives
(4-47)
e c = 2 .
With (4-47) in (4-44) we obtain (4-39), and this proves
the Stirlings formula.
Upper and Lower Bounds
It is possible to obtain reasonably good upper and
lower bounds for n! by elementary reasoning as well.
To see this, note that from (4-42) we get
an an +1
< 13 1 2 + 1 4 "
(2 n +1) (2 n +1)
= 3[(2n 11)2 1] = 12n (1n+1) = 121n 12(1n+1)

+
so that {an 1/12n} is a monotonically increasing
31
PILLAI
sequence whose limit is also c. Hence for any finite n

an 121n < c
an < c + 121n
and together with (4-41) and (4-47) this gives
1
n + 12 n 112 n 2
n ! < 2 n e
Similarly from (4-42) we also have
(4-48)
an an +1 > 3(2n1+1)2 > 121n+1 12( n1+1)+1 > 0

so that {an 1/(12n+1)} is a monotone decreasing sequence
whose limit also equals c. Hence
an 121n+1 > c an > c + 121n+1
or
1
n + 12 n (112 n ( n +1) )
(4-49)
n ! > 2 n e
.
32
Together with (4-48)-(4-49) we obtain
PILLAI
2 n
n + 12 n
e e
1
12 n +1
< n ! < 2 n
n + 12 n
1
12 n
e e .
(4-50)
Stirlings formula also follows from the asymptotic

expansion for the Gamma function given by
x
( x ) = 2 e x
x 12
139 + o( 1 )
1 + 121x + 2861 x2 51840
x3
x4
(4-51)
Together with ( x + 1) = x( x ), the above expansion can

be used to compute numerical values for real x.
For a derivation of (4-51), one may look into Chapter 2
of the classic text by Whittaker and Watson (Modern
Analysis).
We can use Stirlings formula to obtain yet another
approximation to the binomial probability mass
33
PILLAI
function. Since
n!
n k n k
k n k
=
p
q
p
q ,
(4-52)
k
( n k )! k !

using (4-50) on the right side of (4-52) we obtain
and
where
and
n k n k
k p q > c1

n k n k
k p q < c2

{
c1 = e
n
2 ( n k ) k
n
2 ( n k ) k
1
1
1
12 n +1 12 ( n k ) 12 k
{
c2 = e
( )( )
np k nq n k
k
n k
( )( )
np k nq n k
k
n k
}
}.
1
1
1
12 n 12 ( n k ) +1 12 k +1
Notice that the constants c1 and c2 are quite close

to each other.
34
PILLAI

Binomial Random Variable Applications, Conditional Lect4a

Загружено:

Сведения о документе

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Binomial Random Variable Applications, Conditional Lect4a

Загружено:

Авторское право:

Доступные форматы

4.

Binomial Random Variable Approximations,

Since the binomial coefficient

Thus if k1 and k 2 in (4-1) are within or around the

We can express (4-3) in terms of the normalized integral

that has been tabulated extensively (See Table 4.1).

are both positive ,we obtain

Example 4.1: A fair coin is tossed 5,000 times. Find the

Since x1 < 0 , from Fig. 4.1(b), the above probability is given

where we have used Table 4.1 (erf (0.7) = 0.258) .

(a) x1 > 0, x2 > 0

4.2. The Poisson Approximation

Obviously that is the case if, for example, p 0 as n ,

Hence we may assume p = T . Note that p 0 as T .

and here as n , p 0 such that np = . It is easy to

since the finite products 1 n1 1 n2 " 1

Example 4.2: Winning a Lottery: Suppose two million

No. of winning tickets

Here n = 100 , and the number of winning tickets X in the n

and (a) Probability of winning

(b) In this case we need

But = np = n 5 10 5 3 or n 60 ,000 . Thus one needs to

Conditional Probability Density Function

Noting that the probability distribution function F X ( x ) is

Thus the definition of the conditional distribution depends

Since for x2 x1,

The conditional density function is the derivative of the

and proceeding as in (3-26) we obtain

Using (4-16), we can also rewrite (4-13) as

Example 4.4: Refer to example 3.2. Toss a coin and X(T)=0,

has the following form.

For x < 0, {X ( ) x } = , so that { ( X ( ) x ) B } = ,

(see Fig. 4.3(b)).

For x < a , ( X x ) ( X a ) = ( X x ) so that

Example 4.6: Let B represent the event { a < X ( ) b} with b > a.

For x < a, we have { X ( ) x} { a < X ( ) b} = , and

For a x < b, we have { X ( ) x } { a < X ( ) b } = {a <

For x b, we have { X ( ) x} { a < X ( ) b} = { a < X ( ) b}

We can use the conditional p.d.f together with the Bayes

so that (4-27) becomes (see (4-13) and

so that in the limit as

From (4-30), we also get

and using this in (4-30), we get the desired result

To illustrate the usefulness of this formulation, let us

and using (4-32) we get

The a-posteriori p.d.f f P|A ( p | A) represents the updated

Notice that the a-posteriori p.d.f of p in (4-36) is not a

Let B= head occurring in the (n+1)th toss, given that k

Notice that unlike (4-32), we have used the a-posteriori p.d.f

Thus, if n =10, and k = 6, then

which is more realistic compare to p = 0.5.

To summarize, if the probability of an event X is unknown,

Stirlings Formula : What is it?

term 2 in (4-39); that was done by James Stirling

log k < k log x dx.

Summing over k = 1, 2, ", n

0 log x dx < log n ! < 1 log x dx

( log xdx = x log x x )

n log n n < log n ! < ( n + 1) log( n + 1) n.

The double inequality in (4-40) clearly suggests that log n!

to W. Feller this clever idea to use the approximate mean

= 2( n + 12 ) 2n1+1 + 3(2n1+1)3 + 5(2n1+1)5 + " 1

Thus {an} is a monotone decreasing sequence and let c

From (4-41), as n this is equivalent to