Академический Документы
Профессиональный Документы
Культура Документы
and thus P (S1 ) = P (S2 ), i.e., the probability of success is the same in each trial. However,
3 4
P (S2 |S1 ) = 6= P (S2 ) = ,
9 10
1
Interestingly, if the population size N is large relative to the sample size n (and if r is
not too close to 0 or N ), then the dependence between draws is weak.
Example. Suppose in the last example, that N = 100 and r = 40, but still n = 2. Then
we still have P (S1 ) = 0.4 = P (S2 ), but now
39
P (S2 |S1 ) = = 0.393939
99
is must closer to P (S2 ) = 0.4. And if we increase N to 1000 and r to 400, then still
P (S1 ) = 0.4 = P (S2 ), while
399
P (S2 |S1 ) = = 0.399399399 ≈ P (S2 ) = 0.4.
999
With the dependence between draws becoming weaker as N grows larger, we might
expect that when N is large (relative to n), the hypergeometric distribution can be well
approximated by the binomial, and this is indeed the case, as demonstrated by the example
in Table 1 and the corresponding Figure 1.
r N −r N
y 5−y 5
N = 10 N = 20 N = 100 N = 1000
5
(.4)y (.6)5−y
y r=4 r=8 r = 40 r = 400 y
2
N = 10 r = 4 N = 20 r = 8
0.5
0.5
0.4
0.4
0.3
0.3
p(y)
p(y)
0.2
0.2
0.1
0.1
0.0
0.0
0 1 2 3 4 5 0 1 2 3 4 5
y y
0.5
0.4
0.4
0.3
0.3
p(y)
p(y)
0.2
0.2
0.1
0.1
0.0
0.0
0 1 2 3 4 5 0 1 2 3 4 5
y y