Академический Документы
Профессиональный Документы
Культура Документы
4.1 Introduction
Definition: If a sample of size n is drawn from a population of size N such that
every possible sample of size n has the same chance of being selected, the
sampling procedure is called simple random sampling. The sample obtained is
called a simple random sample.
Student y x
1 1 0
2 0 1
3 0 1
.
.
.
98 0 1
99 0 1
100 1 1
100 100
∑ 𝑦𝑖 = 15 ∑ 𝑥𝑖 = 65
𝑖=1 𝑖=1
Sample size required to estimate p with a bound on the error of estimation B
𝑁𝑝𝑞
𝑛=
(𝑁 − 1)𝐷 + 𝑝𝑞
𝐵2
where = .
4
Type 1 lakes are oligotrophic (balanced between decaying vegetation and living
organism, Type 2 lakes are eutrophic (high decay rate and little oxygen), and Type
3 lakes are mesotrophic (between the other two states). The table also shows
whether the lake is formed behind a dam. The summary statistics are in the
following table:
Type Count Mean Median Standard deviation
1 4 0.22 0.20 0.103
2 15 0.74 0.68 0.583
3 16 0.50 0.44 0.272
a) Comparing lake Types 1 and 2, what is your best estimate of the difference in
the mean mercury levels for these 2 types of lakes?
b) Is there sufficient evidence to conclude that the mean mercury level for lakes
type2 differs from that for lakes of type 3?
Solution:
Notes: When comparing means, we consider only the independent sample case
because the dependent case becomes too complicated to handle at this level
For the two sample proportions arising from a multinomial sample of size n
𝐸(𝑝̂1 − 𝑝̂ 2 ) = 𝑝1 − 𝑝2
and
𝑉(𝑝̂1 − 𝑝̂ 2 ) = 𝑉(𝑝̂1 ) + 𝑉(𝑝̂2 ) − 2𝐶𝑜𝑣(𝑝̂1, 𝑝̂2 )
Example: The notion of banning smoking from the workplace has been around
from a long time. A time poll of 800 adults carried out on April 1994 asked:
‘Should smoking be banned from workplaces, should there be special smoking
area, or should there be no restrictions?’
The results are as follows:
Non-smokers (%) Smokers (%)
Banned 44 8
Special areas 52 80
No restrictions 3 11