Академический Документы
Профессиональный Документы
Культура Документы
BUSINESS
STATISTICS
by
AMIR D. ACZEL
&
JAYAVEL SOUNDERPANDIAN
7th edition.
Prepared by Lloyd Jaisingh, Morehead State
University
Chapter 16
Sampling Methods
McGraw-Hill/Irwin
16-2
16 Sampling Methods
Using Statistics
Nonprobability Sampling and Bias
Stratified Random Sampling
Cluster Sampling
Systematic Sampling
Nonresponse
16-3
16 LEARNING OBJECTIVES
After studying this chapter you should be able to:
Apply nonprobability sampling methods
Decide when to conduct a stratified sampling method
Compute estimates from stratified sample results
Decide when to conduct a cluster sampling method
16-4
16-5
16-6
Population
Stratum 1
N1
Stratum 2
N2
N N
i
N i i N
Stratum m
Nm
i 1
16-7
Instratified
stratifiedrandom
randomsampling,
sampling,
weassume
assumethat
thatthe
thepopulation
populationof
ofNNunits
unitsmay
may
sampling we
In
sampling
bedivided
dividedinto
intommgroups
groupswith
withNNiunits
unitsinineach
eachgroup
groupi=1,2,...,m.
i=1,2,...,m. The
Themmstrata
strata
be
i
arenonoverlapping
nonoverlappingand
andtogether
togetherthey
theymake
makeup
upthe
thetotal
totalpopulation:
population:NN1++NN2
are
1
2
+...+NNm=N.
=N.
+...+
m
Nii
nii
Population Distribution
Group
Group
Sample Distribution
proportionalallocation,
allocation,
therelative
relativefrequencies
frequenciesininthe
thesample
sample(n
(n/n)
arethe
thesame
same
allocation the
i/n)are
InInproportional
allocation
i
asthose
thoseininthe
thepopulation
population(N
(N/N)
i/N). .
as
i
nni
Samplingfraction
fractionin
instratum
stratumi i: :f f i
Sampling
i i nn
Truemean
meanof
ofpopulation
population: :
True
Truemean
meanin
instratum
stratumi i: :
True
ii
22
True
variance
of
the
population
:
Truevariance
varianceof
ofstratum
stratumi i:
: 22
True
ii
Samplemean
meanin
instratum
stratumi i: :XX
Sample
ii
22
Sample
variance
in
stratum
i
:
s
Sample variance in stratum i : s i
i
Theestimator
estimatorof
ofthe
thepopulation
populationmean
meanin
instratified
stratifiedrandom
randomsampling
sampling::
The
mm
X
W XX
X st W
i11 ii ii
st i
16-8
VV((XXstst))==
i=1 i nn
i=1
ii
ii
Whenthe
thesampling
samplingfractions,
fractions, f f , , are
aresmall
smalland
andmay
maybe
beignored,
ignored, we
wehave:
have:
When
ii
2
mm 22 2 i i
WWi
VV((XXstst))==
i=1 i nn
i=1
ii
16-9
NN i
thesample
sampleallocation
allocationisisproportional
proportional nn nn i for
then
4.4. IfIfthe
i i NN forallalli i , , then
1--ff mm
1
22 i
V
(
X
)
=
W
V ( X stst) = n
Wi i i
i=1
n i=1
whichreduces
reducestoto
which
1 mm
1
22 i
V
(
X
)
=
W
V ( X stst) = n
Wi i i
i=1
n i=1
whenthe
thesampling
samplingfraction
fractionisissmall.
small.
when
addition, ififthe
thepopulation
populationvariances
variancesininall
allstrata
strataare
areequal,
equal, then
then
InInaddition,
22
VV((XXstst))== n
n
whenthe
thesampling
samplingfraction
fractionisissmall.
small.
when
16-10
m Wi iS i
2
m
2
i (1(1 f f))
SS ((XX st))==
st
i=11 nn
i=
16-11
st
2
Whenthe
thesample
samplesizes
sizesare
aresmall,
small,and
andthe
thepopulation
populationvariances
variancesare
areunknown,
unknown,
When
usethe
thet t--value
valueininthe
theabove
aboveformula.
formula.
use
Theeffective
effectivedegrees
degreesof
offreedom
freedom: :
The
2
2 2
2
s
m
s
m
N ( N n ) ii
N (N n )
i i==11 i i i i i i nni
i
Effective
df
=
Effective df =
2
N ( N n )/n 2 s44
N i( N i n i)/n i s i
mm
i i
i i
i
i i11
((nni11))
i
16-12
16-13
Example 16-2
Population
Population
Group
Group
Diversifiedservice
servicecompanies
companies
1.1.Diversified
Commercialbanking
bankingcompanies
companies
2.2.Commercial
Financialservice
servicecompanies
companies
3.3.Financial
Retailingcompanies
companies
4.4.Retailing
Transportationcompanies
companies
5.5.Transportation
Utilities
6.6.Utilities
True
True
Weights
Weights
(W)i)
(W
i
0.20
0.20
0.20
0.20
0.30
0.30
0.10
0.10
0.10
0.10
0.10
0.10
Number
Number
Firms
ofofFirms
100
100
100
100
150
150
50
50
50
50
50
50
500
NN==500
1 f
Stratum
1
2
3
4
5
6
Mean
52.7
112.6
85.6
12.6
8.9
52.3
Variance
ni
Wi
97650
20
0.2
64300
20
0.2
76990
30
0.3
18320
10
0.1
9037
10
0.1
83500
10
0.1
Estimated Mean:
66.12
Estimated standard error of mean: 23.08
Wixi
10.54
22.52
25.68
1.26
0.89
5.23
532.582
2
W
si
i
n
156.240
102.880
184.776
14.656
7.230
66.800
Sample
Sample
Sizes
Sizes
20
20
20
20
30
30
10
10
10
10
10
10
100
nn==100
Sampling
Sampling
Fraction
Fraction
(f(fi)i)
0.20
0.20
0.20
0.20
0.30
0.30
0.10
0.10
0.10
0.10
0.10
0.10
95%Confdence
ConfdenceInterval:
Interval:
95%
xx st zz ss((XX st))
st
st
22
66.12
.12(1(1.96
.96)()(23
23.08
.08))
66
66.12
.1245
45.24
.24
66
20.88
.88,111
,111.36
.36]]
[[20
16-14
WWiPPi
PP
stst i
i 11 i i
Theapproximate
approximatevariance
varianceofofPPst, ,
The
st
m 2 PPiQQi
m
V(PPst)) WW2i i i
V(
st
i i11 i nni i
Whenthe
thefinite
finite--population
populationcorrection
correctionfactors,
factors, f fi, , must
mustbe
beconsidered:
considered:
When
i
m 2
PiQQi
1
m
P
i i
V(PPst)) 1 NN2 ((NN nn ))
V(
st
2i i11 i i i i i i ((NN 11))nni
2
N
i
N
ii
Whenproportional
proportionalallocation
allocationisisused:
used:
When
1 f f mm
1
WWi PPiQQi
V(PPst))
V(
st
i 11 i i i
nn i
16-15
Number
Wi
ni fi
Interested
0.65 130 0.65
28
0.35
70 0.35
18
Estimated proportion:
Estimated standard error:
Wp
i i
st
2
0.14
0.09
0.23
W
i pi qi
n
0.0005756
0.0003099
0.0008855
0.0297574
16-16
16-17
16-18
Frequency (fi)
1
16
25
4
9
f(x)
1
4
5
2
3
Cum f(x)
5
5
5
16-19
Optimum Allocation
Foroptimum
optimumallocation
allocationofofeffort
effortininstratified
stratifiedrandom
randomsampling,
sampling, minimize
minimizethe
the
For
costfor
foraagiven
givenvariance,
variance, ororminimize
minimizethe
thevariance
variancefor
foraagiven
givencost.
cost.
cost
TotalCost
Cost == Fixed
FixedCost
Cost ++ Variable
VariableCost
Cost
Total
CC== CC 0CC inn i
0
i i
(W ))/ / CC
nn i (W
i ii
i
i
i
i
Optimum
Allocation:
Optimum Allocation:
(W ))/ / CC
nn (W
ii ii
ii
thecost
costper
perunit
unitsampled
sampledisisthe
thesame
samefor
forall
allstrata
strata(C
(C ==c):
c):
IfIfthe
i
i
NeymanAllocation:
Allocation:
Neyman
(W ))
nn i (W
i
ii ii
(W ))
nn (W
ii ii
16-20
ii
Wi i
W
ssii
11
22
33
0.4
0.4
0.5
0.5
0.1
0.1
11
22
33
CCii
44
99
16
16
Wss
W
i ii
i
W
s
Wi si ii
CC i
i
0.4
0.4
1.0
1.0
0.3
0.3
1.7
1.7
0.200
0.200
0.333
0.333
0.075
0.075
0.608
0.608
Optimum Neyman
Neyman
Optimum
Allocation Allocation
Allocation
Allocation
0.329
0.329
0.548
0.548
0.123
0.123
0.235
0.235
0.588
0.588
0.176
0.176
16-21
16-22
Group
Population Distribution
Sample
Sample Distribution
Distribution
Instratified
stratifiedsampling
samplingaa
In
randomsample
sample(n
(n)i)isis
random
i
chosenfrom
fromeach
eachsegment
segment
chosen
ofthe
thepopulation
population(N
(N).i).
of
i
Incluster
clustersampling
sampling
In
observationsare
aredrawn
drawnfrom
from mm
observations
outof
ofM
Mareas
areasor
orclusters
clustersof
ofthe
the
out
population.
population.
nni iXXi i
i i
11
X
X clcl mm
nni i
i i
11
Estimatorof
ofthe
thevariance
varianceof
ofthe
thesample
samplemean:
mean:
Estimator
mm
s
(
X
)
s ( X clcl)
22
where
where
M mm
M
2
2
Mmn
Mmn
mm
nn
i 1
nn == i 1m
m
ii
nn ((XX XX ))
i i
11
22
ii
ii
mm11
clcl
22
16-23
n
i
P
i i
i
PPclcl i 1mm
nni i
i i
11
Estimatorof
ofthe
thevariance
varianceof
ofthe
thesample
sampleproportion:
proportion:
Estimator
mm
22 (
22
n
P
P
)
ni i ( Pi i Pclcl)
2
M
m
i
1
ss2 ((PPclcl)) Mmn22 i 1 m 1
m1
Mmn
16-24
16-25
ni
ni x i
xi-xcl
(xi-xcl)2
21
22
11
34
28
25
18
24
19
20
30
26
12
17
13
29
24
26
18
22
8
8
9
10
7
8
10
12
11
6
8
9
9
8
10
8
8
10
10
11
168
176
99
340
196
200
180
288
209
120
240
234
108
136
130
232
192
260
180
242
-0.8333
0.1667
-10.8333
12.1667
6.1667
3.1667
-3.8333
2.1667
-2.8333
-1.8333
8.1667
4.1667
-9.8333
-4.8333
-8.8333
7.1667
2.1667
4.1667
-3.8333
0.1667
0.694
0.028
117.361
148.028
38.028
10.028
14.694
4.694
8.028
3.361
66.694
17.361
96.694
23.361
78.028
51.361
4.694
17.361
14.694
0.028
xcl =
3930
21.83
s2(Xcl)=
2
2
M m ni ( X i X cl )
m1
Mmn 2
0.00118
0.00005
0.25269
0.39348
0.04953
0.01706
0.03906
0.01797
0.02582
0.00322
0.11346
0.03738
0.20819
0.03974
0.20741
0.08738
0.00799
0.04615
0.03906
0.00009
1.58691
95%Confdence
ConfdenceInterval:
Interval:
95%
xx zz ss((XX ))
clcl
clcl
22
21.83
.83(1(1.96
.96)()( 11.587
.587))
21
21.83
.8322.47
.47
21
[19.36
.36,24
,24.30
.30]]
[19
16-26
16-27
16-28
N n 2
sy ) Nn S
When the mean is constant within each stratum of k elements but different between strata :
n
( X i X ik )2
s2 ( X sy ) N n i1
2(n 1)
Nn
When the population is linearly increasing or decreasing with respect to the variable of interest :
n
( X i 2 X ik X i2k )2
s2 ( X sy ) N n i1
6(n 1)
Nn
16-29
0
.
5
s
X sy
sy
0.5
s 0.36
n
n
N nn 2 2100
100
2
N
2
2 2100100
s
(
X
)
S
.3600.0034
.0034
s ( X sy
sy) Nn S ( 2100)(100) 00.36
( 2100)(100)
Nn
95%confidence
confidenceinterval
intervalfor
forthe
theaverage
averageprice
pricechange
changefor
forall
allstocks:
stocks:
AA95%
.96))ss ((XX sy))
XXsy
sy ((11.96
sy
.96)()( 00.0034
.0034))
00.5.5((11.96
.114
00.5.500.114
.386, 0, 0.614
.614]]
[[00.386
16-30
16-6 Nonresponse
Systematic nonresponse can bias estimates
Callbacksof
ofnonrespondents
nonrespondents
Callbacks
Offersof
ofmonetary
monetaryrewards
rewardsfor
fornonrespondents
nonrespondents
Offers
Random-responsemechanism
mechanism
Random-response