Академический Документы
Профессиональный Документы
Культура Документы
Basic Concepts.
Types of sampling.
Sampling Process
Sampling Error.
Sampling Distribution
Mean μ x
Standard deviation σ s
Proportion p p
No. of elements N n
Need for Sampling
• Reduces cost of research
• Partial data is all that is available, eg. fossils and historical records, climate
change.
Sampling Error
The sampling error is the difference between the value of the statistic and the
value of the parameter.
A researcher hopes to minimize the sampling error, but all samples have some
such error associated with them.
Sampling
Methods
Probability Non-
Samples probability
Simple
Systematic Random Convenience Snowball
Stratified
Cluster Judgment Quota
Random Sampling (OR)Probability Sampling
• A Probability Sampling is a method of sampling that ensures that every unit
in the population has a known non-zero chance of being selected. This
makes us to assess objectively the estimates of the population
characteristics that results from our sample.
• There are various ways of random selection ex., lottery method, random
tables, Computer generated random numbers, etc.
• There are different sampling methods:
• After the first unit is drawn record it and replace it in the population.
• Draw the second unit in the same way as the first unit . Continue this
procedure until all the required ( n ) units of the sample are selected.
• This is how we ensure that all elements have an equal probability being
included in the sample.
• After the first unit is drawn record it and do not replace it in the population.
Systematic Sampling
• In Systematic Random Sampling, the units are drawn from the
population at regular intervals clearly defined.
• Systematic Sampling differs from SRS in that all units have equal chance
of being selected but sample does not have equal chance of being
selected.
• The steps involved in Systematic Sampling are:
• In such situations Stratified Random Sampling technique is used. This involves dividing
the population into homogeneous subgroups (strata) and then taking a simple random
sample in each subgroup, thus ensuring representation from all relevant sub groups.
- Divide the population into non-overlapping groups (i.e., strata) N1, N2, N3, ... Ni,
such that N1 + N2 + N3 + ... + Ni = N.
- Divide the sample into i non overlapping groups: n1, n2, ……,ni.
• If the variances of the strata are not equal then disproportionate method
of stratified random sampling techniques shall be used.
• In this method the stratum which has more variance will have
proportionately more sampling units compared to stratum which have less
variance.
• The formula to compute the no of sampling units of the i th strata is given
by
ni = qi σi n
∑ qi σi
Where n = is th number of sampling units in ith stratum.
σi = is the variance of the ith stratumn.
qi = Ni/N
n = sample size
Stratified Sampling
Advantages
Assures representation of all groups in
sample population needed.
Characteristics of each stratum can be
estimated and comparisons made.
Reduces variability.
Disadvantages
Requires accurate information on
proportions of each stratum.
Stratified lists costly to prepare.
Cluster Sampling (or) Area Sampling
• It is for precisely this problem that cluster or area random sampling was
invented.
• We use stratified sampling when each strata has small variation within itself but
there is wide variation between stratas
Area sample:
Primary sampling unit is a
geographical area.
• This method involves gathering a set of people who have the knowledge and
expertise in certain key areas that are crucial for decision making.
3.Quota Sampling:
• In quota sampling, we select units non randomly according to some fixed quota.
• There are two types of quota sampling: proportional and non proportional.
• The mean of each sample is a random variable …with each mean varying
according to the laws of probability.
• The mean of each sample is a random variable …with each mean varying
according to the laws of probability.
We have a population.
• The central limit theorem describes the relation ship between the shape of
population distribution and the shape of sampling distribution of mean.
X or x-bar
Distribution
Caution: only applies to shape and not to the mean or standard deviation
x x x x x x x x x x x x x x x x
Population Distribution
Student’s t Distribution
If the population standard deviation, , is unknown, replace
with the sample standard deviation, s. If the population is
normal, the resulting statistic: t X
s/ n
has a t distribution with (n - 1) degrees of freedom.
• An SRS is not always practical to obtain, for instance for a highly diverse
population, or really large population.
• Stratified random sampling involves dividing population into homogeneous
subpopulations and drawing SRS from each one. This is useful when you
want to do statistics both on sub population and whole population.
• Multistage cluster sampling. “Tree structured” sampling, units are
different at each stage. Useful for sampling of large of populations.
e.g. 1) Draw states SRS of states
2) Draw SRS of districts from states
3)Draw SRS of mandals from districts
4) Draw of people from each mandal