Вы находитесь на странице: 1из 58

DR ANKITA CHATURVEDI

SAMPLING TECHNIQUES
CONTENTS :
 Introduction
 Parameter and statistics
 Sampling Errors
 Sampling Process
 Methods of Sampling
>>Probability Sampling
>>Non Probability Sampling
 INTRODUCTION

POPULATION

SAMPLE

STATISTICAL
INFERENCE
POPULATION
 The entire aggregation of items from which samples
can be drawn is known as a population.
 In sampling, the population may refer to the units,
from which the sample is drawn.
 Thus, population could consists of all the persons in
the country, or those in a particular geographical
location or a special economic group, depending on
the purpose and coverage of the study.
 This is one of the first things the analyst needs to
define properly while conducting a business
research.
 “N” represents the size of the population.
SAMPLING
 In simple words, sampling consists of obtaining
information from a portion of a larger group or an
universe. Elements are selected in a manner that they yield
almost all information about the whole universe, if and
when selected according to some scientific principles and
procedures.
CENSUS
A complete study of all the elements present in the
population is known as a census. The national population
census is an example of census survey
Merits
1. Data obtained from each and every unit of population.
2. Results: more representative, accurate, reliable.
3. Basis of various surveys.
Demerits
1. More effort ,money , time.
2. Big problem in underdeveloped countries.
SAMPLE
A Sample is a selection of units from the entire group called
the population or universe of interest. It is Subset of a
larger population
Merits
1. Less resources (time, money)
2. Less workload.
3. Gives results with known accuracy that can be
calculated mathematically.
Demerits
1. Difficulties of a representative sample
2. Need for specialized knowledge
3. Chances of bias
 The population criteria establish the target population;
that is, the entire set of cases about which the researcher
would like to make generalizations.

Statistical Inference
 Statistical inference is the process of drawing
conclusions about the entire population based on
information in a sample.
(Population) Parameters

 A parameter is a value used to describe a certain


characteristic of a population. It is usually unknown and
therefore has to be estimated.
 For example, the population mean is a parameter that is
often used to indicate the average/typical value of a
variable in the population.
(Sample) statistics

 A statistic is an estimate, based on a sample of observed


data, of a population parameter.
 The statistic is drawn from the measurement of the
elements of the sample.
 A statistic is used to estimate a parameter. The more
samples you have, the better your estimation.
Sampling Error

 Since a sample does not include all members of the


population, parameter estimates generally differ from
parameters on the entire population (e.g., use mean height
of a sample of 1000 people to estimate mean height of
Indian population).

 The difference between the (sample) statistics estimate


and the (population) parameter is sampling error.
SAMPLING DESIGN PROCESS
PROBABILITY
SAMPLING
PROBABILITY SAMPLING
 A probability sampling scheme is one in which every
unit in the population has a chance (greater than zero) of
being selected in the sample, and this probability can be
accurately determined.
 Universe is identified.
 . When every element in the population does have the
same probability of selection, this is known as an 'equal
probability of selection' (EPS) design. Such designs are
also referred to as 'self-weighting' because all sampled
units are given the same weight.
 Only Parametric Tests can be applied
 Results may be generalized
PROBABILITY SAMPLING
Probability sampling includes:
 Simple Random Sampling,
 Stratified Random Sampling,
 Cluster Sampling
 Systematic Sampling,
 Multistage Sampling.
SIMPLE RANDOM SAMPLING
• Applicable when population is small, homogeneous &
readily available
• All subsets of the frame are given an equal probability.
Each element of the frame thus has an equal probability of
selection.
• It provides for greatest number of possible samples. This
is done by assigning a number to each unit in the sampling
frame.
• A table of random number or lottery system is used to
determine which units are to be selected.
SIMPLE RANDOM SAMPLING

LOTTERY METHOD RANDOM NUMBER TABLE


SIMPLE RANDOM SAMPLING
Merits
 No personal bias.
 Sample more representative of population.
 Accuracy can be assessed as sampling errors follow
principals of chance.
Demerits
 If sampling frame large, this method impracticable
 Requires completely catalogued universe.
 Cases too widely dispersed - more time and cost.
STRATIFIED RANDOM SAMPLING
 Stratified random sampling is a method of probability
sampling in which the population is divided into different
subgroups and samples are selected from each of them.
 Applicable when population is large and heterogeneous
 Where population embraces a number of distinct
categories, the frame can be organized into separate
"strata." Each stratum is then sampled as an independent
sub-population, out of which individual elements can be
randomly selected.
 Every unit in a stratum has same chance of being selected.
STRATIFIED RANDOM SAMPLING
 Using same sampling fraction for all strata ensures
proportionate representation in the sample.
 Adequate representation of minority subgroups of interest
can be ensured by stratification & varying sampling
fraction between strata as required.
STRATIFIED RANDOM SAMPLING
Merits:
 It provides better coverage of the population since
the researchers have control over the subgroups to
ensure all of them are represented in the sampling.
 A stratified sample can provide greater precision than
a simple random sample of the same size.
 Because it provides greater precision, a stratified
sample often requires a smaller sample, which saves
money.
 A stratified sample can guard against an
"unrepresentative" sample (e.g., an all-male sample
from a mixed-gender population).
 We can ensure that we obtain sufficient sample points
to support a separate analysis of any subgroup.
STRATIFIED RANDOM SAMPLING
Demerits
 sampling frame of entire population has to be prepared
separately for each stratum
 when examining multiple criteria, stratifying variables
may be related to some, but not to others, further
complicating the design, and potentially reducing the
utility of the strata.
 in some cases (such as designs with a large number of
strata, or those with a specified minimum sample size per
group), stratified sampling can potentially require a larger
sample than would other methods
CLUSTER SAMPLING
 Cluster sampling is an example of 'two-stage
sampling' .
 First stage a sample of areas is chosen;
 Second stage a sample of respondents within those
areas is selected.
 Population divided into clusters of homogeneous units,
usually based on geographical contiguity.
 Sampling units are groups rather than individuals.
 A sample of such clusters is then selected.
 All units from the selected clusters are studied.
CLUSTER SAMPLING
Merits
 Cuts down on the cost of preparing a sampling frame.
 This can reduce travel and other administrative costs.
Demerit
 sampling error is higher for a simple random sample of
same size.
STRATIFIED CLUSTER SAMPLING
 Combines elements of stratification and clustering
 First we define the clusters
 Then we group the clusters into strata of clusters, putting
similar clusters together in a stratum
 Then we randomly pick one (or more) cluster from each of
the strata of clusters
 Then we sample the subjects within the sampled clusters
(either all the subjects, or a simple random sample of
them)
 Advantage: Reduce the error in cluster sampling by
creating strata of clusters
STRATIFICATION v/s CLUSTER
Stratification Clustering
 Divide population into  Divide population into
groups different from comparable groups:
each other: sexes, schools, cities
races, ages  Randomly sample
 Sample randomly from some of the groups
each group  More error compared to
 Less error compared to simple random
simple random  Reduces costs to
 More expensive to sample only some areas
obtain stratification or organizations
information before
sampling
With stratified sampling, the best survey results
occur when elements within strata are internally
homogeneous. However, with cluster sampling,
the best results occur when elements within
clusters are internally heterogeneous
SYSTEMATIC SAMPLING
 Systematic sampling relies on arranging the target
population according to some ordering scheme and
then selecting elements at regular intervals through
that ordered list.
 Systematic sampling involves a random start and then
proceeds with the selection of every kth element from
then onwards. In this case, k=(population size/sample
size).
 It is important that the starting point is not
automatically the first in the list, but is instead
randomly chosen from within the first to the kth
element in the list.
 A simple example would be to select every 10th name
from the telephone directory (an 'every 10th' sample,
also referred to as 'sampling with a skip of 10').
SYSTEMATIC SAMPLING
As described above, systematic sampling is an EPS method,
because all elements have the same probability of selection (in
the example given, one in ten). It is not 'simple random
sampling' because different subsets of the same size have
different selection probabilities - e.g. the set {4,14,24,...,994}
has a one-in-ten probability of selection, but the set
{4,13,24,34,...} has zero probability of selection.
SYSTEMATIC SAMPLING
Merits:
 Sample easy to select
 Suitable sampling frame can be identified easily
 Sample evenly spread over entire reference population
Demerits:
 Sample may be biased if hidden periodicity in
population coincides with that of selection.
 Difficult to assess precision of estimate from one
survey.
MULTISTAGE SAMPLING
 Complex form of cluster sampling in which two or
more levels of units are embedded one in the other.

 First stage, random number of districts chosen in all


states.

 Followed by random number of talukas, villages.

 Then third stage units will be houses.

 All ultimate units (houses, for instance) selected at


last step are surveyed.
MULTISTAGE SAMPLING
 This technique, is essentially the process of taking random
samples of preceding random samples.
 Not as effective as true random sampling, but probably
solves more of the problems inherent to random sampling.
 An effective strategy because it banks on multiple
randomizations. As such, extremely useful.
 Multistage sampling used frequently when a complete list
of all members of the population not exists and is
inappropriate.
 Moreover, by avoiding the use of all sample units in all
selected clusters, multistage sampling avoids the large,
and perhaps unnecessary, costs associated with traditional
cluster sampling.
IDENTIFY WHAT TYPE OF SAMPLING IS
THIS…
1. A student council surveys 100 students by getting
random samples of 25 I year, 25 II year, 25 III year,
and 25 PG
2. An airline company wants to survey its customers
one day, so they randomly select 5 flights that day
and survey every passenger on those flights
3. A teachers puts students' names in a hat and
chooses without looking to get a sample of
students.
4. A principal takes an alphabetized list of student
names and picks a random starting point. Every
20th student is selected to take a survey
5. Each student at a school has a student identification
number. Counselors have a computer generate 50
random identification numbers and those students
are asked to take a survey
6. A principal orders t-shirts and wants to check some
of them to make sure they were printed properly. She
randomly selects 2 of the 10 boxes of shirts and
checks every shirt in those 2 boxes.
7. A school chooses 3 randomly selected athletes from
each of its sports teams to participate in a survey
about athletics at the school.
Answers
1. Stratified Random Sampling
2. Cluster Sampling
3. Simple Random Sampling
4. Systematic Sampling
5. Simple Random Sampling
6. Cluster Sampling
7. Stratified Random Sampling
Non-PROBABILITY
SAMPLING
NON PROBABILITY SAMPLING
 Every element in the universe [sampling frame] does not
have equal probability of being chosen in the sample
 Universe is unidentified
 It involves the selection of elements based on assumptions
regarding the population of interest, which forms the
criteria for selection. Hence, because the selection of
elements is nonrandom, nonprobability sampling not
allows the estimation of sampling errors.
 Only Non- Parametric Tests can be applied.
 Results may not be generalized.
NON PROBABILITY SAMPLING
Non Probability sampling includes:

 Convenience Sampling
 Quota Sampling
 Judgmental Sampling (Purposive Sampling)
 Snowball sampling
CONVENIENCE SAMPLING
 Convenient sample units selected.
 Selected neither by probability nor by judgment.

Merit
 useful in pilot studies.

Demerit
 results usually biased and
 unsatisfactory.
QUOTA SAMPLING
 Most commonly used in non probability sampling.

 The population is first segmented into mutually exclusive


sub-groups, just as in stratified sampling.

 Then judgment used to select subjects or units from each


segment based on a specified proportion.
QUOTA SAMPLING
Merits
 Used when research budget is limited
 Very extensively used/understood
 No need for list of population elements
Disadvantages
 Variability and bias cannot be measured/controlled
 Time Consuming
 Projecting data beyond sample not justified
JUDGMENTAL SAMPLING
 Judgment/Purposive/Deliberate sampling.
 Depends exclusively on the judgment of investigator.
 Sample selected which investigator thinks to be most
typical of the universe.
 This is used primarily when there is a limited number of
people that have expertise in the area being researched
JUDGMENTAL SAMPLING
Merits
 Small no. of sampling units
 Study unknown traits/case sampling
 Urgent public policy & business decisions
Demerits
 Personal prejudice & bias
 No objective way of evaluating reliability of results
SNOWBALL SAMPLING
 A special non probability method used when the
desired sample characteristic is rare.
 The research starts with a key person and introduce
the next one to become a chain
SNOWBALL SAMPLING- STEPS
 Make contact with one or two cases in the population.

 Ask these cases to identify further cases.

 Ask these new cases to identify further new cases.

 Stop when either no new cases are given or the sample is


as large as is manageable
Merit
 access to difficult to reach populations (other
methods may not yield any results).
Demerit
 not representative of the population and will result in
a biased sample as it is self-selecting.
Types of Sampling Errors
Two types of sampling errors
 Biased Errors- Due to selection of sampling
techniques; size of the sample.
 Unbiased Errors / Random sampling errors-
Differences between the members of the population
included or not included.
METHODS OF REDUCING SAMPLING
ERRORS
 Specific problem selection.
 Systematic documentation of related research.
 Effective enumeration.
 Effective pre testing.
 Controlling methodological bias.
 Selection of appropriate sampling techniques

Вам также может понравиться