Академический Документы
Профессиональный Документы
Культура Документы
Measurement
If you cant measure it,
you cant manage it.
Bob Donath,
Consultant
12-2
Primary Scale
Nominal
Ordinal
Interval
Ratio
Types of Scales
Nominal
Nominal
Ordinal
Ordinal
interval
interval
Ratio
Ratio
12-6
Levels of Measurement
Nominal
Nominal
Classification
Classification
Ordinal
Ordinal
interval
interval
Ratio
Ratio
12-7
12-8
Nominal Scales
Mutually exclusive
and collectively
exhaustive
categories
Exhibits the
classification
characteristic only
Levels of Measurement
Nominal
Nominal
Ordinal
Ordinal
Classification
Classification
Classification
Classification
Order
Order
interval
interval
Ratio
Ratio
12-9
12-10
Ordinal Scales
Characteristics of
Levels of Measurement
Nominal
Nominal
Classification
Classification
Ordinal
Ordinal
Classification
Classification
Order
Order
interval
interval
Classification
Classification
Order
Order
Distance
Distance
Ratio
Ratio
12-11
12-12
Interval Scales
Characteristics of
Levels of Measurement
Nominal
Nominal
Classification
Classification
Ordinal
Ordinal
Classification
Classification
Order
Order
interval
interval
Classification
Classification
Order
Order
Distance
Distance
Ratio
Ratio
Classification
Classification
Order
Order
Distance
Distance
Natural
NaturalOrigin
Origin
12-13
12-14
Ratio Scales
Characteristics of
Primary Scales of
Measurement
Scale
Nominal
Ordinal
Interval
Numbers
Assigned
to Runners
Finish
7
Finish
Rank Order
of Winners
Performance
Rating on a
Third
place
Second
place
First
place
8.2
9.1
9.6
15.2
14.1
13.4
0 to 10 Scale
Ratio
Time to
Finish, in
Scale
Scale
Preference
Preference
Ratings
No. Store
Rankings
3 months
1. Parisian
2. Macys
3. Kmart
4. Kohls
5. J.C. Penney
6. Neiman Marcus
7. Marshalls
8. Saks Fifth Avenue
9. Sears
10.Wal-Mart
7
2
8
3
1
5
9
6
4
10
79
25
82
30
10
53
95
61
45
115
1-7
5
7
4
6
7
5
4
5
6
2
11-17
15
17
14
16
17
15
14
15
16
12
Ratio
Scale
$ spent last
0
200
0
100
250
35
0
100
0
10
Primary Scales of
Measurement
intended to be measured
Reliability
The degree to which measures are free from
The Goal
of Measurement Validity
Old Rifle
Low Reliability
(Target A)
New Rifle
High Reliability
(Target B)
New Rifle
Sun glare
Reliable but Not
Valid
(Target C)
Validity
V a lid it y
FAC E O R C O N TEN T
C R IT E R IO N V A L ID IT Y
C O N C U R R EN T
C O N S T R U C T V A L ID IT Y
P R E D IC T IV E
Reliability
R E L IA B IL IT Y
S T A B IL IT Y
TEST R ETEST
IN T E R N A L C O N S IS T E N C Y
E Q U IV A L E N T F O R M S
S P L IT T IN G H A L V E S
Sensitivity
A measurement instruments ability to
Noncomparative
Scales
Comparative
Scales
Paired
Comparison
Rank
Order
Constant
Sum
Q-Sort and
Other
Procedures
Likert
Continuous
Itemized
Rating Scales Rating Scales
Semantic
Differential
Stapel
A Comparison of Scaling
Techniques
Comparative scales involve the direct
Comparative Scales
Advantages
Disadvantages
all respondents.
Easily understood and can be
applied.
Involve fewer theoretical
assumptions.
Tend to reduce halo or carryover
data
Inability to
generalize beyond
the stimulus objects
scaled.
Comparative Scaling
TechniquesA respondent is presented with two objects and
a)
Paired
Comparison
Scaling
asked
to select
one according to some
criterion.
The data obtained are ordinal in nature.
Paired comparison scaling is the most widely-used
required.
Under the assumption of transitivity, it is possible
Recording Form:
J hirmack Finesse
J hirmack
Vidal
Head &
Sassoon Shoulders
0
1
Finesse
1a
Vidal Sassoon
Pert
Number of Times
Preferredb
1
1
Pert
0
0
1
0
1 in a particular box means that the brand in that column was preferred over the brand
in the corresponding row. A 0 means that the row brand was preferred over the column
brand. bThe number of times a brand was preferred is obtained by summing the 1s in
a
each column.
A paired comparison
test
taste
Comparative Scaling
Techniquesb)
Rank
Order
Scaling
ordinal data.
Only (n - 1) scaling decisions need be made in
Rank Order
1. Crest
_________
2. Colgate
_________
3. Aim
_________
4. Gleem
_________
5. Sensodyne
_________
6. Ultra Brite
_________
7. Close Up
_________
8. Pepsodent
_________
9. Plus White
_________
10. Stripe
_________
Comparative Scaling
Techniques c)
Constant Sum Scaling
Respondents allocate a constant sum of units,
such as 100 points to attributes of a product to
reflect their importance.
If an attribute is unimportant, the respondent
Instructions
2
4
9
17
0
5
3
60
100
Segment I
4
17
7
9
19
9
20
15
100
Non-Comparative Scales
Non-comparative Scaling
Techniques
Respondents evaluate only one object at a time,
Version 2
Probably the worst - - - - - - -I - - - - - - - - - - - - - - - - - - - - ----------------------------Probably the best
0 10
20
30
40
50
60
70
80
90
100
Version 3
Very bad
Neither good
Very good
nor bad
Probably the worst - - - - - - -I - - - - - - - - - - - - - - - - - - - - --------------------Probably the
best
0
10 20
30
40
50
60
70
80
90
100
2. b) Itemized Rating
Scales
The respondents are provided with a scale that has a
Likert Scale
The Likert scale requires the respondents to indicate a degree of agreement
or
disagreement with each of a series of statements about the stimulus objects.
Strongly
Disagree
Neither
disagree
agree nor
agree
Agree
Strongly
2X
2X
3X
disagree
Semantic Differential
The Scale
semantic differential is a seven-point rating scale with end
points associated with bipolar labels that have semantic
meaning.
SEARS IS:
Powerful
--:--:--:--:-X-:--:--: Weak
Unreliable
--:--:--:--:--:-X-:--: Reliable
Modern
--:--:--:--:--:--:-X-: Old-fashioned
:---:---:---:---:---:---:---: Calm
3) Uncomfortable
:---:---:---:---:---:---:---: Comfortable
4) Dominating
:---:---:---:---:---:---:---: Submissive
5) Thrifty
:---:---:---:---:---:---:---: Indulgent
6) Pleasant
:---:---:---:---:---:---:---: Unpleasant
7) Contemporary
:---:---:---:---:---:---:---: Obsolete
8) Organized
:---:---:---:---:---:---:---: Unorganized
9) Rational
:---:---:---:---:---:---:---: Emotional
10) Youthful
:---:---:---:---:---:---:---: Mature
Stapel Scale
The Stapel scale is a unipolar rating scale with ten categories
numbered from -5 to +5, without a neutral point (zero). This scale
is usually presented vertically.
SEARS
+5
+5
+4
+4
+3
+3
+2
+2X
+1
+1
HIGH QUALITY
-1
-1
-2
-2
-3
-3
-4X -4
-5
-5
POOR SERVICE
Basic Noncomparative
Scales
Scale
Basic
Characteristics
Examples
Advantages
Disadvantages
Continuous
Rating
Scale
Place a mark on a
continuous line
Reaction to
TV
commercials
Easy to construct
Scoring can be
cumbersome
unless
computerized
Likert Scale
Degrees of
agreement on a 1
(strongly disagree)
to 5 (strongly agree)
scale
Measurement
of attitudes
Easy to construct,
administer, and
understand
More
time -consuming
Semantic
Differential
Brand,
product, and
company
images
Versatile
Controversy as
to whether the
data are interval
Stapel
Scale
Measurement
of attitudes
and images
Easy to construct,
administer over
telephone
Confusing and
difficult to apply
Scale Evaluation
Scale Evaluation
Reliability
Test/
Retest
Alternative
Forms
Validity
Internal
Consistency
Content
Convergent
Criterion
Generalizability
Construct
Discriminant
Nomological
Measurement Accuracy
The true score model provides a framework
for understanding the accuracy of
measurement.
XO = X T + X S + X R
where
XO = the observed score or measurement
XT = the true score of the characteristic
XS = systematic error
XR = random error
Reliability
Reliability can be defined as the extent to which
Reliability
Internal consistency reliability determines the
Validity
The validity of a scale may be defined as the extent
Validity
Construct validity addresses the question of
Relationship Between
Reliability and Validity
Chapter
Seven
Validity
Issues
in
Causal or Experimental
Research Design:
Test
Experimental Design
An experimental design is a set of procedures
specifying:
the test units and how these units are to be divided into
homogeneous subsamples,
Manipulation of Independent
Variable
Classificatory Vs. continuous variables
Experimental and control groups
Treatment levels
More than one independent variable
Experimental Treatments
The alternative manipulations of the
Validity in Experimentation
Internal validity refers to whether the
Extraneous Variables
History refers to specific events that are external
Extraneous Variables
In the interactive testing effect (IT), a prior
Controlling Extraneous
Variables
Randomization refers to the random assignment
of test units to experimental groups by using
random numbers. Treatment conditions are also
randomly assigned to experimental groups.
Matching involves comparing test units on a set
A Classification of Experimental
Designs
Figure 7.1
Experimental Designs
Pre-experimental
True
Experimental
One-Shot Case
Study
Pretest-Posttest
Control Group
Time Series
Randomized
Blocks
Posttest: Only
Control Group
Multiple Time
Series
Latin Square
Static Group
Solomon FourGroup
Quasi
Experimental
Statistical
Factorial
Design
01
treatment X.
A single measurement on the dependent variable is
taken (01).
There is no random assignment of test units.
The one-shot case study is more appropriate for
One-Group Pretest-Posttest
Design
01
02
01
CG: 02
A two-group experimental design.
The experimental group (EG) is exposed to the
the treatment.
Test units are not assigned at random.
The treatment effect would be measured as 01 - 02.
R
R
01
03
X
04
02
control group.
A pretreatment measure is taken on each group.
The treatment effect (TE) is measured as:(02 - 01) - (04 - 03).
Selection bias is eliminated by randomization.
The other extraneous effects are controlled as follows:
02 01= TE + H + MA + MT + IT + I + SR + MO
04 03= H + MA + MT + I + SR + MO
= EV (Extraneous Variables)
The experimental result is obtained by:
(02 - 01) - (04 - 03) = TE + IT
Interactive testing effect is not controlled.
CG :
01
02
TE = 01 - 02
Except for pre-measurement, the implementation
of this design is very similar to that of the pretestposttest control group design.
Quasi-Experimental Designs:
Time Series Design
01 02 03 04 05
X 06 07 08 09 010
X 06 07 08 09 010
CG : 01 02 03 04 05
06 07 08 09 010
Statistical Designs
Statistical designs consist of a series of basic
experiments that allow for statistical control and
analysis of external variables and offer the following
advantages:
The effects of more than one independent variable
can be measured.
Specific extraneous variables can be statistically
controlled.
Economical designs can be formulated when each
test unit is measured more than once.
Randomized Block
Design
Table 7.4
Block
Store
Number Patronage
1
2
3
4
Heavy
Medium
Low
None
Treatment Groups
Commercial Commercial Commercial
A
B
C
A
A
A
A
B
B
B
B
C
C
C
C
High
Medium
B
C
C
B
B
Factorial Design
Is used to measure the effects of two or
conceptualized as a table.
In a two-factor design, each level of one
Factorial Design
Table 7.6
Amount of Humor
Amount of Store
Information
Low
No
Medium
High
Humor
Humor
A
B
Humor
C
Medium
High