MDS Explained: A Guide to Multidimensional Scaling Analysis

Multidimensional
Scaling (MDS)
Angelina Anastasova
Natalia Jaworska
PSY5121 March 18/2008
Multidimensional Scaling (MDS):

What Is It?
Generally regarded as exploratory data analysis (Ding, 2006).

Reduces large amounts of data into easy-to-visualize structures.
Attempts to find structure (visual representation) in a set of
distance measures, e.g. dis/similarities, between objects/cases.
Shows
how variables/objects are related perceptually.
How? By assigning cases to specific locations in space.

Distances between points in space match dis/similarities as closely
as possible:
Similar objects: Close points Dissimilar objects: Far apart points
MDS Example: City Distances

Distances
Matrix:
Symmetric
Cluster
Spatial Map
Dimensions
1: North/South
2: East/West
The Process of MDS: The Data

Data
of MDS: similarities, dissimilarities, distances,

or proximities reflects amount of dis/similarity or
distance between pairs of objects.
Distinction between similarity and dissimilarity data
dependent on type of scale used:
Dissimilarity scale: Low #=high similarity &
High #=high dissimilarity.
Similarity scale: Opposite of dissimilarity.
E.g.
On a scale of 1-9 (1 being the same and 9

completely different) how similar are chocolate bars A
and B? Dissimilarity scale.
SPSS
requires dissimilarity scales.
Data Collection for MDS (1)
Direct/raw data: Proximities values directly obtained

empirical, subjective scaling.
E.g. Rating or ranking dis/similarities (Likert scales).
Indirect/derived data: Computed from other measurements:

correlations or confusion data (based on mistakes) (Davidson, 1983).
from
E.g. Letters of alphabet presented briefly and must be identified. Rarely confused
letters given high dissimilarity values, those that are confused get low values.
Data collection: Pairwise comparison, grouping/sorting tasks, direct

ranking, objective method (e.g. city distances).
Pairwise comparisons: All object pairs randomly presented:
# of pairs = n(n-1)/2, n = # of objects/cases
Can be tedious and inefficient process.
Data Collection for MDS (2)

Facilitation of pairwise comparison task:
1) Incomplete similarity task: random or cyclic
deletion of comparison pairs.
2) Simplification of pair comparisons (binary scale).
3) Choosing grouping/sorting tasks (Tsogo et al., 2000).
Pre-specified # of groups or not specified.
Appropriateness
of a data collection
technique is dependent on stimuli and,
in some cases, hypothesis and theory.
Type of MDS Models (1)

MDS model classified according to:
1) Type of proximities:
Metric/quantitative: Quantitative information/interval data about

objects proximities e.g. city distance.
Non-metric/qualitative: Qualitative information/nominal data
about proximities e.g. rank order.
2) Number of proximity matrices (distance, dis/similarity

matrix).
Proximity matrix is the input for MDS.
The above criteria yield:
1) Classical MDS: One proximity matrix (metric or non-metric).

2) Replicated MDS: Several matrices.
3) Weighted MDS/Individual Difference Scaling: Aggregate
proximities and individual differences in a common MDS space.
Types of MDS (2)
More typical in Social Sciences is the classification of

MDS based on nature of responses:
1) Decompositional MDS: Subjects rate objects on an
overall basis, an impression, without reference to
objective attributes.
Production of a spatial configuration for an individual and
a composite map for group.
2) Compositional MDS: Subjects rate objects
on a variety of specific, pre-specified attributes
(e.g. size).
No maps for individuals, only composite maps.
The MDS Model
Classical MDS uses Euclidean principles to model

data proximities in geometrical space, where distance
(dij) between points i and j is defined as:
xi and xj specify coordinates of points i
and j on dimension a, respectively.
The modeled Euclidean distances are related to the observed

proximities, ij, by some transformation/function (f).
Most MDS models assume that the data have the form:
ij = f(dij)
All MDS algorithms are a variation of the above (Davidson,

1983).
Output of MDS
MDS Map/Perceptual Map/Spatial Representation:

1) Clusters: Groupings in a MDS spatial
representation.
These may represent a domain/subdomain.
2) Dimensions: Hidden structures in data. Ordered

groupings that explain similarity between items.
Axes are meaningless and orientation is arbitrary.
In theory, there is no limit to the number of
dimensions.
In reality, the number of dimensions that can be
perceived and interpreted is limited.
Diagnostics of MDS (1)
MDS attempts to find a spatial configuration X such

that the following is true: f(ij) dij(X)
Stress (Kruskals) function: Measures degree of
correspondence between distances among points on the MDS
map and the matrix input.
Proportion of variance of disparities
not accounted for by the model:
Range 0-1: Smaller stress = better representation.
None-zero stress: Some/all distances in the map are distortions
of the input data.
Rule of thumb: 0.1 is excellent; 0.15 not tolerable.
Diagnostics of MDS (2)
R2 (RSQ): Proportion of variance of the disparities accounted for

by the MDS procedure.
Weirdness Index: Correspondence of subjects map and the

aggregate map outlier identification.
R20.6 is an acceptable fit.
Range 0-1: 0 indicates that subjects weights are proportional to the average
subjects weights; as the subjects score becomes more extreme, index
approaches 1.
Shepard Diagram: Scatterplot of input proximities (X-axis)

against output distances (Y-axis) for every pair of items.
Step-line produced. If map distances fall on the step-line this indicates that
input proximities are perfectly reproduced by the MDS model (dimensional
solution).
Interpretation of Dimensions
Squeezing data into 2-D enables readability but may

not be appropriate: Poor, distorted representation of the
data (high stress).
Scree plot: Stress vs.
number of dimensions.
E.g. cities distance
Primary objective in dimension interpretation: Obtain
best fit with the smallest number of possible
dimensions.
How does one assign meaning to dimensions?
Meaning of Dimensions
Subjective
Procedures:
Labelling the dimensions by
visual inspection, subjective
interpretation, and information
from respondents.
Experts evaluate and identify
the dimensions.
Validating MDS Results

Split-sample
comparison:
Original sample is divided and a correlation between the
variables is conducted.
Multi-sample
comparison:
New sample is collected and a correlation is conducted

between the old and new data.
Comparisons
are done visually or with a simple

correlation of coordinates or variables.
Assessing whether MDS solution
(dimensionality
extraction) changes in a
substantial way.
MDS Caveats
Respondents
probably perceive stimuli differently. In

non-aggregate data, different dimensions may
emerge.
Respondents may attach different levels of
importance to a dimension.
Importance of a dimension may change over time.
Interpretation of dimensions is subjective.
Generally, more than four times as many objects as
dimensions should be compared for the MDS model
to be stable.
Advantages of MDS
An
alternative to the GLM.

Does not require assumptions of linearity,
metricity, or multivariate normality.
Can be used to model nonlinear relationships.
Dimensionality solution can be obtained from
individuals; gives insight into how individuals
differ from aggregate data.
Reveals dimensions without the need for defined
attributes.
Dimensions that emerge from MDS can be
incorporated into regression analysis to assess
their relationship with other variables.
Disadvantages of MDS
Provides
a global measure of dis/similarity but

does not provide much insight into subtleties (Street
et al., 2001).
Increased
dimensionality: Difficult to represent

and decreases intuitive understanding of the data.
As such, the model of the data becomes as
complicated as the data itself.
Determination of meanings of dimensions is
subjective.
A Tiny Break . . .
SPSSing MDS
In the SPSS Data Editor window, click: Analyze >
Scale > Multidimensional Scaling
Select four or more Variables that you want to test.

You may select a single variable for the Individual
Matrices for window (depending on the distances option
selected).
If Data are distances (e.g. cities distances) option is

selected, click on the Shape button to define
characteristic of the dissimilarities/proximity matrices.
If Create distance from data

is selected, click on the
Measure button to control the
computation of dissimilarities,
to transform values, and to
compute distances.
In the Multidimensional Scaling dialog box, click on the

Model button to control the level of measurement,
conditionality, dimensions, and the scaling model.
Click on the Options button to control the

display options, iteration criteria, and
treatment of missing values.
MDS: A Psychological Example

Multidimensional scaling modelling approach to latent profile
analysis in psychological research (Ding, 2006)
Basic premise: Utilize MDS to investigate types or profiles
of people.
Profile: From applied psych where test batteries are used to
extract and construct distinctive features/characteristics in
people.
MDS method was used to:
Derive profiles (dimensions) that could provide information

regarding psychosocial adjustment patterns in adolescents.
Assess if individuals could follow different profile patterns than
those extracted from group data, i.e. deviations from the
derived normative profiles.
Study Details: Methodology
Participants: College students (=23 years, n=208).

Instruments:
Self-Image Questionnaire for Young Adolescents
(SIQYA). Variables:
Body Image (BI), Peer Relationships (PR), Family Relationships

(FR), Mastering & Coping (MC), Vocational-Educational Goals
(VE), and Superior Adjustment (SA)
Three mental health measures of well-being:

Kandel Depression Scale
UCLA Loneliness Scale
Life Satisfaction Scale
Data for MDS
Scored data for MDS profile analysis
Sample data for 14 individuals:
BI=body image, PR=peer relations, FR=family relations, MC=mastery & coping,

VE=vocational & educational goal, SA=superior adjustment, PMI-1=profile match
index for Profile 1, PMI-2=profile match index for Profile 2, LS=life satisfaction,
Dep=depression, PL=psychological loneliness
The Analysis: Step by Step

Step
1: Estimate the number of profiles

(dimensions) from the latent variables.
MDS map
Euclidean distance model
Kruskal's stress = 0.00478
2.0
pr
1.5
Excellent stress value.

RSQ = 0.9998
Configuration derived in 2
dimensions.
1.0
.5
ve
0.0
mc sa
bi
Profile 2
-.5
-1.0
fr
-1.5
-2
Profile 1
-1
Scale values of two MDS profiles (dimensions) in

psychosocial adjustment.
MDS map
Euclidean distance model
2.0
pr
1.5
1.0
.5
ve
0.0
mc sa
bi
-.5
Profile 2
-1.0
fr
-1.5
-2
-1
Profile 1
Normative profiles of
psychosocial adjustments
in young adults.
Each profile represents

prototypical individual.
Step
2: Using the estimated scale values as

independent variables and observed variables
as dependent variables estimate:
Individual profile match index (PMI):

The extent of individual variability along a profile.
Intra-individual variability across profiles.
Fit
index:
The proportion of variance
in the individuals observed
data that can be accounted
for by the profiles.
PMI-1=profile match index for Profile 1, PMI-2=profile

match index for Profile 2, LS=life satisfaction,
Dep=depression, PL=psychological loneliness
Individual Profiles vs. Aggregate

Profile 1
Profile 2
Subject 1
Subject 2
Subject 4
Body Image (BI)
2.28
-0.5
2.82
3.82
5.09
Peer Relations (PR)
0.23
1.49
5.1
5.3
Subject 1
-0.73
0.29
0.94
0.7
-1.2
4.71
4.69
Subject 2
-0.38
0.23
0.99
Mastery & Coping (MC)
-0.25
0.14
4.6
4.9
Subject 4
-0.16
0.24
0.32
Voc-Ed Goals (VE)
-1.49
5.7
5.4
Superior Adjust. (SA)
-0.08
0.08
4.3
4.9
5.5
Family Relations (FR)
PMI-1
PMI-2
FIT
Step 3: Assess the association between profiles

and other factors by regression.
Profile 1: -High scores on Body Image - higher degree of life satisfaction.

-High scores on the Vocational-Educational Goal - higher degree of depression.
Profile 2: -Higher scores on the family relationships profile - higher degree of psychological
loneliness.
Level:
-Average scores of individuals psychosocial adjustment.
-Overall positive psychosocial adjustment scores suggest less depression or
psychological loneliness and higher degree of life satisfaction.
Commentary on MDS Profile

Analysis
Strength
Provides representation of what typical

configurations or profiles of variables exist in the
population and how individuals differ with respect
to these profiles.
Enables
of MDS profile analysis:
identification/analysis of:
Individuals who develop in an idiographic (specific

and subjective) manner; not consistent with
aggregate profiles.
Limitations of MDS Profile

Analysis
MDS profile analysis is exploratory: Determination
of the number of profiles is subjective.
Because of subjectivity involved, the best methods
for model selection should be based on theoretical
grounds.
Interpretation of the statistical significance of the
scale values (i.e. variable parameter estimates) is
somewhat arbitrary. There are no objective criteria for
decision-making regarding which scale values are
salient.
Not know to what degree the profiles obtained from
MDS can be generalized across populations.
Thank You!
Questions?
References
Davidson, M. L. (1983). Multidimensional scaling. New York: J. Wiley

and Sons.
Ding, C. S. (2006). Multidimensional scaling modelling approach to latent
profile analysis in psychological research. International Journal of
Psychology 41 (3), 226-238.
Kruskal, J.B. & Wish M.1978. Multidimensional Scaling. Sage.
Street, H., Sheeran, P., & Orbell, S. (2001). Exploring the relationship
between different psychosocial determinants of depression: a
multidimensional scaling analysis. Journal of Affective Disorders 64, 53
67.
Takane, Y., Young, F.W., & de Leeuw, J. (1977). Nonmetric individual
differences multidimensional scaling: An alternating least squares method
with optimal scaling features, Psychometrika 42 (1), 767.
Young, F.W., Takane, Y., & Lewyckyj, R. (1978). Three notes on ALSCAL,
Psychometrika 43 (3), 433435.
http://www.analytictech.com/borgatti/profit.htm
http://www2.chass.ncsu.edu/garson/pa765/mds.htm
http://www.terry.uga.edu/~pholmes/MARK9650/Classnotes4.pdf

MDS Explained: A Guide to Multidimensional Scaling Analysis

Загружено:

Сведения о документе

Исходное описание:

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

MDS Explained: A Guide to Multidimensional Scaling Analysis

Загружено:

Авторское право:

Доступные форматы

Multidimensional

PSY5121 March 18/2008

Multidimensional Scaling (MDS):

Generally regarded as exploratory data analysis (Ding, 2006).

how variables/objects are related perceptually.

How? By assigning cases to specific locations in space.

MDS Example: City Distances

The Process of MDS: The Data

of MDS: similarities, dissimilarities, distances,

On a scale of 1-9 (1 being the same and 9

requires dissimilarity scales.

Data Collection for MDS (1)

Direct/raw data: Proximities values directly obtained

E.g. Rating or ranking dis/similarities (Likert scales).

Indirect/derived data: Computed from other measurements:

Data collection: Pairwise comparison, grouping/sorting tasks, direct

Data Collection for MDS (2)

Type of MDS Models (1)

Metric/quantitative: Quantitative information/interval data about

2) Number of proximity matrices (distance, dis/similarity

1) Classical MDS: One proximity matrix (metric or non-metric).

Types of MDS (2)

More typical in Social Sciences is the classification of

The MDS Model

Classical MDS uses Euclidean principles to model

The modeled Euclidean distances are related to the observed

All MDS algorithms are a variation of the above (Davidson,

MDS Map/Perceptual Map/Spatial Representation:

These may represent a domain/subdomain.

2) Dimensions: Hidden structures in data. Ordered

Diagnostics of MDS (1)

MDS attempts to find a spatial configuration X such

Diagnostics of MDS (2)

R2 (RSQ): Proportion of variance of the disparities accounted for

Weirdness Index: Correspondence of subjects map and the

R20.6 is an acceptable fit.

Shepard Diagram: Scatterplot of input proximities (X-axis)

Squeezing data into 2-D enables readability but may

Validating MDS Results

Original sample is divided and a correlation between the

New sample is collected and a correlation is conducted

are done visually or with a simple

probably perceive stimuli differently. In

alternative to the GLM.

a global measure of dis/similarity but

dimensionality: Difficult to represent

Select four or more Variables that you want to test.

If Data are distances (e.g. cities distances) option is

If Create distance from data

In the Multidimensional Scaling dialog box, click on the

Click on the Options button to control the

MDS: A Psychological Example

Derive profiles (dimensions) that could provide information

Study Details: Methodology

Participants: College students (=23 years, n=208).

Body Image (BI), Peer Relationships (PR), Family Relationships

Three mental health measures of well-being:

Data for MDS

Scored data for MDS profile analysis

Sample data for 14 individuals:

BI=body image, PR=peer relations, FR=family relations, MC=mastery & coping,

The Analysis: Step by Step