Академический Документы
Профессиональный Документы
Культура Документы
net/publication/45152401
Making the illusory correlation effect appear and then disappear: The effects
of increased learning
CITATIONS READS
14 161
5 authors, including:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Denis Hilton on 20 May 2014.
Making the illusory correlation effect appear and then disappear: The
effects of increased learning
Robin A. Murphya; Stefanie Schmeerb; Frédéric Vallée-Tourangeauc; Esther Mondragónd; Denis Hiltone
a
Department of Experimental Psychology, University of Oxford, Oxford, UK b School of Psychology,
University of Hertfordshire, Hatfield, UK c School of Social Science, Kingston University, Kingston-
upon-Thames, UK d Institute of Behavioural Neuroscience, University College London, London, UK e
Laboratoire Cognition, Langues, Langage, et Ergonomie (CLLE), University of Toulouse II-le Mirail,
Toulouse, France
To cite this Article Murphy, Robin A. , Schmeer, Stefanie , Vallée-Tourangeau, Frédéric , Mondragón, Esther and Hilton,
Denis(2011) 'Making the illusory correlation effect appear and then disappear: The effects of increased learning', The
Quarterly Journal of Experimental Psychology, 64: 1, 24 — 40, First published on: 08 July 2010 (iFirst)
To link to this Article: DOI: 10.1080/17470218.2010.493615
URL: http://dx.doi.org/10.1080/17470218.2010.493615
This article may be used for research, teaching and private study purposes. Any substantial or
systematic reproduction, re-distribution, re-selling, loan or sub-licensing, systematic supply or
distribution in any form to anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make any representation that the contents
will be complete or accurate or up to date. The accuracy of any instructions, formulae and drug doses
should be independently verified with primary sources. The publisher shall not be liable for any loss,
actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly
or indirectly in connection with or arising out of the use of this material.
THE QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY
2011, 64 (1), 24 –40
Robin A. Murphy
Department of Experimental Psychology, University of Oxford, Oxford, UK
Stefanie Schmeer
School of Psychology, University of Hertfordshire, Hatfield, UK
Frédéric Vallée-Tourangeau
School of Social Science, Kingston University, Kingston-upon-Thames, UK
Esther Mondragón
Institute of Behavioural Neuroscience, University College London, London, UK
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
Denis Hilton
Laboratoire Cognition, Langues, Langage, et Ergonomie (CLLE), University of Toulouse II– le Mirail,
Toulouse, France
The acquisition of a negative evaluation of a fictitious minority social group in spite of the absence of
any objective correlation between group membership and negative behaviours was described by
Hamilton and Gifford (1976) as an instance of an illusory correlation. We studied the acquisition
and attenuation through time of this correlation learning effect. In two experiments we asked for par-
ticipants’ judgements of two fictitious groups using an online version of a group membership belief
paradigm. We tested how judgements of the two groups changed as a function of the amount of train-
ing they received. Results suggest that the perception of the illusory correlation effect is initially
absent, emerges with intermediate amounts of absolute experience, but diminishes and is eliminated
with increased experience. This illusory correlation effect can be considered to reflect incomplete
learning rather than a bias due to information loss in judgements or distinctiveness.
The formation of attitudes or opinions about an It can be of course that these stereotypic associations
individual based solely on that person’s membership reflect actual correlations that exist in the “real”
in a particular group is a form of stereotyping. These world—alternatively, there may not be any objective
attitudes may also be the result of a perceived corre- evidence to support such a belief. For instance, it has
lation or association between an individual’s group been documented that certain minorities may be
membership and various traits and behaviours. considered to be criminal, even though studies of
Correspondence should be addressed to Robin A. Murphy, Experimental Psychology, University of Oxford, South Parks Road,
Oxford, OX1 3UD, UK. E-mail: robin.murphy@psy.ox.ac.uk
This research was supported by Grant R–000 –22–3688 from the Economic and Social Research Council. We would like to
thank Peter Sedlmeier, Eliot Smith, and Eva Walther for helpful comments on an earlier draft of this article.
prison populations show that the minorities in ques- group membership, and their engagement in beha-
tion are not overrepresented in prison populations viours. A member of either a majority or minority
(e.g., Hamilton, 1981). Criminal behaviour is gen- group (A or B) was described as engaging in either
erally infrequent, which led Hamilton and Gifford a desirable (positive) or an undesirable (negative)
(1976) to propose a distinctiveness hypothesis that behaviour. For example, a positive statement
infrequent behaviours are especially likely to be might read “John, a member of Group A, visited
associated to minority groups, as both are likely to a sick friend in hospital”. These sentences were
attract attention during the learning process. They presented one at a time to the participants, who
devised a learning task that showed that, as pre- were expected to develop an overall impression of
dicted, such illusory correlations emerge even the two groups and how likely they were to
when the perceiver has no motivational grounds to engage in different types of behaviour. Positive
form either a positive or a negative stereotype of behaviours were those that people had previously
the minority group in question. rated as being desirable, with negative statements
Although Hamilton and Gifford’s (1976) results being undesirable. The ratio of positive to negative
have been replicated by many (for reviews see behaviour statements (0.69: 0.31) was the same for
Berndsen, Spears, van der Pligt, & McGarty, the two groups. However, participants were
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
2002; McGarty & de la Haye, 1997), the findings exposed to twice as many statements about
sit uneasily with another mass of experimental find- people in Group A than about people in Group
ings in associative learning theory that provide B (i.e., Group A: 18:8; Group B: 9:4).
strong evidence that after sufficient learning people Importantly, given that the overall probability of
are quite accurate judges of the correlation between engaging in positive or negative behaviours was
events in nonsocial domains (e.g., Shanks & the same for both Group A and Group B, the
Dickinson, 1987; Vallée-Tourangeau, Murphy, overall contingency between group membership
Drew, & Baker, 1998b). As we shall show, the key and valence of behaviour was zero. If judgements
prediction made by an associative learning model were based on some unbiased probabilistic judge-
(e.g., Rescorla & Wagner, 1972) is that illusory cor- ment then they should have thought that both
relations should be observed in the early phase of groups were as likely to engage in positive (or
learning, followed by attenuation and elimination negative) behaviours.
of this effect in the final phase of learning. The learning phase was followed by a test of
However, none of the social cognition accounts attitudes and likeability. They found that the min-
based on distinctiveness (Hamilton & Gifford, ority group (i.e., B) was rated more negative than
1976), information loss (Fiedler, 1991), or exemplars the majority group (i.e., A). This was indicated
(Smith, 1991) have been used to make this predic- by more negative trait ratings for members of
tion of sequential appearance followed by disappear- Group B, more attributions of Group B member-
ance of the illusory correlation effect. Indeed, the ship to negative behaviours, and an overestimation
information loss account of Fiedler (1991) and the of the frequency of negative behaviours in Group
exemplar account (Smith, 1991) are sufficiently B (see also Berndsen et al., 2002; Mullen &
well formalized that it is clear that they make pre- Johnson, 1990).
cisely the opposite prediction—increased learning
will lead to increased illusory correlation.
The distinctiveness-based account
Hamilton and Gifford (1976) proposed that the
The illusory correlation effect of stereotypes:
relative infrequency of the minority group’s
Current explanations
engagement in negative behaviours enhances the
In Hamilton and Gifford’s (1976) demonstration distinctiveness of the pairings of the minority
of an illusory correlation (IC) effect, people read (i.e., less frequent) group membership (i.e.,
statements describing fictitious individuals, their Group B) with the less frequent behaviour type
(i.e., negative behaviours). At the time they were difference between the number of presented posi-
somewhat agnostic as to how this enhanced dis- tive and negative behaviours rather than simply
tinctiveness caused the negative judgements. their relative difference.
They suggested that it could be via better encod- Smith (1991, p. 118) explicitly argues that the
ing, easier accessibility, or easier availability (e.g., degree of IC perceived depends on the arithmetic
Tversky & Kahneman, 1973). difference in frequencies of occurrence of the fre-
An extension of Hamilton and Gifford’s (1976) quent and infrequent events in the majority and
distinctiveness account, proposed by McConnell, minority groups. As the arithmetic difference
Sherman, and Hamilton (1994), suggested that increases mathematically with the number of
ultimate distinctiveness, rather than distinctive- trials, this predicts further increase in IC with
ness at the time of encoding, was crucial for the further learning. According to Smith, the IC
occurrence of the IC effect. The notion of ultimate effect observed after 72 trials will therefore be
distinctiveness avoids the problem of determining twice as big as the IC effect observed after 36
what makes something distinctive until all the trials. In turn, Fiedler (1991) suggests that “the
events have been sampled. Once participants illusion should decrease with increasing sample
make their judgement, the relative distinctiveness size” (p. 27) but quickly discards the prediction
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
of the two groups is determined, and the bias is as being predicated on an “impermissible
expressed. McConnell et al. manipulated the application of the Spearman –Brown formula”
serial position of the minority group statements (p. 27) that predicts more representative sampling
relative to each other and relative to the time of as a function of test lengthening (increasing
judgement, while keeping the total number— the number of statements). The application is
that is, their ultimate distinctiveness—constant. impermissible because of a number of auxiliary
Results supported the ultimate distinctiveness assumptions Fiedler deems too compelling to
account. Therefore, as long as some information ignore about information load that taxes working
was more distinctive (based on relatively less memory detrimentally: “Minority behaviour in
exposure) then people would be biased in their a long and demanding [italics added] series is
judgements. assessed less accurately . . . than minority behav-
iour in short and less demanding series” (Fiedler,
The information loss account 1991, p. 28).
Both Fiedler (1991; Fiedler, Russer, & Gramm,
1993) and Smith (1991) have proposed alternative
frequency-based accounts that do not rely on dis-
tinctiveness but rather on memory decay. For Associative learning
them biased judgements result from impaired, In contrast, we argue that a simple associative
not enhanced, processing of the less frequent min- learning mechanism predicts that the illusory cor-
ority group instances. Fiedler (1996, 2000a) argues relation effect is the outcome of preasymptotic or
that both the Fiedler (1991) and Smith (1991) incomplete learning. Such a mechanism predicts
models share the same essential properties, in that (a) learning is faster about more frequent
that the memories for the less frequent behaviour events and slower about less frequent events; (b)
associated with members of the minority group in the early phase of the learning curve, corre-
are particularly likely to be lost from memory com- lations are overestimated; and (c) after a sufficient
pared to the more frequently experienced state- number of trials, learning reaches asymptote
ments associated with the majority group. From because associations have reached maximum
this perspective the relatively well-encoded data strength. Asymptotic learning in simple associative
about the majority group allow the majority models converges on the actual degree of corre-
group to be more highly rated. Fiedler argues lation between two variables (Chapman &
that such judgements are sensitive to the absolute Robbins, 1990).
differences in the perception of control (Msetfi, date cause that is in fact uncorrelated with a
Murphy, Simpson, & Kornbrot, 2005), preference target effect. However, with longer exposure,
learning (Field, 2005), and language acquisition causal ratings decrease and approach zero
(Ellis, 2006) have all been analysed from the (Shanks & Dickinson, 1987, 1988; for analogous
perspective of associative models. results in animals, see Kremer, 1974). By exten-
Much work that explores an associative sion we speculated that the illusory correlation
account of causal learning in humans use simple effect might reflect a preasymptotic phase of
predictive tasks where participants experience learning. Initially participants erroneously per-
the repeated presence (or absence) of a predictor ceive a correlation between group membership
event and the subsequent presence (or absence) and valence of behaviour, but with sufficient
of an outcome (e.g., Vallée-Tourangeau, training the effect might be expected to
Hollingsworth, & Murphy, 1998a). Thus the disappear.
task scenario may involve fictitious patients who To understand the nature of this prediction we
have a virus (or not) and who are diagnosed present a brief overview of one commonly used
with a disease (or not). The correlation between associative model that possesses the basic charac-
the predictor and the outcome can be calculated, teristics necessary to predict the IC effect
and participants’ judgements of the predictor – (Rescorla – Wagner model; Rescorla & Wagner,
outcome relationship are assessed at different 1972; RWM henceforth). We use this model
points during the learning experience. for the purposes of exposition because it is a
Correlational learning typically displays a well-known and representative member of the
learning curve. Thus people typically overesti- class of associative error correction learning
mate the contingency between the predictor and models that contain a delta rule. Any associative
the outcome in the early phases of learning, but model that calculates changes (the delta) in
their judgements slope off to become remarkably association on the basis of an imposed limit on
accurate at the final stage of learning. Despite learning should predict this basic finding (see
some claims that people are poor judges of Sedlmeier, 2006, for a relevant theoretical dem-
contingency (e.g., Baron, 2000; Fiedler, 2000b; onstration and Denrell, 2007, Fiedler, 2000b,
Fiedler, Freytag, & Meiser, 2009; Peterson, Van Rooy, Van Overwalle, Vanhoomissen,
1980; Smedslund, 1963), research has shown Labiouse, & French, 2003, for other forms of
that the human ability to learn online cause – associative models developed to explain diverse
event relationships is both very general and phenomena in social cognition).
The formal properties of an associative learning (whether positive or negative, see Field, 2005, for
model a review of the application of associative theory
In Hamilton and Gifford’s (1976) IC paradigm, to the development of preferences). The valence
the associated events (groups and behavioural of the affective response determines the direction
statements) are similar to the associated events in of the associative change. Positive statements cause
causal learning. Group membership labels are the an increase of the association in a positive direction
stimulus events that become associated with the and negative ones cause an increase in the negative
positive or negative affect induced by the behav- value of the association. On each trial then the
ioural statements. The RWM was originally devel- specific statement is hypothesized to cause a
oped to describe the formation of associations change in the internal association between the
between representations of a conditioned stimulus group label and the affective response or outcome.
(CS, e.g., light) and an unconditioned stimulus The parameters a and b represent the associability
(US, e.g., food). Temporal contiguity between of the labels and the affective outcome; the
the two events strengthens their association (V ), maximum amount of associative strength l was set
but there is a limit to the total strength of any at +1 for positive statements and –1 for negative
association (l), and this limit allows learning statements. In addition to the associative strength
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
models to predict the characteristic learning of the group labels A and B, the model also
curve in which learning gradually reaches an assumes that associations form between the context
asymptote, at which point learning stops. The (X) and the outcome. These contextual cues are par-
change in associative strength on any given trial ticularly important for predicting various cue compe-
(DV ) is a function of the difference between the tition effects in human learning (see Baker, Murphy,
lu˙and the associative strength acquired on pre- Vallée-Tourangeau, & Mehta, 2001), but have no
vious trials. However, the RWM also proposed impact on the predictions that we describe here.
that the associative strength (V ) acquired by a We simulated the illusory correlation effect in
single CS was determined not only by its own pre- the Hamilton and Gifford (1976) paradigm
vious association with the US, but by the associ- using the RWM. We used a particular set of par-
ations acquired
by all other (n) cues present on a ameters and training trials to illustrate the effect,
given trial ( ni¼1V ) but the effect is parameter independent assuming
standard assumptions about learning. In general
n it is the rate of acquisition and elimination of the
DV = ab l − V 1 illusory correlation effect that is influenced by
i=1 changes in these values. The data from the simu-
lations are presented in the left-hand panel of
where a and b represent learning-rate parameters Figure 1. While the two originally neutral group
for the predictor (e.g., CS) and outcome (e.g., US), labels (A for the majority group and B for the min-
respectively, with l representing the limit on the ority group) acquire positive associations, and the
total amount of associative strength. The formula associations are of similar strength by the end of
can be used to calculate predicted changes in training, the model predicts that preasymptotically,
associative strength for a given cue presented on Group A will have a stronger positive association
a given trial. Rather than associating the labels than Group B. Thus, this simple associative learning
with the specific words and actions described in mechanism makes the prediction that the IC effect is
each statement, we propose that statements a result of incomplete learning. Therefore, it is not a
describing social interactions are associated with final outcome of the learning mechanism and should
the affective response elicited by the statements diminish with sufficient training.1 Two other
1
Not shown in Figure 1 are the association strengths for the contextual cues; although included in the simulations, they are not
shown as they have no impact on this prediction, given that the strength of the background does not influence this prediction.
Figure 1. Simulations applying the Rescorla–Wagner associative model (Rescorla & Wagner, 1972) to the three treatments described in
Experiments 1 and 2. The left panel (2:1 positive) represents the treatment described in Experiments 1 and 2. The simulation was run
with twice as many positive as negative statements and twice as many majority as minority group members. This treatment is similar to
the one described by Hamilton and Gifford (1976). The middle panel (5:1 positive) represents simulation data for the treatment
described in Experiment 2, with five times more positive than negative statements and twice as many majority as minority group
members. The right panel (5:1 majority) represents the data from the simulation of a majority group that was five times the size of the
minority group with the ratio of positive to negative statements again 2:1. The associability parameter for the group labels, a, was set at
.2 for both group labels, the associability parameter, b, for outcome presence and absence was set at .1, and the maximum amount of
associative strength, l, was set at 1 and –1 for the positive and negative statements, respectively; with these parameters the model was
run for 1,000 trials.
features of the predictions are worth noting. First, is unclear how this result is predicted by the distinc-
with low levels of experience the model predicts tiveness and frequency approaches (e.g., Hamilton
that the initially accrued associative strengths will & Gifford, 1976). Finally, this type of associative
be similar for both groups, and therefore an illusory learning model contains no episodic memory of
correlation effect is expected to be absent. Second, the events. There are no hidden units or distributed
both groups should become associated with positive representations to deal with as the model represents
affect. Neither of these predictions can be derived the problem in terms of two associations, one for
from an account based on distinctiveness or one each group.
based on memory loss since with low levels of
experience, distinctiveness should be high and
memory loss strongest. In spite of the fact that the EXPERIMENT 1
minority group is rated as less positive than the
majority group, the prediction is that both groups The first experiment was a conceptual replication
will be rated more positively at the end of training of the original IC experiment, which allowed
than at the outset. This prediction reflects the fact learning effects to be evaluated. Participants were
that the model learns that a majority of the thus presented with stimulus materials adapted
statements describing both groups are positive. It from those used by Hamilton and Gifford
(1976), but were exposed to a total of 90 trials (39 that fell within one standard deviation of the
used in the original). Participants were asked to mean were selected for use during the main
report their evaluations of the majority and min- study. A total of 60 positive and 30 negative state-
ority groups at regular intervals during training ments were used.
in order to monitor the development of the IC
effect (in the original procedure, these evaluations Procedure
are only made at the end of the presentation). Over Participants signed a standard consent form, sat in
90 trials, 67% of all the statements describing both front of a desktop computer, and read a series of
majority and minority group members were chosen instructions that informed them of the task
to suggest positive behaviours, with the remaining requirements. Participants were instructed to
33% suggesting negative behaviours (these pro- read sentences that described members of two
portions result in a more balanced contingency fictitious groups labelled Group A and Group
than the 69% and 31% used in Hamilton & B. They were told that they would be required to
Gifford, 1976). Experiment 1 used the same make judgements about the two groups.
letter group labels (A and B) as those used in the Sentences were presented on the monitor one at
original IC demonstration. a time, with no opportunity to return to previously
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
to “extremely like” and 1 referring to “extremely increase across training. All statistical tests used a
dislike”. The order in which the ratings were gath- .05 rejection level except where stated.
ered was counterbalanced. Half the participants The baseline data were analysed separately from
were always asked to give their rating of the the training data. This conservative decision
majority group first while the other half were ensures that any initial differences that may have
asked to judge the minority group first. confounded interpretation of the training data
Although all participants were asked to make jud- were identified. The analysis of variance
gements about both groups, they were not told (ANOVA) of the baseline judgements revealed
which group corresponded to the majority or min- neither an effect of group (majority or minority)
ority groups. nor an effect of question order (majority or min-
ority judged first) nor any interactions. The differ-
ence between ratings of the majority and minority
Results and discussion group was not reliable, F(1, 18) ¼ 1.00, p ¼ .33,
MSE ¼ 0.10, h2 ¼ .053. This finding confirms
The mean ratings of liking in Experiment 1, our that no initial bias in favour of the majority
measure of the participants’ newly formed attitudes Group A or minority Group B was present. A
about the groups, are shown in Figure 2. Each
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
then looked directly at comparing judgements at In addition to the liking ratings, which reflect
each block. As these could be argued to be the participants’ attitudes to the two groups, we
planned comparisons, we corrected the alpha also gathered data that were meant to reflect par-
level (a/number of comparisons) and used a ¼ ticipants’ memory for the frequency of events. It
.01 for each comparison. These individual single is possible, but unlikely, that participants saw so
degree of freedom comparisons found that the many sentences that by the end of our longer train-
relatively large difference shown in Figure 2 on ing procedure they could not distinguish between
Block 3 was consistent with a statistically reliable the two groups, although why more exposure
difference in ratings, F(1, 18) ¼ 13.16, p ¼ .002, should produce weaker memories is unclear.
MSE ¼ 0.372, h2 ¼ .422. None of the other When asked at the end of the experiment to
block comparisons approached the critical F provide frequency estimations of the number of
value, with Blocks 2, 4, and 5, all Fs , 1, and statements for each type of group, participants
Block 1, F(1, 18) ¼ 2.00, p ¼ .174, MSE ¼ recalled the group ratio accurately. They had no
0.313, h2 ¼ .100. Therefore, these results are con- trouble reporting that twice as many sentences
sistent with Hamilton and Gifford’s (1976) orig- were presented referring to the majority group.
inal finding of an illusory correlation effect, but The mean estimated number of positive statements
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
only during the intermediate stages of the learning experienced for the majority and minority groups
procedure at a similar number of trials to the 39 were M ¼ 28.75 (SE ¼ 2.78) and M ¼ 15.75
used by Hamilton and Gifford (especially Blocks (SE ¼ 1.12), respectively, where the actual
2 and 3, corresponding to 36 and 48 exposures, numbers were 40 and 20. Because the variances
respectively). were not homogeneous, we used the nonparametric
In summary, participants reported no initial Wilcoxon Signed Ranks test to analyse these data
preference for either the majority Group A or and found a reliable difference in frequency esti-
the minority Group B, but with intermediate mates, Z ¼ – 3.41, p , .001. Therefore, at the
training the majority group was liked significantly end of training, participants accurately reported
more than the minority group. With extended that they had received approximately twice as
training, however, this difference in turn disap- many positive statements about the majority
peared. It is worth noting from consideration of group than about the minority group, although
Figure 2 that the ratings of the minority group at we note that this may represent a slight underesti-
the end of training were similar to the ratings of mation of the actual number of statements.
the majority group at about half the amount of
exposure. This is suggestive that the lower liking
of the minority group relative to the majority EXPERIMENT 2
group early on in training represents preasymptotic
learning. While the exact number of trials required To test the generality of these new findings, in a
to show the effect will always depend upon the sal- second experiment we tested a prediction that
ience of the stimuli used during learning, it is the illusory correlation effect of minority group
striking that the production of the IC effect in stereotypes will dissipate with extended training
our experiment peaks at Blocks 2 (36 trials) and for two novel situations in which we varied the
3 (48 trials), exactly the range of exposures (36 – ratio of the behavioural statements or the ratio of
48) used in all the studies of the IC effect reviewed the groups or the proportions.
by Mullen and Johnson (1990). This limited range We chose one of the new conditions specifically
of exposures appears to have become standard in to explore an interesting prediction of the associat-
illusory correlation research inspired by the ive model that absolute judgements of likeability
Hamilton and Gifford paradigm and continues of the two groups are related to the ratio of positive
to be used by later researchers (e.g., Sherman statements to negative statements. In Experiment 1,
et al., 2009). 67% of statements were positive; increasing this
statements about the majority group. The effect members). The ratio of positive to negative state-
should first be acquired and then dissipate. ments for this group was the same as that in the
In the second new group the majority to min- control group (2:1). Participants in group 5:1 posi-
ority ratio was increased to 5:1 (84%, 16%). This tive received five times as many positive as negative
group might be expected to produce a stronger (75:15) statements. The ratio of majority to min-
illusory correlation effect or at least one that took ority statements was the same as that in the
longer to overcome with training. The model control group. These two new conditions can
suggests that since they would receive even more therefore be considered as involving either a treat-
relative experience with the majority group, it ment with a large majority or a treatment involving
would take longer to learn about the minority more well-behaved individuals. Group labels were
group (See Figure 1, right panel). Both distinctive- blue and red or orange and purple; within each pair
ness and memory loss theories should predict a the use of a colour to describe the majority group
strengthening of the illusory correlation effect was counterbalanced. We pilot tested using
regardless of the number of trials, since again the colour labels instead of letter labels as used in the
minority group is presented even less frequently original Hamilton and Gifford (1976) design and
than in the original design. A final group involved found that these produced the same pattern of
the 2:1 group that we used in Experiment 1 as a effects as that in Experiment 1. We did not ask
control. the participants for baseline ratings in this exper-
iment given that these did not elicit differential
ratings in our previous work, and we thought it
Method
was worth testing whether the effect would still
Participants and materials dissipate if there was no chance to anchor judge-
A total of 92 university undergraduates took part ments at the outset of the experiment.
in this experiment, each given 3 GBP. Two
changes were made to the experimental materials.
Results and discussion
Groups were labelled with colours rather than the
letters A and B used in Experiment 1, and the As in Experiment 1, initially there was no discrimi-
Likert scale was extended. We used an 11-point nation between the majority and minority groups
likeability rating scale in attempt to provide a in each condition. At an intermediate level of train-
larger range for participants to express their judge- ing a difference emerged, and by the end of training
ments to better capture the effect. no preference for either group was found. The
mean liking ratings for the majority and minority three conditions on the second block of trials. The
groups across five blocks are shown separately for main effect of statement ratio also was reliable,
the three different frequency manipulations in F(2, 86) ¼ 10.35, p ¼ .001, MSE ¼ 3.61, h2 ¼
Figure 3. The learning effect does not seem as .194, but there was no interaction between block
pronounced, but it is important to remember that and statement ratio. The ratio main effect suggests
this is partially because there are no baseline data, that the absolute value of the judgements differed in
so that Block 1 already reflects some learning. the three groups; it was predicted by the RWM that
We replicate the development and then diminution the 5:1 positive ratio group should have elicited
of the illusory correlation effect in all three groups. higher judgements than either other treatment.
Additionally, there was only moderate statistical The statistical analysis confirmed this prediction,
support for any reliable difference in the relative and the observation from Figure 3 demonstrates
size of the majority group preference between the that this was the case. The 5:1 high positive con-
three conditions. However, there was strong dition elicited higher judgements than both the
evidence supporting the associative prediction 5:1 high majority and the 2:1 control condition
that judgements of likeability would be higher using the Bonferroni correction (the mean differ-
(both groups rated as more likeable) by the end of ences were 1.30 and 1.35, p , .001, for both com-
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
training when the ratio of positive to negative parisons, respectively). There was no difference in
statements was 5:1 than in the 2:1 condition, the absolute judgements between the high majority
more well-behaved condition. and the control condition.
An ANOVA was carried out including the four These results confirm the predictions generated
factors statement ratio (2:1 positive, 5:1 positive, 5:1 by an associative learning model for the emergence
majority), group (majority or minority), majority and elimination of the illusory correlation effect.
label (blue/red, orange/purple), and block (1–5). We have shown with this experiment that the
Only two effects were statistically reliable: the inter- acquisition process posited by associative learning
action between group and block, F(4, 320) ¼ 4.01, models is common to other trial sets. We also
p ¼ .003, MSE ¼ 2.37, h2 ¼ .045, and the main found some evidence that this effect is sensitive to
effect of statement ratio, F(2, 86) ¼ 19.94, p ¼ the relative ratio of positive to negative statements,
.001, MSE ¼ 8.85, h2 ¼ .317. The interaction is although the prediction for a stronger effect with
consistent with the observation that the difference the 5:1 majority was not confirmed. This failure
between the majority and minority ratings was not may reflect the failure of the theory or the sensitivity
the same after each block of presentations. of our measures to find a significant effect.
However, the three-way interaction between state-
ment ratio, group, and block that might have
reflected different levels of illusory correlation in GENERAL DISCUSSION
the three conditions was not reliable (F , 1).
Although the 5:1 majority group looks like it may Our results show that the emergence of an illusory
have produced a larger illusory correlation, as was correlation effect depends on the amount of
predicted by the RWM, the statistical evidence experience that the learner has of the (non)covar-
did not support this prediction. We tested the size iation of the target social groups with the frequent
of the difference between majority and minority and infrequent behaviours in question. Thus while
group liking ratings after each block and only we reproduced the illusory correlation effect using
found a statistically reliable difference between the the experimental paradigm of Hamilton and
minority and majority group ratings preasymptoti- Gifford (1976), the (illusory) perception of a cor-
cally, in this case after the second block of trials, relation between the minority group and an
F(1, 86) ¼ 12.56, p ¼ .001, MSE ¼ 3.35, h2 ¼ infrequent behaviour peaked on the third trial
.127. This could be taken as evidence that the illu- block (54 observations) in Experiment 1 and the
sory correlation effect was of a similar strength in all second block (36 observations) in Experiment 2
Figure 3. Mean liking ratings for majority and minority groups in the standard 2:1 positive condition, with a ratio of 2:1 for both majority to
minority and positive to negative statements (left panel), the 5:1 positive condition with a ratio of 5:1 positive to negative behavioural
statements (middle panel), and the 5:1 majority condition with a ratio of 5:1 majority to minority group members (right panel) in
Experiment 2. Average ratings from two blocks (18 trials).
and in both cases declined to be become extin- negativity. They also raise the question as to
guished at the fifth trial block (90 observations). whether similar learning curve effects will emerge
Our results are thus quite consistent with the in related experimental paradigms in which illu-
meta-analysis of Mullen and Johnson (1990), sory correlation has been observed, such as that
which reviewed 24 experiments that all used used by Sherman et al. (2009) where people are
between 36 and 48 observations, which correspond asked to learn the relationships between majority
approximately to our second and third learning and minority groups and two independent dimen-
blocks. However, the emergence of a clear learning sions that vary in positivity and negativity, such as
curve qualifies Mullen and Johnson’s (1990, p. 21) friendliness and intelligence.
assertion that “the basic illusory correlation effect By using predictions derived from a formal
is stronger under . . . larger numbers of exemplars associative learning model we were able to test
in the stimulus array”. While Mullen and the prediction that the effect is a preasymptotic
Johnson’s assertion is correct for the earlier part learning effect that is not simply dependent upon
of the learning curve (Blocks 1 to 3), it is mani- the distinctiveness of the minority group state-
festly incorrect for the later part of the learning ments or memory loss. The results suggest that
curve (Blocks 4 to 5). Our theoretical analysis this is indeed the case, with a significant illusory
and results therefore set boundary conditions for correlation effect favouring the majority group
the emergence (and disappearance) of the illusory only occurring after an intermediate number of
correlation effect in Hamilton and Gifford’s exper- stimulus presentations. In Experiment 1, using
imental paradigm, where people have to learn the Hamilton and Gifford (1976) procedure, we
relationships between groups and behaviours that obtained an illusory correlation effect with inter-
vary along a single dimension of positivity and mediate exposure. The actual amount of exposure
required to produce the effect is not simply deter- exposure, such as Smith’s (1991) memory-trace
mined by the number of trials, but obviously by model and Fiedler’s (1991) information loss
individual differences and salience of sentences, account.
and hence no unqualified conclusion should be
derived about the ideal number of trials required
Advantages of the associative learning
to elicit the effect. As in other domains of learning
approach
this is an empirical question, which depends sig-
nificantly on the exact experimental procedure A primary advantage of a formal associative learn-
used. What learning theory does provide is the ing approach like the RWM is that it can already
generalized learning curve. account for a wide range of phenomena other
Finally, an equally important but often over- than illusory correlation. We hope to have shown
looked aspect of the data is that comparison of how the generally accepted associative learning
responding on the final block of training with model can (a) account for the illusory correlation
the baseline ratings suggested quite clearly that effect in social cognition and (b) make original
judgements became more positive as a direct predictions about the development and extinction
result of experience. In Experiment 2 we also of this effect. In particular, we note that associative
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
showed that the RWM model can be used to gen- models have already had success in explaining
erate specific and novel predictions about the con- human learning on a range of other tasks including
ditions under which the absolute level of likeability correlation learning, blocking, and outcome
can be manipulated by varying the relative pro- density effects (e.g., Baker et al., 1996), and in
portion of positive to negative statements. using them to explain illusory correlation effects
Taken together these findings illustrate the in social stimuli, we are adding explanatory range
value in applying a general learning model to our to a theory that is already parsimonious and
understanding of the processes underlying powerful. Second, the associative learning
human learning of negative beliefs about and atti- approach has prima facie psychological plausibility
tudes to minority groups. They pose a challenge as applied to illusory correlation: Two factors that
to Hamilton and Sherman’s explanation have been identified as central to the learning of
(McConnell, Sherman, & Hamilton, 1994) in illusory correlations by other models, such as dis-
terms of “ultimate distinctiveness”, since it tinctiveness (Hamilton, 1981) and frequency
remains unspecified how this should vary as a (Fiedler, 1991; Smith, 1991), are already incorpor-
function of amount of exposure. Models of a ated in existing learning models (e.g., Rescorla &
more formal nature have been developed to Wagner, 1972). Third, and most important for
account for the illusory correlation phenomenon the experiments described here, existing associat-
(e.g., Fiedler, 1991, 2000b; Smith, 1991). ive learning models can readily predict the illusory
However, none of the existing social cognition correlation effect and naturally generate unique
models of illusory correlation have looked at learn- and novel testable predictions about the develop-
ing curves and the development and disappearance mental nature of the phenomenon.
of the group difference in the illusory correlation Hamilton and Gifford (1976) offered the
paradigm, and still less have they examined the concept of distinctiveness by infrequency as an
effect of increased learning on the illusory corre- account of the illusory correlation effect they
lation effect. The disappearance of the group observed—that is, infrequent events are con-
difference is a research question that has been sidered more memorable and hence become more
neither explored nor modelled in detail, and available when participants recall the covariation
which clearly merits further investigation. information. However, other work indicates a
Furthermore, our data provide explicit and robust more complex picture in which distinctiveness is
disconfirmation of models that predict continuing used to describe salient features possessed by indi-
perception of illusory correlation with increased vidual stimuli rather than within-category stimuli
infrequencies. In these studies expectations of hand we have assumed that the strength of the
associations between paired stimuli (e.g., association between our fictitious group labels
Chapman, 1967; Hamilton & Rose, 1980), the and valence of behaviour is predicted to be zero
negative valence of stimuli (Mullen & Johnson, on the first trial. But depending upon the nature
1990), and own-attitude congruence with some of any prior knowledge it may take longer or less
of the stimuli presented (Spears, van der Pligt, & time to learn to asymptote.
Eiser, 1986) have all enhanced illusory corre- Our overall conclusion is that these data
lations. In the simplest case, when distinctiveness provide new empirical evidence for the associative
is defined purely as the relative within-category learning interpretation of the illusory correlation
infrequency of a particular stimulus, associative effect first suggested by Slugoski, Sarson, and
models (e.g., RWM) do predict preasymptotic for- Krank (1991). However, we do not claim that
mation of the typical illusory correlation effect. associative learning models can explain all findings
Information loss models (e.g., Fiedler, 1996, that have been reported in connection with the
2000a) can make the prediction of the elimination illusory correlation effect or that the effect is
of the illusory correlation effect by recruiting a only determined by such a process (e.g., Sherman
number of auxiliary assumptions (e.g., concerning et al., 2009). The illusory correlation effect
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
stimulus salience; Fiedler, 2000b). However, these clearly is a multiply determined phenomenon
predictions do not emerge from the information that occurs at both the encoding and the judge-
loss account itself, but from the addition of ment stage. For instance, associative models are
additional parameters to the model. It is also not applicable in cases where there is no sequential
important to note that the RWM already incor- learning—that is, where participants are only given
porates weighting parameters that represent summary information about groups (e.g.,
stimulus salience. Thus when distinctiveness is McGarty, Haslam, Turner, & Oakes, 1993).
defined as a stimulus feature, predictions about
speed of learning depend on the link between the
Practical implications for stereotype
learning-rate parameters a and b, which capture
formation and elimination
the salience or associability of the group labels
and statement’s affective value, respectively. The We conclude by considering the implications of
RWM predicts that learning will be enhanced our results for real-world stereotyping phenomena
when more salient stimuli are paired with a in intergroup relations, and in particular for the
group (e.g., Kanouse & Hanson, 1972). contact hypothesis for reducing negative stereo-
However, determining stimulus salience for any typing of outgroups. Our data suggest that nega-
given stimuli set requires a psychophysical study of tive minority stereotypes are not an inevitable
stimulus salience and is likely to depend on the final product of general learning processes. They
specific qualities of the stimuli involved and the also help resolve an apparent contradiction
particular experimental situation. At the moment between the illusory correlation literature and
no such study has been conducted. However, work on the intergroup contact hypothesis
recent work has suggested that other factors are (Allport, 1954; Watson, 1947). This contradiction
also likely to influence learning. For example, pre- exists because current models of IC (Fiedler, 1991;
vious predictiveness of a stimulus may facilitate Smith, 1991) predict that proportionately
learning. LePelley et al. (2010) have shown that increased exposure to both majority and minority
originally neutral stimuli that are predictive of groups should a priori lead to increased prejudice,
unrelated neutral events endow these stimuli not to reduced prejudice. In contrast, the contact
with increased associability consistent with some hypothesis predicts that increased contact
associative accounts. Any experience that increases between groups should lead to less prejudice
attention or prior knowledge about a cue would be (e.g., Brewer & Miller, 1984; Gaertner, Mann,
expected to influence these effects. On the other Murrell, & Dovidio, 1989; Hewstone & Brown,
1986; Pettigrew, 1998; see Brown & Hewstone, Baron, J. (2000). Thinking and deciding (3rd ed.).
2005, for a review), and recent meta-analyses New York: Cambridge University Press.
have found support for this hypothesis (e.g., Berndsen, M., Spears, R., van der Pligt, J., & McGarty,
Pettigrew & Tropp, 2005, 2008). C. (2002). Illusory correlation and stereotype for-
mation: Making sense of group differences and cog-
Allport (1954) suggested that contact would be
nitive biases. In C. McGarty, Y. Yzerbyt, &
beneficial if it occurred in a situation of equal
R. Spears (Eds.), Stereotypes as explanations: The for-
status, involved cooperative interaction in the mation of meaningful beliefs about social groups
pursuit of common goals, allowed the formation (pp. 90–110). Cambridge, UK: Cambridge
of close relationships with outgroup members, University Press.
and was institutionally supported. The associative Brewer, M. B., & Miller, N. (1984). Beyond the contact
learning approach represented by the RWM hypothesis: Theoretical perspectives on desegrega-
would add an important qualifier to these con- tion. In N. Miller & M. B. Brewer (Eds.), Groups
ditions: Contact as such will only be beneficial if in contact: The psychology of desegregation
it is prolonged and/or allows the individual to (pp. 281–302). Orlando, FL: Academic Press.
learn to asymptote the proportion of positive to Brown, R., & Hewstone, M. (2005). An integrative
theory of intergroup contact. In M. P. Zanna
negative attributes in both groups (and if these
(Ed.), Advances in experimental psychology (Vol. 37,
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
Fiedler, K., Russer, S., & Gramm, K. (1993). Illusory McGarty, C., & de la Haye, A. M. (1997). Stereotype
correlations and memory performance. Journal of formation: Beyond illusory correlation. In
Experimental Social Psychology, 29, 111–136. R. Spears, P. J. Oakes, N. Ellemers, & S. A.
Field, A. P. (2005). Learning to like (or dislike): Haslam (Eds.), The social psychology of stereotyping
Associative learning of preferences. In A. J. Wills and group life (pp. 144–170). Oxford, UK:
(Ed.), New directions in human associative learning Blackwell.
(pp. 221–252). Mahwah, NJ: Lawrence Erlbaum McGarty, C., Haslam, S. A., Turner, P. J., & Oakes, P.
Associates. J. (1993). Illusory correlation as accentuation of
Gaertner, S. L., Mann, J. A., Murrell, A., & Dovidio, J. intergroup difference: Evidence for the effect with
(1989). Reducing intergroup bias: The benefits of minimal stimulus information. European Journal of
recategorization. Journal of Personality and Social Social Psychology, 23, 391–410.
Psychology, 57, 239–249. Msetfi, R. M., Murphy, R. A., Simpson, J., & Kornbrot,
Gluck, M. A., & Bower, G. H. (1988). From condition- D. E. (2005). Depressive realism and outcome
ing to category learning: An adaptive network model. density bias in contingency judgements: The effect
Journal of Experimental Psychology: General, 117, of context and the inter-trial interval. Journal of
227–247. Experimental Psychology: General, 134, 10–22.
Hamilton, D. L. (1981). Illusory correlation as a basis Mullen, B., & Johnson, C. (1990). Distinctiveness-
for stereotyping. In D. L. Hamilton (Ed.), based illusory correlations and stereotyping: A
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
Cognitive processes in stereotyping and intergroup meta-analytic integration. British Journal of Social
behavior (pp. 115–144). Hillsdale, NJ: Lawrence Psychology, 29, 11–28.
Erlbaum Associates. Murphy, R. A., & Baker, A. G. (2004). A role for CS–
Hamilton, D. L., & Gifford, R. K. (1976). Illusory cor- US contingency in Pavlovian conditioning. Journal of
relation in intergroup perception: A cognitive basis Experimental Psychology: Animal Behavior Processes,
of stereotypic judgments. Journal of Experimental 30, 229–239.
Social Psychology, 12, 392–407. Murphy, R. A., Vallée-Tourangeau, F., Msetfi, R., &
Hamilton, D. L., & Rose, T. L. (1980). Illusory corre- Baker, A. G. (2005). Signal-outcome contingency,
lation and the maintenance of stereotypic beliefs. contiguity and the depressive realism effect. In A.
Journal of Personality and Social Psychology, 39, Wills (Ed.), New directions in human associative
832–845. learning (pp. 193–219). Hove, UK: Lawrence
Hewstone, M., & Brown, R. (1986). Contact is not Erlbaum Associates.
enough. An intergroup perspective on the “contact Pearce, J. M. (1987). A model for stimulus generaliz-
hypothesis”. In M. Hewstone & R. Brown (Eds.), ation in Pavlovian conditioning. Psychological
Contact and conflict in intergroup encounters Review, 94, 61–73.
(pp. 1–44). Oxford, UK: Blackwell. Peterson, C. R. (1980). Recognition of non-contin-
Kanouse, D. E., & Hanson, L. R. (1972). Negativity in gency. Journal of Personality and Social Psychology,
evaluations. In E. E. Jones, D. E. Kanouse, H. H. 38, 727–734.
Kelley, R. E. Nisbett, S. Valins, & B. Weiner Pettigrew, T. F. (1998). Intergroup contact theory.
(Eds.), Attribution: Perceiving the causes of behavior. Annual Review of Psychology, 49, 65–85.
Morristown, NJ: General Learning Press. Pettigrew, T. F., & Tropp, L. R. (2005). Allport’s inter-
Kremer, E. F. (1974). The truly random control pro- group contact hypothesis: Its history and influence.
cedure: Conditioning to the static cues. Journal of In P. Glick, L. A. Rudman, & J. F. Dovidio
Comparative and Physiological Psychology, 86, 700–707. (Eds.), On the nature of prejudice: Fifty years after
LePelley, M., Reimers, S., Beesley, T., Calvini, G, Allport (pp. 262–277). Malden, MA: Blackwell.
Spears, R., & Murphy, R. A. (2010). Stereotype for- Pettigrew, T. F., & Tropp, L. R. (2008). How does
mation: Biased by association. Journal of intergroup contact reduce prejudice? Meta-analytic
Experimental Psychology: General, 139, 138–161. tests of three mediators. European Journal of Social
McConnell, A. R., Sherman, S. J., & Hamilton, D. L. Psychology, 38, 922–934.
(1994). Illusory correlation in the perception of Rescorla, R. A., & Wagner, A. R. (1972). A theory of
groups: An extension of the distinctiveness-based Pavlovian conditioning: Variations in the effective-
account. Journal of Personality and Social Psychology, ness of reinforcement and non-reinforcement. In
67, 173–185. A. H. Black & W. F. Prokasy (Eds.), Classical
conditioning II: Current theory and research Smedslund, J. (1963). The concept of correlation in
(pp. 64–99). New York: Appleton-Century-Crofts. adults. Scandinavian Journal of Psychology, 4,
Sedlmeier, P. (2006). Intuitive judgments about sample 165–173.
size. In K. Fiedler & P. Juslin (Eds.), Information Smith, E. R. (1991). Illusory correlation in a simulated
sampling and adaptive cognition. Cambridge, UK: exemplar-based memory. Journal of Experimental
Cambridge University Press. Social Psychology, 27, 107–123.
Shanks, D. R. (2007). Associationism and cognition: Spears, R., van der Pligt, J., & Eiser, J. R. (1986).
Human contingency learning at 25. The Quarterly Generalizing the illusory correlation effect.
Journal of Experimental Psychology, 60, 291 –309. Journal of Personality and Social Psychology, 51,
Shanks, D. R., & Dickinson, A. (1987). Associative 1127–1134.
accounts of causality judgments. In G.H. Bower Tversky, A., & Kahneman, D. (1973). Availability: A
(Ed.), Psychology of learning and motivation: heuristic for judging frequency and probability.
Advances in research and theory (Vol. 21, Cognitive Psychology, 4, 207–232.
pp. 229–261). San Diego, CA: Academic Press. Vallée-Tourangeau, F., Hollingsworth, L., & Murphy,
Shanks, D. R., & Dickinson, A. (1988). The role of R. A. (1998a). “Attentional bias” in correlation judg-
selective attribution in causality judgment. In D.J. ments: Smedslund (1963) revisited. Scandinavian
Hilton (Ed.), Contemporary science and natural Journal of Psychology, 39, 221–233.
explanation: Commonsense conceptions of causality Vallée-Tourangeau, F., Murphy, R. A., Drew, S., &
Downloaded By: [Hilton, Denis] At: 18:29 6 January 2011
(pp. 94–126). Brighton, UK: Harvester Press. Baker, A. G. (1998b). Judging the importance of
Sherman, J. W., Kruschke, J. K., Sherman, S. J., Percy, constant and variable candidate causes: A test of
E. J., Petrocelli, J. V., & Conrey, F. R. (2009). the power PC theory. Quarterly Journal of
Attentional processes in stereotype formation: A Experimental Psychology, 51A, 65–84.
common model for category accentuation and illu- Vallée-Tourangeau, F., Payton, T., & Murphy, R. A.
sory correlation. Journal of Personality and Social (2008). The impact of presentation format on
Psychology, 96, 305–323. causal inferences. European Journal of Cognitive
Slugoski, B. R., Sarson, D. A., & Krank, M. D. (1991, Psychology, 20, 177–194.
June). Cognitive load has paradoxical effects on the for- Van Rooy, D., Van Overwalle, F., Vanhoomissen, T.,
mation of illusory correlations, Paper presented at the Labiouse, C., & French, R. M. (2003). A recurrent
4th International Nag’s Head Conference on connectionist model of group biases. Psychological
Stereotypes and Intergroup Relations, Highland Review, 110, 536–563.
Beach, FL. Watson, G. (1947). Action for unity. New York: Harper.