Nagai 2013

Vision Research 79 (2013) 2735
Contents lists available at SciVerse ScienceDirect
Vision Research
journal homepage: www.elsevier.com/locate/visres
Comparing face processing strategies between typically-developed observers

and observers with autism using sub-sampled-pixels presentation in response
classication technique
Masayoshi Nagai a,, Patrick J. Bennett b, M.D. Rutherford b, Carl M. Gaspar c, Takatsune Kumada a,
Allison B. Sekuler b
a
National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba Central 6, 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566, Japan
b
Department of Psychology, Neuroscience & Behaviour, McMaster University, 1280 Main Street West, Hamilton, Ontario, Canada L8S 4K1
c
Center for Cognitive and Brain Disorders, Hangzhou Normal University, Room 909, Yuanli Building, 276 Lishui RD, Gongshu District, Hangzhou 310000, China
a r t i c l e i n f o a b s t r a c t
Article history: In the present study we modied the standard classication image method by subsampling visual stimuli
Received 4 June 2012 to provide us with a technique capable of examining an individuals face-processing strategy in detail
Received in revised form 30 December 2012 with fewer trials. Experiment 1 conrmed that one testing session (1450 trials) was sufcient to produce
Available online 12 January 2013
classication images that were qualitatively similar to those obtained previously with 10,000 trials (Sek-
uler et al., 2004). Experiment 2 used this method to compare classication images obtained from observ-
Keywords: ers with autism spectrum disorders (ASD) and typically-developing (TD) observers. As was found in
Response classication
Experiment 1, classication images obtained from TD observers suggested that they all discriminated
Face discrimination
Autism
faces based on information conveyed by pixels in the eyes/brow region. In contrast, classication images
Strategy obtained from ASD observers suggested that they used different perceptual strategies: three out of ve
ASD observers used a typical strategy of making use of information in the eye/brow region, but two used
an atypical strategy that relied on information in the forehead region. The advantage of using the
response classication technique is that there is no restriction to specic theoretical perspectives or a pri-
ori hypotheses, which enabled us to see unexpected strategies, like ASDs forehead strategy, and thus
showed this technique is particularly useful in the examination of special populations.
2013 Elsevier Ltd. All rights reserved.
1. Introduction identity (Boucher & Lewis, 1992; de Gelder, Vroomen, & van der
Heide, 1991; Klin et al., 1999). The results of these behavioral stud-
Autism spectrum disorders (ASD) are neurodevelopmental con- ies are complimented by neuroimaging evidence showing that
ditions characterized by severe social and communication difcul- people with ASD exhibit less activation in areas typically associ-
ties, as well as by restricted behaviors and interests (American ated with face processing, such as the fusiform gyrus, when view-
Psychiatric Association, 1994; World Health Organization, 1994). ing faces (Dalton et al., 2005; Grelotti et al., 2005; Hubl et al., 2003;
The social decits seen in ASD are universal and arguably the most Kanwisher, McDermott, & Chun, 1997; Pierce et al., 2001; but see
clinically profound and debilitating symptoms (Kanner, 1943; also Pierce et al., 2004).
Wing & Gould, 1979); however, there is increasing evidence that Results from behavioral and eye-tracking experiments also
perceptual decits accompany those social decits. Evidence from point to some qualitative differences in the ways in which ASD
a variety of psychophysical studies suggests that people with ASD and TD observers process faces. Whereas TD individuals rely pri-
do not process faces in the same manner as typically-developing marily on information around the eyes and eyebrows for face iden-
(TD) individuals. Researchers have reported ASD-related difcul- tication (e.g., Sekuler et al., 2004; Vinette, Gosselin, & Schyns,
ties in recognizing facial expressions (Castelli, 2005; Celani, Batac- 2004), it has been suggested that individuals with autism rely less
chi, & Arcidiacono, 1999; Gross, 2004; Hobson, 1986; Hobson, on information in the eye/brow region, and more on information
Ouston, & Lee, 1988; Pelphrey et al., 2002), following eye gaze around the mouth/lower face region (Joseph & Tanaka, 2003;
(Chawarska, Klin, & Volkmar, 2003; Ristic et al., 2005; Swettenham Gross, 2004; Klin et al., 2002; Pelphrey et al., 2002; but see Ruth-
et al., 2003; Volkmar & Mayes, 1990), and in determining facial erford, Clements, & Sekuler, 2007; Rutherford & Towns, 2008;
van der Geest et al., 2002).
Corresponding author. Fax: +81 298616790. However, the distinction between ASD and TD face processing
E-mail address: masayoshi-nagai@aist.go.jp (M. Nagai).
may not be as clear as previously thought. For example, Rutherford
0042-6989/$ - see front matter 2013 Elsevier Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.visres.2013.01.001
28 M. Nagai et al. / Vision Research 79 (2013) 2735
and Towns (2008) found no evidence of less looking time toward ulated, and the typical level of feature manipulation may not be
the eyes, nor greater looking time toward the mouth, for observers high-resolution enough to reveal individual differences in process-
with ASD compared to TD controls when scan paths were recorded ing strategies. Therefore, the classication image technique has an
during emotional face perception. Another recent study (Nihshim- advantage in investigating face processing of individuals with ASD
ura, Rutherford, & Maurer, 2008) examined the composite face ef- who may vary considerably in their face processing strategies
fect (Carey & Diamond, 1994; De Heering, Houthuys, & Rossion, (Rutherford, Clements, & Sekuler, 2007), and who might have atyp-
2007; Young, Hellawell, & Hay, 1987) and the effects of feature, ical face-processing strategies.
spacing, and contour changes on face identication (see Mondloch, In response classication, on each trial, a unique external noise
Le Grand, & Maurer, 2002). Nishimura et al. found that although is added to a stimulus, which an observer must classify (e.g., as
reaction times were slower for people with ASD than for TD control Stimulus A or Stimulus B). On some trials, the observers classica-
subjects, the general pattern of behavior was the same in the two tion will be correct. However, on other trials, the noise may make
groups. Based on these ndings, Nishimura et al. concluded that one stimulus (e.g., Stimulus A) look more like the other stimulus
ASD-related face perception decits may not be as pervasive as (e.g., Stimulus B), leading to incorrect classications. After many
previously thought. Indeed, in an earlier study, Rutherford, Cle- trials, the noise elds presented on each trial are sorted into four
ments, and Sekuler (2007) examined the observers ability to dis- stimulusresponse classes (NAA, NAB, NBA, and NBB). Here, NAB rep-
criminate positional changes around the eyes and mouth on face resents all samples of noise elds where Stimulus A was presented
recognition, and found that, on average, individuals with ASD per- and the observer classied it as Stimulus B. The mean classication
formed slightly worse on eye position discrimination than TD con- image (CI) is calculated as follows:
trols, but that there were two distinct groups of ASD individuals.
One group performed the eye-position discrimination task as well
CI MeanNAA MeanNBA MeanNBB MeanNAB
as TD controls, and the other group was signicantly worse. Nei-
ther ASD group performed differently than TD controls on the The classication image provides information about the correla-
mouth-position discrimination task. tion between the noise contrast at each location in the stimulus
The results from Rutherford, Clements, and Sekuler (2007) are and the observers response to that stimulus. Essentially, the clas-
notable because they suggest that there may be multiple behav- sication image is a map that shows how the contrast levels at var-
ioral clusters within a well-dened group of individuals with ious locations inuence the behavior response; consequently,
ASD (see also Barton et al., 2007), and therefore that averaging re- classication images sometimes are referred to as a behavioral
sults across all ASD participants may lead to results that are not receptive elds (Gold et al., 2000).
representative of individuals. Individual differences in behavior Although response classication was initially developed over
within an ASD group also may help explain some of the variability 40 years ago (Ahumada & Lovell, 1971), the standard classication
in results found across experiments. For example, given the rela- image technique, or variations thereof, recently have become
tively limited sample size in a typical experiment, if different increasingly popular to study a variety of visual phenomena: ver-
experiments include different relative proportions of participants nier acuity (Beard & Ahumada, 1998), face perception/cognition
from face perception-normal and face perception-impaired groups, (e.g., Gold, Sekuler, & Bennett, 2004; Gosselin & Schyns, 2001;
average results would be expected to differ across experiments. Mangini & Biederman, 2004; Schyns, Bonnar, & Gosselin, 2002;
In general, the variability in results across experiments and the Sekuler et al., 2004; Spezio et al., 2007), perceptual organization
presence of large individual differences among individuals with (Gold et al., 2000; Nagai, Bennett, & Sekuler, 2008), attention
ASD suggests that a full understanding of ASD-related face process- (e.g., Eckstein, Shimozaki, & Abby, 2002; Neri & Heeger, 2002; Sol-
ing decits requires a closer examination of data at the level of omon, 2002; Tse, Sheinberg & Logothetis, 2003), perceptual learn-
individuals. The present paper addresses this issue, using the re- ing (Gold, Sekuler, & Bennett, 2004), stereo vision (Gosselin, Bacon,
sponse classication technique (Ahumada & Lovell, 1971; Beard & Mamassian, 2004; Neri, Parker, & Blakemore, 1999), the crowd-
& Ahumada, 1998) to examine the information observers use to ing effect (Nandy & Tjan, 2007), and visual search (Saiki, 2008).
discriminate faces. Response classication can highlight where Although the classication image method is a powerful method
there are commonalities and differences in visual processes for revealing the visual processing strategies of individual observ-
amongst observers. For example, experiments using this technique ers, it has the weakness of often requiring many trials to obtain sta-
have obtained evidence for differential processing strategies across ble data. For example, Sekuler et al. (2004) tested observers in
individuals, even when overall measures of performance (e.g., dis- 10,000 trials per condition to obtain stable classication images
crimination thresholds) do not differ across individuals (e.g., Gold, for 128 128 pixel faces. It has been suggested that a variant of
Sekuler, & Bennett, 2004; Nagai, Bennett, & Sekuler, 2007, 2008). the classication image method, called bubbles, could calculate
An additional benet of the technique is that it does not require individual observers diagnostic images (the equivalent of classi-
an a priori hypothesis about which regions of a stimulus are most cation images in the bubbles method) with fewer trials, but a re-
important to observers. Because the technique estimates the ex- cent study by Caldara and colleagues examining face perception
tent to which of each local pixels contrast contributes to overall in a prosopagnosic patient required approximately 9000 trials to
performance, it can detect strategies that may not have been ex- reach a stable level of performance (Caldara et al., 2005). The bub-
pected before the experiment. This approach may be particularly bles technique has also been applied to face processing in an ASD
useful in examining strategies of face perception in special popula- population but diagnostic images from individual subjects were
tions, for whom little may be known in advance about processing not shown (although they were calculated; Spezio et al., 2007).
strategies. In contrast, more standard face perception experiments Therefore, detailed individual differences among ASD observers
make a priori determinations about the critical regions for discrim- have not yet been shown using the bubbles technique.
ination, manipulating information in particular regions according In the present study we modied the standard classication im-
to the experimenters hypothesis. For example, in different exper- age method to provide us with a technique to examine an individ-
iments, researchers may manipulate both eyes as a group, each eye uals face-processing strategy in detail with fewer trials, which
independently, the eyes and brows together, the nose, and/or the could allow us to apply the technique to special populations.
mouth regions to investigate the relative contribution of each fea- Experiment 1 describes the modied technique, and tests the ap-
ture (e.g., Joseph & Tanaka, 2003; Tanaka & Farah, 1993; Tanaka proach in a group of experienced observers. Experiment 2 then
et al., 1998). However, critical regions for a task may not be manip- uses the modied technique to compare face-processing strategies
M. Nagai et al. / Vision Research 79 (2013) 2735 29
in people with ASD and TD controls, and also investigates individ-

ual differences within each group of participants.
2. Experiment 1: The random sub-sampling method
The number of trials required for stable classication images in-

creases with the number of independent noise elements, so reduc-
ing the number of noise elements in the presented stimuli should
decrease the number of trials required for stable classication
images (Murray, Bennett, & Sekuler, 2002). Nagai, Bennett, and
Sekuler (2008) devised an approach to reduce the number of trials
required. Their Experiment 4 presented large stimuli comprising
552 552 (i.e., a total 304,704) pixels, making it very difcult to
estimate classication images using standard methods. Therefore,
instead of using a full 552 552 noise eld, they grouped pixels
into 6 6 noise regions, so that there were only 92 92 noise ele-
ments added to the unaltered stimulus images. By reducing the
number of noise elements, stable classication images could be ob-
tained in 9600 trials, compared to approximately 300,000 trials
that would be needed using a standard pixel-by-pixel noise pre-
sentation method (Murray, Bennett, & Sekuler, 2002).
Here, we take a slightly different approach, called the random
sub-sampling method. In this method, the resolution of the noise
matches the resolution of the stimulus, but only a random subset
of 25% of the stimulus pixels (+noise) is displayed. By distributing
the positions of these sub-sampled pixels over the entire stimulus,
the appearance is of a low contrast, completely sampled image.
Thus, we can probe the use of information across the entire stim-
ulus with the spatial scale of stimulus and noise equated, but we
should require signicantly fewer trials to obtain stable classica-
tion images. Experiment 1 tests the validity of this method to de-
rive classication images for face discrimination in four TD
observers with past experience in psychophysical experiments.
2.1. Method
2.1.1. Observers
Three observers from McMaster University (Canada) and one
observer from the University of Tsukuba (Japan), participated in
Experiment 1 (ages ranged from 23 to 28 years; mean age = 24.75 -
Fig. 1. Face stimuli used in the present study. (A) Original face stimuli used in
years). All observers had normal or corrected-to-normal Snellen vi- Sekuler et al. (2004). (B). Randomly sub-sampled images of original faces presented
sual acuity for Canadian observers and Landolt C visual acuity for in the actual experiments in the present studies. (C and D) Examples of noise-added
Japanese observer, and were experienced in psychophysical exper- faces images with the random sub-sampling and standard method, respectively.
iments. Observers MAT and m50 were male; observer m50 was
Japanese. Two of the Canadian observers (MAT and SKH) partici- the width subtended 2.41 deg (70 pixels), and each face was cen-
pated previously in the Sekuler et al. (2004) study; these two tered within a 128 128 pixel array (4.41 4.41 deg). In the pres-
observers are labeled in the current experiment as they were in ent study, instead of presenting the entire face and background,
Sekuler et al. to enable comparison across the studies. The remain- one pixel within each 2 2 pixel region of the face was randomly
ing two observers were nave regarding the purpose of the exper- selected for presentation, and the remaining pixels in that 2 2 re-
iment. Observers were paid $10 or 1000 yen for each session. gion were set to zero contrast (Fig. 1B); all pixels within the back-
ground region were also set to zero contrast. The locations of pixels
2.1.2. Apparatus used for the stimulus and noise presentation were xed within an
Stimuli were displayed on a 21-in. AppleVision monitor at observer, but varied across observers. Unique noise elements were
McMaster University and an Eizo T961 monitor at AIST (both moni- generated in each trial and added to the stimulus, with the lumi-
tors had resolution = 640 480 pixels, screen size = 38.0 cm nance of each noise element selected randomly from a Gaussian
28.5 cm, and refresh rate = 67 Hz), controlled by an Apple G3 com- distribution that had a mean of zero and a root mean square con-
puter at McMaster University and an Apple G4 computer at AIST. trast of 0.3. Fig. 1C and D shows an example of noise-added faces
Observers viewed stimuli binocularly from a distance of 100 cm, images with standard method and the random sub-sampling,
with head position stabilized by a chin-and-forehead rest. respectively. The background luminance of CRT monitors ranged
between 22.0 and 24.2 cd/m2 depending on individual monitors,
2.1.3. Stimuli and procedure but it was xed within each observer. One 2-down/1-up staircase
Except for the random sub-sampling method, stimuli and pro- was used to adjust the contrast of face to maintain response accu-
cedure were similar to those in Sekuler et al. (2004). Stimuli were racy at approximately 71% during a testing session.
two male faces from Gold, Bennett, and Sekulers (1999) face set Each trial began with the presentation of a black or white xa-
(Fig. 1A). The height of each face subtended 3.41 deg (99 pixels), tion point at the center of the screen for 1000 ms. Immediately fol-
lowing the xation point, a single stimulus (i.e., one of the two 1993). These distributions were then used to assess the statistical
faces plus noise) was presented for 507 ms. After the stimulus pre- signicance of each pixel in the ltered classication images.
sentation, both faces were presented side by side in 100% contrast Fig. 2B shows the locus of signicant pixels (in red) in the ltered
as full pixels image (i.e., without sub-sampling) until observers classication images for each observer after each session (p < .001).
pressed a key to indicate which face had been presented. Observers These data showed that the observers face-identity discrimina-
were aware that the probability of each face being presented on tions were mainly inuenced by noise in the eye and eyebrow re-
any given trial was 0.5. Auditory feedback indicated whether the gions. Although the number of signicant pixels in classication
response was correct or incorrect, and the xation point was re- images generally increases with the number of sessions, there
drawn 1000 ms after the response to signal the beginning of the were relatively minor changes in the spatial distribution of signif-
next trial. Each session consisted of 1500 trials and lasted approx- icant pixels across the face after the rst few sessions (Fig. 2B).
imately 90 min. The rst 50 trials were considered practice trials, Thus, one or two sessions (i.e., 1450 or 2900 trials) were enough
and only the last 1450 trials of each session were used to estimate to derive stable classication images with the random sub-sam-
the classication images. Each observer participated in four exper- pling method. Furthermore, the classication images are similar
imental sessions on mostly consecutive days. to images estimated from many more trials using standard meth-
ods (cf., Gold, Sekuler, & Bennett, 2004; Sekuler et al., 2004).
2.2. Results and discussion Fig. 3 shows the relationship of root-mean-square signal con-
trast threshold to the normalized cross-correlation between raw
The data analyses were based only on the pixels in which stim- classication images and the ideal template (Sekuler et al., 2004).
ulus and noise were actually presented. Fig. 2 shows raw classica- Here, the ideal template was simply the pixel-by-pixel difference
tion images (2A) and signicant pixels (2B) for each observer based between the two faces being discriminated (i.e., face A minus face
on session 1, sessions 12, sessions 13, and all sessions. The B) for each observer. Note that the cross correlation was calculated
images in Fig. 2 were compressed versions of the sampled faces, only with presented pixels. Consistent with previous studies of
which excluded non-presented pixels, so each panel includes face discrimination that used standard classication image meth-
64 64 pixels of data. The raw classication images reveal high ods (Gold, Sekuler, & Bennett, 2004), we found that the cross cor-
contrast blobs (black or white) around eye(s) and eyebrow(s) areas, relation measure, which is an index of how efciently observers
which generally become clearer as the number of sessions in- used the information on faces, increased across testing sessions
creases (top to bottom rows in Fig. 2A). For example, after the rst for three of the four subjects. Furthermore, a clear association be-
session observer MATs classication image shows one clear black tween threshold and cross-correlation emerged by the end of ses-
blob around the left eyebrow, and after the second session it shows sion 2, and changed only slightly after sessions 3 and 4. Finally, the
the right eyebrow as well. Then, after the third session, white blobs results from observers MAT and SKH were consistent with results
also appear around the both eyes as well as black blobs around the from those reported by Sekuler et al. (2004) using the standard re-
both eyebrows, and they become clearer still after the fourth sponse classication technique: As was reported by Sekuler et al.
session. (2004), in the current study observer MAT had a signicantly high-
Signicance levels were calculated by a permutation test: the er cross-correlation and more signicant pixels than SKH.
responses of each observer were randomly shufed, the classica- Overall, the random sub-sampling method provides results that
tion images were re-calculated for this random permutation of re- are qualitatively similar to those obtained with the standard meth-
sponses, and then ltered by a 10 10 uniform convolution kernel od, including the ability to reveal consistent individual differences
to remove high-frequency noise. This process was repeated 10 in processing strategies and performance. However, the sub-sam-
times to estimate the distribution of 40,960 noise elements lumi- pling method requires many fewer trials than the standard meth-
nance under the null hypothesis of no association between the ob- od, making it more appropriate for use with special populations a
servers response and the elements luminance (Efron & Tibshirani, topic we now turn to in Experiment 2.
1 ses.
2 ses.
3 ses.
4 ses.
MAT f10 SKH m50 MAT f10 SKH m50

(A) Raw classification images (B) Significant pixels in filtered classification
images
Fig. 2. Classication images results in Experiment 1. (A) Raw classication images and (B) signicant pixels (in red color) in ltered classication images against ideal
templates after each of four sessions for experienced TD observers.
entering the study, and one of the authors (MDR) conrmed their
diagnoses via two criteria: (1) the Autism Diagnostic Interview
(ADI-R; Lord, Rutter, & LeCouteur, 1994); and (2) the Autism Diag-
nostic Observation Schedule (ADOS-G; Lord et al., 2000). All
observers were free from other known medical conditions. They
had normal or corrected-to-normal Snellen visual acuity, were
nave regarding the purpose of the experiment, and were paid
$10 for each session.
3.1.2. Apparatus
The apparatus was the same as in that used in McMaster Uni-
versity in Experiment 1.
3.1.3. Stimuli and procedure

The stimuli and procedure were the same as in Experiment 1
Fig. 3. The relationship of log-transformed threshold RMS contrast and the log- except for the number of sessions: observers participated in two
transformed cross-correlation between a raw classication image and ideal
sessions of 1500 trials. As in Experiment 1, only the last 1450 trials
template in Experiment 1. Different symbols represent different experienced TD
observers, and different levels of gray correspond to different number of sessions of each session were used to estimate the classication images.
used to create the classication images. The dashed lines represent regression lines
t to the data; the correlations between thresholds and cross-correlations are
3.2. Results and discussion
indicated next to each line.
Classication images for each observer were analyzed as in

3. Experiment 2: Face discrimination in typically-developed Experiment 1. Fig. 4 shows signicant pixels in the ltered classi-
observers and observers with autism spectrum disorders cation images for each individual. The general distribution of sig-
nicant pixels across the face remained consistent between
The present experiment used the random sub-sampling re- classication images based on one and two experimental sessions,
sponse classication image technique to compare face-processing suggesting that one session was sufcient to produce stable classi-
strategies of ASD and TD observers, and to examine individual dif- cation images. For TD observers, signicant pixels were located
ferences within each group. primarily around the eye and eyebrow regions, except for observer
m43 who showed a clear cluster of signicant pixels around the
forehead as well as the eye/brow region after two sessions. These
3.1. Methods results, obtained from community-based TD observers, are consis-
tent with results obtained from University-based TD observers in
3.1.1. Observers Experiment 1 and previous face discrimination studies using re-
Ten volunteers participated in the experiment: ve high func- sponse classication (Gold, Sekuler, & Bennett, 2004; Sekuler
tioning adolescents and young adults in the ASD group (all male, et al., 2004), eye-tracking (Heisz & Shore, 2008; Klin et al., 2002;
ages ranging from 19 to 24 years, mean age = 21.2 years), and ve Langdell, 1978), and other behavioral methods (Joseph & Tanaka,
age and IQ matched typically-developed young adults from Hamil- 2003; Vinette, Gosselin, & Schyns, 2004). In general, information
ton, Ontario (all male, ages ranging from 19 to 30 years, mean in the eye/brow region was most clearly linked to an observers
age = 23.4 years; see Table 1 for IQ information). All participants discrimination of faces. The modied response classication image
in the ASD group had received clinical diagnoses of autism before approach also was able to detect individual differences in the rela-
tive weighting of pixels within eye/brow regions. For example,
observers m31 and m41 were inuenced by pixels on the left side
Table 1
Age and WAIS-III IQ scores of participants in Experiment 2. Standard deviations are
of faces, m45 and m49 on the right side, and m43 on both sides.
shown in parentheses. Welch Two Sample t tests were used for group comparisons. Additionally, the relative extent to which observers were inu-
enced by pixels in the eye and/or brow regions varied across
Group Age Verbal IQ Performance Full scale IQ
IQ
individuals.
The classication images in Fig. 4b indicate that there were
TD (N = 5) 23.4 (4.4) 101.8 (6.3) 103.6 (9.7) 103.2 (6.6)
ASD (N = 5) 21.2 (1.8) 100.0 (15.1) 100.6 (6.9) 100.0 (9.8)
clearly two qualitatively different face-processing strategies used
t(5.29) = 1.04 t(5.36) = 0.25 t(7.29) = 0.57 t(7.03) = 0.61 by ASD observers. One group was inuenced by pixels in the eye/
p = .34 p = .81 p = .59 p = .56 brow region (m23, z02, and z03), and another group was inu-
enced by pixels in the forehead region (m21 and z04). The forehead
1 ses.
2 ses.
m31 m41 m43 m45 m49 m21 m23 z02 z03 z04
(A) Significant pixels in filtered classification (B) Significant pixels in filtered classification
images for typically-developed observers images for autistic observers
Fig. 4. Classication images results in Experiment 2. Signicant pixels (in red color) in ltered classication images against ideal templates after each of two sessions for (A)
TD and (B) ASD observers.
region was not a typical cue for TD observers in the face discrimi- (see Table 2). These results highlight the importance of determin-
nation task, and carries relatively little discrimination information. ing the individual strategies revealed by classication images,
Thus, consistent with Rutherford, Clements, and Sekuler (2007), which could not be detected by other measures like contrast
the classication images demonstrate that ASD observers adopted threshold.
different strategies to discriminate faces: some observers adopted All ASD observers in the present study also participated in the
a strategy that was similar to the one used by TD observers, but experiments in Rutherford, Clements, and Sekuler (2007), enabling
others used an atypical strategy. us to compare performance across the two tasks. As noted earlier,
Table 2 shows the discrimination threshold and two quantita- Rutherford et al. found no overall difference between the ability of
tive measures of the classication images from each observer: TD and ASD observers to discriminate faces that differed in terms
the normalized cross correlation between an observers classica- of the vertical position of the mouth, but they found that, on aver-
tion image and ideal template, and the number of signicant pixels age, ASD observers were slightly worse than TD subjects at dis-
in the classication image based on two-sessions data. Statistical criminating faces that differed in terms of the horizontal
tests did not show any signicant group differences (threshold, positions of the two eyes. However, this overall group difference
MeanTD = 0.081, MeanASD = 0.099, t(8) = 0.503, p = .63; cross cor- was caused by a subset of ASD observers who performed signi-
relation, MeanTD = 0.093, MeanASD = 0.064, t(8) = 1.564, p = .16; cantly worse than TD subjects, whereas the rest of the ASD observ-
signicant pixels, MeanTD = 85.2, MeanASD = 40.2, t(8) = 1.923, ers performed as well as TD observers. Observers m21, m23, and
p = .091). Overall, there was a signicant correlation between log- z02 from the current experiment belonged to typically-performing
transformed threshold and cross-correlation (r = 0.89, group in Rutherford et al., whereas observers z03 and z04 belonged
t(8) = 5.64, p < .001), and there was no evidence that the associa- to the atypical (impaired) group. In the present study, observers
tion differed between groups (Fig. 5). Interestingly, quantitative m21 and z04 had atypical classication images: they were inu-
measures (signal contrast threshold, number of signicant pixels, enced signicantly by pixels in the forehead region. Hence, there
and cross-correlation between the ideal template and classication was not a one-to-one correspondence between whether an obser-
images) did not predict ASDs two different strategies. One of two ver was classied as atypical in the two studies. This result sug-
observers relying on the forehead (z04) showed the highest thresh- gests that although there is variability in observers ability to
old, the lowest cross-correlation, and the least number of signi- perform face perception tasks, the factors that lead to atypical per-
cant pixels among ASD observers, but the other (m21) showed formance differ across tasks. The two tasks seem to shed light on
the highest cross-correlation and similar levels for threshold and different aspects of face processing, consistent with recent sugges-
number of signicant pixels to observers relying on eye/brows tions that measures related to congural processing may not al-
ways predict performance on face discrimination tasks (e.g.,
Konar, Bennett, & Sekuler, 2010).
Table 2 Previous researchers have suggested that ASD observers rely on
Cross correlations with ideal template, RMS signal contrast threshold, and the number
of signicant pixels, for individuals in Experiment 2.
non-eye information for face identication (Joseph & Tanaka, 2002;
Langdell, 1978; but see Rutherford & Towns, 2008), but those stud-
Group Cross correlation Threshold Signicant pixels ies specically investigated the relative inuences of the eye and
TD mouth region (or, the upper and lower half of faces). Thus, other re-
m31 0.134 0.062 48 gions, like the forehead, were not independently manipulated and
m41 0.114 0.048 115
conclusions about their inuence could not be made. In contrast, as
m43 0.101 0.059 145
m45 0.058 0.086 96 stated previously, because it does not rely on a priori hypotheses
m49 0.033 0.149 22 about the relative importance of one region versus another, the
ASD classication image technique can detect even unexpected strate-
m21 0.088 0.079 45 gies, such as the use of information in the forehead region.
m23 0.059 0.062 58
z02 0.081 0.069 39
z03 0.065 0.064 44
z04 0.026 0.219 15
4. General discussion
The present study used a new technique, the random sub-sam-

pling method, to derive classication images with relatively few
trials. One session was enough to visualize individual face-process-
ing strategies in a range of groups, including experienced TD
observers, naive TD observers, and ASD observers. The results from
TD observers were consistent with results from the traditional re-
sponse classication technique, despite being based only 1450 tri-
als in the current study, as opposed to the 10,000 trials required for
128 128 pixel images (the same image size as in the present
study) in previous studies (e.g., Gold, Sekuler, & Bennett, 2004;
Sekuler et al., 2004). This great reduction in the number of required
trials, without any clear loss in the quality or nature of the results,
provides us to with a method for determining classication images
in a range of special populations, including observers with ASD.
Recently, Wang et al. (2011) developed an adaptive bubbles
method using each observers history of responses, and showed
that they required 50% fewer trials compared to the standard
Fig. 5. The relationship of log-transformed threshold RMS contrast and the log- non-adaptive method to estimate diagnostic features in a facial
transformed cross-correlation between a raw classication image and ideal
template in Experiment 2. The line represents regression line t to the data; the
expression categorization task, but they did not show images for
correlations between thresholds and cross-correlations are indicated next to each diagnostic features for individual subjects, only their average
line. across subjects. Adolphs et al. (2008) used the bubbles method to
investigate face processing in children with ASD, but they also did The current study investigated individual face-processing strat-
not present diagnostic features for individual subjects. Instead, egies for adults with ASD, who could have developed various ways
they calculated quantitative similarity indices in the eye and to compensate for their innate atypical strategies. In other words, it
mouth regions between individual diagnostic features and the cor- is possible that all of the ASD observers used an atypical strategy to
responding base image to evaluate how much information was discriminate faces during their childhood, but that the three ASD
used in each region. When this value is high, it suggests that obser- observers who exhibited a typical strategy in our testing success-
ver highly used information within the relevant region, but the ex- fully learned to discriminate faces based on information in the
tent to which information is used in subareas of that region is not eye/brow region over the course of their lifetimes. Because the ran-
made explicit, and one could have high efciencies in regions that dom-subsampling classication image technique requires rela-
do not provide signicant information. Hence, although this new tively few trials, it may be possible to use it to investigate how
bubbles technique may be useful for studying special populations, face processing develops in ASD children. Measuring the face pro-
it remains unclear whether it has sufcient spatial resolution to re- cessing strategies in children has two advantages: it might be less
veal individual differences with respect to the specic parts of a confounded by strategies that have been deliberately acquired
stimulus on which observers rely. Mangini and Biederman (2004) through years of practice, and it might provide an opportunity to
also devised a variant of the response classication technique that compare performance at different ages. Comparing this develop-
reportedly required only 390 trials to estimate stable classication ment with that of a typical group might shed light on the extent
images. However, the images used by Mangini and Biederman dif- to which acquired heuristics and strategies are important for this
fer considerably from those used in the standard classication im- group of adults. Moreover, developmental data may help to explain
age technique. In the standard technique, a unique external noise the dissociation in subgroup individuals between the present and
mask is added to the image A or B on each trial, and the correct an- Rutherford et al.s studies.
swer (for A or B) is dened. In contrast, in Mangini and Biedermans Although the atypical forehead strategy shown in two of ASD
method, a unique external noise was added to a xed standard face observers was not expected before the experiment, the results of
that was constructed by averaging 200 faces, and on each trial to the response classication technique are not restricted to specic
participants had to respond whether the standard-plus-noise im- theoretical perspectives or a priori hypotheses. This technique is
age looked like Tom Cruise or John Travolta. Hence, the decision re- thus particularly useful in the examination of special populations
lies on a subjective perceptual judgment where correct/incorrect because we cannot assume that face perception strategies are the
responses are not dened, and therefore it is not possible to pro- same as in typical populations. Therefore, our approach is not to
vide observers with response feedback. This feature of the proce- use traditional psychophysical approaches that might rely on
dure may reduce its utility for testing special populations such as assumptions about normal visual processing. Although one might
individuals with ASD. It would be worthwhile to see whether these think that studies of eye movements have the same advantages
different techniques yield similar results in this type of face dis- as the response classication technique, there are several impor-
crimination task, to determine which approach gives the most sta- tant distinctions. First, although it often is assumed that xated
ble results, with the greatest spatial resolution, in the fewest trials. positions are critical for performing the task, this assumption does
Most studies on face processing of ASD observers have focused not always hold true. Less-frequently xated positions, or even
on group differences between TD and ASD populations, rather than non-xated positions, may play critical roles in performing a task
investigating individual differences within the ASD population (Charness et al., 2001), and eye-movements can also be dissociated
(e.g., Boucher & Lewis, 1992; Castelli, 2005; Joseph & Tanaka, from overt and covert attention (Inhoff et al., 1989; Posner, 1980).
2002; Klin et al., 1999; Pelphrey et al., 2002). But recent studies Second, even if observers xated on the critical regions, the exper-
have suggested that there may be substantial variation in face pro- imenters might not always correctly evaluate them. For example,
cessing abilities within a relatively well-dened ASD group (e.g., in Pelphrey et al.s investigation of the effects of ASD on eye move-
Barton et al., 2007; Rutherford, Clements, & Sekuler, 2007). The ments in expression discriminations, xations were categorized
present study investigated individual differences within each pop- into eyes, nose, mouth, and other regions. The experimenters
ulation using the classication image technique. TD observers basi- were interested in the role played by distinct features like the eyes,
cally all showed the identical strategy, with everyone making use nose, and mouth, but not specically in regions such as the fore-
of information within the eyes/brow region. In contrast, ASD head, cheeks, and chin, although ASD observers more frequently
observers were mixed in their perceptual strategies; although xated the other regions than did TD observers. Because other
three ASD observers did appear to use a typical strategy, making eye-movements did not necessarily correspond to specic fea-
use of information in the eye/brow region, two observers used an tures that would have been identied in advance, the authors
atypical strategy, relying more on information concentrated in- did not discuss their potential contribution to perception. Finally,
stead in the forehead region. Critically, the traditional behavioral even if experimenters were to include analyses of all xations,
measure (e.g., discrimination threshold) could not predict these even to non-feature regions of the face, stimuli would need to be
differences in face processing strategies. The existence of different presented for relatively long periods of time to ensure that enough
subgroups within ASD group is consistent with the results of Ruth- eye movements could be detected for a thorough analysis, and pre-
erford, Clements, and Sekuler (2007) and Barton et al. (2007), vious studies have suggested that processing mode might vary as a
although the subgroups seem to be dened by different properties function of duration, so eye-tracking studies cannot provide useful
depending on the task. Thus, one cannot conclude that a given ASD information for very short duration stimuli, which may be better
observer will always show a typical or an atypical face processing indicators, for example, of congural processes (Bruce et al.,
strategy the typicality of the strategy seems to be related to the 1991; Farah, Tanaka, & Drain, 1995; Maurer, Le Grand, & Mondloch,
specic task within face processing, and there is not a one-to-one 2002; Rhodes, Brake, & Atkinson, 1993). In contrast, because clas-
correspondence between face processing tasks including tasks sication images do not depend on eye movements, they can show
thought to tap into congural processing and tasks related to face the contribution of each local pixel even when unexpected regions
identication. Importantly, a good proportion of ASD observers in play a role in processing, and even for relatively short stimulus pre-
the current experiment show normal face processing strategies, sentations. This makes the classication image technique a partic-
consistent with other recent studies (e.g., Nihshimura, Rutherford, ularly useful tool for investigating perceptual processes in special
& Maurer, 2008; Rutherford, Clements, & Sekuler, 2007; Rutherford populations, particularly when the total number of required trials
& Towns, 2008). is reduced, as in the random sub-sampling technique.
Sekuler et al. (2004) used two sets of faces (male faces and fe- de Gelder, B., Vroomen, J., & van der Heide, L. (1991). Face recognition and lip-
reading in autism. European Journal of Cognitive Psychology, 3, 6986.
male faces) and consistently found the eye/brow region strategy
De Heering, A., Houthuys, S., & Rossion, B. (2007). Holistic face processing is mature
with typically-developing observers, which was replicated in this at 4 years of age: Evidence from the composite face effect. Journal of
study with all typically-developing observers and three of observ- experimental Child Psychology, 96, 5770.
ers with autism. In contrast, the forehead strategy was used by two Farah, M. J., Tanaka, J. W., & Drain, H. M. (1995). What causes the face inversion
effect? Journal of Experimental Psychology: Human Perception and Performance,
observers with autism with one set of face stimuli. Future studies 21, 628634.
should address the question of whether a similar atypical strategy Gold, J. M., Bennett, P. J., & Sekuler, A. B. (1999). Identication of band-pass ltered
would be found with different sets of face stimuli. letters and faces by human and ideal observers. Vision Research, 39, 35373560.
Gold, J. M., Sekuler, A. B., & Bennett, P. J. (2004). Characterizing perceptual learning
In the current study, observers saw noisy faces, thus differing with external noise. Cognitive Science, 28, 167207.
from naturalistic face viewing. This kind of face presentations as Gosselin, F., & Schyns, P. G. (2001). Bubbles: A technique to reveal the use of
well as facial-part presentations (only eyes/brows, nose, or mouth; information in recognition tasks. Vision Research, 41, 22612271.
Gross, T. F. (2004). The perception of four basic emotions in human and nonhuman
Joseph & Tanaka, 2002; Tanaka & Farah, 1993) could make observ- faces by children with autism and other developmental disabilities. Journal of
ers focus on more local areas in faces than natural face presenta- Abnormal Child Psychology, 32, 469480.
tion, but in the current study all typically-developing and three Heisz, J. J., & Shore, D. I. (2008). More efcient scanning for familiar faces. Journal of
Vision, 8(1), 110.
observers with autism depended on the eyes/brows (i.e., normal) Hobson, R. P. (1986). The autistic childs appraisal of expressions of emotion. Journal
region in face discrimination task. Thus, noisy and partial presenta- of Child Psychology and Psychiatry and Allied Disciplines, 27, 321342.
tion here does not always force observers to process faces unnatu- Hobson, R. P., Ouston, J., & Lee, A. (1988). Whats in a face? The case of autism. British
Journal of Psychology, 76, 441453.
rally, so this stimulus presentation worked well with the current
Inhoff, A. W., Pollatsek, A., Posner, M. I., & Rayner, K. (1989). Covert attention and
participants. eye movements during reading. The Quarterly Journal of Experimental Psychology
In conclusion, the present study showed that classication im- Section A, 32, 6389.
age with random sub-sampling face presentation worked ne to Joseph, R. M., & Tanaka, J. (2002). Holistic and part-based face recognition in
children with autism. Journal of Child Psychology & Psychiatry, 43(8), 114.
show individuals face-processing strategy in detail with much Klin, A., Jones, W., Schultz, R., Volkmar, F. R., & Cohen, D. J. (2002). Visual xation
fewer trials than with the standard classication image (Experi- pattern during viewing of naturalistic social situations as predictors of social
ment 1). Additionally, the advantage in the application of this tech- competence in individual with autism. Archives of General Psychiatry, 59,
809816.
nique to special populations was conrmed: no restriction to Klin, A., Sparrow, S. S., de Bildt, A., Cicchetti, D. V., Cohen, D. J., & Volkmar, F. R.
specic theoretical perspectives or a priori hypotheses to be tested, (1999). A normed study of face recognition in autism and related disorders.
enabled us to get unexpected strategy like ASDs forehead strategy Journal of Autism and Developmental Disorders, 29, 499508.
Konar, Y., Bennett, P. J., & Sekuler, A. B. (2010). Holistic processing does not predict
(Experiment 2). Lastly, our results showed qualitatively different face identication. Psychological Science, 21, 3843.
strategies within ASD group which was not predicted by other tra- Langdell, T. (1978). Recognition of faces Approach to study of autism. Journal of
ditional psychophysical measures. Child Psychology and Psychiatry and Allied Disciplines, 19. 255-168.
Mangini, M. C., & Biederman, I. (2004). Making the ineffable explicit: Estimating the
information employed for face classication. Cognitive Science, 28, 209226.
Acknowledgments Maurer, D., Le Grand, R., & Mondloch, C. J. (2002). The many faces of congural
processing. Trends in Cognitive Sciences, 6(6), 255260.
Mondloch, C. J., Le Grand, R., & Maurer, D. (2002). Congural face processing
This study was supported by JSPS & CIHR Japan-Canada Joint develops more slowly than featural face processing. Perception, 31, 553566.
Health Research Program to M.N., M.R., P.J.B., and A.B.S., JSPS Murray, R. F., Bennett, P. J., & Sekuler, A. B. (2002). Optimal methods for calculating
Grant-in-Aid for Scientic Research (C) to M.N., Grant-in-Aid for classication images: Weighted sums. Journal of Vision, 2(1). 6, 79104..
Nagai, M., Bennett, P. J., & Sekuler, A. B. (2007). Spatiotemporal templates for
Scientic Research on Innovative Areas, Face perception and rec- detecting orientation-dened targets. Journal of Vision, 7(8), 11, 116.
ognition from MEXT to M.N., NSERC Discovery grants to P.J.B. and Nagai, M., Bennett, P. J., & Sekuler, A. B. (2008). Exploration of vertical bias in
A.B.S., and the Canada Research Chair Program to P.J.B. and A.B.S. perceptual completion of illusory contours: Threshold measures and response
classication. Journal of Vision, 8(7), 25, 117.
and M.D.R. We thank Donna Waxman, Taras Narhiny, and Diana Nandy, A. S., & Tjan, B. S. (2007). The nature of letter crowding as revealed by rst-
Carbone for assistance with the data collection, and Hijiri Ishii and second-order classication images. Journal of Vision, 7(2), 5, 126.
and Masako Nara for cooperation with this project. Nihshimura, M., Rutherford, M. D., & Maurer, D. (2008). Converging evidence of
congural processing of faces in high-functioning adults with autism spectrum
disorders. Visual Cognition, 16, 859891.
References Pelphrey, K. A., Sasson, N. J., Reznick, J. S., Paul, G., Goldman, B. D., & Piven, J. (2002).
Visual scanning of faces in autism. Journal of Autism and Developmental
Disorders, 32, 249261.
Adolphs, R., Spezio, M. L., Parlier, M., & Piven, J. (2008). Distinct face-processing
Posner, M. I. (1980). Orienting of attention. The Quarterly Journal of Experimental
strategies in parents of autistic children. Current Biology, 18, 10901093.
Psychology, 32, 325.
Ahumada, A. J., Jr., & Lovell, J. (1971). Stimulus features in signal detection. Journal of
Rhodes, G., Brake, S., & Atkinson, A. P. (1993). Whats lost in inverted faces?
the Acoustical Society of America, 49, 17511756.
Cognition, 47, 2557.
American Psychiatric Association (1994). Diagnostic and statistical manual of mental
Ristic, J., Mottron, L., Friesen, C. K., Iarocci, G., Burack, J. A., & Kingstone, A. (2005).
disorders-IV. Washington, DC: American Psychiatric Association.
Eyes are special, but not for everyone: The case of autism. Brain Research:
Barton, J. J. S., Hefter, R. L., Cherkasova, M. V., & Manoach, D. S. (2007). Investigations
Cognitive Brain Research, 24, 715718.
of face expertise in the social developmental disorders. Neurology, 69, 860870.
Rutherford, M. D., Clements, K. A., & Sekuler, A. B. (2007). Differences in
Beard, B. L., & Ahumada, A. J. (1998). A technique to extract relevant image features
discrimination of eye and mouth displacement in autism spectrum disorders.
for visual tasks. Proceedings of SPIE, 3299, 7985.
Vision Research, 47, 20992110.
Boucher, J., & Lewis, V. (1992). Unfamiliar face recognition in relatively able autistic
Rutherford, M. D., & Towns, A. M. (2008). Scan path differences and similarities
children. Journal of Child Psychology and Psychiatry, 33, 843859.
during emotion perception in those with and without autism spectrum
Bruce, V., Doyle, T., Dench, N., & Burton, M. (1991). Remembering facial
disorders. Journal of Autism and Developmental Disorders, 38, 13711381.
congurations. Cognition, 38, 109144.
Saiki, J. (2008). Stimulus-driven mechanisms underlying visual search asymmetry
Carey, S., & Diamond, R. (1994). Are faces perceived as congurations more by
revealed by classication image analyses. Journal of Vision, 8(4), 30, 119.
adults than by children? Visual Cognition, 1, 253274.
Schyns, P. G., Bonnar, L., & Gosselin, F. (2002). Show me the features! Understanding
Castelli, F. (2005). Understanding emotions from standardized facial expressions in
recognition from the use of visual information. Psychological Science, 13,
autism and normal development. Autism, 9, 428449.
402409.
Celani, G., Batacchi, M. W., & Arcidiacono, L. (1999). The understanding of the
Sekuler, A. B., Gaspar, C. M., Gold, J. M., & Bennett, P. J. (2004). Inversion leads to
emotional meaning of facial expression in people with autism. Journal of Autism
quantitative, not qualitative, changes in face processing. Current Biology, 14,
and Developmental Disorders, 29, 5766.
391396.
Charness, N., Reingold, E. M., Pomplun, M., & Stampe, D. M. (2001). The perceptual
Swettenham, J., Condie, A., Campbell, R., Milne, E., & Coleman, M. (2003). Does the
aspect of skilled performance in chess: Evidence from eye movements. Memory
perception of moving eyes trigger reexive visual orienting in autism?
& Cognition, 29, 11461152.
Philosophical Transactions of the Royal Society of London: B., 358, 325334.
Chawarska, K., Klin, A., & Volkmar, F. (2003). Automatic attention cueing through
Tanaka, J. W., & Farah, M. J. (1993). Parts and wholes in face recognition. The
eye movement in 2-year-old children with autism. Child Development, 74,
Quarterly Journal of Experimental Psychology Section A, 46, 225245.
11081122.
Tanaka, J. W., Kay, J. B., Grinnell, E., Stanseld, B., & Szechter, L. (1998). Face Volkmar, F., & Mayes, L. (1990). Gaze behavior in autism. Development and
recognition in young children: When the whole is greater than the sum of its Psychopathology, 2, 6169.
parts. Visual Cognition, 5, 479496. Wang, H. F., Friel, N., Gosselin, F., & Schyns, P. G. (2011). Efcient bubbles for visual
van der Geest, J. N., Kemner, C., Verbaten, M. N., & van Engeland, H. (2002). Gaze categorization tasks. Vision Research, 51, 13181323.
behavior of children with pervasive disorder toward human faces: A xation World Health Organization (1994). International classication of diseases (10th ed.).
time study. Journal of Child Psychology and Psychiatry, 43, 669678. Geneva, Switzerland: World Health Organization.
Vinette, C., Gosselin, F., & Schyns, P. G. (2004). Spatio-temporal dynamics of face Young, A. W., Hellawell, D., & Hay, D. C. (1987). Congurational information in face
recognition in a ash: Its in the eyes. Cognitive Science, 28, 289301. perception. Perception, 16, 747759.

Nagai 2013

Загружено:

Сведения о документе

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Nagai 2013

Загружено:

Авторское право:

Доступные форматы

Vision Research 79 (2013) 2735

Contents lists available at SciVerse ScienceDirect

Comparing face processing strategies between typically-developed observers

in people with ASD and TD controls, and also investigates individ-

2. Experiment 1: The random sub-sampling method

The number of trials required for stable classication images in-

MAT f10 SKH m50 MAT f10 SKH m50

3.1.3. Stimuli and procedure

Classication images for each observer were analyzed as in

The present study used a new technique, the random sub-sam-

Вам также может понравиться