Академический Документы
Профессиональный Документы
Культура Документы
Purpose: This study examined the effects of feedback training, familiarization training,
and no training on nave listeners word identification ( WI) and magnitude estimation
scaling ( MES) judgments of the speech intelligibility of children with severe-toprofound hearing impairments.
Method: Depending on the training group, listeners received a pretest, an immediate
posttest, and /or a delayed posttest.
Results: Results indicated that repeated exposure, with or without training, led to
improved WI scores. Beyond the effects of repeated exposure, listeners WI judgments
of the intelligibility of speech significantly increased immediately after training in
which listeners received feedback regarding the accuracy of their WI responses. The
MES results were less straightforwardlisteners in the feedback group perceived
speech samples as less intelligible after the training, perceptions of speech intelligibility
stayed almost the same for the familiarization training group, and participants in the
control group perceived speech samples as more intelligible at the posttest. For the
training groups that were not pretested, perceptions improved from the immediate to
delayed posttest.
Discussion: Results may have both theoretical and clinical significance, particularly as
they relate to contrasting theories of perceptual learning and the extent to which
listener characteristics may be reflected in intelligibility judgments.
KEY WORDS: speech intelligibility, listener training, children with hearing loss
everal studies have reported that professionals who work with deaf
or hard-of-hearing individuals tend to find their speech more understandable compared with those who are unfamiliar with the
speech of these individuals (Klimacka, Patterson, & Patterson, 2001;
McGarr, 1983; Monsen, 1983). Interestingly, in one study by Monsen
(1983), the difference in scores between experienced and inexperienced
listeners somewhat decreased as inexperienced listeners heard more
examples of hearing-impaired speech. Such a finding suggests that, at
least for some individuals, simply becoming more familiar with difficultto-understand speech may be an important factor in the individual
variance noted among listeners.
Beyond Monsens observation, however, there has been no systematic investigation of the effect of familiarization on listeners intelligibility judgments of the speech of hearing-impaired children. The effect of
familiarizing listeners with particular speakers or particular types
of speakers has been examined for listeners intelligibility judgments of
1114 Journal of Speech, Language, and Hearing Research
1115
1116
Although not explicitly mentioned, speech intelligibility falls within the class of the above-mentioned phenomena. Thus, more intelligible speech can be described
as less difficult to understand, whereas less intelligible
speech can be said to be more difficult to understand
from a listener-centered approach. Therefore, using MES
to measure perceptions of speech intelligibility (or difficulty) in this study was deemed useful and appropriate.
Our decision to use both MES and word identification
(WI) as measures of speech intelligibility was also based
on the premise that many factors affect intelligibility and
contribute to deficits in intelligibility; hence, more than a
single measure is often needed to provide the most complete representation of a speakers speech intelligibility.
As Weismer and Laures (2002) have noted, using identification methods alone may make a study of intelligibility incomplete because these methods focus primarily
on the segmental aspects of the speech signal and may be
less sensitive to the nonsegmental aspects (e.g., voice,
prosody). Including scaling methods that are more sensitive to these aspects may better represent the contribution of these factors to a speakers overall intelligibility.
The purpose of the present study was to examine the
effects of two methods of training (familiarization and
feedback) on nave listeners WI and MES judgments of
the intelligibility of speech of children with severe-toprofound hearing impairments. We also examined the
nature of individual change by type of training. The WI
and MES intelligibility judgments of nave listeners
following familiarization training or response feedback
training were compared to the intelligibility judgments
of other nave listeners who did not receive training.
Three major research questions were addressed:
1.
2.
3.
Journal of Speech, Language, and Hearing Research Vol. 51 11141123 October 2008
Method
Using the Rasch (1960, 1980) analysis, the samples characterized by higher PCC scores and an absence of other
speech deviations were shown to be easier for listeners to
understand compared with those that had lower PCC
scores and different types of speech deviations. The Rasch
analysis also revealed that the range was sufficient for
the sample used in the study.
Participants
The testing procedures were adapted from a previous study (Ellis & Fucci, 1992) and were identical for all
participants. A Marantz portable minidisk recorder
( Model PMD 650; Marantz, Mahwah, NJ) was used to
deliver all of the stimuli. Stimuli were presented at the
1117
Group
A
B
C
D
E
Pre-test
Training
O1
O3
O5
___
___
Familiarization
Feedback
___
Familiarization
Feedback
Immediate
post-test
Delayed
post-test
O7
O9
O2
O4
O6
O8
O10
1118
initial testing, participants in Group A and Group B received training. Familiarization training for participants in Group A consisted of listening to the training
samples while following along with a printed transcript
of what each child was attempting to say. Response
feedback training for participants in Group B involved
listening to the training samples, writing down the words
that were understood, and receiving feedback about
which words were identified correctly. Feedback consisted of providing the participants with their transcripts
on which correctly identified words were highlighted and
informing the participants that these were words they
had correctly identified. Approximately 1 week after the
training was completed, participants in groups A, B, and
C were tested again under the conditions identical to
those during the initial testing, except that each participant responded to a different randomized sequence of
the samples.
Participants in Group D received the same familiarization training as Group A, and participants in Group E
received the same feedback training as Group B; however,
participants in these two groups did not undergo pretesting, and their post-testing occurred immediately after
the training was completed (immediate post-testing)
and again 1 week later (delayed post-testing).
As part of the training, groups A, B, D, and E listened to a different set of eight samples three times each.
These training samples were produced by the same
children who produced the eight test samples. Each
training sample consisted of the first 25 words of each
childs retelling of a different story. Training sessions for
both the familiarization and the feedback conditions
lasted 60 min.
Several factors were considered in determining the
training parameters for this study, particularly the duration of training and the time intervals between testing
and training sessions. Available research on the effects
of familiarization training typically used a rather brief
period of training within a single session (e.g., Tjaden &
Liss, 1995). To control for time on task as a variable separate from type of training, we decided to keep the training relatively brief and limited to a single session for this
preliminary investigation. The decision to limit the period
of training to no more than 60 min was also based on
the researchers desire to limit any influence of listener
fatigue and reduced motivation.
The decision to use two different time intervals between test periods was based on the need to capture the
possible immediate effects of training and maintenance
of change or other effects of training over time. Scheduling no more than a 1-week period between the training
and post-testing was also influenced by the availability of
participants and a concern about possible attrition if a
longer time period was used.
Journal of Speech, Language, and Hearing Research Vol. 51 11141123 October 2008
Previous literature related to familiarization training (e.g., Tjaden & Liss, 1995; Yorkston & Beukelman,
1983) was helpful in guiding our selection of training
materials for the familiarization and feedback conditions as well as procedures for the familiarization condition. Regarding specific procedures developed for the
feedback condition, our choices were motivated primarily by the general literature related to the use of feedback
during skill development, which tends to emphasize
the importance of prompt and reasonably contingent
feedback.
Participants transcripts of the test samples (preand/or post-training) were scored to determine the number of correctly identified words. All correctly identified
words (i.e., content or function words) were equally
weighted in the scoring. Two female graduate students
in speech-language pathology independently scored the
transcripts using a scoring key. The scoring key consisted of the transcripts of the eight samples that represented each childs intended words as determined by
the consensus between an individual familiar with the
children (this individual had elicited the original samples) and the parent of each child.
Prior to scoring the samples, the two graduate students were trained in the scoring procedure by the first
author, and both students achieved at least 90% agreement with the author when scoring transcripts similar
to those that were used in the study. After scoring the
actual samples, the two graduate students conferred to
resolve any discrepancies in scoring. Any discrepancies
in scoring between the two graduate students were resolved by consensus between the two scorers.
Data Analysis
Because of the incomplete data matrix (see Table 1),
the effect of training was determined by conducting two
repeated measures analyses of variance (ANOVAs) on
each dependent measure (i.e., WI and MES)one to test
for change from pre-test to delayed post-test using the
data from groups A, B, and C, and the other to test for
change from immediate post-test to delayed post-test
using the data for groups D and E and pre-test data for
the control group C.
The WI and MES scores used in these analyses were
obtained from the Rasch analysis (Rasch, 1960, 1980)
that transformed the raw percentages correct (the WI
data) and the intelligibility perceptions (the MES data)
into abstract, equal-interval units by log transformations of raw data odds and probabilistic equations (Bond
& Fox, 2001).We chose this approach over computing
total raw scores for the WI measure and over using average scores for the MES measure because we could not
assume that the WI and MES scales were interval. For
example, we could not assume that the difference and
Results
Table 2 summarizes descriptive statistics for the sample on both measures of speech intelligibility. For convenience of interpretation, the MES scores were reported on
the same 0100 scale as the WI scores. However, lower
MES scores indicate more intelligible speech, whereas
higher MES scores indicate less intelligible speech.
WI Results
The WI results of the repeated measures ANOVA for
groups A, B, and C showed a significant main effect for
time, F(1, 30) = 83.64, p < .001. This means that regardless of the type of training, there was a statistically
significant increase in WI scores from the average of 55%
correct at the pre-test to the average of 58% correct at the
delayed post-test (Cohens d = 0.75). Neither the effect for
the type of training nor the Training Time interaction
effect were significant. The three groups were statistically
equivalent at the pre-test, F(2, 30) = 1.05, p > .05.
Similar WI results were obtained for Groups C, D,
and E (i.e., comparing immediate and delayed post-test
data). Although we did not collect the immediate posttest data for Group C, it was established that Group C at
the pre-test was not statistically different from Groups D
and E at immediate post-test, F(2, 30) = 2.61, p > .05, and
therefore, we were able to use Group C pre-test data for
1119
Table 2. Mean scores and standard deviations for the Word Identification (WI) and Magnitude Estimation Scaling (MES) measures of
speech intelligibility as a function of training and time.
WI percentage correct
Pre-test
Immediate
post-test
MES scores
Delayed
post-test
Immediate
post-test
Pre-test
Delayed
post-test
Type of Training
SD
SD
SD
SD
SD
SD
55.3
53.8
56.2
3.7
4.5
3.6
57.2
58.8
3.1
2.5
57.6
57.1
58.3
58.7
60.0
2.6
3.8
2.3
2.5
2.0
54.3
53.7
56.5
3.2
3.7
1.8
54.7
55.0
2.7
2.3
53.8
55.9
53.3
52.7
54.5
4.9
4.3
2.0
4.9
2.3
MES Results
The MES results of the repeated measures ANOVA
test for Groups A, B, and C showed a significant Time of
Testing Type of Training interaction, F(2, 30) = 4.87,
p < .05. This means that depending on the time of assessment, the perceptions of speech intelligibility differed
for different groups (see Figure 1). As seen in Figure 1, the
MES score went up for the feedback Group B, stayed almost the same for the familiarization training Group A
and dropped for the control group C. The three groups did
not differ in their MES scores at the pretest, F(2, 30) = 2.92,
p > .05.
The repeated measures ANOVA results for the MES
data for Groups C, D, and E (i.e., comparing immediate
and delayed post-test data) were similar to the WI results. As with the WI data, we did not collect the immediate post-test MES data for Group C. However, it
was established that Group C at the pretest was not
statistically different from Groups D and E at immediate
Mean difference
SE
2.32
1.85
3.38
3.31
5.11
6.21
1.96
1.46
1.36
1.29
1.55
1.49
.003
.220
.022
.000
.004
.000
Note.
1120
WI = Word Identification.
Journal of Speech, Language, and Hearing Research Vol. 51 11141123 October 2008
Figure 1. Change in MES judgments depending on time of assessment and type of training.
Discussion
This study attempted to answer three major research questions and determine if there was an effect of
training (either immediate or 1 week after the training)
on listeners WI performance and their perceptions of
speech intelligibility, if either of the two types of training
(familiarization and feedback) was more effective than
the other, and whether there were any patterns of individual change specific to the type of training. Overall, the
study found no significant training effect and no trainingspecific patterns of individual change in WI performance
and perceptions of speech intelligibility. However, several
interesting findings merit discussion.
The results of this study indicate that overall, for the
sample of listeners in this study, repeated exposure
with or without trainingleads to improved WI scores
and perception scores as supported by both the group
change and individual change analyses. Such an outcome would be supported by a differentiation theory of
perceptual learning. The standardized effect sizes of
these improvements ranged from medium to high, which
is not surprising, given small standard deviations. However, when these improvements are interpreted using a
simple effect size (i.e., raw difference between the means)
and in the context of the location of the listeners on the
scale of measurement, the clinical significance is limited.
It appears that regardless of the training and exposure,
listeners WI and perception of speech intelligibility significantly improved but still remained in the range of
55%58%. Therefore, from the perspective of clinical significance or social validity of results, it might be argued
that the discrepancy in results for the two measures or
across groups on the same measure is more apparent
than real.
The improvement itself was neither surprising nor
very different from similar findings in the literature (e.g.,
Monsen, 1983; Peng, Spencer, & Tomblin, 2004), although
procedures to control for the differential effect of repeated exposure were included in this study by providing all listeners with three consecutive exposures to each
sample during each test occasion. The decision to use
this common procedure was based on the available research (e.g., Epstein, Giolas, & Owens, 1968) suggesting
that more than three exposures to single word stimuli do
not typically result in significant changes in listeners
WI responses. It is possible that the same procedure may
not function in the same way when used with narrative
speech samples used as stimuli in the study. Further
research is warranted to determine the number of trials
needed for narrative samples in order to control for the
practice effect resulting simply from repeated exposure
1121
1122
References
Beltyukova, S. A., Fox, C. M., Stone, G. E., & Ellis, L.
(2006, April). Understanding the utility of magnitude estimation scaling. Paper presented at the meeting of the American Educational Research Association, San Francisco, CA.
Journal of Speech, Language, and Hearing Research Vol. 51 11141123 October 2008
McGarr, N. (1983). The intelligibility of deaf speech to experienced and inexperienced listeners. Journal of Speech and
Hearing Research, 26, 451458.
Schiavetti, N. (1992). Scaling procedures for the measurement of speech intelligibility. In R. D. Kent ( Ed.), Intelligibility in speech disorders ( pp. 1134). Amsterdam: John
Benjamins.
Weismer, G., & Laures, J. (2002). Direct magnitude estimates of speech intelligibility in dysarthria: Effects of a
chosen standard. Journal of Speech, Language, and Hearing
Research, 45, 421433.
Yorkston, K. M., & Beukelman, D. R. (1983). The influence
of judge familiarization with the speaker on dysarthric
speech intelligibility. In W. Berry ( Ed.), Clinical dysarthria
( pp. 155163). San Diego: College Hill Press.
Yorskston, K. M., & Beukelman, D. R. (1992). Intelligibility measurement as a tool in the clinical management of
dysarthric speakers. In R. D. Kent ( Ed.), Intelligibility in
speech disorders ( pp. 265286). Amsterdam: John Benjamins.
Klimacka, L., Patterson, A., & Patterson, R. (2001). Listening to deaf speech: Does experience count? International
Journal of Language and Communication Disorders, 36,
210215.
DOI: 10.1044/1092-4388(2008/06-0217)
1123