Академический Документы
Профессиональный Документы
Культура Документы
Abstract
This paper describes the transcription process and the development of transcription skills in a research project using recorded spoken interaction as its
main data. The spoken data was transcribed using intonation units, and the
paper traces the development of the rst authors skills in identifying such
intonation units. Intertranscriber checks of transcription, involving three researchers, were used to highlight ways in which the identication of intonation units could be improved. Subsequent re-transcription of the data highlighted stretches of talk that included many hesitations, false starts, and
speech used to regulate ongoing spoken interaction. These features were
linked to low levels of intertranscriber agreement. It is argued that the existing literature on intonation units does not address how to best deal with
this quality of spontaneous spoken interaction. The paper concludes with an
agenda that may be used to improve the quality of transcription in similar
research projects, and to develop the transcription skills of the researchers
that are responsible for transcription.
Keywords:
intonation units; transcription; transcription skills; intertranscriber checks; spoken interaction; spontaneous talk.
362
A main focus of Ochss argument was to raise awareness about the eects
of various features of re-presenting spoken interaction in the form of transcripts. She discussed at length the signicance of page layouts, top-tobottom and left-to-right biases when reading transcripts, the representation of verbal versus nonverbal features of interaction, and the use of
transcription symbols. She not only made what was previously not-soobvious obvious, she also built a careful case for treating transcription as
something theoretical in nature.
According to Lapadat and Lindsay (1998: 5), the period following
Ochss contribution has been characterized by the following progression
of perspectives: the search for [transcription] conventions, acceptance of
a multiplicity of conventions, and [then] abandonment of standardization
in favour of contextualized negotiation of method (cf. also Lapadat and
Lindsay 1999). Standardized notation systems suggested in the literature
are usually designed to facilitate transcription according to a particular
perspective on spoken interaction. For example, conversation analysis
employs the Jeersonian notation system (cf. Atkinson and Heritage
1984: ixxvi), which is designed to explore the moment-to-moment
unfolding of turn-taking in interaction. Other systems are more eclectic,
but even so make their orientations explicit. For example, Gumperz and
Berenz (1993: 119) suggest a comprehensive set of conventions designed
to reveal the functioning of communicative signs in the turn-by-turn interpretation of talk. Finally, some systems are designed to account for
more specialized features of spoken interaction, such as Du Boiss (1991)
suggested standard notations for transcribing talk into intonation units,
which have as their theoretical basis the connecting of mental processing
and speech production (cf. also Du Bois et al. 1993). This latter notation
system will be discussed in detail later in this paper, as the transcription in
the current research in part draws on Du Boiss notation system.
The argument for standardized notation systems was followed by debate about the usefulness of such standardization (Lapadat and Lindsay
1998). This debate has included attempts at empirically deriving agreed
upon notation systems (e.g., Dressler and Kreuz 2000), suggestions for
general design principles to base transcription conventions on (Du Bois
1991; Edwards 1993), and arguments against standardization; for example, Muller and Damico (2002: 303) suggest that because transcription
involves recurrent interpretive cycles that lter, shape, and even recreate
data, there is no such thing as the complete transcript, and Cook
(1990) argues that one cannot claim objectivity when trying to re-present
363
364
Lapadat (2000: 216) also suggests that a researcher should keep an audit
trail of decision points while transcribing, in the end using this record to
produce a code book of what was transcribed, how, and why. Again,
these are useful comments indicating the importance of approaching transcription in a systematic manner, but they do not address in any empirical
manner the process of actually doing transcription.
A more notable exception to the general picture is a small set of recent
studies that have explored the experiences of researchers doing transcription work. Gregory et al. (1997: 295) comment on the progressively more
common practice of hiring transcribers, made possible by the increasing
funding available to qualitative research, and how, in their eld of health
research, this creates a need for examining the emotional laboring and
work worlds of transcribers. In a series of articles, Tilley (cf. Tilley
2003a, 2003b; Tilley and Powick 2002) has explored, through interviews
and narrative accounts, the particular experiences of transcribers who
themselves are not researchers on a particular project, and the challenges
this creates within the research projects. Finally, Bird (2005: 246) provides a personal narrative of her own initial and growing relationship
with the process of transcription across a series of transcription tasks
she undertook. These studies provide holistic accounts of transcribers experiences in qualitative research projects. They do not, however, address
the particular challenges involved in doing the narrow types of transcription often required in discourse analysis or applied linguistics research projects. The studies, nevertheless, suggest that doing transcription
is a complex process, and that to uncover the real complexity of this process may well require further direct exploration.
Associated with the scarcity of studies on the process of doing transcription is the observation, made by several authors, that the researchers
who actually do transcription are often postgraduate students or research
assistants (e.g., Lapadat 2000; Tilley 2003a, 2003b). With this realization
must come the recognition that these research assistants are developing
their transcription skills concurrently with the activity of preparing transcripts for a project (Bird 2005; Tilley 2003a, 2003b; Tilley and Powick
2002). Conspicuous in this regard is Lapadat and Lindsays (1998: 21) observation that transcription seldom appears as a topic of consideration in
the education of future researchers and practitioners who will be employing transcription in their work. McLellan et al. (2003: 73) come fairly
close to the mark when they point out that training data managers, transcribers and proofreaders [tasks often done by research assistants] is
highly variable given the research structure, the setting, the type and
volume of data collected, the data produced, and the analytic approach
taken. This highlights a number of implications that need careful consid-
365
eration in research projects employing research assistants to do transcription. For example, how much time it takes to prepare quality transcriptions may need to be balanced against the time needed to develop
transcription skills, and the involvement of more senior researchers in
the development of the transcription skills of the research assistants needs
to be made explicit.
The present paper addresses these gaps in the literature, on the one
hand documenting the process involved in the transcription of spoken interaction using intonation units in an applied linguistics research project,
and on the other hand describing the gradual development of the transcription skills of the researcher with primary responsibility for preparing
the transcripts for the project. To achieve this, we next provide an introduction to the project from which this paper emerged, take a closer look
at the literature on intonation units and the transcription of these units,
and then outline our own view of what constitutes quality transcription.
This is followed by a detailed description of the transcription process we
engaged in and the intertranscriber checks we conducted. The nal part
of the paper is an in-depth exploration of one researchers developing
skills in transcribing spoken interaction using intonation units, employing
as evidence both the products and the processes of transcription in the
research project. We conclude the paper by presenting an agenda for the
development of transcription skills and quality of transcription that
emerged from our close engagement with the transcription process in our
project work.
366
3.
The project team, i.e., the rst and second authors of this paper, agreed at
an early stage to transcribe the conciliation talk using intonation units.
We were attracted to intonation units because there is evidence that these
units can account for the inherently dynamic interplay between speaking
and thinking (Chafe 1994, 1996), thereby responding to the research aim,
which was to explore the dynamics of metaphor use in talking and thinking in a context of post-conict reconciliation.
Our use of the intonation unit (henceforth IU or IUs) is based primarily on Wallace Chafes (1994, 1996) extensive work on naturally occurring
language data from a range of discourse contexts and languages, building
a case for a relationship between consciousness and language. In particular, Chafe (1996: 39) suggests that consciousness is a process in which
remembering, imagining, evaluating, and speaking come together to produce what we know as thought and language.2 One of the constant properties of consciousness, according to Chafe, is that it has a focus and a
periphery, much like human focal and peripheral vision. Moreover, focus
is a dynamic property, as there is a restless movement from one focus to
the next; consciousness does not stand still (1996: 38). In terms of speaking, the restless movement, or dynamic nature, of consciousness manifests
itself in speaking in the form of IUs (Chafe 1994, 1996). Chafe (1996: 40)
underlines the dynamic nature of consciousness, and the dynamic relationship between consciousness and speaking, by pointing out that IUs
(and consciousness) are produced in a series of brief spurts, typically between one and two seconds long.
This coupled relationship between consciousness and IUs, produced
in spurts, is supported by a number of arguments for the IU as a
cognitive unit (cf. Park 2002), as well as arguments for how the IU,
and intonation more generally, does work in spoken interaction (cf.
Couper-Kuhlen 2001; Wennerstrom 2001). One argument is based on
the observation that IUs, on average between 4 and 5 words long (Chafe
1994: 65; Crystal 1969: 256), are smaller than clauses. For this reason,
IUs may be constrained by something other than grammatical structure.
Cognitive constraints, such as how much information can be active in
consciousness at one time, might be such an alternative explanation
(Chafe 1994). On the other hand, some IUs are clause length. Since
clauses often encode propositions, it may be argued that IUs themselves
are vehicles for basic cognitive processes of information storage and
discourse processing (Park 2002: 639). However, this observation may
equally well be explained as a conuence between grammatical and intonation structure in discourse processing (cf. Ford and Thompson 1996).
367
368
pitch (in musical terms, each key on a piano represents a dierent pitch),
(ii) changes in duration or tempo (manifesting itself as shortening and
lengthening of syllables and words), (iii) changes in intensity or loudness
(including stress and/or accents), (iv) alterations between vocalization and
silence (i.e., pausing), (v) changes in voice quality (e.g., creaky voice),
and (vi) changes in speaker turn. Chafes account thus produces six characteristics of prototypical IUs:
1.
2.
3.
4.
5.
6.
Pitch usually includes a resetting of the pitch baseline (as in a stepup or step-down in the pitch level) and a recognizable nal pitch
contour (e.g., falling or rising);
Duration usually includes increased tempo at the beginning (as in
a shortening of syllables and/or words), and then a gradual slowing
down toward the end (as in a lengthening of syllables and/or words);
Intensity usually includes one or more syllables and/or words spoken
more loudly;
Pausing is often preceded or followed by pausing (but may also contain pauses within its boundaries);
Voice quality sometimes begins or ends with a creaky voice or
whispering;
Speaker turn may sometimes be associated with a change of speaker.
369
then, are (i) an intonation group must contain at least one stressed syllable, and (ii) there must be a pitch movement to or from at least one accented syllable.
Another potential ambiguity is that pausing, anacrusis, and syllable
lengthening may all, according to Cruttenden, happen in the middle of
intonation groups, especially in the case of hesitation. To distinguish
between intonation group boundaries and hesitation, Cruttenden (1997:
35) suggests the following heuristic: if the features of pause, and/or anacrusis, and/or syllable lengthening divide an utterance into two partutterances either one of which does not have the minimum internal structure of an intonation group, then any combination of these features is
taken as a hesitation rather than a boundary between intonation groups.
The later parts of this paper will show how this added detail, provided by
Cruttendens external and internal criteria, helped us make sense of IUs
in our own data.
A nal point we would like to make about the identication of IUs is
related to the distinction between prepared and spontaneous talk, introduced earlier in this paper. There is a question about the extent to which
IUs, with complete intonation contours, as described by, e.g., Chafes
(1994) characteristics listed above, reect spoken interaction adequately.
Cruttenden (1986: 36) points out that in the case of spontaneous speaking
any clear and obvious division into intonation-groups is not so apparent
because of the broken nature of much spontaneous speech, including as
it does hesitation, repetitions, false starts, incomplete sentences and
sentences involving a grammatical caesura in their middle. Wichmann
(2000: 21) also observes that spontaneous speech will to a greater or
lesser extent display syntactic and prosodic disuencieshesitations,
repetitions, incomplete utterances. In sum, the extent to which IUs are
complete will vary a great deal when speakers have to deal with the
real-time dynamics of spontaneous speech.
The research by Lindsay and OConnell (1995), reviewed above, suggests that hesitation phenomena and sentence fragments can cause particular diculties in transcription of spoken discourse. Even so, there is little
information in the transcription literature on how to deal with this often
broken, or fragmentary, nature of spontaneous speech. The literature
does mention the possibility of incomplete intonation contours, called
fragmented (Chafe 1994) or truncated (Du Bois et al. 1993) IUs. However, there is little detail on how such fragmented or truncated IUs are
recognized, or how the presence of these aects the identication of more
complete intonation contours. As we will see, the presence of such truncated IUs was one of the features of our data that caused most problems
in the project transcription process.
370
5.
371
Transcriber
Version
First transcriber
Second transcriber (experienced, but not with IUs)
Third transcriber (experienced with IUs)
First transcriber
Transcript 1
Transcript 2
Transcript 3
Enhanced transcript
372
The second and third transcribers prepared their transcripts under somewhat dierent conditions. They were given a verbatim transcription of the
ve-minute segment of spoken interaction, presented as a continuous
block of text as per Extract (2).
(2)
Consistent with the aims of the intertranscriber checks, and working with
a digitized audio le, the task of the second and third transcribers was to
373
segment the continuous block of text into IUs. Although a somewhat different task, we felt that this approach was justied, especially considering
previous research that has established high agreement between experienced transcribers identication of words and sounds only (Roberts and
Robinson 2004). The second and third transcribers were free to change
the words in the text they received according to their own hearing.
It was understood, by all the researchers involved, that the project was
working with Chafes (1994) notion of IUs. For this reason, no formal
denition of IUs was provided, and the intertranscriber checks were
based on rules as published rather than rules as agreed through discussion (cf. Oelschlaeger and Thorne 1999). Just as the rst transcriber did,
the second and third transcribers also indicated the transitional continuity
of each units intonation contour. Transitional continuity is dened as
the degree of continuity that occurs at the transition point between one
intonation unit and the next (Du Bois et al. 1993: 53). We will discuss
the marking of transitional continuity in greater detail later in the paper.
At this point, it is sucient to note that there are three types of transitional continuity: nal, continuing, and appeal. If an IU did not have an
identiable transitional continuity, it was marked as truncated (see discussion of truncated IUs below). The specic instructions for the intertranscriber checks were:
Insert line breaks corresponding to intonation unit boundaries (press the RETURN key at the end of each intonation unit you identify);
Mark the end of each intonation unit for transitional continuity. Please use:
a period for nal
a comma for continuing
a question mark for appeals
Alternatively, mark intonation units as truncated. Please use --
The second transcriber additionally recorded pauses and marked prominent words. The second transcriber listened to the audio le using the
same VoiceWalker software as the rst transcriber. The third transcriber,
being a Mac user, commented that there is no Mac version of VoiceWalker, so Im just rewinding on Quick Time.
7. Intertranscriber checks
We struggled for some time to nd a meaningful way to compare the different transcript versions. On the face of it, what we were doing seemed
like a straightforward inter-rater check that could be reported in terms
of levels of agreement and would tell us something about the reliability
374
Transcript 1
Transcript 2
Transcript 3
159
186
218
IUs identical as
in Transcript 1
93
98
of Transcript 1 prepared by the rst transcriber. This would be commensurate with a positivist notion of transcription, where talk is an observable behavior that can be transcribed completely and accurately (Lapadat
2000). There are, in the literature, some precedents for such inter-rater
checks, some of which have developed quite sophisticated statistical measures (e.g., Roberts and Robinson 2004).
Table 2 shows a simple count of IUs identied by each of the transcribers, as well as how many IUs were transcribed as identical. The results show that the two intertranscribers identied more IUs overall, 186
and 218 IUs respectively, as compared to the rst transcribers 159 IUs.
This result was not very encouraging. Moreover, of the 186 IUs that the
second transcriber identied, only 93 IUs were identical in the rst researchers transcript. Table 2 also shows that the numbers are similar
when the rst and third transcribers performances are compared.
Not only were these results discouraging, we also felt that this positivist
view did not t the purpose of our intertranscriber checks, which was to
improve the quality of transcription in the conciliation talk project. That
is, we felt that any comparisons between the transcripts should facilitate
the development of the rst researchers transcription skills.
Extracts (3) and (4) contain parallel representations of a segment of
talk from Transcripts 1 and 2 and Transcripts 1 and 3, respectively. The
annotation in these parallel transcripts illustrates the approach we nally
used in order to produce a more meaningful comparison of the three transcripts. The lines in these parallel transcriptions are numbered to facilitate
line by line comparisons of IUs identied by the three transcribers. The
additional annotation, in the right-hand column in Extracts (3) and (4),
records dierent types of agreement, using Transcript 1 as the point of
reference. We recorded the following types of agreement:
identical IUs;
IUs with a common initial boundary;
IUs with a common nal boundary;
talk identied as two or more IUs by the intertranscribers, but that
has the same overall initial and nal boundaries as a single IU identied by the rst transcriber.
Type of agreement
er Im aware thats like er -its a part of your sort of spiritual make up to,
. . to confront . . the situation.
. . er and er . . move on from it.
identical
identical
identical
Transcript 1
Transcript 3
Type of agreement
identical
er Im aware thats like er -its a part of your sort of spiritual make up to,
. . to confront . . the situation.
identical
identical
One IU transcribed as two IUs
375
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
1
2
3
4
5
6
7
8
9
10
11
12
13
14
Transcript 1
376
A comparison of the talk represented in Extracts (3) and (4), using the
above types of agreement, showed that the rst and second transcribers
agreed fully on six IUs (lines 4, 5, 9, 12, 13, and 14; marked identical in
Extract [3]). The rst and third transcribers agreed on ve IUs (lines 1, 3,
4, 9, and 13; marked identical in Extract [4]). Equally interesting from
our perspective of developing transcription skills was that the rst and
second transcribers agreed on the initial boundary in line 1 (marked
same initial boundary in Extract [3]), and on the nal boundary in line
3 (marked same nal boundary in Extract [3]). A similar pattern is evident between the rst and third transcribers for lines 10 and 12 (see Extract [4]). Moreover, the single IUs identied by the rst transcriber in
lines 7 and 10 are transcribed as two IUs by the second transcriber (while
retaining the same overall initial and nal boundaries as the single IUs
identied by the rst transcriber; see Extract [3]). The same relationship
holds between the rst and the third transcribers for lines 5, 7, and 14.
That is, the rst transcriber has transcribed these lines as single IUs,
whereas the third transcriber has identied two IUs (but retaining the
same overall initial and nal boundaries as the single IUs identied by
the rst transcriber; see Extract [4]). There were also cases in the data
where the second or third transcribers identied three or more IUs with
the same overall initial and nal boundaries as a single IU identied by
the rst transcriber. The parallel representation of transcriptions in Extract (5) illustrates such an instance.
(5)
Transcript 1
Transcript 3
so it was partly on on
. . er . . a political thing,
Instances when the rst and second transcribers agreed on the identication of an IU
Instances when the same talk was identied as one IU by the rst transcriber, but as
two IUs by the second transcriber
Instances when the same talk was identied as one IU by the rst transcriber, but as
three IUs by the second transcriber
Instances when the same talk was identied as one IU by the rst transcriber, but as
four IUs by the second transcriber
Instances where both the rst and the second transcriber agreed on the initial
boundary of an IU
Instances where both the rst and the second transcriber agreed on the nal
boundary of an IU
IUs uniquely identied by the second transcriber
Table 4.
377
93
21
4
1
17
11
7
Instances when the rst and second transcribers agreed on the identication of an IU
Instances when the same talk was identied as one IU by the rst transcriber, but as
two IUs by the second transcriber
Instances when the same talk was identied as one IU by the rst transcriber, but as
three IUs by the second transcriber
Instances when the same talk was identied as one IU by the rst transcriber, but as
four IUs by the second transcriber
Instances when the same talk was identied as one IU by the rst transcriber, but as
ve IUs by the second transcriber
Instances where both the rst and the second transcriber agreed on the initial
boundary of an IU
Instances where both the rst and the second transcriber agreed on the nal
boundary of an IU
IUs uniquely identied by the second transcriber
98
33
6
2
1
9
9
5
heard only one, and one instance where the second transcriber identied
as many as four IUs corresponding to the one that the rst transcriber
identied. Finally, there were 17 instances where the rst and second
transcribers identied the same initial boundary and 11 instances where
they identied the same nal boundary (see Extracts [3] and [4] for illustration). The nal row in Table 3 records the number of IUs uniquely
identied by the second transcriber. Table 4 shows the same comparison,
but now between the rst and the third transcribers.
We found this more subtle analysis more productive for our purposes
than a straightforward inter-rater check. For one, it begins to explain
why the rst transcriber identied only 159 IUs, as compared to the second transcribers 186 IUs and the third transcribers 218 IUs. Although
the absolute level of agreement reported in Table 2, counting only IUs
fully agreed upon, appears low, the rst transcribers identication of
378
boundaries between IUs appeared promising. That is, the more subtle
analysis shows that the real problem, as it were, was that the rst transcriber did not identify enough, or all, boundaries between IUs, thereby
identifying fewer IUs overall. This was productive for the purpose of developing transcription skills, as it indicated a specic area that the rst
transcriber could pay attention to, namely improving his identication of
boundaries between IUs.
8.
In order to improve his skills in recognizing IU boundaries, the rst researcher again consulted Chafes six characteristics of IUs, reviewed in an
earlier part of this paper. Chafe (1994: 59) uses the following typical IU
to illustrate four out of his six characteristics:
. . and so the hall is re`al long%.
. . .(.36) [the next intonation unit]
379
Voice quality: The end of the last word of the IU (long) is spoken with a
creaky voice (marked by the percent sign).
A similarly rigorous identication of IUs in the project data encountered
some diculties. The rst transcriber initially found Chafes use of a period to mark sentence-nal falling pitch, a comma to mark contours that
are not sentence-nal, and a question mark to represent an appeal, potentially confusing. Du Bois et al. (1993) was helpful here. They also use
period, comma, and question mark to distinguish intonation contours as
being either nal, continuing, or an appeal. According to Du Bois et
al. (1993: 53), this transitional continuity of intonation contours will have
various realizations, one of which is the nal pitch movement of IUs. In
the above example, from Chafe, the falling pitch contour at the end of the
IU is associated with a nal intonation contour. In sum, the rst researcher resolved to re-learn the use of these familiar symbols, the punctuation marks, to describe the intonation contours of IUs, and he used nal pitch movement of IUs as a heuristic to aid the identication of these
intonation contours. At the same time, the identication of nal pitch
movement turned out to be helpful in the identication of IU boundaries.
The rst transcriber encountered two additional diculties. Firstly, socalled changes in voice quality at the beginning and end of IUs (e.g., Chafes creaky voice) were dicult to recognize. Secondly, distinctly discernible declining patterns of pitch level were not found within or across the
identied IUs. We suspect that these may be rather subtle characteristics
of IUs, and that extensive experience may be needed before one may be
able to use these productively for identifying IU boundaries. We did,
however, notice that IU boundaries were sometimes associated with a
slight re-setting of the pitch baseline; i.e., IUs sometimes started with a
slightly lower or higher pitch than the preceding talk. In addition, we also
added prominence as a characteristic of IUs in our data. That is, we often
saw that IUs in our project data contained at least one prominent syllable
or word.
The careful re-transcription of the ve-minute sample segment, allowing for the above subtleties, resulted in the following characteristics of
IUs identied in the project data:
1. Pitch: Beginning with a re-setting of the pitch baseline and having a
recognizable nal pitch movement
2. Duration: Beginning with a shortening of syllables/words (i.e., anacrusis) and ending with lengthened syllables/words
3. Prominence: Including at least one clearly prominent word, achieved
with altered intensity, pitch, or a combination of both
4. Pausing: Being preceded and followed by a pause
380
Note that these characteristics include elements not only from Chafe
(1994). We also include one of Cruttendens (1986, 1997) internal criteria;
i.e., that there should be at least one prominent word or syllable in an IU.
We accept, however, that the way we recognize prominence is somewhat
dierent than the pitch movement to or from an accented syllable that
Cruttenden uses as an internal criterion for identifying intonation groups.
An important tool helping the rst transcriber to arrive at, and practice
the use of, the above criteria in his identication of IUs, was what we call
an enhanced transcription. The conventions used in this enhanced transcription build on Chafes (1994) narrow transcription of IUs, as illustrated above; include elements from Du Bois et al. (1993); and add our
own notation where necessary. Example (a) below illustrates the enhanced transcription conventions, and thereby also the four criteria we
used to identify IUs in the data. The enhanced conventions are, then, a
way to make transparent the decision-making involved in the identication of IUs in the ve-minute sample segment of the conciliation talk. By
contrast, the conventions used in the nal transcription of the full data set
were guided by what we needed to re-present for our exploration of metaphor dynamics. Hence, in the later full transcription we used the more
economical conventions illustrated by Example (b).
a. . . .(2.0) _back in the ^moment\
b. . . .(2.0) back in the moment.
In Example (a), there is a two second pause at the beginning of the IU
(marked by the three dots and parentheses indicating the length of the
pause). At the beginning of the IU there is a resetting of the pitch baseline
(marked by the underscore). The rst three words of the IU are accelerated (anacrusis; marked by the smaller font size). The rst syllable of the
word moment is prominent (marked by the caret). The rst and second
vowels of the word moment are both lengthened (marked by the equal
signs). Finally, the IU ends with falling nal pitch (marked by the backward slash). In Extract (6) the conventions are applied to a longer stretch
of talk from the rst researchers enhanced transcription of the veminute sample segment of conciliation talk. The line numbering in this
extract, as well as in subsequent extracts, corresponds to the line numbering in the enhanced transcript (see Appendix 2 for an illustrative
segment).
(6) (Illustration of conventions used in the enhanced transcription)
39 Pat: and you ^know Im er -40
. . .(1.0) er Im ^aware thats like er -41
. . its a part of your sort of spiritual ^make up to\
381
. . .(2.0) to con^front/
[hmh]
. . .(1.0) _[the] situ^ation/
. . . _and er -. . . move ^on from it\
. . .(1.0) but a^gain/ -_er
. . .(3.0) _^dealing /-_you know\
like . . ^having to handle that\
. . . _you know\
and er
. . .(1.0) _or the . . e^normity of it\
146
147
148
149
Enhanced transcript
_that ^loss
382
150
151
152
153
154
155
156
157
158
you know,
I felt it.
Extract (8) shows a stretch of talk that was more uent, in the sense that
it includes fewer truncated IUs (only one in Extract [8]: when you come
-- in line 12) and fewer regulatory IUs (only one clear example in Extract
[8]: I mean in line 13). Such more uent stretches of talk were also
common in the ve-minute sample segment.
(8) (Enhanced transcription of a more uent stretch of talk)
8 Pat: theres ^no right
9
. . .(1.0) ^for ^me
10
. . .(2.0) _to sit here and be for^given\
11
. . .(2.0) if you understand me
12 Jo: hmh
13 Pat: _I mean
14
in a ^sense theres the political thing
15
I knew what I was ^doing\
16
. . and I would even de^fend actions Ive taken
17
_etcetera
18
_^but
19
_when you come -20
when it comes ^down to it\
21
. . I am ^sitting with somebody whos aected by\
22
. . . _^my actionns
The more uent stretches of talk, as in Extract (8), were also dierent in
that they showed high levels of agreement in the earlier intertranscriber
checks, and the transcription of these stretches of talk did not change
much between the rst researchers initial transcription and the enhanced
transcription. The parallel representation in Extract (9) illustrates how the
transcription of the more uent stretch of talk in Extract (8) did not
change much between the initial transcription (Transcript 1) and the enhanced transcription; only two IUs, in lines 14 and 22, are unique to the
enhanced transcription.
Enhanced transcript
8
9
10
there is no right,
. . for me,
. . to sit here and be forgiven.
11
12
13
14
15
16
17
18
19
20
21
22
383
. . . _^my actionns
It seems, then, that stretches of talk that included a lot of truncated and
regulatory IUs were particularly challenging for the rst transcriber. Successfully dealing with these stretches of talk was therefore instrumental
for the development of his transcription skills. By contrast, the transcription of stretches of talk with fewer truncated and regulative IUs were associated with higher levels of intertranscriber agreement, and changed
much less across the dierent stages of transcription.
The nal stage, then, was the transcription of the full data set, putting
the skills that the rst transcriber had developed to work. This transcription did not use the enhanced transcription conventions. Rather, the project now reverted to a more simple set of conventions agreed as sucient
for the purposes of the research project. Extract (10) illustrates the conventions used in this nal transcription.
(10)
384
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
Pat:
9.
Conclusion
Jo:
Pat:
and you know Im er -. . .(1.0) er Im aware thats like er -. . its a part of your sort of spiritual make up to,
. . .(2.0) to confront.
[hmh]
. . .(1.0) [the] situation.
. . . and er -. . . move on from it.
. . .(1.0) but again -er,
. . .(3.0) dealing -you know,
like . . having to handle that.
. . . you know,
and er,
. . .(1.0) or the . . enormity of it.
In this paper, we have highlighted issues around the development of transcription skills, in particular the problems of working with spontaneous
talk, such as that found in spoken interaction on emotionally charged
topics. Our position is that the process of transcription is key to rigorous
research involving spoken interaction data, and the present paper oers,
we believe, a unique description of the challenges of the transcription process in one particular project, as well as the ways that these challenges
were overcome. In the following, we summarize this process of developing transcription skills in our research project. This summary may, at the
same time, be seen as a set of recommendations, or an agenda, that can
inform the transcription process of researchers engaged in other similar
projects working with spoken discourse data.
Our starting point was the decision to transcribe the conciliation talk
using IUs. Working toward this aim, we trialed our transcription using
a ve-minute sample segment of talk. The rst transcriber began by transcribing the segment based on denitions of IUs available in the literature. Next, we involved two more experienced intertranscribers in the rst
transcribers learning process. Following this, the rst transcriber made
his decisions transparent using our enhanced transcription conventions.
Finally, we developed project-specic conventions, criteria, and procedures for the transcription of the full data set.
385
The rst step, then, was to prepare an initial transcript of the veminute sample segment of conciliation talk based primarily on the characteristics of IUs outlined by Chafe (1994), but also consulting Cruttenden (1986, 1997). However, as Chafe (1994: 62) points out, and we ourselves discovered at the end of this rst step, skills in the transcription of
IUs requires both instruction and practice, and takes time and eort to
develop. In particular, the later intertranscriber checks, as well as the subsequent externalizing of the transcribers decisions, showed that the criteria for identifying IUs were only partially internalized by the rst transcriber in this rst step of the transcription process.
We, next, had two additional researchers transcribing our ve-minute
sample segment of conciliation talk. Both of these transcribers brought
unique and valuable experience to the task. Although we originally set
out to compare the three resulting transcriptions, we found this to be a
less fruitful exercise for the development of the rst transcribers skills.
Rather, anticipating the necessarily more autonomous task of transcribing the full data set, the rst transcriber worked through the two additional transcriptions now available to him, noted both dierences and
similarities, what the nature of these dierences and similarities were, and
then reected on whether these indicated areas where his own transcription might change and/or improve. For example, the detailed intertranscriber analysis showed that IU boundaries had been correctly identied,
but that a number of potential IU boundaries had been missed. With this
insight, and re-engaging with the literature on IUs, the ve-minute sample
segment was re-transcribed. In sum, this exercise may be described as listening, noticing, and reecting using already prepared transcripts, in this
case the transcripts of two experienced researchers. The exercise, then,
involved learning with rather than from the transcripts of the more
experienced researchers. More generally, we believe that intertranscriber
checks should be geared toward improving the quality of transcription,
rather than acting as a simple objective measure validating the reliability
of transcription. This does not mean that we dismiss the need for reliability across transcribers. Rather, our experience tells us that such reliability
can only follow from a more general concern with the quality of transcription within a project.
In the following step of the transcription process, the rst transcriber
used the enhanced transcription conventions (see Extract [6]). This exercise was instrumental in making transparent the decision-making process
involved in the identication of IUs. It also acted to highlight particular
challenges of transcribing IUs with consistency. For example, the rst
transcriber found that paying equal attention to the beginnings and ends
of IUs increased the consistent identication of IUs. This also highlighted
386
characteristics of IUs, such as anacrusis and re-setting of the pitch baseline at the beginning of IUs, as well as syllable lengthening and terminal
pitch contours at the end of IUs. Importantly, this equal focus on both
the beginnings and ends of IUs was subtly dierent than the concern
with boundaries between intonation units, which had been the focus
of the rst transcriber at the earlier intertranscriber stage. Making the
decision-making processes more transparent also highlighted that identifying IUs was less problematic in the case of more uent stretches of
talk, and more problematic in the case of less uent stretches of talk.
From this, we became aware that the existing literature provides insucient detail on how to deal with hesitations, false starts, and talk used to
regulate ongoing spoken interaction, as well as the eect of these truncated intonation contours on the transcription of co-present more complete intonation contours. Finally, making the transcription decisions
transparent using our enhanced conventions helped ensure the development of transcription skills to a level where, when the rst transcriber
went on to transcribe the full data set, he was working not at the limits
of his skill level, but within his skill level.
Finally, the development of project-specic conventions, criteria, and
procedures was an ongoing aim, aecting all stages of the transcription
process. At the beginning of the project, we decided to transcribe the conciliation talk using IUs. This was based on our intuitions that such a transcription would facilitate the investigation of the dynamics of metaphor
use in the conciliation talk. In later stages, more ne-grained decisions
were made about what features of the spoken discourse to represent in
the transcripts. These decisions were not taken in isolation from the rst
transcribers gradually developing skills. Rather, his developing skills in
the transcription of IUs acted at times to create aordances, i.e., opening
up new possibilities for what could be included in the nal project transcription, and at times as a constraint, i.e., what the rst transcriber could
do within his skill level limited what could be included in the nal project
transcription.
One nal activity that may usefully be added to this agenda for developing transcription skills is the possibility of learning from other transcribers experiences. This may involve talking to other researchers with
similar responsibilities within a research project, or with more experienced researchers who in the past have had such transcription responsibilities. It may also include interacting with the emerging research methods
literature that describes the experiences of transcribers (e.g., Gregory et al.
1997; Bird 2005; Tilley 2003a, 2003b; Tilley and Powick 2002). The present paper is a further contribution to this emerging research methods
literature.
387
Overlapping
speech:
Uncertain
hearing:
Convention
,
.
?
/
\
_ (at the end of IU)
_ (at the start of IU)
[ ]
Description
Continuing
Final
Appeal
Rising
Falling
Level
Re-setting of the pitch
baseline
Truncated word
Truncated intonation unit
Prominent syllable or word
Small print
Equal sign
Short pause (<0.4 s)
Medium pause (0.4 to 0.75 s)
Longer pause (length in
seconds indicated in
parenthesis)
Overlapping speech
3X X4
-^
and so the
..
...
. . .(2.0)
388
389
390
Enhanced
1 Pat:
2
3 Jo:
4 Pat:
5
6
7
8 Pat:
9
10
11
12 Jo:
13 Pat:
14
15
16
17
18
19
20
21
22
23
24
25
26 Jo:
27 Pat:
28
29
30
31
32
33
34
35
36
37
391
Notes
1. The project, Using visual display to investigate the dynamics of metaphor in conciliation talk, was supported by the UKs Arts and Humanities Research Board under its
Innovation Award scheme. We acknowledge that support, and also thank the participants in the talk for giving permission to use the data.
2. This view of consciousness does not necessarily dismiss a role for unconscious mental
processes. In fact, Chafe (1996: 39) concedes that both the content . . . ow and management of consciousness is probably in large part unconsciously determined. Our assessment is that the exact division of labor, between conscious and unconscious processes,
does not impact upon the process of transcribing intonation units.
3. Pierrehumbert and Hirschberg (1990) suggest that intonational phrases are made up of
one or more intermediate units. These intermediate units are roughly similarly to what
we in this paper call intonation units.
References
Atkinson, J. M. and Heritage, J. (1984). Transcript notation. In Structures of Social Action:
Studies in Conversational Analysis, J. M. Atkinson and J. Heritage (eds.), ixxvi. Cambridge: Cambridge University Press.
Bird, C. M. (2005). How I stopped dreading and learned to love transcription. Qualitative
Inquiry 11 (2): 226248.
Cameron, L. (2003). Metaphor in Educational Discourse. London: Continuum.
Cameron, L. J. (forthcoming). Patterns of metaphor use in reconciliation talk. Discourse &
Society.
Cameron, L. J. and Stelma, J. H. (2004). Metaphor clusters in discourse. Journal of Applied
Linguistics 1 (2): 107136.
Chafe, W. (1993). Prosodic and functional units of language. In Talking Data: Transcription
and Coding in Discourse Research, J. A. Edwards and M. D. Lampert (eds.), 3343. Hillsdale, NJ: Lawrence Erlbaum.
(1994). Discourse, Consciousness, and Time: The Flow and Displacement of Conscious Experience in Speaking and Writing. Chicago: Chicago University Press.
(1996). How consciousness shapes language. Pragmatics and Cognition 4 (1): 3554.
Cook, G. (1990). Transcribing innity: Problems of context presentation. Journal of Pragmatics 14: 124.
Couper-Kuhlen, E. (2001). Intonation and discourse: Current views from within. In The
Handbook of Discourse Analysis, D. Schirin, D. Tannen, and H. E. Hamilton (eds.),
1334. Malden, MA: Blackwell.
Cruttenden, A. (1986). Intonation. Cambridge: Cambridge University Press.
(1997). Intonation, 2nd ed. Cambridge: Cambridge University Press.
Crystal, D. (1969). Prosodic Systems and Intonation in English. London: Cambridge University Press.
Dressler, R. A. and Kreuz, R. J. (2000). Transcribing oral discourse: A survey and a model
system. Discourse Processes 29 (1): 2536.
Du Bois, J. W. (1991). Transcription design principles for spoken discourse research. Pragmatics 1 (1): 71106.
(2000). VoiceWalker: A discourse transcription utility. URL: 3http://www.linguistics
.ucsb.edu/resources/computing/download/documentation.htm4 [accessed 27 May 2005].
392
Du Bois, J. W., Schuetze-Coburn, S., Cumming, S., and Paolino, D. (1993). Outline of discourse transcription. In Talking Data: Transcription and Coding in Discourse Research,
J. A. Edwards and M. D. Lampert (eds.), 4589. Hillsdale, NJ: Lawrence Erlbaum.
Edwards, J. A. (1993). Principles and contrasting systems of discourse transcription. In Talking Data: Transcription and Coding in Discourse Research, J. A. Edwards and M. D.
Lampert (eds.), 331. Hillsdale, NJ: Lawrence Erlbaum.
Ford, C. E. and Thompson, S. A. (1996). Interactional units in conversation: Syntactic, intonational, and pragmatic resources for the management of turns. In Interaction and
Grammar, E. Ochs, E. A. Scheglo, and S. A. Thompson (eds.), 134184. Cambridge:
Cambridge University Press.
Green, J., Franquiz, M., and Dixon, C. (1997). The myth of the objective transcript: Transcribing as a situated act. TESOL Quarterly 31 (1): 172176.
Gregory, D., Russell, C. K., and Phillips, L. R. (1997). Beyond textual perfection: Transcribers as vulnerable persons. Qualitative Health Research 7 (2): 294300.
Gumperz, J. J. and Berenz, N. (1993). Transcribing conversational exchanges. In Talking
Data: Transcription and Coding in Discourse Research, J. A. Edwards and M. D. Lampert
(eds.), 91121. Hillsdale, NJ: Lawrence Erlbaum.
Halliday, M. A. K. (1967). Intonation and Grammar in British English. The Hague: Mouton.
Kreckel, M. (1981). Communicative Acts and Shared Knowledge in Natural Discourse. London: Academic Press.
Lapadat, J. C. (2000). Problematizing transcription: Purpose, paradigm and quality. International Journal of Social Research Methodology 3 (3): 203219.
Lapadat, J. C. and Lindsay, A. C. (1998). Examining transcription: A theory-laden methodology. Paper presented at the Annual Meeting of the American Educational Research Association, San Diego, CA, 1317 April (Eric Document Number ED 419 821).
(1999). Transcription in research and practice: From standardization of technique to interpretive positionings. Qualitative Inquiry 5 (1): 6486.
Lindsay, J. and OConnell, D. C. (1995). How do transcribers deal with audio recordings of
spoken discourse. Journal of Psycholinguistic Research 24 (2): 101115.
McLellan, E., MacQueen, K. M., and Neidig, J. L. (2003). Beyond the qualitative interview:
Data preparation and transcription. Field Methods 15 (1): 6384.
Muller, N. and Damico, J. S. (2002). A transcription toolkit: Theoretical and clinical considerations. Clinical Linguistics & Phonetics 16 (5): 299316.
Ochs, E. (1979). Transcription as theory. In Developmental Pragmatics, E. Ochs and B. B.
Schieelin (eds.), 4372. New York: Academic Press.
OConnell, D. C. and Kowal, S. (1994). The transcriber as language user. In The Dynamics
of Language Processes: Essays in Honor of Hans W. Dechert, B. Guillermo (ed.), 119142.
Tubingen: Gunter Narr.
(1999). Transcription and the issue of standardization. Journal of Psycholinguistic Research 28 (2): 103120.
Oelschlaeger, M. L. and Thorne, J. C. (1999). Application of the correct information unit
analysis to the naturally occurring conversation of a person with aphasia. Journal of
Speech, Language, and Hearing Research 42: 636648.
Park, J. S.-Y. (2002). Cognitive and interactional motivations for the intonation unit. Studies in Language 26 (3): 637680.
Pierrehumbert, J. and Hirschberg, J. (1990). The meaning of intonational contours in the interpretation of discourse. In Intentions in Communication, P. R. Cohen, J. Morgan, and
M. E. Pollack (eds.), 271311. Cambridge, MA: MIT Press.
Roberts, F. and Robinson, J. D. (2004). Interobserver agreement on rst-stage conversation
analytic transcription. Human Communication Research 30 (3): 376410.
393
Romero, C., OConnell, D. C., and Kowal, S. (2002). Notation systems for transcription:
An empirical investigation. Journal of Psycholinguistic Research 31 (6): 619631.
Schuetze-Coburn, S., Shapley, M., and Weber, E. G. (1991). Units of intonation in discourse: A comparison of acoustic and auditory analyses. Language and Speech 34: 207
234.
Sealey, A. and Carter, B. (2004). Applied Linguistics as a Social Science. London:
Continuum.
Tilley, S. A. (2003a). Challenging research practices: Turning a critical lens on the work of
transcription. Qualitative Inquiry 9 (5): 750773.
(2003b). Transcription work: Learning through coparticipation in research practices.
Qualitative Studies in Education 16 (6): 835851.
Tilley, S. A. and Powick, K. D. (2002). Distanced data: Transcribing other peoples research
tapes. Canadian Journal of Education 27 (2/3): 291310.
Wennerstrom, A. (2001). The Music of Everyday Speech: Prosody and Discourse Analysis.
Oxford: Oxford University Press.
Wichmann, A. (2000). Intonation in Text and Discourse: Beginnings, Middles and Ends. Harlow, Essex: Pearson Education/Longman.
Yule, G. (1980). Speakers topics and major paratones. Lingua 52: 3347.
Juurd H. Stelma received a Ph.D. in Education from the University of Leeds in 2003 and is
currently a Lecturer in TESOL in the School of Education at the University of Manchester.
His main research interest is the development of methodology for exploring the dynamical
nature of language in use. Address for correspondence: School of Education, Humanities
Devas Street Building, University of Manchester, Oxford Road, Manchester, M13 9PL,
UK 3Juup.Stelma@manchester.ac.uk4.
Lynne J. Cameron is Professor of Applied Linguistics in the Centre for Language and Communication at the Open University. Her research seeks to understand how language is used
in building understanding between people, particularly through metaphor. She has published
widely on the use of metaphor in dierent settings, including the co-edited Researching and
Applying Metaphor (1999, with Graham Low) and her book Metaphor in Educational Discourse (2003). Address for correspondence: Centre for Language and Communication, Faculty of Education and Language Studies, The Open University, Milton Keynes, MK7 6AA,
UK 3L.J.Cameron@open.ac.uk4.