Comprehension
Ark Verma
To begin with…
• So, if you are asked to name the picture of a rabbit & the
target phonemes are /r/ or /b/, you should press a button as
quickly as possible & you should refrain from responding if
the target phoneme to monitor is /k/.
• People can do this phoneme - monitoring task very accurately,
and they respond a little bit faster if the target phoneme
comes from the beginning of the word than if the target
phoneme comes from the middle or the end of the word
(Wheeldon & Levelt, 1995).
• To summarise, how the WEAVER++ model works,
• speech errors
• speech errors are not random. for e.g. slips of the tongue
occur in systematic patterns, & those patterns can be
related back to aspects of the speech production process.
• Subjects might be asked to say, bid meek, bud muck & big
men, all of which have a /b/ in the first position in the first
syllable and an /m/ in the first position in the second syllable.
• About 10% of the time, subjects make an error and say bad
mack or bad back .
• Sound exchange errors almost always occur when sounds are
exchanged between words in the same phrase, & the vast
majority involve movement of only a single phoneme from
each word (Nooteboom, 1969).
• this frame consists of a set of slots, & each slot is labeled for
the kind of word that has to appear there (e.g. noun, verb,
adjective etc.).
• How does one activate the sounds that make up the word?
• Early studies in picture recognition & picture naming showed
that people activate different concepts at about the same
speed, but concepts that were used less frequently in speech
or writing , took longer response times (Oldfield & Wingfield,
1965).
• in one task they simply stated whether they had seen the
object before (recognition test)
• Do you activate just the concept you need, right when you
need it? or
• in the former case, both fig & beet are words; in the latter
neither hig nor borse are words.
• Interactive spreading activation accounts explain lexical bias
effect by appealing to feed - forward and feedback links
between lemmas and phonological output mechanisms.
• In this kind of model, phonological activation begins as soon as
lemmas begin to be activated, but before a final candidate has
been chosen. As individual phonemes begin to be activated,
they send feedback to the lemmas that they are connected to,
increasing the activation of the lemmas.
• It is less likely that a non - word error will result, because any set
of phonemes that would lead to a non - word being produced
will not enjoy any reinforcing activation from the lemma level.
• Infact, there are patients who have difficulty with just some
types of words. Some patients have difficulties with content
words (semantically rich words like cat, table, & Hannibal
Lecter) but little difficulty with function words (semantically
light grammatical markers like the, of, and, was)
• Lemma theory would explain a patient like that by proposing
that lemmas for content words are selectively damaged, while
lemmas representing the function words intact.
• When the same patient was asked to write the name of the
picture, she wrote forks.
• the articulators include the lips, the tongue tip, the tongue
body, the velum (the part of soft palate towards the back of the
mouth), & the glottis (a structure in your throat that houses the
vocal folds).
• So, by knowing what the gestures are, you can tell what the
set of words was that produced that set of gestures.
• For e.g. the “core” part of the gesture to produce either “di” or
“du” sounds is tapping the tip of your tongue against the back
of your teeth (or your alveolar ridge).
• So, differences between two acoustic signals will not cause you to
perceive two different phonemes as long as the gestures that created
those two different acoustic signals are the same.
• Motor theory also seeks to explain how a person can perceive
an acoustic stimulus as a phoneme in one context but a s
chirp or a buzz in another context.
• next, they edited their stimuli to create separate signals for the
transition and the rest of the syllable, which they called the
base.
• they played the two parts of the stimulus over headphones, with
the transition going in one ear & the base going in one ear.
• the question was, how would people perceived the stimulus?
• the difference between /b/ & /p/ is that the /b/ is voiced while the /p/ is
not.
• other than voicing the two phonemes are essentially identical; in that
they are both labial plosives, meaning that we make these sounds by
closing our lips & allowing air pressure to build up behind our lip dam
and then releasing the pressure suddenly, creating a burst of air that
rushes out of the mouth.
• the difference between the two phonemes has to do with the timing of
the burst and the vocal fold vibrations that create voicing.
• for the /b/ sound, the vocal folds begin vibrating while your lips are
closed or just after; but for the /p/ sound, there is a delay between
the burst and the point in time when the vocal folds begin to vibrate.
This gap is the voice onset time.
• the VOT is a variable that can take any value whatsoever, so it
is called a continuous variable. but even though not can vary
continuously in this way, we do not perceive much of that
variation. for e.g. we can not greatly hear the difference
between a bot of 2ms and 7ms or between 7 ms & 15ms.
• However, the part of the brain of the macaques that have the
mirror neurons (area F5) is similar to the Broca’s area in the
human brain.
• The parts of the brain that control articulation are the motor
cortex in the frontal lobes of the brain & the adjacent premotor
cortex when we perceive speech.
• Why?
• Studies involving the production bite - block vowels also show that
very different gestures can lead to the same or nearly the same set
of phonemes.
• The motor theory can account for this set of findings in one of two
ways:
• for e.g. some studies have looked at the way people and
animals respond to voicing contrasts (the difference
between unvoiced consonants like /p/ and voiced
consonants like /b/).
• the voicing boundary for people & quail sits right at the same point.
• if vocal fold vibration starts within 20ms of the burst, we perceive the
phoneme as voiced; but if there is more than a 20ms gap between
the burst & the vocal fold vibration, we perceived an unvoiced stop.