ch5.3 (Vocoders)

Vocoders
1
The Channel Vocoder (analyzer):
 The channel vocoder employs a bank of
bandpass filters,
 Each having a bandwidth between 100 HZ and 300
HZ.
 Typically, 16-20 linear phase FIR filter are used.
 The output of each filter is rectified and lowpass
filtered.
 The bandwidth of the lowpass filter is selected to
match the time variations in the characteristics of the
vocal tract.
 For measurement of the spectral magnitudes, a
voicing detector and a pitch estimator are
included in the speech analysis.
2
The Channel Vocoder (analyzer block diagram):
Bandpass Lowpass A/D
Rectifier
Filter Filter Converter
Bandpass Lowpass A/D

Rectifier
Encoder
Filter Filter Converter To
S(n)
Channel
Voicing
detector
Pitch
detector
3
The Channel Vocoder (synthesizer):
 At the receiver the signal samples are passed through
D/A converters.
 The outputs of the D/As are multiplied by the voiced

or unvoiced signal sources.
 The resulting signal are passed through bandpass

filters.
 The outputs of the bandpass filters are summed to

form the synthesized speech signal.
4
The Channel Vocoder (synthesizer block diagram):
D/A Bandpass
Converter Filter
Output
∑ speech
D/A Bandpass
Converter Filter
Decoder
From
Channel
Voicing
Information Switch
Random
Pitch Pulse Noise
period generator generator
5
The Phase Vocoder :
 The phase vocoder is similar to the
channel vocoder.
 However, instead of estimating the pitch,

the phase vocoder estimates the phase
derivative at the output of each filter.
 By coding and transmitting the phase

derivative, this vocoder destroys the
phase information .
6
The Phase Vocoder (analyzer block diagram):
cos ωk n Short-term
magnitude
ak n
Lowpass
cos ω n sin ωk n
Decimator
k
Filter
Differentiator Compute
Short-term
Encoder
S(n) To
Magnitude
And Channel
Phase
Differentiator Derivative
Lowpass
cos ω n Decimator
Filter k
bk n
Short-term phase
sin ωk n derivative
7
The Phase Vocoder
(synthesizer block diagram, kth channel):
Decimate
Short-term
amplitude
cos ωk n
Decoder
From
Channel
Cos Interpolator
Integrator ∑
Decimate Interpolator
Sin
Short-term
Phase sin ωk n
derivative
8
The Formant Vocoder :
 The formant vocoder can be viewed as a
type of channel vocoder that estimate the
first three or four formants in a segment of
speech.
 It is this information plus the pitch period

that is encoded and transmitted to the
receiver.
9
The Formant Vocoder :
 Example of formant:
 (a) : The spectrogram of the utterance “day one”
showing the pitch and the harmonic structure of
speech.
 (b) : A zoomed spectrogram of the fundamental and
the second harmonic.
(a) (b)
10
The Formant Vocoder (analyzer block diagram):
F3
F3 B3
F2
F2 B2
Input
Speech F1
F1 B1
Pitch V/U
And
V/U F0
Decoder
Fk :The frequency of the kth formant

Bk :The bandwidth of the kth formant
11
The Formant Vocoder ( synthesizer block diagram) :
F3
F3
B3
F2
F2 ∑
B2
F1
F1
B1
V/U Excitation
F0 Signal
12
Linear Predictive Coding :
 The objective of LP analysis is to estimate
parameters of an all-pole model of the vocal
tract.
 Several methods have been devised for

generating the excitation sequence for speech
synthesizes.
 LPC-type of speech analysis and synthesis are

differ primarily in the type of excitation signal that
is generated for speech synthesis.
13
LPC 10 :
 This methods is called LPC-10 because of 10
coefficient are typically employed.
 LPC-10 partitions the speech into the 180

sample frame.
 Pitch and voicing decision are determined by

using the AMDF and zero crossing measures.
14
Residual Excited LP Vocoder :
 Speech quality in speech quality can be improved at
the expense of a higher bit rate by computing and
transmitting a residual error, as done in the case of
DPCM.
 One method is that the LPC model and excitation

parameters are estimated from a frame of speech.
15
Residual Excited LP Vocoder :
 The speech is synthesized at the transmitter and
subtracted from the original speech signal to
form the residual error.
 The residual error is quantized, coded, and

transmitted to the receiver
 At the receiver the signal is synthesized by

adding the residual error to the signal generated
from the model.
16
RELP Block Diagram :
S(n) Buffer
And ∑
window
LP
Parameters
LP
Encoder
analysis To
Excitation Channel
parameters
LP
Synthesis
model
17
Code Excited LP :
 CELP is an analysis-by-synthesis method
in which the excitation sequence is
selected from a codebook of zero-mean
Gaussian sequence.
 The bit rate of the CELP is 4800 bps.
18
CELP (analysis-by-synthesis coder) :
Speech samples
Buffer and Side

LP
LP analysis information
Gain parameters
Gaussian Pitch Spectral +
Excitation Synthesis
Envelope ∑
(LP) -
codebook filter Synthesis filter
Perceptual
Weighting
Filter W(z)
Compute Index of
Energy of Error
Excitation
(square and sum)
sequence
19
CELP (synthesizer) :
From Buffer Gaussian Pitch LP

decoder And Excitation Synthesis Synthesis
Channel controller codebook filter
filter
LP parameters,
gain and pitch estimate
updates
20
Vector Sum Excited LP :
 The VSELP coder and decoder basically differ in
method by which the excitation sequence is
formed.
 In next block diagram of the VSELP, there are

three excitation source.
 One excitation is obtained from the pitch period

state.
 The other two excitation source are obtained

from two codebook.
21
Vector Sum Excited LP :
 The bit rate of the VSELP is about 8000 bps.
 Bit allocations for 8000-bps VSELP
Parameters Bits/5-ms Frame Bits/20ms

10 LPC coefficients - 38
Average speech energy - 5
Excitation codewords
from two VSELP
codebooks 14 56
Gain parameters 8 32
Lag of pitch filter 7 28
Total 29 159
22
VSELP Decoder :
Long-term
Filter state
θ0
Spectral
Pitch envelop
Codebook Spectral Synthetic
∑ synthesis (LP)
1 synthesis post filter Speech
filter
filter
β1
Codebook
2
β2
23

ch5.3 (Vocoders)

Загружено:

Сведения о документе

Исходное описание:

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

ch5.3 (Vocoders)

Загружено:

Авторское право:

Доступные форматы

Vocoders

Bandpass Lowpass A/D

 The outputs of the D/As are multiplied by the voiced

 The resulting signal are passed through bandpass

 The outputs of the bandpass filters are summed to

 However, instead of estimating the pitch,

 By coding and transmitting the phase

 It is this information plus the pitch period

Fk :The frequency of the kth formant

 Several methods have been devised for

 LPC-type of speech analysis and synthesis are

 LPC-10 partitions the speech into the 180

 Pitch and voicing decision are determined by

 One method is that the LPC model and excitation

 The residual error is quantized, coded, and

 At the receiver the signal is synthesized by

 The bit rate of the CELP is 4800 bps.

Buffer and Side

From Buffer Gaussian Pitch LP

 In next block diagram of the VSELP, there are

 One excitation is obtained from the pitch period

 The other two excitation source are obtained

Parameters Bits/5-ms Frame Bits/20ms

Вам также может понравиться