Вы находитесь на странице: 1из 23

Vocoders

1
The Channel Vocoder (analyzer):
 The channel vocoder employs a bank of
bandpass filters,
 Each having a bandwidth between 100 HZ and 300
HZ.
 Typically, 16-20 linear phase FIR filter are used.
 The output of each filter is rectified and lowpass
filtered.
 The bandwidth of the lowpass filter is selected to
match the time variations in the characteristics of the
vocal tract.
 For measurement of the spectral magnitudes, a
voicing detector and a pitch estimator are
included in the speech analysis.

2
The Channel Vocoder (analyzer block diagram):
Bandpass Lowpass A/D
Rectifier
Filter Filter Converter

Bandpass Lowpass A/D


Rectifier

Encoder
Filter Filter Converter To
S(n)
Channel

Voicing
detector

Pitch
detector
3
The Channel Vocoder (synthesizer):
 At the receiver the signal samples are passed through
D/A converters.

 The outputs of the D/As are multiplied by the voiced


or unvoiced signal sources.

 The resulting signal are passed through bandpass


filters.

 The outputs of the bandpass filters are summed to


form the synthesized speech signal.

4
The Channel Vocoder (synthesizer block diagram):
D/A Bandpass
Converter Filter
Output
∑ speech

D/A Bandpass
Converter Filter
Decoder

From
Channel

Voicing
Information Switch

Random
Pitch Pulse Noise
period generator generator

5
The Phase Vocoder :
 The phase vocoder is similar to the
channel vocoder.

 However, instead of estimating the pitch,


the phase vocoder estimates the phase
derivative at the output of each filter.

 By coding and transmitting the phase


derivative, this vocoder destroys the
phase information .
6
The Phase Vocoder (analyzer block diagram):
cos ωk n Short-term
magnitude
ak n
Lowpass
cos ω n sin ωk n
Decimator
k

Filter
Differentiator Compute
Short-term

Encoder
S(n) To
Magnitude
And Channel
Phase
Differentiator Derivative
Lowpass
cos ω n Decimator
Filter k

bk n
Short-term phase
sin ωk n derivative

7
The Phase Vocoder
(synthesizer block diagram, kth channel):
Decimate
Short-term
amplitude

cos ωk n
Decoder

From
Channel
Cos Interpolator

Integrator ∑
Decimate Interpolator
Sin
Short-term
Phase sin ωk n
derivative

8
The Formant Vocoder :
 The formant vocoder can be viewed as a
type of channel vocoder that estimate the
first three or four formants in a segment of
speech.

 It is this information plus the pitch period


that is encoded and transmitted to the
receiver.

9
The Formant Vocoder :
 Example of formant:
 (a) : The spectrogram of the utterance “day one”
showing the pitch and the harmonic structure of
speech.
 (b) : A zoomed spectrogram of the fundamental and
the second harmonic.

(a) (b)

10
The Formant Vocoder (analyzer block diagram):
F3
F3 B3

F2
F2 B2
Input
Speech F1
F1 B1

Pitch V/U
And
V/U F0
Decoder

Fk :The frequency of the kth formant


Bk :The bandwidth of the kth formant
11
The Formant Vocoder ( synthesizer block diagram) :

F3
F3
B3

F2
F2 ∑
B2

F1
F1
B1

V/U Excitation
F0 Signal

12
Linear Predictive Coding :
 The objective of LP analysis is to estimate
parameters of an all-pole model of the vocal
tract.

 Several methods have been devised for


generating the excitation sequence for speech
synthesizes.

 LPC-type of speech analysis and synthesis are


differ primarily in the type of excitation signal that
is generated for speech synthesis.
13
LPC 10 :
 This methods is called LPC-10 because of 10
coefficient are typically employed.

 LPC-10 partitions the speech into the 180


sample frame.

 Pitch and voicing decision are determined by


using the AMDF and zero crossing measures.

14
Residual Excited LP Vocoder :
 Speech quality in speech quality can be improved at
the expense of a higher bit rate by computing and
transmitting a residual error, as done in the case of
DPCM.

 One method is that the LPC model and excitation


parameters are estimated from a frame of speech.

15
Residual Excited LP Vocoder :
 The speech is synthesized at the transmitter and
subtracted from the original speech signal to
form the residual error.

 The residual error is quantized, coded, and


transmitted to the receiver

 At the receiver the signal is synthesized by


adding the residual error to the signal generated
from the model.
16
RELP Block Diagram :
S(n) Buffer
And ∑
window

LP
Parameters
LP

Encoder
analysis To
Excitation Channel
parameters

LP
Synthesis
model

17
Code Excited LP :
 CELP is an analysis-by-synthesis method
in which the excitation sequence is
selected from a codebook of zero-mean
Gaussian sequence.

 The bit rate of the CELP is 4800 bps.

18
CELP (analysis-by-synthesis coder) :
Speech samples

Buffer and Side


LP
LP analysis information
Gain parameters
Gaussian Pitch Spectral +
Excitation Synthesis
Envelope ∑
(LP) -
codebook filter Synthesis filter

Perceptual
Weighting
Filter W(z)

Compute Index of
Energy of Error
Excitation
(square and sum)
sequence
19
CELP (synthesizer) :

From Buffer Gaussian Pitch LP


decoder And Excitation Synthesis Synthesis
Channel controller codebook filter
filter

LP parameters,
gain and pitch estimate
updates

20
Vector Sum Excited LP :
 The VSELP coder and decoder basically differ in
method by which the excitation sequence is
formed.

 In next block diagram of the VSELP, there are


three excitation source.

 One excitation is obtained from the pitch period


state.

 The other two excitation source are obtained


from two codebook.
21
Vector Sum Excited LP :
 The bit rate of the VSELP is about 8000 bps.
 Bit allocations for 8000-bps VSELP

Parameters Bits/5-ms Frame Bits/20ms


10 LPC coefficients - 38
Average speech energy - 5
Excitation codewords
from two VSELP
codebooks 14 56
Gain parameters 8 32
Lag of pitch filter 7 28
Total 29 159

22
VSELP Decoder :

Long-term
Filter state

θ0
Spectral
Pitch envelop
Codebook Spectral Synthetic
∑ synthesis (LP)
1 synthesis post filter Speech
filter
filter
β1

Codebook
2

β2

23

Вам также может понравиться