Вы находитесь на странице: 1из 15

Chapter 8 Speech Coding

To compress more speech channels within a given bandwidth, researchers are continuously in search of speech coders that will provide toll quality speech at lower bit rates.

More specifically, in wireless communications, the goal of all speech coding systems is to
* transmit speech with the highest possible quality using the least possible channel capacity. * consume little power when implemented * maintain certain levels of complexity to reduce the processing delay and cost of implementation.

A balance needs to be struck between the coder bit-rate efficiency and algorithmic complexity.

The hierarchy of speech coders

Vocoders
* using a priori knowledge about the signal to be coded, and for this reason, they are, in general, signal specific. * can achieve very high economy in transmission bit rate, but more complex.

8.6 Vocoders Analyze the voice signal at transmitter

Transmit parameters derived from the analysis


Then synthesis the voice at the receiver using those parameters. All vocoder systems try to model the speech generation process as a dynamic system and attempt to quantify certain physical constraints of the system.
5

These physical constraints are then used to provide a description the speech signal. In general much more complex than the waveform coders and achieve very high, and economy in transmission bit rate.

The most popular among the vocoding schemes is the linear predictive coder (LPC). The other schemes include:
* The channel vocoder * Format vocoder * Cepstrum voceder * Voice excited vocoder

8.7 Linear Predictive Coders

LPC Vocoders
* The time domain class of vocoders

* Attempts to extract the significant features of speech from the time waveform
* Computationally intensive, but by far the most popular among the class of low bit rate vocoders * Possible to transmit good quality voice at 4.8 kbps.
8

* It models the vocal tract as an all pole linear filter with a transfer function

* The coefficients of the all pole filter are obtained in the time domain using linear prediction techniques.
* The prediction principles used are similar to those in ADPCM coders. * However, instead of transmitting quantized values of the error signal, the LPC system transmits only selected characteristics of the error signal, including the gain factor, pitch information, and the voiced/unvoiced decision information.

10

* Various LPC schemes differ in the way they recreate the error signal (excitation) at the receiver.

11

* The first one shows the most popular means. It uses two sources at the receiver, one of white noise and the other with a series of pulses at the current pitch rate. * Multi-pulse excited LPC (MPE-LPC) * Code excited LPC (CELP)

12

+ Computationally intensive, advances in DSP and VLSI technology have made real-time implementation of CELP codes possible. + The CDMA (IS-95) uses a variable rate CELP codec at 1.2 to 14.4 Kbps.
13

* Residual Excited LPC (RELP)

+ After estimating the model parameters (LP coefficients or related parameters) and excitation parameters (voiced/unvoiced decision, pitch, gain) from a speech frame, the speech is synthesized at the transmitter and subtracted from the original speech to form a residual signal.
+ The residual signal is quantized, coded and transmitted to the receiver along with the LPC model parameters. + At the receiver, the residual error signal is added to improve the quality of the synthesized speech.

14

15

Вам также может понравиться