Академический Документы
Профессиональный Документы
Культура Документы
M.Bouzid, M.Boudraa
I.
INTRODUCTION
Original
speech
Coding
Packetization
IP network
Synthetic
speech
Decoding
De-packetization
IV. PACKETIZATION
MELP 2.4
kbps
MELP1.2
kbps
Fn
Fn+1
Fn+2
Fn+3
Table I: Bit allocation encoder MELP 2.4 kbps and 1.2 kbps
Bit Rate
Sampling frequency
Size of frame
Bit Rates of frame
8 kHz
180 samples (22.5 ms)
44,44 frames/seconde
Mode of voicing
N/V
VVV
10 LSFs
Pitch
10 Fourier amplitudes
5 Bands of voicing
2 Gains
25
7
8
4
8
25
7
8
43
12
8
6
10
Flag
Protection
Synchronisation
1
1
0
13
1
1
0
1
54 bits
54*44,44= 2400 bps
8 kHz
3*180 samples (67.5 ms)
14.8148 frames/seconde
UVV
UUV
VUV
VVU
UVU
VUU
43
39
43
12
12
12
8
8
8
4
4
2
10
10
10
1
2
1
1
6
1
81bits
81*14.8148 = 1200 bps
1
4
1
UUU
27
12
0
0
10
0
31
1
V. PACKET RECOVERY
Figure 3 shows how our MDC system allows recovering lost
packets. Using this scheme, three successive lost packets can
be easily recovered. Even the fourth frame can be retrieved by
adopting a method of extrapolation [2]. From left to right on
the same figure, the respective cases of loss of a single packet,
2 packets, 3 packets and finally 4 packets are shown.
Explanation:
The first case corresponds to a single packet loss (F2 frame).
The current packet is lost and the MELP 2.4 is unable to
provide us with a good quality of the speech. The system falls
back to the previous received packet containing the
information the lost current frame. This frame is then
reconstructed with a coarse quality using the 81 bits
corresponding to the 1.2 MELP coder; i.e. to the three frames
F2, F3 and F4.
22.5 ms
F1
Original speech
F2
F3
F4
F5
F6
F7
F8
F9
F10
2.4 kbps
2.4 kbps
2.4 kbps
2.4 kbps
1.2 kbps
1.2 kbps
1.2 kbps
1.2 kbps
F2 F3 F4
Received
F3 F4 F5
F4 F5 F6
F5 F6 F7
Lost
Received
Lost
F6 F7 F8
Lost
F7 F8 F9
Received
F8 F9 F10
Lost
F1
F2
F3
F4
F5
F6
F7
2.4 kbps
1.2 kbps
2.4 kbps
1.2 kbps
1.2 kbps
2.4 kbps
1.2 kbps
F10
F11
F12
F14
F14
F8
1.2 kbps
2.4 kbps
2.4 kbps
1.2 kbps
1.2 kbps
F11F12 F13
Received
Lost
F15
F9 F10 F11
F10
2.4 kbps
F12 F13F14
Lost
F11
1.2 kbps
F13F14 F15
F14F15 F16
Lost
Lost
F12
F13
1.2 kbps
F15F16 F17
F16F17 F18
Lost
Received
F14
F15
2.4 kbps
1.2 kbps
Extrapolation
Synthetic speech
Fig. 3: Process of recovery of lost packets based on the proposed MDC
F10F11 F12
F11F12 F13
Lost
Received
F9
F10
1.2 kbps
2.4 kbps
A. EVALUATION CORPUS
To assess and validate our method, we used a
multilingual corpus combining Arabic, French and
English. The first record is composed of Arabic
sentences phonetically balanced [7] developed in our
laboratory. This corpus contains a total of 60
sentences, 10 sentences spoken by 3 male and 3 female
speakers. For French and English, we used the known
phrases phonetically balanced, la bise et le soleil
and The wind and sun .
B. EVALUATION OF CODERS
The evaluation of the performance of the two MELP
coders implemented separately were designed using
the Recommendation P.862 of the ITU-T (International
Telecommunication Union) [8] called PESQ
(Perceptual Evaluation of Speech Quality). This
method describes an objective method for predicting
the subjective quality for telephony and for voice
coders. It is intended to evaluate the influence of
factors such as packet loss, the variable delay and
distortion due to channel errors that is poorly evaluated
by conventional methods. The PESQ is designed to
compare a reference version (original) to that obtained
by synthesizing this reference or after transmission and
have been adversely affected. The results are shown in
Table 2.
Coder
Reference signal
Table II: Results of objective tests of two MELP coders
Synthetic signal
corrected by MDC
Evaluation
Signal with loss
PESQ
3.20
2.71
Original Signal
PESQ Score
Fig. 4: Results of objective tests of two MELP coders
5
4.5
4
3.5
2.5
2
1.5
1
Original signal
MELP
MELP with MDC
G.729
0.5
10
15
20
25
30
Amplitude
Original signal
0.5
0
-0.5
1.02
1.04
1.06
1.08
1.1
1.12
1.14
1.16
1.18
1.2
4
x 10
0.5
0
-0.5
1
1.02
1.04
1.06
1.08
1.1
1.12
1.14
1.16
1.18
1.2
4
x 10
1.14
1.16
1.18
1.2
4
x 10
PESQ
0.5
0
-0.5
1.02
1.04
1.06
1.08
1.1
1.12
Number of samples
Fig. 6: Sample output showing the correction made by the MDC
Amplitude
Original signal
0.5
0
-0.5
8000
8200
8400
8600
8800
9000
9200
9400
9600
9800
10000
9400
9600
9800
10000
9600
9800
10000
Amplitude
8200
8400
8600
8800
9000
9200
Amplitude
8200
8400
8600
8800
Amplitude
Original signal
0.5
0
-0.5
-1
8200
8400
8600
880
9000
9200
940
9600
9800
9400
9600
9800
10000
9400
9600
9800
10000
Amplitude
8400
8600
8800
9000
9200
Amplitude
8200
8400
8600
8800
9000
9200
Number of samples
Fig. 8: Sample output showing the correction made by the MDC
VIII.
CONCLUSION
REFERENCES
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[8]
[9]