Вы находитесь на странице: 1из 5

2008 International Conference on Signals, Circuits and Systems

Speech Denoising by Adaptive Weighted Average


Filtering in the EMD framework
Kais KHALDI Abdel-Ouahab BOUDRAA
and Monia TURKI-HADJ ALOUANE IRENav, Ecole Navale/ E3 I2 (EA3876), ENSIETA
Unité Signaux et Systèmes, ENIT Groupe ASM, Lanvéoc Poulmic
BP 37, Le Belvedre 1002 Tunis, Tunisia BP600, 29240 Brest−Armées, France
Email: kais.khaldi@gmail.com, m.turki@enit.rnu.tn Email:boudra@ecole-navale.fr

Abstract—This paper introduces a new speech enhancement only signals corrupted by additive white Gaussian noise are
method, which combines Adaptive Center Weighted Average considered.
(ACWA) filter with Empirical Mode Decomposition (EMD). Both In this paper, an adaptive denoising scheme associating EMD
ACWA and EMD operate in the time domain. The ACWA filter
is advantageous as it operates adaptively in the time domain with the ACWA filter is proposed. The ACWA filter [12]
and does not require the stationarity and the whiteness of the and other correlated versions are basically used in image
signals. Thanks to the data driven decomposition of the EMD, enhancement domain [13]. This filter operates adaptively in
the application of the ACWA filter on the IMFs gives better the time domain what fits in the EMD framework, and it does
results than the ACWA filtering of the noisy signal. The proposed not require the stationarity of the signals and the whiteness
EMD-ACWA denoising method is applied to noisy speech signal
with different noise levels and the results are compared to those of the noise. The effectiveness of the ACWA filter can be
obtained by two different denoising methods: wavelet thresholds improved when it is associated to the EMD decomposition.
and ACWA filtering. A significant superiority of the EMD-ACWA Indeed, the IMFs are less noisier than the noisy speech. The
method over the two others is shown in white noisy contexts as proposed denoising method benefits from the advantages of
well as in correlated noisy ones. the EMD and the attractive properties of the ACWA filter,
which is adaptive and easy to implement, for obtaining good
I. I NTRODUCTION
performance in the presence of white as well as colored noises.
Recently, a new temporal signal decomposition method,
II. EMD ALGORITHM
called Empirical Mode Decomposition (EMD), has been in-
troduced by Huang et al. [1] for analyzing data from non- The EMD decomposes a given signal x(t) into a series of
stationary and nonlinear processes. The major advantage of IMFs through an iterative process called sifting; each one with
the EMD is that the basis functions are derived from the signal a distinct time scale [1]. The decomposition is based on the
itself. Hence, the analysis is adaptive in contrast to traditional local time scale of x(t), and yields adaptive basis functions.
methods such as wavelets where the basis functions are fixed. The EMD can be seen as a type of wavelet decomposition
The EMD has received more attention in terms of applications whose subbands are built up as needful to separate the different
[2]-[3], interpretation [4]-[5], and improvement [6]-[7]. The components of x(t). Each IMF replaces the signals detail,
major advantage of the EMD is that the basis functions are at a certain scale or frequency band [4]. The EMD picks
derived from the signal itself. The EMD is also used in out the highest frequency oscillation that remains in x(t). By
speech denoising [8]. In fact, speech signal noise reduction definition, an IMF satisfies two conditions :
is a well known problem in signal processing. Particularly, 1) the number of extrema and the number of zeros cross-
linear methods such as the Wiener filtering [9], are largely ings may differ by no more than one.
used, because linear filters are easy to implement and to 2) the average value of the envelope defined by the local
design. However, these methods are not effective when the maxima, and the envelope defined by the local minima,
noise estimation is not possible or when the noise is colored. is zero.
To overcome these difficulties, nonlinear methods have been Thus, locally, each IMF contains lower frequency oscillations
proposed and especially those based on Wavelet thresholding than the just extracted one. The EMD does not use a
[10]-[11]. A limit of the wavelet approach is that the basis pre-determined filter or a wavelet function, and is a fully
functions are fixed, and thus do not necessarily match all real data-driven method [1]. To be successfully decomposed
signals. into IMFs, the signal x(t) must have at least two extrema,
To overcome the drawbacks of the wavelet method, two one minimum and one maximum. The sifting involves the
strategies for noise reduction have been proposed in [8]: EMD following steps :
associated with filtering is efficient for relatively low noise
level and when associated with thresholding is attractive in Step 1: Fix the threshold  and set j ← 1 (j th IMF)
particular for relatively high noise level. However, in [8], Step 2: rj−1 (t) ← x(t) (residual)

978-1-4244-2628-7/08/$25.00 ©2008 IEEE -1-


2008 International Conference on Signals, Circuits and Systems

IM F1 -
Step 3: Extract the j th IMF : ACWA filter
(a) : hj,i−1 (t) ← rj−1 (t) ,i ← 1 ( i number of sifts)
IM F2
(b) : Extract local maxima/minima of hj,i−1 (t) -
ACWA filter
(c) : Compute upper and lower envelopes
Uj,i−1 (t) and Lj,i−1 (t) by interpolating,using cubic spline, IM F3
- ACWA filter
respectively local maxima and minima of hj,i−1 (t)
(d) : Compute the mean of the envelopes :
μj,i−1 (t) =(Uj,i−1 (t) + Lj,i−1 (t))/2
U

(e) : Update : hj,i (t) := hj,i−1 (t) − μj,i−1 (t), i := i + 1 R^
y(t) - EMD + - x̃(t)
(f) : Calculate the stopping criterion : 3
T 2 
|hj, i − 1(t) − hj, i(t)| IM FC
SD(i) = -
t=1
(hj, i − 1(t))2 ACWA filter
(g) : Repeat Steps (b)-(f) until SD(i)<  and then put
IMFj (t) ← hj,i (t) (j th IMF)
Step 4: Update residual : rj (t) := rj−1 (t) − IMFj (t). Residual
Step 5: Repeat Step 3 with j := j + 1 until the number of
extrema in rj (t) is ≤ 2.
Fig. 1. Denoising scheme.
Where T is x(t) time duration. The sifting is repeated several
times (i), in order to get h true IMF that fulfills the conditions
(1) and (2). The result of the sifting is that x(t) will be where IMFj is a noisy version of the data fj .
decomposed into a sum of C IMFs and a residual rC (t) such An estimation f˜j (t) of fj (t) based on the noisy observation
as the following: IMFj (t) is given by
C
 f˜j (t) = Γ[IMFj (t)], (5)
x(t) = IMFj (t) + rC (t) (1)
j=1 where Γ[IMFj (t)] is a temporal processing using ACWA filter.
C value is determined automatically using SD (Step 3(f)). The Finally, the estimated signal, x̃(t), is given by :
sifting has two effects : (a) it eliminates riding waves, and C

(b) it smoothes uneven amplitudes. To guarantee that IMF x̃(t) = f˜j (t) + rC (t) (6)
components retain enough physical sense of both amplitude j=1
and frequency modulation, we have to determine SD value for The denoising of the IMF by the ACWA filter is given as
the sifting. This is accomplished by limiting the size of the follows [12]
standard deviation SD, computed from the two consecutive 
sifting results. Usually, SD (or ) is set between 0.2 and 0.3 ˜ Fmean + Kj (IM Fj (t) − Fmean ), if Fvar ≥ σj2
fj (t) =
[1]. Fmean , otherwise
(7)
III. T HE EMD-ACWA DENOISING APPROACH σj2
Kj = (1 − ), (8)
Fvar
The proposed denoising method is shown in figure 1.
The noisy signal y(t) described by an additive model is given where Fmean and Fvar denote respectively the average and
by : the variance of the IMF computed over a sliding window of
length L, and σj2 designates the variance of noise contained
in the IMF indexed by j.
y(t) = x(t) + b(t), (2)

where x(t) corresponds to the clean speech signal and b(t) The noise level σj is estimated as in [14],[15], [16] as
denotes the noise signal. following:
σ˜j = 1.4826 × Median {|IMFj (t) − Median {IMFj (t)} |} .
The noisy signal is decomposed into a sum of IMFs as follows:
(9)
C
 Classically the ACWA filter has been used in image en-
y(t) = IMFj (t) + rC (t). (3) hancement applications. It can be also interesting and effective
j=1 in the context of audio signal enhancement. As shown by
The extracted IMFs include the noise since each IMF, indexed (7) this filter operates in the time domain what corresponds
by j, can be approximated as follows: well to the EMD framework. In contrast to the classical filters
such as Wiener filter, all the parameters are computed in time
IMFj (t) = fj (t) + bj (t), (4) domain and hence transformation to frequency domain is not

-2-
2008 International Conference on Signals, Circuits and Systems

needed. Besides, as the signal is enhanced sample by sample, Besides, a significant SNR improvement, varying from 4.2
the hypothesis of signals stationarity and noise whiteness are dB to 17.4 dB, is achieved by the EMD-ACWA method. In
relaxed. In particular, this filter can perform in general noisy fact, even for very low SNR values, we can still observe the
contexts: white as well as colored noise, high as well as low effectiveness of the proposed method in removing the noise
noise level. components as the gain in SNR can go up to 14 dB.
Note that when listening to the enhanced speechs, the EMD-
IV. E XPERIMENTAL RESULTS ACWA produces lower residual noise, noticeably less speech
The proposed noise reduction method is tested on a speech distortion compared to the wavelet (db8) method and ACWA
signal corrupted by different noises whose levels are fixed filter.
through the input Signal to Noise Ratio (SNR),
V. C ONCLUSION

T
2
(x(t)) In this paper, a new speech enhancement method to effec-
t=1
SNRin = 10 log10 , (10) tively remove the noise components is presented. We have

T
(y(t) − x(t))2 combined two powerful adaptive methods: the EMD and the
t=1 ACWA filtering. Obtained results for speech signal contami-
where x(t) and y(t) are respectively the clean and the noisy nated with different noises with different SNR values ranging
signals. The results obtained by the proposed method are from -10 dB to 10 dB, showed that the proposed method
compared to the wavelet approach (Daubechies 8) and ACWA performs better than the the wavelet approach and the ACWA
filter. We use the ACWA filter as comparison method because filter. In addition, the reported results demonstrated that the
it gives better results than the MMSE filter [17]. As an EMD-ACWA denoising method is effective for noise removal
objective criterion to evaluate the performance of the denoising and confirmed that it is a very attractive method to use in
method, we use the output Signal to Noise Ratio: general noisy contexts.

T
R EFERENCES
(x̃(t))2
t=1
SNRout = 10 log10 , (11) [1] N.E. Huang and al. The empirical mode decomposition and Hilbert

T
spectrum for nonlinear and non-stationary time series analysis. Proc.
(x(t) − x̃(t))2 Royal Society, 454(1971):903–995, 1998.
t=1
[2] F. Salzenstein A.O. Boudraa, J.C. Cexus and L. Guillon. If estima-
where x̃(t) is the reconstructed signal. tion using empirical mode decomposition and nonlinear teager energy
operator. Proc. IEEE ISCCSP, pages 45–48, Hammamet, 2004.
[3] A.O. Boudraa S. Benramdane, J.C. Cexus and J.A. Astolfi. Transient tur-
We take as example two speech signals ”a” and ”b”. These bulent pressure signal processing using empirical mode decomposition.
signals are corrupted by a colored noise ”f16” with SNR value Proc. Physics in Signal and Image Processing, Mhoulouse, 2007.
[4] P. Flandrin, G. Rilling, and P. Goncalves. Empirical mode decomposition
fixed to -2dB. The original signals and their corresponding as a filter bank. IEEE Sig. Proc. Lett., 11(2):112–114, 2004.
noisy versions are depicted in figure 2. The size L of the [5] Z. Wu and N.E. Huang. A study of the characteristics of white noise
ACWA filter window is fixed to 511. This choice is justified using the empirical mode decomposition method. Proc. Roy. Soc.
London A, 460:1597–1611, 2004.
by the results shown in figure 3 where are displayed the [6] B. Weng and K.E. Barner. Optimal and bidirectional optimal empirical
variations of the SNRout versus L for two values of SNRin : -2 mode decomposition. Proc. IEEE ICASSP, 3:1501–1504, Toulouse,
dB and 0 dB. Figure 3, shows that for L = 511 the SNRout 2007.
[7] R. Deering and J.F. Kaiser. The use of a masking signal to im-
remains almost constant. prove empirical mode decomposition. Proc. IEEE ICASSP, 4:485–488,
The denoised versions of signals ”a” and ”b” obtained by the Philadelphia, 2005.
EMD-ACWA, the wavelet thresholds (db8), and the ACWA [8] K. Khaldi, A.O. Boudraa, A. Bouchiki, M. Turki-Hadj Alouane, and
E. Samba Diop. Speech signal noise reduction by EMD. In Proc. IEEE
filter, are shown respectively in figures 4 and 5. The SNRin is ISCCSP, Malta, March 2008.
fixed to -2dB. In fact, we choose db8 with a hard threshold as [9] J.G. Proakis and D.G. Manolakis. Digital Signal Processing: Principles,
a tool of comparison, because it gives good results compared Algorithms, and Applications, volume 1. Prentice-Hall, 3rd edition,
1996.
with others wavelets. [10] D.L. Donoho. De-noising by soft-thresholding. IEEE Trans. Inform.
A careful comparative examination of the signals of figures 4 Theory, 41(3):613–627, 1995.
and 5, shows that the EMD-ACWA performs better than [11] D.L. Donoho and I.M. Johnstone. Ideal spatial adaptation via wavelet
the wavelet (db8) and ACWA-filter in terms of noise reduction. shrinkage. Biometrica, 81:425–455, 1994.
[12] J.S. Lee. Digital image enhancement and noise filtering by using local
statistics. IEEE Trans. Pattern Anal. Mach. Intell., 2(4):165–168, March
Figures 6, 7 and 8 show the variations of the SNRout versus 1980.
the SNRin relating to the denoising signal ”a” when corrupted [13] Masayuki Meguro, Akira Taguchi, and Nozomu Hamada. Data-
dependent weighted average filterig for image sequence restoration.
respectively by a white Gaussian noise, the colored f16 noise Electroics and Communications in Japan, Part III: Fundamental Elec-
and the colored factory noise. These results demonstrate the tronics Science, 84(4):1–10, 2000.
effectiveness of the proposed method. Indeed, the improve- [14] A.O. Boudraa and J.C. Cexus. Denoising via empirical mode decom-
position. In Proc. IEEE ISCCSP, Marrakech, Morocco, 2006.
ment in SNR provided by the EMD-ACWA is much higher [15] A.O. Boudraa, J.C. Cexus, and Z. Saidi. EMD-based signal noise
than those given by the wavelet method and the ACWA filter. reduction. Int. J. Sig. Process., 1(1):33–37, 2004. ISSN: 1304-4494.

-3-
2008 International Conference on Signals, Circuits and Systems

[16] William H. Press, S.A. Teukolsky, W.T. Vetterling, and B.P. Flannery.
Numerical Recipes in C: The Art of Scientific Computing, volume 1. 1

EMD−ACWA
Cambridge University Press, 2nd edition, 1992.
[17] M. Turki-Hadj Alouane K. Khaldi and A.O. Boudraa. Voiced speech 0
enhancement based on adaptive filtering of selected intrinsic mode func-
tions. Advances in Adaptive Data Analysis (AADA), 2008 (submitted). −1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
4
x 10
1

Wavelet (db8)
a
Amplitude

1
0 0
−1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 −1
4
x 10 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
b 4
Amplitude

1 x 10
0 1

ACWA filter
−1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 0
4
noisy a x 10
Amplitude

1 −1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
0
Time 4
x 10
−1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
4
noisy b x 10
Amplitude

1 Fig. 4. Denoised version of the signal ”a” obtained by the EMD-ACWA,


0 the Wavelet (db8) and the ACWA filter (f16 noise with SNR =-2dB).
−1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
Time 4
x 10
1
EMD−ACWA
0

Fig. 2. The original signals (”a” and ”b”) and their noisy versions (f16 noise −1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
with SNR =-2dB). 4
x 10
1
Wavelet (db8)

11 −1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
4
x 10
10.5 1
ACWA filter

0
10
−1
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
9.5 Time 4
x 10
SNR gain [dB]

8.5
Fig. 5. Denoised version of the signal ”b” obtained by the EMD-ACWA,
the Wavelet (db8) and the ACWA filter (f16 noise with SNR =-2dB).
8

7.5
For initial SNR = −2 dB
For initial SNR = 0 dB 18
7
EMD−ACWA
6.5 16 Wavelet (Daubechies 8)
ACWA filter
6 14
0 200 400 600 800 1000 1200
(L) Size of the window ACWA filter
12
SNR output [dB]

Fig. 3. The variation of the SNRout relating to the noisy signal ”a” versus L 10
the size of the ACWA filter window (f16 noise with SNR=-2 db ad SNR=0
db). 8

0
−10 −5 0 5 10
Initial SNR [dB]

Fig. 6. Variation of the SNRout versus the SNRin relating to the denoising
of the signal ”a” corrupted by a white Gaussian noise.

-4-
2008 International Conference on Signals, Circuits and Systems

18
EMD−ACWA
16 Wavelet (Daubechies 8)
ACWA filter
14

12
SNR output [dB]

10

0
−10 −5 0 5 10
Initial SNR [dB]

Fig. 7. Variation of the SNRout versus the SNRin relating to the denoising
of the signal ”a” corrupted by the f16 noise.

18
EMD−ACWA
16 Wavelet(Daubechies 8)
ACWA filter
14

12
SNR output [dB]

10

0
−10 −5 0 5 10
Initial SNR [dB]

Fig. 8. Variation of the SNRout versus the SNRin relating to the denoising
of the signal ”a” corrupted by the factory noise.

-5-

Вам также может понравиться