Академический Документы
Профессиональный Документы
Культура Документы
AbstractIn this paper, we propose a low complex architectural design for adaptive decision feedback equalizer (ADFE).
For this we recast the ADFE equations using distributed arithmetic (DA), which enables the implementation of ADFE without
multipliers. The design is based on the distributed arithmetic
based formulation of it. It is further shown that high order
filters, which are required to implement high speed ADFEs can be
realized using only look-up-tables (LUTs) and shift-accumulate
operations. A novel approach was proposed to replace feed
forward and feedback filters of ADFE with a single DA unit.
By proper initialization, it is also shown that a low complexity
ADFE architecture can be obtained.
Index TermsInter Symbol Interference (ISI), Adaptive Decision Feedback Equalizer (ADFE), Distributed Arithmetic (DA),
Least Significant Bit (LSB), Most Significant Bit (MSB), slicer,
quantization.
I. I NTRODUCTION
c 2013 IEEE
978-1-4673-5157-7/13/$31.00
24 LU T
e(n)
x(n)
FFF
Address
bits
sq (n)
s(n)
v(n)
Fig. 1.
x
(n)-
a3 a2 a1 a0
Slicer
FBF
b0,j
b1,j
b2,j
b3,j
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
Z 1
v(n)
II. BACKGROUND
DA is a bit-by-bit serial operation [11] that computes the
inner product of two vectors in a fixed number of clock cycles
regardless of the length of the vectors. In other words, it
implements a series of sum-of-product operations (or MAC
operation) regardless of the number of products to be summed
up. The bit-serial operation from the sum-of-product operation
of vectors can be obtained as follows.
The sum-of-product of two length- vectors c and x is
given as
=
(1)
=0
, 2
=1
(2)
0
0
0
0
1
1
1
1
0
0
0
0
1
1
1
1
0
0
1
1
0
0
1
1
0
0
1
1
0
0
1
1
Entries
0
c0
c1
c1 + c0
c2
c2 + c0
c2 + c1
c2 + c1 + c0
c3
c3 + c0
c3 + c1
c3 + c1 + c0
c3 + c2
c3 + c2 + c0
c3 + c2 + c1
c3 + c2 + c1 + c0
0
1
0
1
0
1
0
1
0
1
0
1
0
1
0
1
sign bit
control
+/
Accumulating
register
21
Shifter
[ 1
]
,1 +
=0
Defining
[ 1
=1
{
=
=0 ,
1
=0 1
]
, 2
(3)
=0
=
0
=1
Then
=
(4)
=0
1
=0
()
= 0, 1, . . . , 2 1
(5)
()
=0
()
(6)
22 LU T
a1 a0
Address b0,j
bits
b1,j
0
0
1
1
0
1
0
1
Entries
0
c0
c1
c1 + c0
Now,
sign bit
control
+
+/
1
Accumulating
register
() =
Address b2,j
bits
b3,j
0
0
1
1
0
1
0
1
Entries
0
c2
c3
c3 + c2
()( )
=0
1
22 LU T
a1 a0
Fig. 3. Architecture for computing the inner product of two 4-length vectors
using DA ROM decompostion.
=0
()( )
=0
()( ) +
=0
1
Shifter
{ ()}( )
=0
()( ) +
()( )
=0
()( )
=0
Therefore,
() = w z
(11)
(12)
(13)
Hence,
() =
( )
(14)
=0
[ 1
=0
() = [()]
(7)
() =
() ()
(8)
() =
() =
=0
()( )
(9)
,0 +
{ 1
=1
=0
}
,
(15)
It can be observed that (15) is similar to (3) and hence the FFF
and FBF can be combined and implemented using a single DA
processing unit by proper initialization of FFF and FBF coefficients. Choosing the LMS criteria inorder to update the coefficients of FFF and FBF, the weight update equations would
be
, ( + 1) = , () + ()( )
(16)
, ( + 1) = , () + ()( )
(17)
where and are the step sizes of FFF and FBF respectively, () is the error signal which is given as () =
() ().
Replacing , ( + 1) by its negative number
()( )
=0
() = ( 1)
(10)
, ( + 1) = , () + ()( )
(18)
Buers
2Nb LU T
FBF
Co-ecients
2Nf LU T
Input
Samples
Accum
Buers
2Nf LU T
FFF
Co-ecients
2Nb LU T
Decision
Outputs
Slicer
Equalized
Output
Fig. 5.
Fig. 4.
, , ( + 1) =
=0
, , ()
=0
(19)
+ ()
, ( )
=0
1
=0
, , ( + 1) =
() =
+ ()
=1
(21)
, , ()
,0 +
1
1
, 2
=1
=0
=0
+ 1
1
+
1
,0 +
, 2
=0
(20)
, ( )
=0
TABLE I
C OMPARISON OF HARDWARE COMPLEXITY.
Implementations
Number of
adders
Number of
multipliers
Number of
21
MUXs
2+2 +
1
2+2 1
2() /2
Proposed
2 + 2
2() /2
2
2
[11] S. White, Applications of distributed arithmetic to digital signal processing: a tutorial review, IEEE ASSP Mag., vol. 6, no. 3, pp. 4 19,
Jul. 1989.
[12] D. Allred, H. Yoo, V. Krishnan, W. Huang, and D. Anderson, LMS
adaptive filters using distributed arithmetic for high throughput, IEEE
Trans. Circuits Syst. I, Reg. Papers, vol. 52, no. 7, pp. 1327 1337, Jul.
2005.