Академический Документы
Профессиональный Документы
Культура Документы
by
in
I. INTRODUCTION X4
C
FA
S C
FA
S C
FA
S C
FA
S
P2
Y3
Multipliers are one of the most important arithmetic units X3 X2 X1 X0
P3
multipliers is key to satisfying the overall power budget of X3 X2 X1 X0 Y4
and Noaks [2], De Mori [3], and Guilt [4] for positive
C S C S C S C S
P5
numbers, and Baugh and Wooley [5] and Hwang [6] for P9 P8 P7 P6
numbers in two’s complement form. References [7-9] give a Figure 1(a). Conventional Unsigned Array Multiplier
good insight into the problem and design optimizations at all
the hierarchy levels. Y0
X4 X1 X0
In this paper, we focus on power reduction for both
X3 X2
P2
The basic process of binary array multiplication involves X3 X2 X1 X0
Y3
P3
multiplier. NOR gates are used instead of AND in X0 Y4
X3 X2 X1
P4
From (1), it is clear that if NOR gates are used, the inputs 1
1
have to be complimented.
While it takes 6 transistors to build AND/OR gate, only 4 C
HA
S C
FA
S C
FA
S C
FA
S C
FA
S
Authorized licensed use limited to: INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY. Downloaded on March 25,2010 at 05:31:40 EDT from IEEE Xplore. Restrictions apply.
y m-1 . . . y4 y3 y2 y1 y0
x n-1 . . . x3 x2 x1 x0
1 x 0 y m-2 . . . x 0y4 x 0y3 x 0y 2 x 0y 1 x 0y0
x 1 y m-2 . . . x 1y4 x 1y3 x 1y2 x 1y 1 x 1y 0
x 2 y m-2 . . . x 2y4 x 2y3 x 2y2 x 2y1 x 2y 0
. .
. .
. .
x n-1y m-1 0 x n-2y m-2 . . . x n-2y 2 x n-2y 1 x n-2y 0
(x n-1y m-2 )’ (x n-1y m-3 )’ . . . (x n-1y 2)’ (x n-1 y 1)’ (x n-1 y 0)’
1 (x n-2y m-1 )’ (x n-3y m-1 )’ . . . (x 0ym-1)’ 1
P m+n-1 P m+n-2 P m+n-3 P m+n-4 . . . P m-1 . . P n+1 Pn P n-1 . . . P3 P2 P1 P0
Figure 2. Tabular form for modified Baugh-Wooley two’s complement signed multiplier
⎛ m−2
⎞⎛ n −2
⎞
Y 1'
P0
= ⎜ − ym−1 2 m−1 + ∑ yi 2i ⎟⎜ − xn−1 2 n−1 + ∑ xi 2i ⎟
X3 ' X2 ' X1 ' X0 '
HA HA HA HA
⎝ i =0 ⎠⎝ i =0 ⎠
X4 ' C S C S C S C S
'
P2 gates to generate partial products. The tabular form of
' ' ' ' Y3
X3 X2 X1 X0
modified Baugh-Wooley multiplier is shown in Fig. 2. Its
architecture is shown in Fig. 1(b).
Now we see, out of m ∗ n partial products, majority of the
FA FA FA FA
X4 ' C S C S C S C S
P3
X3 ' X2 ' X1 ' X0 '
Y4 '
bits (m ∗ n − (m + n − 2)) are generated as a result of AND
FA FA FA FA
operation while only a few (m + n − 2) are a result of
X4 ' C S C S C S C S
the proposed multiplier, all the AND gates are replaced with
NOR gates.
FA FA FA HA
C S C S C S C S
P5
According to DeMorgan’s Law,
P9 P8 P7 P6
A.B = (A’+B’)’. (2)
Figure 3. Proposed unsigned array multiplier This makes it necessary to use inverted inputs (addition of
(m + n) inverters) and convert the remaining NAND gates
III. PARALLEL TWO’S COMPLEMENT SIGNED MULTIPLIER
to OR gates, as (A.B)’ = (A’+B’).
For m ∗ n parallel two’s compliment signed multiplication, These changes reduce (m ∗ n − 2 * (m + n − 2) − ( m + n))
m-bit multiplier Yv is represented as: inverters in the proposed multiplier. (As an example, for a
m−2
16x16 multiplier, 164 inverters are reduced.)
Yv = − ym−1 2 m−1 + ∑ yi 2i Thus, in comparison to the modified Baugh-Wooley
i =0
multiplier (Fig. 1(b)), area is reduced. The probability of
and n-bit multiplicand Xv is represented as:
120
Authorized licensed use limited to: INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY. Downloaded on March 25,2010 at 05:31:40 EDT from IEEE Xplore. Restrictions apply.
getting a 1 in both NOR and AND gate is same (i.e. 1/4) and Sign Extension is done to 16 bits,
that for NAND and OR is also same (i.e. 3/4). Thus the Multiplicand, 5 = 0000 0000 0000 0101
switching activity in the proposed multiplier remains the
same. Since less number of transistors are used, power
dissipation in the proposed multiplier is bound to decrease. Input
|n|
However due to the extra inverters added to obtain
complemented inputs, there is extra 1T delay. Sign extension to m bits n=+ve
However, if the inputs are not in two’s complement form, n=‐ve
further modifications can be done to produce the
m inverters
complemented inputs as explained below.
Instead of generating inverted bits simply by complementing X3' X2' X1' X0' Y1' P0
P2
if the input number is positive, bits are complemented. If it is X3' X2' X1' X0' Y3'
P3
X3' X2' X1' X0' Y4'
The process is shown in Fig. 4(b) and illustrated in example
given below. The tabular form representation shown in Fig. FA FA FA FA
2, now looks like as shown in Fig. 6. The block diagram is X4' C S C S C S C S
P4
shown in Fig. 5. 1 1
121
Authorized licensed use limited to: INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY. Downloaded on March 25,2010 at 05:31:40 EDT from IEEE Xplore. Restrictions apply.
ym-1 . . . . y4 y3 y2 y1 y0
xn-1 . . . x4 x3 x2 x1 x0
Inversion of bits by process described in Fig. 5
(ym-1)’ . . . (y4)’ (y3)’ (y2)’ (y1)’ (y0)’
(xn-1)’ . . . (x3)’ (x2)’ (x1)’ (x0)’
1 (x0’+ym-2’)’ . . . (x0’+y3’)’ (x0’+y2’)’ (x0’+y1’)’ (x0’+y0’)’
(x1’+ym-2’)’ . . . (x1’+y3’)’ (x1’+y2’)’ (x1’+y1’)’ (x1’+y0’)’
(x2’+ym-2’)’ . . . (x2’+y3’)’ (x2’+y2’)’ (x2’+y1’)’ (x2’+y0’)’
. .
. .
. .
(xn-1’+ym-1’)’ 0 (xn-2’+ym-2’)’ . . . (xn-2’+y1’)’ (xn-2’+y0’)’
xn-1’+ym-2’ xn-1’+ym-3’ . . . xn-1’+y1’ xn-1’+y0’
1 xn-2’+ym-1’ xn-3’+ym-1’ . . x0’+ym-1’ 1
Pm+n-1 Pm+n-2 Pm+n-3 Pm+n-4 . . Pm-1 . . Pn Pn-1 . . . P3 P2 P1 P0
Figure 6. Tabular form for proposed two’s complement signed array multiplier
Multiplier, 3 = 0000 0000 0000 0011 TABLE I. Power and Delay comparison of conventional and proposed
Multiplier is positive so the bits are inverted to give 1111 multipliers
1111 1111 1010.
Multiplicand is negative, so (216 - 1) is added to 0000 0000
0000 0011 to give the complement of two’s complement form Unsigned array multiplier
for -3 i.e. 0000 0000 0000 0010.
Conventional Proposed Proposed/Conventional
Proof for addition of (2m-1): Power (in Watt) 7.2177E-04 6.6482E-04 0.92
Delay (in ns) 1.015 0.968 0.954
As mentioned earlier, required input for proposed multiplier is
complement of two’s complement form of input. Steps Two’s complement signed array multiplier
involved in obtaining two’s complement form of a number
(sign extended to m bits) are: Modified Proposed/Modified
Proposed
1) Complement the number. Baugh Wooley Baugh Wooley
2) Addition of 1. Power (in Watt) 5.6533E-04 5.5366E-04 0.979
Delay (in ns) 1.1478 0.98 0.854
As we know that, taking two’s complement of a number twice
gives the same number.
Let A is the given m-bit sign extended number and B is its V. CONCLUSION
two’s complement representation. B’ is required number, In this paper, a new approach for the design of parallel
then array multipliers has been suggested. AND gates in the
A → A’→ (A’+1) = B existing designs have been replaced with NOR gates. Where
B → B’→ (B’+1) = A the numbers are not in two’s complement form then they are
B’ = A – 1 = A + (2m – 1) inverted and given as input. Results of the simulation clearly
show that the proposed multiplier architecture performs better
Thus for a negative number, calculation of its two’s than the existing modified Baugh-Wooley multiplier.
complement form and its complement are taken care of
simultaneously avoiding the need to calculate its two’s REFERENCES
complement form. This not only improves time delay but
also power dissipation is reduced. [1] J. Hoffman, G. Lacaze, and P. Csillag, “Iterative Logical
Network for Parallel Multiplication,” Electronics Letters, vol. 4,
p. 178, 1968.
IV. SIMULATION DETAILS AND RESULTS [2] P. Burton and D.R. Noaks, “High-Speed Iterative Multiplier,”
The analysis has been carried out on the proposed Electronics Letters, vol. 4, p. 262, 1968.
[3] R. De Mori, “Suggestion for an IC Fast Parallel Multiplier,”
multipliers by performing simulations on HSpice and
Electronics Letters, vol. 5, pp. 50-51, Feb. 1969.
compared with the existing multipliers. Simulations are [4] H. Guilt, “Fully Iterative Fast Array for Binary Multiplication,”
performed for 16x16 bit multipliers at 1.2V and at a Electronics Letters, vol. 5, p. 263, 1969.
frequency of 50 MHz. Results shown in the Table I are for [5] R. Baugh and B.A. Wooley, “A Two’s Complement Parallel
the particular inputs 1010101010101010x Array Multiplication Algorithm,” IEEE Trans. Computers, vol.
01010010101010101. Similar results can also be obtained for 22, no. 12, pp. 1,045-1,059, Dec. 1973.
other inputs.
122
Authorized licensed use limited to: INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY. Downloaded on March 25,2010 at 05:31:40 EDT from IEEE Xplore. Restrictions apply.
[6] K. Hwang, “Global and Modular Two’s Complement Array
Multipliers,” IEEE Trans. Computers, vol. 28, no. 4, pp. 300-
306, Apr. 1979.
[7] Wayne Wolf, (2002). Modern VLSI Design: System-On-Chip
Design. 3rd Edition, Prentice Hall, Upper Saddle River, N.J.
[8] M.S.Elrabaa, I.S. Abu-Khater, M.I. Elmasry, “Advanced Low-
PowerDigital Circuits Techniques”, Kluwer Academic Publ.,
1997.
[9] J.M.Rabaey, A.Chandrakasan, and B.Nicolic, “Digital
Integrated Circuits”, (2nd Edition) Prentice Hall, 2002.
123
Authorized licensed use limited to: INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY. Downloaded on March 25,2010 at 05:31:40 EDT from IEEE Xplore. Restrictions apply.