Академический Документы
Профессиональный Документы
Культура Документы
Abstract
In this work, Shannon’s and Mixed Shannon’s logics have been proposed. The simulation results show that the proposed
techniques offer low power, high speed and with high performance than the existing designs CMOS and CPL. Non-Restoring
and Restoring array divider circuits have been designed using adder cell and subtractor cell respectively. The proposed
Shannon adder cell consists of 16 transistors and mixed Shannon based adder cell consists of 12 transistors compared to CPL-
28 transistors and CMOS-28 transistors. The two different (Non-Restoring and Restoring array) 7x4 bit divider circuits have
been simulated by using Microwind 3.1 VLSI CAD tool. Various parameters such as propagation delay, power dissipation,
PDP have been determined from array dividers layout of feature size 65nm technology. The divider circuits have been
analyzed using BSIM 4 parameter analyzer. The proposed Shannon adder based Non-Restoring array divider circuit has a
reduced power dissipation of 82.77%, a reduced propagation delay of 44.12% and a reduced PDP of 90.37% compared with
CMOS based array divider circuits due to lower critical path in the proposed adder cell. Similarly, Restoring array divider
circuit has a reduced power dissipation of 66.09%, a reduced propagation delay of 28.88% and a reduced PDP of 75.98%
compared with CMOS based array divider circuits.
Keywords—Array divider; CMOS; CPL; PDP; Restoring; Subtractor cell; Shannon adder cell
1.Introduction
The power consumed in high performance microprocessors has increased to levels that impose a fundamental limitation to
increasing performance and functionality [1]–[3]. If the current trend in increasing power continues, high performance
microprocessors will soon consume thousands of watts. The power density of a high performance microprocessor will exceed the
power density levels encountered in typical rocket nozzles within the next decade [2].
The generation, distribution, and dissipation of power are at the forefront of current problems faced by the integrated circuit
industry [1]–[5]. The application of aggressive circuit design techniques which only focus on enhancing circuit speed without
considering power is no longer an acceptable approach in most high complexity digital systems. Dynamic switching power, the
dominant component of the total power consumed in current CMOS technologies, is quadratically reduced by lowering the supply
voltage. Lowering the supply voltage, however, degrades circuit speed due to reduced transistor currents. Threshold voltages are
scaled to reduce the degradation in speed caused by supply voltage scaling while maintaining the dynamic power consumption
within acceptable levels [1]–[5]. At reduced threshold voltages, however, subthreshold leakage currents increase exponentially.
Energy efficient circuit techniques aimed at lowering leakage currents are, therefore, highly desirable. Domino logic circuit
techniques are extensively applied in high performance microprocessors due to the superior speed and area characteristics of
domino CMOS circuits as compared to static CMOS circuits [7]–[8]. However, deep sub micrometer (DSM) domino logic
circuits utilizing low power supply and threshold voltages have decreased noise margins [9]–[11]. As on-chip noise becomes
more severe with technology scaling and increasing operating frequencies, error free operation of domino logic circuits has
become a major challenge [9], [10], [11].
The focus of this paper is to implement various Reduced-swing domino logic circuit techniques which offer better speed,
energy-efficiency and noise immunity in DSM technology. The organization of the paper is as follows. A brief review of the
sources of power dissipation in CMOS circuits is provided in Section II. In Section III various Reduced-swing techniques in
domino logic circuits for power reduction are proposed. In Section IV simulation and implementation results are presented.
Finally, conclusions are presented in Section V.
Similarly the function of the PDN is to connect the output to VSS when the output is meant to be 0. Static logic retains its
output level as long as the power is applied. Generally outputs are generated in response to input voltage levels after a certain
time delay. In static logic design, any combinational design will possess an equal number of PMOS and NMOS transistors.
The disadvantage with static gates is that it occupies more space since it uses equal number of PMOS and NMOS transistors,
more power dissipation and less speed.
2.2.Dynamic CMOS Logic:
The gates whose output levels remain valid only for certain period of time are called as dynamic logic gates. This period is
called clock period.
In conventional CMOS logic circuits, the required logic function is implemented twice, both in a PUN and a PDN. To
increase speed, in dynamic logic, the PUN is normally replaced by a single transistor that is controlled by a clock signal. An
additional transistor (footer transistor) is cascaded with the remaining logic block to avoid contention. Therefore it consists of
N+2 number of transistors( N NMOS transistors in PDN, 1 PMOS and 1 NMOS to which the clock signal is applied). Dynamic
circuits relay on the temporary storage signal values on the capacitance of high impedance nodes.
Dynamic circuit takes a sequence of precharge and conditional evaluation phases to realize the logic functions. Once the
output of the dynamic gate is discharged, it cannot be charged again until the next precharge operation.
3.5.Array Dividers:
Digital dividers generally can be categorized as employing arithmetic operations to execute a division operation. Arithmetic
dividers receive an input that combines the numerator and denominator. Look-up table implementations often require large look-
up tables to be accurate for high speed division, which is generally requiring significant processing time and chip space [18].
Many look-up table implementations also require multiple iterations to improve accuracy, which increases latency associated with
the division operation. One consideration in the design of digital dividers is the throughput of the division process [19]. The
throughput of the process determines the rate at which a new value can be initiated into the process and also refers as the initiation
interval. The throughput of the division process generally depends on the desired precision and the algorithm used. Another
consideration is the processing time required to perform the digital division operation, which corresponds to the amount of time
required to perform the division process and referred as latency [19, 20]. In many conventional digital divider designs, the latency
determines the overall speed of the division process. As a result, most calculation type dividers typically provide only a few bits
of precision for real-time operation.
There are two kinds of array divider such as Non-Restoring Array Divider (NRAD) and Restoring Array Divider (RAD) [20].
The Non-Restoring array divider guesses the quotient at each stage and when it is wrong it will not correct the remainder in this
stage, instead of that it would continue to go to next stage. It has some extra remainder correction circuit after the last stage to
correct the last remainder output by the divider. The Non-Restoring divider is much more efficient and faster than regular
Restoring array divider. Since the array divider has many stages, it can be efficiently pipelined [14]. More importantly, the Non-
Restoring divider uses a very regular structure and each cell only needs to connect to the nearest neighbor cells, which makes it
very efficient for VLSI design. The Restoring array divider circuit has controlled subtractor cell which is used as the cell
component of the divider circuit [20]. Conventional two’s complement binary subtractor is implemented using carry-propagate
subtractor cell. This carry-propagate subtractor cell propagates the carry signal from the least significant bit to the most
significant bit position.
The proposed adder cell is used in the Non-restoring array divider (NRAD) and Restoring array divider (RAD) circuits. The
Non- Restoring and Restoring array dividers may use in DSP application circuits [20]. By using BSIM 4 parameters, the power
dissipation, propagation delay, area of the 7x4 bit Non-restoring and Restoring array divider circuits have been analyzed and
implemented by CPL, CMOS, Mixed Shannon and proposed Shannon. The parameters such as power dissipation, propagation
delay, power-delay product of the divider circuit have been calculated from layout simulations.
where A-Dividend,
D-Divisor,
Q-Quotient,
R- Reminder.
where A-Dividend,
D-Divisor,
Q-Quotient,
R- Reminder.
POWER DELAY
TECHNIQUES POWER(µ Watts) DELAY(ns) PRODUCT (*10-15
Watt-Sec)
In part-1, the simulation results of MCIT AND gate, OR gate and 2:1 Mux circuit for 65 nm technology have been obtained.
The Designed MCIT logic gates have beeen compared with conventional CMOS AND, OR and 2:1 Mux circuits. From the
simulated results shown in Table1, 2 and 3, it has been observed that MCIT based circuits offer better performance. For example,
for OR gate, MCIT based circuit has a reduced power dissipation of 48.15%, a reduced propagation delay of 47.82% & a reduced
PDP of 72.95% than conventional CMOS OR gate. The simulation results are carried out by using Dsch3.1 and Microwind3.1
CAD tool.
6.References:
[1] S. Borkar, “Obeying moore’s law beyond 0.18 micron,” in Proc. IEEE Int. ASIC/SOC Conf., Sept.2000, pp. 26–31.
[2] R. Ronen et al., “Coming challenges in microarchitecture and architecture,”Proc. IEEE, vol. 89, pp. 325–339, Mar. 2001.
[3] M. T. Bohr, “Nanotechnology goals and Challenges for electronic applications,”IEEE Trans.Nanotechnol., vol. 1, pp. 56–
62,Mar. 2002.
[4] D. J. Frank et al., “Device scaling limits of Si MOSFET’s and their application dependencies,” Proc. IEEE, vol. 89, pp. 259–
288, Mar. 2001.
[5] R. K. Krishnamurty, A. Alvandpour, V. De, and S. Borkar, “High-performance and low-power challenges for sub-70 nm
microprocessor circuits,”in Proc. IEEE Custom Integrated Circuits Conf., May 2002, pp.125–128.
[6]. S. Mutoh et al., “1-V power supply high-speed Digital circuit technology with multithreshold- voltage CMOS,” IEEE J.
Solid-State Circuits, vol.30, pp. 847–854, Aug. 1995.
[7]. V. Kursun and E. G. Friedman, “Domino logic with dynamic body Biased keeper,” in Proc. Eur. Solid-State Circuits Conf.,
Sept. 2002, pp.675–678.
[8] “Variable threshold voltage keeper for contention reduction in dynamic circuits,” in Proc. IEEE Int. ASIC/SOC Conf., Sept.
2002, pp.314–318.
[9] S. Borkar, .Low Power Design Challenges for the Decade,. Proceedings of the IEEE/ACM Design Automation Conference,
pp. 293-296, June 2001.
[10] P. Srivastava, A. Pua, and L. Welch, .Issues in the Design of Domino Logic Circuits,. Proceedings of the IEEE Great Lakes
Symposium on VLSI, pp. 108-112, February 1998.
[11] G. Balamurugan and N. R. Shanbhag, .Energy- efficient Dynamic Circuit Design in the Presence Crosstal Noise,.
Proceedings of the IEEE International Symposium on Low Power Electronics and Design, pp. 24-29, August 1999.
[12] Dimitrios S, Christian P, Costas GE. CMOS circuits for low power. European low-power initiative for electronic system
design Kluwer academic publishers.
[13] G Yanga ,Z Wang and S Kang, ”Leakage Proof Domino Cicuit Design for DeepSub-100nm Technologies”, Proceedings of
IEEE International Conference on VLSI Design, .pp.222-227, January 2004.
[14] IEEE Journal Of Solid-State Circuits, Vol. 32, No. 7, July 1997low-Power Logic Styles: …..Cmos Versus Pass-Transistor
Logic Reto Zimmermann And Wolfgang Fichtner.
Appendix
SIMULATED WAVEFORMS FROM MICROWIND 3.1