Вы находитесь на странице: 1из 9

IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 15, NO.

12, DECEMBER 2007 1311

PMOS-Only Sleep Switch Dual-Threshold Voltage


Domino Logic in Sub-65-nm CMOS Technologies
Zhiyu Liu, Student Member, IEEE, and Volkan Kursun, Member, IEEE

Abstract—A circuit technique is proposed in this paper for si-


multaneously reducing the subthreshold and gate oxide leakage
power consumption in domino logic circuits. Only p–channel sleep
transistors and a dual-threshold voltage CMOS technology are uti-
lized to place an idle domino logic circuit into a low leakage state.
Sleep transistors are added to the dynamic nodes in order to re-
duce the subthreshold leakage current by strongly turning off all of
the high-threshold voltage transistors. Similarly, the sleep switches
added to the output nodes suppress the voltages across the gate in-
sulating layers of the transistors in the fan-out gates, thereby mini-
mizing the gate tunneling current. The proposed circuit technique
lowers the total leakage power by up to 77% and 97% as compared
to the standard dual-threshold voltage domino logic circuits at the
high and low die temperatures, respectively. Similarly, a 22% to
44% reduction in the total leakage power is observed as compared
to a previously published sleep switch scheme in a 45-nm CMOS
technology. The energy overhead of the circuit technique is low,
justifying the activation of the proposed sleep scheme by providing
a net savings in total energy consumption during short idle periods. Fig. 1. Comparison of the maximum gate oxide leakage current of an nMOS
transistor (V = V =V =V ) in two successive CMOS technology
Index Terms—Dynamic CMOS, electron tunneling, gate oxide generations.
tunneling, hole leakage, low-leakage sleep mode, multithreshold
voltage, subthreshold leakage current.

gate dielectric layer. The variation of the with the supply


I. INTRODUCTION voltage in two successive CMOS technologies (45 and
EATURE size scaling in MOSFETs requires reducing the 65 nm) is shown in Fig. 1. While advancing from the 65-nm
F supply and threshold voltages. The lowering of threshold
voltages leads to an exponential increase in the subthreshold
technology node to the 45-nm technology node, the reduction
of by 3 increases the by up to 14.9 times depending
leakage current. Several circuit techniques based on multiple on the voltage difference across the gate oxide, as illustrated in
threshold voltage (multiple ) CMOS technologies are de- Fig. 1.
scribed in the literature for reducing the subthreshold leakage The is in the range of 12–16 in the current CMOS
current [1]–[6], [15], [16], [22]. The effect of these multiple technologies [7], [8], [10]. Such a thin leads to a signifi-
CMOS circuit techniques on the gate oxide leakage current cant gate tunneling current. The variation of the gate oxide and
characteristics, however, has not been explored until recently. subthreshold leakage currents of an nMOS transistor with the
In the CMOS technologies with a gate insulator thicker supply voltage at two different die temperatures is shown in
than 20 , the gate oxide leakage current is orders Fig. 2, assuming a 45-nm CMOS technology. At 110 C, the
of magnitude smaller than the subthreshold leakage current is 6.7 times higher than the gate oxide leakage cur-
[8], [10]. The has, therefore, typically been rent (operating at the nominal supply voltage 0.8 V) as illus-
trated in Fig. 2. Alternatively, at the room temperature, the
ignored in the previous CMOS technologies. is caused by
the direct tunneling of the electrons and holes through the insu- is 2.5 times higher than the subthreshold leakage current. Due
lating gate dielectric layer. The tunneling probability of carriers to the aggressive scaling of , the gate dielectric tunneling has
increases with the scaling of the gate oxide thickness in become a primary leakage mechanism. Particularly at the low
die temperatures during long idle periods, most of the power
each new technology generation. The tunneling probability also
has a strong dependence on the voltage difference across the consumption of a CMOS circuit could occur due to gate oxide
leakage. New circuit techniques aimed at simultaneously re-
ducing the subthreshold and gate oxide leakage currents are,
Manuscript received August 11, 2005; revised May 28, 2006. This work
therefore, highly desirable.
was supported by a Grant from the Wisconsin Alumni Research Foundation A circuit technique for lowering the gate oxide leakage cur-
(WARF). rent in domino logic circuits is presented in [11]. Using P-type
The authors are with the Department of Electrical and Computer Engi- (predischarge) domino is proposed in order to exploit the lower
neering, University of Wisconsin–Madison, Madison, WI 53706-1691 USA
(e-mail: zhiyuliu@wisc.edu). gate oxide leakage current characteristics of the pMOS tran-
Digital Object Identifier 10.1109/TVLSI.2007.903947 sistors as compared to the nMOS transistors. The transistors
1063-8210/$25.00 © 2007 IEEE
1312 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 15, NO. 12, DECEMBER 2007

Fig. 3. Maximum gate oxide and subthreshold leakage current states in nMOS
and pMOS transistors. (a) Maximum gate oxide leakage current state. (b) Max-
imum subthreshold leakage current state.

Fig. 2. Comparison of the subthreshold and gate oxide leakage currents of an


nMOS transistor for various supply voltages at two different die temperatures.
I :V = 0 and V = V .I :V = V = V =
V .

in a p-type domino circuit, however, must be sized larger for


achieving similar speed as compared to an n-type domino cir-
cuit. Although this technique can be effective for reducing the
leakage current, the area and active power consumption are sig-
nificantly increased.
In this paper, a new circuit technique with lower overheads
of delay and energy is proposed to reduce both the subthreshold
and gate oxide leakage currents in domino logic circuits. pMOS
sleep transistors are utilized along with a dual threshold voltage Fig. 4. Comparison of the gate oxide leakage current produced by the same
(dual- ) CMOS technology to ensure that an idle domino cir- sized nMOS and pMOS transistors for various voltages applied across the gate
insulator in a 45-nm CMOS technology. V = V =V =V .
cuit is placed into a low leakage state. The proposed technique
reduces the total leakage power by up to 77% and 96.9% as com-
pared to the standard dual- domino logic circuits at 110 C
and 25 C, respectively. from the gate terminal to the conducting channel is shared
The paper is organized as follows. The leakage current char- between the source and drain terminals [14]. and are the
acteristics of the domino gates are described in Section II. The edge tunneling currents from the gate terminal to the source and
new circuit technique to reduce the total leakage power con- drain terminals through the gate-to-source and gate-to-drain
sumption is presented in Section III. The simulation results are overlap areas, respectively. is typically several orders of
given in Section IV. Some conclusions are offered in Section V. magnitude smaller than the other three components of the gate
tunneling current [9]. The highest gate oxide leakage current is
observed when a transistor operates in the active region with the
II. LEAKAGE CURRENT CHARACTERISTICS OF
maximum voltage difference across the gate-to-source and the
DYNAMIC CMOS CIRCUITS
gate-to-drain terminals, as illustrated in Fig. 3(a). Alternatively,
The leakage current characteristics of dynamic CMOS cir- the highest subthreshold leakage current is observed when a
cuits are explored in this section. The subthreshold and gate cutoff transistor is biased with the maximum voltage difference
oxide leakage currents produced by the nMOS and pMOS between the source and drain terminals, as depicted in Fig. 3(b).
transistors are described in Section II-A. The leakage current In a technology utilizing silicon-dioxide as the gate dielec-
characteristics of the previously published sleep switch dual- tric material, the probability of hole tunneling is much smaller
domino logic circuit techniques in the literature are discussed than the probability of electron tunneling through the gate oxide
in Sections II-B and II-C. (discussed in Appendix I). The for a pMOS device is,
therefore, significantly lower as compared to an nMOS device
A. Comparison of Leakage Currents in n-Channel and with the same and the same voltage difference across the
p-Channel Devices gate insulator. A comparison of the gate oxide leakage currents
The subthreshold and gate oxide leakage currents pro- produced by the n-channel and p-channel devices with similar
duced by the nMOS and pMOS transistors are illustrated in physical dimensions (width, length, and ) in a 45-nm CMOS
Fig. 3. has four components, as shown in Fig. 3(a): technology is shown in Fig. 4. The produced by an nMOS
gate-to-channel tunneling current , gate-to-drain tunneling transistor is up to 40 times (depending on the voltage difference
current , gate-to-source tunneling current , and across the gate oxide) higher as compared to a pMOS transistor,
gate-to-body tunneling current . The tunneling current as illustrated in Fig. 4.
LIU AND KURSUN: PMOS-ONLY SLEEP SWITCH DUAL-THRESHOLD VOLTAGE DOMINO LOGIC 1313

Fig. 6. Two cascaded dual-V domino OR gates in the sleep mode. The dom-
inant gate oxide leakage current conduction paths in the sleep mode are illus-
trated with arrows. H: high. L: low. High-V transistors are represented with a
thick line in the channel region.

Fig. 5. Two-input standard dual-V domino OR gate. High-V transistors are


represented with a thick line in the channel region.

B. Dual- Domino Logic


Employing dual- transistors for subthreshold leakage cur-
rent reduction in domino logic circuits was first proposed by
Kao [3]. A two-input standard dual- domino OR gate is shown Fig. 7. Two cascaded two-input dual-V domino OR gates with high-V sleep
in Fig. 5. The critical signal transitions that determine the delay transistors. The dominant gate oxide leakage current conduction paths in the
sleep mode are illustrated with arrows. H: high. L: low. High-V transistors are
of a domino logic circuit occur along the evaluation path. In a represented with a thick line in the channel region.
dual- domino circuit, therefore, all of the transistors that can
be activated during the evaluation phase have a low- . Alter-
natively, the precharge phase transitions are not critical for the circuit have been proposed in [2]–[4]. The high output of an
performance of a domino logic circuit. Therefore, those transis- idle domino gate, however, places the fan-out domino circuits
tors that are active during the precharge phase have a high- into the highest gate oxide leakage current state, as illustrated
[22]. in Fig. 6. The techniques proposed in [2]–[4], [15], and [16],
Provided that all of the high- transistors are cutoff in a therefore, increase the gate oxide leakage current while re-
dual- domino logic circuit, the subthreshold leakage current ducing the subthreshold leakage current. In the sub-65-nm
is significantly reduced as compared to a low- circuit [5]. CMOS technologies, the significant increase in the gate oxide
The clock is gated high, turning off the high- pull-up tran- leakage current could negate the subthreshold leakage current
sistor when a domino logic circuit is idle. In a standard dual- reduction provided by these techniques, thereby increasing the
domino logic circuit, the modes of the remaining high- tran- total leakage power consumption in the sleep mode.
sistors (other than the pull-up transistors) are determined by the
input vectors applied after the clock is gated high [15]. C. NMOS Sleep Switch Dual- Domino Logic
A sleep switch technique that places an idle domino logic cir- In addition to setting the dynamic node voltage low for re-
cuit into a low subthreshold leakage current state, regardless of ducing the subthreshold leakage current [15], the output node of
the input vectors, is presented in [15]. A high- nMOS switch a domino logic circuit should also be placed into a low voltage
is added to the dynamic node of a domino gate as shown in state in order to suppress the gate oxide leakage current into
Fig. 6. The operation of this transistor is controlled by a sep- the fan-out gates. A technique to force both the dynamic and
arate sleep signal. During the standby mode of operation, the output nodes of a domino logic circuit into a low voltage state
clock signal is gated high, turning off the high- pull-up tran- in the standby mode is proposed in [17]. Two high- nMOS
sistor of each domino gate. The sleep signal transitions high, sleep transistors N1 and N2 are placed at the dynamic and output
turning on the sleep switch. The dynamic node of the domino nodes, respectively, as illustrated in Fig. 7.
gate is discharged through the sleep switch and the output node In the standby mode, the clock is gated high. The sleep signal
transitions to high. Following the low-to-high transition of the is set high turning on N1 and N2. The dynamic and output nodes
output of a sleep switch dual- domino gate, the subsequent are discharged through N1 and N2, respectively. P3 is cutoff
gates (fed by the noninverting signals) also evaluate and dis- to avoid a static dc current path through P4 and N2. After the
charge in a domino fashion. After the node voltages settle to a dynamic and output nodes are discharged, the two nMOS sleep
steady state, all of the high- transistors are cutoff, thereby re- transistors (N1 and N2) are both in the maximum gate oxide
ducing the subthreshold leakage current [15]. leakage current state [see Fig. 3(a)]. The sleep transistors (N1,
Similar subthreshold leakage current reduction techniques N2, and P3) are required within every domino gate in a dynamic
based on discharging and charging the dynamic and output circuit designed with the technique presented in [17]. The gate
nodes, respectively, of all of the domino gates in a dynamic oxide leakage current overhead of the nMOS sleep transistors,
1314 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 15, NO. 12, DECEMBER 2007

Fig. 9. Two-input dual-V domino OR gate with high-V pMOS sleep transis-
Fig. 8. Two-input dual-V domino OR gate with low-V (P1 and P2) and tors. High-V transistors are represented with a thick line in the channel region.
high-V (P3) pMOS sleep transistors. High-V transistors are represented with
a thick line in the channel region.
leakage current. Similarly, the voltages across the gate insu-
lating layers of most of the transistors are suppressed, thereby
therefore, imposes a serious limitation to the leakage current lowering the gate oxide leakage current.
reduction that can be provided by this technique. Provided that a dual- CMOS technology is employed, the
noise immunity of a domino logic circuit is weakened due to the
III. PMOS-ONLY SLEEP SWITCH DUAL- DOMINO LOGIC high- keeper transistor [5], [15]. Using low- sleep transis-
A new circuit technique with enhanced effectiveness to si- tors could further increase the noise vulnerability of the domino
multaneously reduce the subthreshold and gate oxide leakage circuits since the sleep switches can be turned on by coupling
currents in domino logic circuits is proposed in this paper. Only noise to the sleep signal line. An alternative pMOS-only sleep
p-type sleep transistors are employed in order to reduce the gate switch technique based on high- sleep transistors is also pro-
oxide leakage current overhead of the new sleep switch cir- posed in this paper. Three high- pMOS sleep transistors (P3,
cuit technique. Since the gate tunneling current produced by a P5, and P6) are employed with the second proposed technique,
pMOS transistor is much smaller than an nMOS transistor, the as illustrated in Fig. 9. The current produced by a high- tran-
pMOS-only sleep switch circuit technique offers a significant sistor is smaller as compared to a low- transistor with sim-
reduction in the total leakage current as compared to the previ- ilar physical dimensions. Employing high- sleep transistors,
ously published schemes. therefore, enhances the noise immunity as compared to the first
The proposed circuit technique is illustrated in Fig. 8. Two proposed technique. Due to the reduced current provided by
low- pMOS sleep transistors P1 and P2 are added to the dy- the high- sleep transistors, however, the discharging speed
namic and output nodes, respectively. Provided that the dynamic of the dynamic and output nodes is reduced while entering the
node is discharged in the sleep mode, the pMOS transistor (P4) sleep mode. The size of the high- sleep transistors should be
in the output inverter is turned on. The output inverter and P2 increased in order to maintain a similar sleep delay as com-
produce a static dc current if P4 is directly connected to .A pared to the first proposed technique. The active power con-
high- pMOS sleep transistor (P3) is employed in series with sumption of the second proposed technique is, therefore, higher
P4 in order to eliminate the static dc current path through P4 as compared to the first proposed technique (for example, for
and P2 and to suppress the subthreshold leakage current pro- an eight-input domino OR gate, the active mode power consump-
duced by the output inverter in the sleep mode. tion is increased by 2.3% as compared to the first proposed tech-
In the active mode, the sleep signal is set high. P1 and P2 nique while maintaining a similar sleep delay).
are cutoff and P3 (driven by the inverted sleep signal) is turned
on. The proposed domino circuit operates similar to a stan- IV. SIMULATION RESULTS
dard domino gate. In the standby mode, the clock is gated high, BSIM4 device models are used in this paper for an accu-
turning off the high- pull-up transistor. The sleep signal is set rate estimation of the gate oxide leakage current [14]. The fol-
low, turning on P1 and P2. P3 is cutoff by the inverted sleep lowing circuits are simulated in a 45-nm CMOS technology
signal. The dynamic node is discharged to a voltage level equal ( 0.22 V, 0.35 V,
to the threshold voltage of a low- pMOS transistor and 0.8 V): cascaded multistage two-input domino AND
through P1. gates (AND2), cascaded multistage two-input, four-input, and
The dynamic and output nodes are eventually discharged to eight-input domino OR gates (OR2, OR4, and OR8, respectively),
a steady-state voltage less than (after P1 and P2 are and a 16-bit domino multiplexer (MUX16). All of the circuits
cutoff) by the high subthreshold leakage currents of the low- (other than MUX16) are composed of three stages. Each gate
transistors in the pull-down network and the output inverter and drives a fan-out of four. The domino gates in the first stage are
the gate-oxide leakage current into the fan-out gates. After the footed while the domino gates in the second and third stages are
node voltages settle to a steady state, all of the high- transis- footless. All of the circuits are designed with the following three
tors are strongly cutoff, significantly reducing the subthreshold techniques: standard dual- domino (dual- ), the technique
LIU AND KURSUN: PMOS-ONLY SLEEP SWITCH DUAL-THRESHOLD VOLTAGE DOMINO LOGIC 1315

TABLE I
STEADY-STATE NODE VOLTAGES (MILLIVOLTS) WITH
THE DUAL-V -LK TECHNIQUE

dual- -YWK techniques become smaller since the parasitic


capacitance introduced by the sleep transistors become less
important as compared to the parasitic capacitance of the
pull-down network transistors.
Fig. 10. Comparison of the active mode power consumption of the three
domino circuit techniques. The active mode power consumption is normalized
The dual- -LK technique increases the active power con-
to the power consumption of the standard dual-V technique for each circuit. sumption by 1.9% (MUX16) to 10.7% (OR2) as compared to
the dual- -YWK technique. This difference is primarily due to
the higher parasitic capacitance of the pMOS sleep transistors
presented in [17] (dual- -YWK), and the circuit technique pro- (P1 and P2 in Fig. 8) as compared to the nMOS sleep transistors
posed in this paper as illustrated in Fig. 8 (dual- -LK). The (N1 and N2 in Fig. 7).
sleep mode data are measured at 25 C and 110 C assuming
long and short idle periods, respectively. Alternatively, the ac- B. Sleep Mode Leakage Power Consumption at
tive mode data are measured at a worst case temperature of High Temperature
110 C. A 3-GHz clock is applied to the circuits. To have a rea- In the dual- -LK circuits, the pMOS sleep transistors are
sonable comparison, the circuits are sized to have a similar worst not capable of directly discharging the dynamic and output
case propagation delay with each technique. nodes to zero volts ( rise problem). The high subthreshold
leakage currents of the low- transistors in the pull-down net-
A. Area Overhead and Active Mode Power work and the output inverter and the gate-oxide leakage current
The layouts of the circuits are drawn assuming MOSIS deep into the fan-out gates continue to discharge the dynamic and
submicrometer design rules [21]. As shown in Fig. 8, a high- output nodes after P1 and P2 are turned off. The steady-state
pMOS transistor (P3) is placed in series with the low- pMOS dynamic and output node voltages in the dual- -LK domino
transistor of the output inverter in the dual- -LK circuits. Phys- circuits are both below , as listed in Table I.
ical size of pMOS transistors in the output inverters of dual- The subthreshold leakage current produced by a domino logic
-LK circuits must be increased to provide an evaluation delay circuit strongly depends on the dynamic and output node volt-
similar to the standard dual- circuits. Furthermore, two extra ages [5]. Two input conditions are simulated to evaluate the
pMOS sleep transistors (P1 and P2) are added to the dynamic leakage current in the sleep mode with the standard dual-
and output nodes. The areas of the proposed pMOS sleep switch technique. The first condition assumes that all of the inputs ap-
circuits are, therefore, increased by up to 82% (OR2) as com- plied to the first stage gates are low (high dynamic node voltage
pared to the standard dual- circuits based on the layout area state). The second condition assumes that all of the inputs ap-
comparison. plied to the first stage gates are high (low dynamic node voltage
The active power consumption of the domino circuits is state). The total leakage power consumption characteristics of
shown in Fig. 10. In the dual- -YWK and dual- -LK circuits, the domino circuits with the three techniques at 110 C are
two pMOS transistors are placed in series in the pull-up path of shown in Fig. 11. The leakage power reduction offered by the
the output inverter. The driving capability of the output inverter proposed circuit technique is listed in Table II.
is, therefore, degraded. Since P3 has a high- , the pull-up The dual- -LK technique reduces the total leakage power
strength of the output inverter is further reduced. Increasing by 81.9% to 97.3% as compared to the standard dual- circuits
the physical size of the dual- -LK and dual- -YWK circuits driven with low inputs, as listed in Table II. For a higher fan-in,
in order to provide a propagation delay similar to the standard the leakage power reduction provided by the dual- -LK tech-
dual- circuits increases the parasitic capacitance. Further- nique is enhanced (OR2: 87.6% versus OR8: 95.5%).
more, the parasitic capacitance at the dynamic and output For a dual- CMOS technology to be effective for reducing
nodes is increased due to the sleep transistors. The dual- -LK the subthreshold leakage current, the inputs to a standard
technique therefore increases the active mode power con- dual- domino gate must be maintained high during the sleep
sumption by 6% (MUX16) to 21% (AND2) as compared to mode. Maintaining the inputs high ensures that the subthreshold
the standard dual- domino logic circuit technique. Similarly, leakage current is produced by the high- transistors. A high
the dual- -YWK technique increases the active mode power input vector, therefore, reduces the leakage power consumption
consumption by 3% (OR8) to 14% (AND2) as compared to by 57.1% (AND2) to 88.6% (OR4) as compared to a low input
the standard dual- domino circuits. For a higher fan-in, the vector. The leakage reduction provided by the high input vector
active power consumption overheads of the dual- -LK and is smaller for AND2 as compared to the other circuits since
1316 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 15, NO. 12, DECEMBER 2007

Fig. 11. Comparison of the leakage power consumption of the domino circuits
with the three circuit techniques at 110 C. The leakage power is normalized to Fig. 12. Comparison of the total leakage power consumption of the domino
the leakage power of the standard dual-V technique with low inputs for each circuits with the three circuit techniques at 25 C. The leakage power is nor-
circuit. malized to the leakage power of the standard dual-V technique with low inputs
for each circuit.

TABLE II
LEAKAGE POWER REDUCTION PROVIDED BY THE DUAL-V -LK TECHNIQUE TABLE III
AS COMPARED TO THE DUAL-V -YWK AND STANDARD DUAL-V TOTAL LEAKAGE POWER REDUCTION PROVIDED BY THE DUAL-V -LK
TECHNIQUES AT 110 C AS COMPARED TO THE DUAL-V -YWK AND STANDARD DUAL-V
TECHNIQUES AT 25 C

the subthreshold leakage current produced by the pull-down


network of AND2 is reduced by the stack effect when the inputs
are maintained low. ature are shown in Fig. 12. The leakage power reduction of-
A high input vector places the transistors in the pull-down net- fered by the proposed circuit technique is listed in Table III. The
work of a standard dual- domino logic gate into the highest dual- -LK technique reduces the total leakage power by 87.7%
gate oxide leakage current state. Since the gate oxide leakage to 96.4% as compared to the standard dual- circuits driven
current of a low- nMOS transistor is significantly higher than with low inputs, as listed in Table III. For a higher fan-in, the
the subthreshold leakage current of a high- transistor (see leakage power savings provided by the dual- -LK technique is
Appendix II), the gate tunneling current produced by the pull- increased (OR2: 91.6% versus OR8: 96.4%).
down network dominates the total leakage power consumption The gate oxide leakage current produced by the nMOS
of a standard dual- domino gate when the inputs are main- transistors is the dominant source of the total leakage power
tained high. The dual- -LK technique suppresses both the sub- consumption at the room temperature (see Appendix II). A high
threshold and the gate oxide leakage by discharging the dynamic input vector places the transistors in the pull-down network into
and output nodes. The dual- -LK technique thereby reduces the maximum gate oxide leakage current state. The leakage
the total leakage power by 45.6% to 77.0% as compared to the power reduction provided by the proposed circuit technique is,
standard dual- circuits with high inputs, as listed in Table II. therefore, higher when the standard dual- domino circuits
The sleep switches at the dynamic and output nodes are are driven with high inputs. The dual- -LK technique reduces
sized much smaller than the precharge transistor, pull-down the total leakage power by 90.2% to 96.9% as compared to the
network transistors, and the transistors in the output inverter in standard dual- circuits with high inputs, as listed in Table III.
the dual- -YWK and dual- -LK circuits. Particularly for the The standard dual- circuits driven by high inputs consume
wide fan-in gates, the gate oxide leakage current of the sleep more power than the standard dual- circuits driven by low
switches is a relatively small portion of the total leakage current inputs in the sleep mode, as illustrated in Fig. 12. This result
produced at a high die temperature. The difference between indicates a dramatic change in the node voltage dependent
the total leakage power consumption of the dual- -YWK and leakage power characteristics of the standard domino logic
dual- -LK techniques is within 10.7% (OR4) at 110 C. circuits fabricated in technologies subject to significant gate di-
electric tunneling current [18]–[20]. The subthreshold leakage
C. Sleep Mode Leakage Power Consumption at Room currents produced by AND2 and MUX16 are reduced by the
Temperature stack effect when the inputs are low [15]. Therefore, the shift
The total leakage power consumption characteristics of the in the leakage power characteristics is more significant in the
domino circuits with the three techniques at the room temper- gates with stacked pull-down transistors.
LIU AND KURSUN: PMOS-ONLY SLEEP SWITCH DUAL-THRESHOLD VOLTAGE DOMINO LOGIC 1317

An nMOS sleep transistor can produce a significantly higher


gate oxide leakage current as compared to a pMOS transistor
(see Appendix I). The high- nMOS sleep switches proposed in
[17] cause a significant increase in the total leakage current of an
idle dual- domino gate. The highest gate oxide leakage current
produced a pMOS transistor is more than an order of magnitude
smaller than the highest gate oxide leakage current produced by
an nMOS transistor. Furthermore, there is zero voltage differ-
ence across the gate insulator layers of P1 and P2 (see Fig. 8)
throughout the idle mode with the dual- -LK technique
. The proposed pMOS-only sleep switch technique,
therefore, effectively eliminates the gate oxide leakage current
overhead introduced by the sleep transistors. The dual- -LK
technique reduces the total leakage power by 22.2% to 43.8% as
compared to the dual- -YWK technique, as listed in Table III. Fig. 13. Cumulative standby energy dissipation of the dual-V -LK and stan-
dard dual-V circuits for two different input vectors. The step change of the
dual-V -LK energy characteristics at the first nanosecond is the energy over-
D. Energy Overhead at High Temperature head for implementing a sleep mode with the dual-V -LK technique.
The energy overhead of a dual- -LK circuit for imple-
menting a low leakage sleep mode is discussed in this section.
The circuit is assumed to be operating at a worst case tem- of the dynamic nodes in the standard dual- circuits are dis-
perature of 110 C before entering the idle mode. The energy charged during the sleep mode. Alternatively, all of the dynamic
overhead of the proposed circuit technique is low, providing nodes in a pMOS-only sleep switch domino logic circuit are dis-
a net savings in total energy consumption shortly after the charged during the sleep mode, independent of the input vec-
beginning of the idle mode. The energy overhead data are tors. The activation energy required by the sleep switch circuit
measured at this worst case high die temperature assuming the technique is, therefore, higher than the standard dual- circuit
junction temperature does not significantly change during the technique. In order to justify the proposed sleep switch circuit
short duration of time at the beginning of the idle mode. technique to force a circuit into a low leakage state, the total en-
When a sleep switch domino logic circuit is idle, the clock is ergy consumed to enter and leave the sleep mode must be less
gated high. The sleep signal should be applied after the low-to- than the total savings in standby leakage energy.
high edge of the clock signal propagates to the gates in the last The cumulative energy dissipated in the standby mode by
stage of a clock-delayed domino logic circuit. Activating the the standard dual- and the dual- -LK circuits is shown in
sleep switches after the low-to-high transition of the clock en- Fig. 13. The leakage energy per cycle is assumed to be con-
sures that no short-circuit power is consumed while entering the stant. The cumulative energy of a standard dual- domino cir-
sleep mode. The dynamic and output nodes in all of the domino cuit is only affected by the subthreshold and gate oxide leakage
gates are discharged through the sleep transistors. After the node currents during the standby mode. Alternatively, both the cumu-
voltages settle, all of the high- transistors are strongly cut off, lative leakage energy and the energy overhead of entering and
minimizing the subthreshold leakage current. Similarly, the gate leaving the sleep mode are included in the energy characteris-
oxide leakage current is minimized by discharging the output tics of the dual- -LK circuit. The total energy overhead of the
nodes. After the clock is gated and the sleep switches are acti- dual- -LK circuit technique is independent of the duration of
vated, it takes 2.4 to 7.5 ns (depending upon the circuit type) to the idle mode (assuming the duration of idle mode is longer than
place the circuits into a low leakage state. the time needed to enter the low leakage sleep mode, 3.7 ns for
Before the end of an idle mode, the sleep signal transitions the dual- -LK circuit under consideration). The total energy
high, cutting off the sleep switches attached to the dynamic and overhead of the proposed technique is included as an energy
output nodes while activating the sleep transistors in the output step in the first nanosecond of the standby mode (see Fig. 13).
inverters. Disabling the dynamic and output node sleep transis- Similar to the dual- energy characteristics, after entering the
tors before activating the clock is necessary to avoid short-cir- sleep mode, the dual- -LK circuit energy consumption is due
cuit current while leaving the idle mode. The clock is reactivated to only the leakage currents. Since the standby leakage energy of
and all of the dynamic nodes are recharged to activate (wake-up) dual- -LK is significantly lower than the standard dual- cir-
a sleeping domino circuit. The reactivation time is equal to the cuit, the dual- -LK energy characteristics have a much smaller
time it takes to precharge a domino circuit. slope as compared to the energy characteristics of the standard
The energy overhead of a three-stage sleep switch circuit domino circuit (see Fig. 13). A specific amount of time in the
composed of four-input domino OR gates with a fan-out of four idle mode, also dependent upon the input vectors, is necessary
is evaluated in this section. Changing the mode of operation for the cumulative leakage energy of a standard dual- circuit
of the sleep transistors to place a dual- domino logic circuit to exceed the cumulative energy of a dual- -LK circuit.
into standby mode requires a specific amount of energy. Ad- The intersection of the dual- -LK and standard dual- cu-
ditional energy is dissipated at the end of an idle period while mulative energy characteristics are evaluated to determine the
precharging the dynamic nodes in order to reactivate a domino necessary minimum duration of the sleep mode such that the
logic circuit. Depending upon the input vectors, some or none pMOS-only sleep switch circuit technique offers a net savings
1318 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 15, NO. 12, DECEMBER 2007

in energy as compared to a standard dual- circuit. As illus-


trated in Fig. 13, the cumulative standby energy of the standard
dual- and dual- -LK circuits exhibit different behavior de-
pending upon the input vectors. The leakage current produced
by the standard dual- circuit is smaller for a high input vector
as compared to a low input vector (see Fig. 11). Alternatively,
the leakage current of the dual- -LK is virtually independent of
the input vector. However, depending upon the input vector, the
relative energy overhead of the dual- -LK scheme changes. For
low inputs, none of the dynamic nodes of the standard dual-
circuit are discharged during the standby mode. Alternatively,
all of the dynamic nodes are discharged in the dual- -LK. The
Fig. 14. Energy band diagrams of pMOS and nMOS transistors under inversion
relative energy overhead of the dual- -LK circuit technique re- bias. (a) Energy band diagram of a pMOS transistor. (b) Energy band diagram of
quired to charge the dynamic nodes to reactivate the circuit (to an nMOS transistor. E : Bottom of the conduction band. E : Top of the valence
transition from the standby mode to the active mode) is, there- band. E : Fermi energy level.
fore, higher for a low input vector. As shown in Fig. 13, a min-
imum duration of 8.9 and 100 ns is required in the idle mode
for the dual- -LK circuit technique to provide a net savings in electron tunneling from the conduction band (ECB) of the sil-
energy as compared to the standard dual- circuit with the low icon substrate is the dominant tunneling mechanism in an nMOS
and high input vectors, respectively. transistor. The HVB of the polysilicon is negligible due to the
smaller number of holes as compared to the electrons in the gate
of an nMOS device. The energy barrier heights for the HVB and
V. CONCLUSION
ECB are 4.5 and 3.1 eV, respectively, as illustrated in Fig. 14
In sub-65-nm CMOS technologies, both the subthreshold [12], [13]. The probability of hole tunneling is, therefore, much
and gate dielectric leakage currents need to be suppressed for smaller than the probability of electron tunneling through the
reducing the standby power consumption. A circuit technique gate oxide. The for a pMOS device is significantly lower
based on pMOS-only sleep transistors and a dual- CMOS as compared to an nMOS device with similar physical dimen-
technology is presented in this paper for simultaneously re- sions (width, length, and ) and voltage difference across the
ducing the subthreshold and gate oxide leakage currents in gate insulator. The produced by a low- nMOS transistor
domino logic circuits. is 47 times and 30 times higher than the produced by a
Both the dynamic and output nodes in a domino logic cir- low- pMOS transistor at 110 C and 25 C, respectively (see
cuit are discharged through pMOS sleep transistors in the idle Appendix II). Replacing the nMOS sleep transistors with pMOS
mode. Placing the dynamic node into a low voltage state reduces sleep transistors is, therefore, an effective method for reducing
the subthreshold leakage current by strongly turning off all of the gate oxide leakage current of an idle sleep switch dual-
the high threshold voltage transistors. Furthermore, placing the domino gate.
output node into a low voltage state suppresses the gate dielec-
tric tunneling currents into the fan-out gates. The proposed cir- APPENDIX II
cuit technique also exploits the intrinsically high subthreshold
and gate oxide leakage currents in scaled nanometer CMOS A comparison of the normalized subthreshold and gate oxide
leakage currents produced by the low threshold voltage (low- )
technologies for placing an idle domino logic circuit into a min-
and high threshold voltage (high- ) transistors in a dual-
imum leakage state.
CMOS technology is listed in Table IV. The gate oxide leakage
The circuit technique reduces the leakage power by up to
77.0% to 96.9% as compared to the standard dual- domino cir- current of a low- nMOS transistor is 4.7 times and 159.1
cuits in the sleep mode at the high and low die temperatures, re- times higher than the subthreshold leakage current of a high-
spectively. Furthermore, by employing pMOS-only sleep tran- pMOS transistor at the high and low die temperatures, respec-
sistors, the presented circuit technique reduces the total leakage tively. The difference between the subthreshold leakage currents
power by up to 43.8% as compared to a previously published of the high- nMOS and pMOS transistors is less than 20%. As
sleep technique based on nMOS sleep transistors. The energy discussed in Sections II-B and II-C, maintaining inputs high is
overhead of the circuit technique is low, providing a net energy preferable to reduce the subthreshold leakage current in an idle
savings during idle periods as short as 8.9 ns. dual- domino gate. When the inputs are maintained high, the
majority of gate tunneling current in an idle wide fan-in dual-
domino gate is produced by the low- pull-down network tran-
APPENDIX I sistors while the subthreshold leakage currents are produced
In a technology with silicon dioxide as the gate dielectric ma- by the high- transistors. The gate oxide leakage current of a
terial, the tunneling current of a pMOS device is primarily due low- nMOS transistor is much higher than the subthreshold
to the hole tunneling from the valence band (HVB) in the silicon leakage currents of the high- transistors at both high and low
substrate. The electron tunneling from the polysilicon conduc- temperatures. The gate tunneling, therefore, becomes the dom-
tion band is negligible since the electron concentration in the inant leakage mechanism in an idle dual- domino gate when
p+ polysilicon gate is low in a pMOS device. Alternatively, the the inputs are high. When the gate-oxide tunneling phenomenon
LIU AND KURSUN: PMOS-ONLY SLEEP SWITCH DUAL-THRESHOLD VOLTAGE DOMINO LOGIC 1319

TABLE IV [13] Y. C. Yeo, Q. Lu, W. C. Lee, T.-J. King, C. Hu, X. Wang, X. Guo,
NORMALIZED SUBTHRESHOLD AND GATE OXIDE LEAKAGE CURRENTS OF THE and T. P. Ma, “Direct tunneling gate leakage current in transistors with
LOW-V AND HIGH-V TRANSISTORS AT TWO DIFFERENT DIE TEMPERATURES ultrathin silicon nitride gate dielectric,” IEEE Electron Device Lett.,
vol. 21, no. 11, pp. 540–542, Nov. 2000.
[14] “Berkeley predictive technology model (BPTM),” Univ. Cali-
fornia, Berkeley, 2007. [Online]. Available: http://www.de-
vice.eecs.berkeley.edu/~ptm/download.html
[15] V. Kursun and E. G. Friedman, “Sleep switch dual threshold voltage
domino logic with reduced standby leakage current,” IEEE Trans. Very
Large Scale Integr. (VLSI) Syst., vol. 12, no. 5, pp. 485–496, May 2004.
[16] S. Heo and K. Asanovic, “Leakage-biased dynamic fine-grain leakage
reduction,” in Proc. IEEE Int. Symp. VLSI Circuits, 2002, pp. 316–319.
Transistor width = 1 m. Transistor length = 45 nm. Low-V = 0.22 V. [17] G. Yang, Z. Wang, and S. Kang, “Leakage-proof domino circuit design
High-V = 0.35 V. V = 0.8 V. I : V = 0 and V = V . for deep sub-100 nm technologies,” in Proc. IEEE Int. Conf. VLSI Des.,
I : V = V = V = V . For each temperature, the currents 2004, pp. 222–227.
[18] Z. Liu and V. Kursun, “Temperature dependent leakage power char-
are normalized to the subthreshold leakage current produced by the high-V
acteristics of dynamic circuits in sub-65 nm CMOS technologies,” in
pMOS transistor.
Proc. IEEE Int. Midw. Symp. Circuits Syst., 2005, pp. 551–554.
[19] Z. Liu and V. Kursun, “Leakage power characteristics of dynamic cir-
cuits in nanometer CMOS technologies,” IEEE Trans. Circuits Syst. II,
is significant, a small size active nMOS sleep switch can pro- Exp. Briefs, vol. 53, no. 8, pp. 692–696, Aug. 2006.
duce a considerably higher leakage current as compared to a [20] Z. Liu and V. Kursun, “Leakage biased pMOS sleep switch dynamic
circuits,” IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 53, no. 10, pp.
larger cutoff high- pMOS transistor. For example, the 1093–1097, Oct. 2006.
produced by an active 0.1 m wide high- nMOS sleep tran- [21] MOSIS, Marina del Rey, CA, “The MOSIS service,” (2007).
sistor is equal to the produced by a cutoff 12.4 m [Online]. Available: http://www.mosis.org/Technical/Design-
rules/scmos/scmos-main.htm
wide high- pMOS pull-up transistor in a dual- domino gate [22] V. Kursun and E. G. Friedman, Multi-Voltage CMOS Circuit Design.
at room temperature. Hoboken, NJ: Wiley, 2006.

Zhiyu Liu (S’05) received the B.S. and M.S. degrees


REFERENCES in engineering physics from Tsinghua University,
Beijing, China, in 2000. He is currently pursuing
[1] S. Shigematsu, S. Mutoh, Y. Matsuya, and J. Yamada, “A 1 V the Ph.D. degree in electrical and computer engi-
high-speed MTCMOS circuit scheme for power-down applications,” neering from the University of Wisconsin-Madison,
in Proc. IEEE Int. Symp. VLSI Circuits, 1995, pp. 125–126. Madison, under the supervision of Prof. Kursun.
[2] J. T. Kao and A. P. Chandrakasan, “Dual-threshold voltage techniques His research interests include low-power and high-
for low-power digital circuits,” IEEE J. Solid-State Circuits, vol. 35, performance integrated circuit design and emerging
no. 7, pp. 1009–1018, Jul. 2000. CMOS technologies.
[3] J. Kao, “Dual threshold voltage domino logic,” in Proc. Eur. Solid-
State Circuits Conf., 1999, pp. 118–121.
[4] M. W. Allam, M. H. Anis, and M. I. Elmasry, “High-speed dynamic
logic styles for scaled-down CMOS and MTCMOS technologies,”
in Proc. IEEE/ACM Int. Symp. Low Power Electron. Des., 2000, pp.
145–160. Volkan Kursun (S’01–M’04) received the B.S.
[5] V. Kursun and E. G. Friedman, “Node voltage dependent sub- degree in electrical and electronics engineering
threshold leakage current characteristics of dynamic circuits,” in Proc. from the Middle East Technical University, Ankara,
IEEE/ACM Int. Symp. Quality Electron. Des., 2004, pp. 104–109. Turkey, in 1999, and the M.S. and Ph.D. degrees
[6] V. Kursun and E. G. Friedman, “Domino logic with variable threshold in electrical and computer engineering from the
voltage keeper,” IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. University of Rochester, Rochester, NY, in 2001 and
11, no. 6, pp. 1080–1093, Dec. 2003. 2004, respectively.
[7] “International technology roadmap for semiconductors,” 2001. [On- In 2004, he became an Assistant Professor with the
line]. Available: http:www//public.itrs.net Department of Electrical and Computer Engineering,
[8] T. Ghani, K. Mistry, P. Packan, S. Thompson, M. Stettler, S. Tyagi, and University of Wisconsin-Madison, Madison. In 2000,
M. Bohr, “Scaling challenges and device design requirements for high he was with Xerox Corporation, Webster, NY, where
performance sub-50 nm gate length planar CMOS transistors,” in Proc. he performed research on mixed-signal thermal inkjet integrated circuits. During
IEEE Int. Symp. VLSI Technol., 2000, pp. 174–175. the summers of 2001 and 2002, he was with Intel Microprocessor Research Lab-
[9] H. Sasaki, M. Ono, T. Yoshitomi, T. Ohguro, S. Nakamura, M. Saito, oratories, Hillsboro, OR, where he was responsible for the modeling and design
and H. Iwai, “1.5 nm direct-tunneling gate oxide Si MOSFETs,” IEEE of high frequency monolithic power supplies. His current research interests in-
Trans. Electron Devices, vol. 43, no. 8, pp. 1233–1242, Aug. 1996. clude low-voltage, low-power, and high-performance integrated circuit design,
[10] T. Inukai, M. Takamiya, K. Nose, H. Kawaguchi, T. Hiramoto, and T. modeling of semiconductor devices, and emerging integrated circuit technolo-
Sakurai, “Boosted Gate MOS (BGMOS): Device/circuit cooperation gies. He has more than 60 publications and four issued and three pending patents
scheme to achieve leakage-free gigascale integration,” in Proc. IEEE in the areas of high-performance integrated circuits and emerging semicon-
Int. Custom Integr. Circuits Conf., 2000, pp. 409–412. ductor technologies. He is the author of the book Multi-Voltage CMOS Circuit
[11] F. Hamzaoglu and M. R. Stan, “Circuit level techniques to control gate Design (Wiley, 2006).
leakage for sub-100 nm CMOS,” in Proc. IEEE/ACM Int. Symp. Low Dr. Kursun is a member of the technical program and organizing committees
Power Electron. Des., 2002, pp. 60–63. of a number of IEEE and ACM conferences and serves on the editorial boards
[12] W.-C. Lee and C. Hu, “Modeling gate and substrate currents due to of the IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI)
conduction- and valence-band electron and hole tunneling,” in Proc. SYSTEMS, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS—II: EXPRESS
IEEE Int. Symp. VLSI Technol., 2000, pp. 198–199. BRIEFS, and the Journal of Circuits, Systems, and Computers.

Вам также может понравиться