Вы находитесь на странице: 1из 9

2292 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 59, NO.

10, OCTOBER 2012

A Simple Circuit Approach to Reduce Delay


Variations in Domino Logic Gates
Gaetano Palumbo, Fellow, IEEE, Melita Pennisi, Member, IEEE, and Massimo Alioto, Senior Member, IEEE

AbstractIn this paper, a simple approach to reduce delay


variations in domino logic gates is proposed. Previous analysis
by the same authors showed that delay variations in domino
logic are mainly due to the feedback loop implemented by the
keeper transistor and the output inverter gate. Accordingly, the
proposed strategy aims at reducing the loop gain associated with
this feedback loop, and hence its impact on delay variations. In
particular, a simple modified keeper is proposed to reduce the
loop gain while keeping the same silicon area, noise margin, and
nominal performance. The resulting delay variations associated
with keeper insertion are shown to be lowered by approximately
50%. The proposed approach is assessed by means of simulations
in 65-nm and 90-nm commercial CMOS technologies.
Index TermsIntradie variations, process variations, timing
Fig. 1. Schematic of a Domino logic gate.
modeling, variability, VLSI.

the power supply and the dynamic node , when the latter has
to be kept high [10], [11].
I. INTRODUCTION
In nanometer technologies, intradie process variations deter-
mine large delay variations that are a timing overhead that limits
the performance improvements potentially offered by Domino

V ARIABILITY issues pose a major challenge in


nanometer integrated circuits (ICs) [1][3]. Indeed, espe-
cially in high-performance applications, the large delay/power
logic [15][23]. Since intradie process variations are expected
to rapidly increase in the next technology generations, and con-
sidering that they are very difficult to compensate with adaptive
deviations due to process and environmental variations make it schemes, they are regarded as the most critical source of vari-
difficult to meet the tight bounds imposed by performance and ability and a major limit to performance in nanometer technolo-
consumption requirements, thereby degrading the yield [4][9]. gies [21].
In performance-critical applications, Domino logic is widely Recently, the effect of process variations on the delay of dy-
employed since it has a lower delay at the cost of a reduced noise namic logic was investigated at the circuit level of abstraction
immunity, compared with static CMOS logic [10], [11]. The by the same authors [24]. The analysis showed that the delay
speed advantage of Domino logic is obtained thanks to the more variability of Domino logic gates is typically doubled compared
compact circuit topology (the pull-up network is much simpler to that of the static logic counterparts. In the same paper it was
than CMOS logic). This speed advantage becomes more pro- shown that [24]:
nounced when considering wide fan-in gates. As an example, the variability of the delay at the dynamic node (i.e., from
fast OR gates and MUXes with wide fan-in are typically used the input to node in Fig. 1) is almost the same as the
to realize high-performance register files [12][14]. overall delay variability (i.e., from the input to node OUT
To counteract the noise immunity and signal integrity degra- in Fig. 1);
dation observed in Domino logic, the dynamic node is typi- variations in the keeper transistor and the precharge tran-
cally connected to a keeper PMOS transistor ( in Fig. 1). sistor do not significantly contribute to the delay variability
During the evaluation phase, provides a static path between at the dynamic node .
Thus, the only remaining cause of variability increase compared
to static CMOS logic is the positive feedback loop that is imple-
Manuscript received February 18, 2011; revised September 05, 2011; ac-
mented by the keeper transistor and the output inverter in Fig. 1
cepted January 15, 2012. Date of publication May 04, 2012; date of current
version September 25, 2012. This paper was recommended by Associate Editor [24]. Furthermore, the delay variability degradation associated
S. Cotofana. with the feedback loop tends to get worse in more advanced
G. Palumbo and M. Pennisi are with the DIEES (Dipartimento di Ingegneria
technologies (see Section II for the details). Hence, it must be
Elettrica, Elettronica e dei Sistemi), Universit di Catania, I-95125, Catania,
Italy (e-mail: gpalumbo@diees.unict.it; mpennisi@diees.unict.it). necessarily mitigated to avoid unmanageable delay variations in
M. Alioto is with the DII (Dipartimento di Ingegneria dellInformazione), current and next technology generations.
Universit di Siena, I-53100, Siena, Italy, and also with the EECS Department,
In regard to the trade-off between noise margin and perfor-
the University of Michigan, Ann Arbor, MI 48109 USA (e-mail: malioto@dii.
unisi.it; alioto@umich.edu). mance, a strong keeper improves the former but degrades the
Digital Object Identifier 10.1109/TCSI.2012.2189046 latter, due to the stronger current contention with pull-down

1549-8328/$31.00 2012 IEEE


PALUMBO et al.: A SIMPLE CIRCUIT APPROACH TO REDUCE DELAY VARIATIONS IN DOMINO LOGIC GATES 2293

network. Clearly, this trade-off becomes hard to manage in the


presence of variations, hence various circuit solutions have been
proposed to limit their impact [14], [25][28]. More specifi-
cally, approaches in [14], [25] and [26] temporarily reduce the
noise margin for a portion of the clock cycle. The approach in
[27] proposes a replica bias technique that can counteract varia-
tions that are fully correlated in both NMOS and PMOS transis-
tors (i.e., layout-dependent variations), whereas it is ineffective
against random variability components (e.g., random dopant
Fig. 2. Test circuit to evaluate delay variability for the traditional Domino.
fluctuations) or variations that affect NMOS and PMOS tran-
sistors in different ways. The scheme proposed in [28] compen-
sates only fully correlated variations through a keeper with digi- of determined by the PDN in Fig. 1, thereby increasing
tally programmable strength, which suffers from heavy loading the gate delay [3]. This is the well-known trade-off between
of the dynamic node and its large overhead is justified only if noise immunity and speed in Domino logic, which basically
this approach is adopted in large structures like register files. depends on the ratio between the saturation current of
All the above approaches are based on the simple principle the keeper and that of the PDN. To avoid an excessive delay
that delay variations can be reduced by appropriately tuning the increase, must be kept lower than unity, and typically
keeper strength. On the other hand, in this paper a completely ranges from 0.1 to 0.5 [29] (i.e., the keeper saturation current is
different principle is exploited to reduce delay variations in from 10% to 50% of the on current of the PDN).
Domino circuits. Rather, this work aims to reduce the delay The output inverter must be high skewed to minimize the
sensitivity to process variations by reducing the loop gain, as delay, typically with the PMOS transistor sized four times wider
justified by the above considerations on the positive feedback than the NMOS [30]. On the other hand, the size of the eval-
loop implemented by the keeper [24]. In particular, a simple uation transistor is chosen according to the trade-off between
keeper topology that replaces the standard PMOS keeper is the clock load specification (i.e., a size smaller than PDN tran-
adopted to reduce delay variations while keeping the same sistors) and the speed requirement, and is reasonably sized 1.5
silicon area, delay and noise margin. Also, in the proposed times larger than the equivalent width of the PDN [30].
technique, the noise margin is constantly kept to the targeted
B. Analysis of the Impact of Intradie Variations and Discussion
value without any temporary degradation, in contrast to [14],
on the Role of the Positive Feedback Loop
[25], [26]. Interestingly, thanks to the different principle and
goal, the proposed technique can be mixed with any of the Let us analyze the impact of the intradie variations on the
alternative techniques in [14], [25][28] by simply replacing speed of Domino gates with different fan-in (i.e., an inverter, a
the PMOS keeper with the proposed structure. Hence, the NAND, and a NOR with fan-in equal to [2], [3], [4] and [2],
proposed technique can actually be complementary to such [3], [4], [8], [16], [32], respectively) designed by using a 65-nm
techniques, rather than being purely a competitor. CMOS technology. Moreover, some results by using a 90-nm
The paper is organized as follows. In Section II, a detailed technology are reported in Appendix I to enable comparison
analysis of delay variation in a Domino logic gate is carried between different technologies.
out. The proposed keeper topology to reduce delay variations To correctly evaluate the delay variations, an appropriate sim-
is discussed in Section III along with design guidelines. Monte ulation setup is adopted. In particular, the circuit shown in Fig. 2
Carlo simulations in 65-nm and 90-nm CMOS technology are is used, where the circuit under test is driven by a realistic input
reported in Section IV to validate the proposed approach. Con- waveform generated by an equal gate, although the latter is not
clusions of the work are summarized in Section V. Two appen- subject to any process variation (so that the input rise time of
dices are added to improve the readability of the paper. the circuit under test is not subject to significant variations).
The circuit under test is loaded by a capacitance that is set
to some multiple of the input capacitance of a min-
II. ANALYSIS OF DELAY VARIATIONS IN DOMINO LOGIC imum-sized inverter ( quantifies the load indepen-
dently of technology). In particular, results equal to
A. Assumptions and Constraints on the Transistor Sizes 410 aF in 65-nm technology.
The generic Domino logic gate in Fig. 1 consists of an NMOS The results, obtained with 2000 runs1 of Monte Carlo simu-
pull-down network (PDN) inserted between the precharge tran- lations in 65-nm CMOS are summarized in Tables IIII, which
sistor and the evaluation transistor . The dynamic respectively report the mean value , standard deviation
node at the output of the PDN drives the static output inverter. and the resulting variability of the delay. In these tables,
At the beginning of the evaluation phase (i.e., when goes the load capacitance is set to and , the
high), node is kept at the voltage by the keeper tran- equivalent aspect ratio of the PDN is set to the realistic value
sistor . When a transient noise is observed at node , the /60 nm, and is minimum sized. Also,
keeper rejects it by bringing the voltage of back to [3]. the keeper is sized to achieve a of 0.1 (a wider
With regard to the size of the keeper, the adoption of a range for will be considered in Section IV), and the
stronger keeper obviously permits a better noise immunity. On 1This number of runs ensures a lower than 4% inaccuracy in the standard
the other hand, a stronger keeper also opposes to transitions deviation estimate.
2294 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 59, NO. 10, OCTOBER 2012

TABLE I TABLE III


DELAY MEAN VALUE IN DOMINO LOGIC WITH/WITHOUT STANDARD KEEPER DELAY MEAN VALUE, STANDARD DEVIATION AND VARIABILITY IN
AND KEEPER IN FIG. 4 (65-nm TECHNOLOGY, IN DOMINO GATES WITH STANDARD KEEPER IN FIG. 1 (65-nm TECHNOLOGY,
BRACKETS: PERCENTAGE INCREASE W.R.T. THE CASE WITHOUT KEEPER) )

decrease when increasing the fan-in, as shown in Table III. Fur-


thermore, the delay variability decreases when increasing ,
due to the increase in the mean value (the standard deviation
increases only slightly). As expected, the delay variability also
increases when considering a more advanced technology, as is
confirmed by the comparison with results in 90-nm technology
summarized in Appendix I.
As clarified by the authors in [24], the above delay varia-
tions are due to two main factors: one is the process variability
TABLE II within the PDN in Fig. 1, the other is related to the presence
DELAY STANDARD DEVIATION IN DOMINO LOGIC WITH/WITHOUT STANDARD of the positive feedback loop implemented by the keeper and
KEEPER AND KEEPER IN FIG. 4 (65-nm TECHNOLOGY, IN the inverter gate. In particular, the feedback loop in Fig. 1 is
BRACKETS: PERCENTAGE INCREASE W.R.T. THE CASE WITHOUT KEEPER)
responsible for an increase in the delay variations even when
transistors in the loop (i.e., keeper and inverter transistors) are
not subject to variations. To quantitatively evaluate the impact
of the feedback loop on the delay variations, let us analyze the
results in Tables I, II, where the mean value and the standard
deviation of the delay are reported for the above considered
logic gates with and without the keeper (i.e., with/without the
feedback loop). In these tables, data in bold font and brackets
refer to the percentage increase with respect to the case without
keeper. From Table I, the keeper insertion determines a mod-
erate increase in the mean value of the gate delay, compared
to the case without keeper, due to the current contention with
the pull-down network. On the other hand, from Table II, the
keeper insertion determines a strong increase in the delay stan-
dard deviation , compared to the case without keeper.2
From the above considerations, it is clear that the impact
of the feedback loop on the delay variability is significant
in sub-100 nm technologies, and also tends to increase in
down-scaled technologies. In turn, this means that the speed
degradation of Domino due to delay variations tends to grow
faster than static CMOS logic (whose delay variations are basi-
cally the same as Domino logic variations due to the only PDN,
without the effect of keeper [24]). Hence, the feedback loop in
Domino gates tends to reduce their speed improvement over
high-skewed static inverter is sized by using the logical effort static logic, thereby making Domino logic less advantageous in
approach for each load condition. down-scaled technologies.
From inspection of Tables I, II, the mean value tends to in- 2In Table III, the reduced variability in logic gates with higher fan-in is easily
crease faster than the standard deviation when increasing the understood by considering that a larger number of stacked transistors permits to
fan-in at a given load. Thus, the delay variability tends to average out the transistor variations [18], [21].
PALUMBO et al.: A SIMPLE CIRCUIT APPROACH TO REDUCE DELAY VARIATIONS IN DOMINO LOGIC GATES 2295

above. Equivalently, becomes rather sensitive to varia-


tions when , for a given change in the input voltage. In
turn, the current determines the voltage drop at the dy-
namic node and hence the delay at the same node. Hence, when
the delay becomes rather sensitive to process variations.
On the other hand, when as in the case of Domino logic
without keeper, the delay variability is reduced and becomes
equal to that of static CMOS gates.
Summarizing, the Domino delay variability increase due to
the presence of the keeper (i.e., feedback loop) can be lowered
by reducing the loop gain. In the following section, a circuit ap-
proach to reduce the loop gain without impacting other param-
eters of interest is discussed.

III. A SIMPLE CIRCUIT APPROACH TO REDUCE DELAY


Fig. 3. Simplified schematic of a Domino logic gate. VARIATIONS IN DOMINO LOGIC GATES
From (1), the magnitude of the loop gain can be reduced
C. Circuit Analysis of the Positive Feedback Loop by reducing either the voltage gain of the inverter , or the
impedance or the keeper transconductance .
Let us analyze the small-signal loop gain of the circuit in In the following, we discuss an approach that aims at reducing
Fig. 1, whose simplified scheme is reported in Fig. 3. The PDN the latter.
in Fig. 3 (including the evaluation transistor) is represented with
an independent current source , whereas the keeper is A. Keeper Topology to Reduce the Impact of Process
represented as a dependent current source , which is Variations
controlled by the output small-signal voltage . In turn, the
small-signal component of current can be written A simple approach to modify the keeper and reduce its small-
as , being signal transconductance while keeping the same
the keeper transconductance. The resulting small-signal voltage strength is shown in Fig. 4. As no change is made in the strength,
at the dynamic node can be written as , being the circuit in Fig. 4 does not have any penalty in terms of noise
the impedance at the dynamic node (which is determined margin and speed, compared to the standard keeper in Fig. 1. In
by transistor parasitics), and the current detail, the improved keeper is realized by splitting the original
flowing through . For the case of interest, the transistor keeper transistor into two transistors and
voltage at node decreases from the precharged value , , where only is driven by the inverter output. In-
hence . stead, transistor works as an equivalent resistance
From inspection of Fig. 1, the feedback associated with the
keeper transistor has a noninverting loop gain given by
(2)
(1)
that represents a source degeneration for the transistor .
where is the voltage gain of the inverter, which is obvi- Hence, naming the transconductance of , the
ously lower than one3 at the beginning of the evaluation phase effective transconductance of the keeper is immediately found
(i.e., when the voltage at node is ). At the same time, to be
the keeper transconductance at the beginning of the evaluation
phase is small as well, as the keeper operates in the linear region. (3)
Due to the low value of and of the gain
, the loop gain is positive but certainly lower than one at the
beginning of the evaluation phase. From basic circuit theory, a Apparently, compared to the standard keeper topology in Fig. 1
feedback circuit with a loop gain has a closed-loop gain sen- without , and hence the loop gain is reduced by a
sitivity to variations equal to [31], [32]. Hence, the factor of , as desired.
lower-than-one and positive feedback loop gain in Domino
gates increases the gain sensitivity to variations when , B. Transistor Sizing Strategy
compared to the case without keeper (i.e., in open loop con- From a design perspective, the above technique makes sense
figuration). In particular, let us consider the closed-loop gain only if the loop gain reduction is obtained by keeping the same
between the input voltage and the current , which be- noise margin as the gate with the original keeper, since it is a crit-
comes very sensitive to variations when , as discussed ical specification in Domino logic gates. To this aim, we have to
3 becomes rather high only around bias points where the voltage of size and so that they provide the same DC current
node are close to the inverter logic threshold. as the original keeper (i.e., they have to set an equal
2296 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 59, NO. 10, OCTOBER 2012

TABLE IV
DELAY VARIABILITY IN DOMINO GATES WITH KEEPER IN FIG. 4
(65-nm TECHNOLOGY, )

Fig. 4. Keeper topology to reduce delay variations.


First, we considered Domino logic gates with
assigned to 0.1, which is in the low side of the typical
ratio). Since these transistors are in series, this condition is met
range and leads to a noise margin ranging from 300 to 350
by setting the aspect ratio of and according to
mV for all logic gates and the standard keeper in Fig. 1.
As an example, a noise margin of 340 mV for the inverter
(4) and 300 mV for the NAND3 gate were obtained with
the appropriate sizes of the keeper in Fig. 4 that leads to
Since the keeper usually has a rather small driving capability, its (i.e., ,
transistor width is customarily set to the minimum value for the keeper in Fig. 4,
allowed by the technology, whereas its length is and for the standard keeper
greater than minimum. Thus, in the gate with modified keeper, in Fig. 1). As expected from the considerations in Section III,
transistors and are sized with the nominal noise margin was confirmed to be consistently the
and . same for both types of keeper with a difference of just a few
Now, we still have another degree of freedom in the percentage points (the small difference is due to the approxi-
and sizes that can be used to keep the loop gain in mations that are customarily introduced in the transistor sizes).
(1) as low as possible, or equivalently to keep Interestingly, the keeper in Fig. 4 was also found to keep the
as high as possible from (3). Since is propor- same noise margin variability as the traditional keeper, for the
tional to , we have to set4 and reasons discussed in Appendix II. This means that the proposed
. This sizing criterion combined with (4) keeper does not introduce any penalty in terms of noise margin,
permits to reduce the loop gain while maintaining the same even when variations are accounted for. Also, this ensures that
ratio, thereby avoiding any penalty in terms of noise all comparisons presented in the following are fairly performed
margin and performance. at same noise margin, even in the presence of variations.
It is worth noting that the considered keeper is very similar Monte Carlo simulations with 2000 samples were carried
to the keeper based on a single transistor, and its layout is very out to evaluate the mean value and the standard deviation of
simple, since it is the series of two transistors with minimum the delay for the proposed keeper, whose numerical values are
width. As the overall length is the same as the reported in Tables III. The variability resulting from these
channel length of the original keeper, the increment in the simulations is reported in Table IV. Monte Carlo simulations
silicon area is negligible. accounted for the variation in the threshold voltage, mobility,
gate oxide thickness, source/drain resistance and channel di-
mensions.
IV. DESIGN CONSIDERATIONS, SIMULATIONS, AND VALIDATION From the simulation results in Table II, the mean value of the
To assess the reduction in the delay variations offered by the delay is basically the same for both the standard keeper in
above discussed keeper topology, Domino logic gates were sim- Fig. 1 and the proposed keeper in Fig. 4 (again, the small dif-
ulated according to the test circuit in Fig. 2 and under the same ference is due to the usual approximations in transistor sizes).
conditions as in Section II for the 65-nm technology (results on This means that the proposed keeper does not introduce any sig-
the most critical gates for the 90-nm CMOS technologies are re- nificant speed penalty at nominal conditions, as expected. The
ported in Table VII of Appendix I). The same sizes were kept for same observation was confirmed to hold for the noise margin
all transistors except for the keeper, which was sized according variability, which was found to be the same for both keepers, as
to the design strategy in Section III. expected from the considerations in Appendix II.
In regard to the delay variability, the delay standard devi-
4This clearly holds in practical cases where . In the ation and variability with the proposed keeper (see Table IV)
uncommon case where , it is easily found that we
must set , and are consistently lower than those with the standard keeper (see
in order to satisfy (4) and maximize . Table III) for all logic gates, as expected. This is especially true
PALUMBO et al.: A SIMPLE CIRCUIT APPROACH TO REDUCE DELAY VARIATIONS IN DOMINO LOGIC GATES 2297

Fig. 5. Percentage increase of delay standard deviation of standard/proposed


keeper w.r.t. case with no keeper vs. ( , 65-nm tech-
nology): (a) inverter and NAND; (b) NOR. Fig. 6. Percentage increase of delay standard deviation of standard/proposed
keeper w.r.t. case with no keeper vs. ( , 65-nm
technology): (a) inverter and NAND; (b) NOR.
in low fan-in gates and in the case of low capacitive load. This is
clear from the data in brackets in Table II, which shows that the
delay variability increase due to the insertion of the proposed gates when a stronger keeper (i.e., higher ) is adopted,
keeper is almost half that obtained with the standard keeper regardless of the adopted keeper. This is because a stronger
in Fig. 1, compared to the case without keeper. This means keeper has a greater small-signal transconductance, hence the
that the keeper topology in Fig. 4 under the sizing strategy in loop gain increases and hence delay variations are more pro-
Section III-B is effective in reducing the delay variability in- nounced (as clarified in Section III). Again, the proposed keeper
crease associated with the feedback loop. Similar results are re- offers a considerable delay variability reduction over the stan-
ported in Appendix I under a 90-nm technology. dard keeper, regardless of the value of . More quantita-
Although the above results were obtained assuming tively, the delay variability increase due to the insertion of the
, they hold also for different values of . proposed keeper was found again to be typically 50% lower than
To show the influence of a different (i.e., keeper that obtained with the standard keeper in Fig. 1. This confirms
strenght), the same Domino logic gates were also designed that the considerable advantages of the proposed keeper are kept
and simulated with widely ranging from 0.1 to 0.5. In regardless of the adopted value of .
regard to the mean value of the delay, a higher value of The improvement in delay standard deviation that is enabled
leads to a higher delay because of the increased current con- by the proposed keeper is clearly maintained in practical circuits
tention with the pull-down network. For example, all the gates built with cascaded gates. Indeed, the delay standard deviation
loaded with were found to be slowed of an -stage path approximately scales proportionally to
down by 1 ps (4 ps) and 2 ps (8 ps) for and [4], [21]. As an example, Fig. 7 shows the percentage standard
, compared to the case with . As deviation increase with respect to the case with no keeper in
already observed for , simulations confirmed a sample 6-stage path. The results for both the standard and
that the proposed keeper does not introduce any speed and the proposed keeper are shown for the output of each stage. As
noise margin penalty at nominal conditions, regardless of the expected, the proposed keeper significantly reduces (by 35%
adopted . on average) the standard deviation increase due to the keeper
In regard to the delay standard deviation, its percentage in- insertion.
crease due to the insertion of the keeper with respect to the
case without keeper is plotted in Fig. 5 (Fig. 6) versus V. CONCLUSIONS
for . As a first comment As was recently shown by the same authors, Domino logic
on Figs. 56, the delay variation always increases for all logic gates suffer from significant increase higher delay variability,
2298 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 59, NO. 10, OCTOBER 2012

TABLE V
DELAY MEAN VALUE, STANDARD DEVIATION, AND VARIABILITY IN
DOMINO GATES WITH STANDARD KEEPER IN FIG. 1 (90-nm TECHNOLOGY,
)

TABLE VI
DELAY MEAN VALUE, STANDARD DEVIATION, AND VARIABILITY IN DOMINO
GATES WITH KEEPER IN FIG. 4 (90-nm TECHNOLOGY, )

Fig. 7. Percentage increase of delay standard deviation of standard/proposed


keeper w.r.t. case with no keeper at the output of the various stages in the de-
picted circuit.

compared to static CMOS logic [24]. This is due to the pres-


ence of the keeper transistor and the associated positive feed-
back loop. This delay variability increase becomes more critical
moving toward more advanced technology generations.
In this paper, the loop gain was found to be the critical pa- TABLE VII
rameter that is responsible for this delay variability degrada- DELAY STANDARD DEVIATION IN DOMINO LOGIC WITH/WITHOUT
STANDARD KEEPER IN FIG. 1 AND KEEPER IN FIG. 4 (90-nm TECHNOLOGY,
tion. A keeper topology to reduce the loop gain was introduced )
along with a detailed transistor sizing strategy. The latter per-
mits to minimize the loop gain (and hence delay variations)
while keeping the same keeper strength, and hence the same
noise margin and delay. The resulting area overhead is also in-
significant.
Extensive Monte Carlo simulations in commercial 65-nm and
90-nm CMOS showed that the proposed keeper topology halves
the delay variability increase associated with the keeper inser-
tion. Since Domino logic without keeper essentially experiences
the same delay variability as static CMOS logic, this also means
that the proposed keeper reduces by 50% the delay variability
degradation of Domino logic with respect to static CMOS logic
(i.e., it partially fills the variability gap between the two logic
styles). At the same time, the proposed keeper guarantees the the same simulation setup previously described for the 65-nm
same nominal noise margin and its variability as well, hence the technology.
reduced delay variability is not obtained by penalizing the noise The equivalent aspect ratio of the PDN for all gates is set to
margin. Also, the proposed keeper keeps the same dynamic be- the realistic value , and again the abso-
havior (delay mean value) and silicon area as the Domino logic lute sizes for the NAND2 and NAND3 are multiplied by 2 and 3,
with standard keeper. Due to the increasing importance of vari- compared to the inverter, is minimum sized and the keeper
ability at each new technology generation, the proposed keeper is sized to achieve a of 0.1. Moreover, the high-
is a useful tool to keep variations under control at no penalty in skew static inverter is sized with the NMOS equal to 0.5 /0.1
terms of nominal performance, area, and noise margin. and 2.8 /0.1 for the and load,
respectively. The noise margin for Domino logic with 90-nm
APPENDIX I technology is around 350 mV for the inverter and linearly re-
RESULTS ON A 90-nm TECHNOLOGY duces to 310 mV for the NAND3 gate, which is very close to
In this appendix, some results are reported for the most crit- the design examples that we presented above in 65-nm CMOS.
ical gates realized with a 90-nm technology, for which the input The transistor sizes of the proposed keeper to achieve an equiv-
capacitance of a minimum-sized inverter was found alent aspect ratio of were found
to be 520 aF. Table V and Table VI summarize the mean , to be and
standard deviation and the variability of the delay . Comparison on standard deviation delay be-
obtained with 2000 runs of Monte Carlo simulations for a load tween traditional Domino gates and the proposed topology are
capacitance equal to and , and by using summarized in Table VII.
PALUMBO et al.: A SIMPLE CIRCUIT APPROACH TO REDUCE DELAY VARIATIONS IN DOMINO LOGIC GATES 2299

TABLE VIII [6] M. Alioto and G. Palumbo, Impact of supply voltage variations on full
STRENGTH VARIABILITY IN PROPOSED AND TRADITIONAL KEEPER IN A NOR3 adder delay: Analysis and comparison, IEEE Trans. Very Large Scale
GATE (65-nm TECHNOLOGY) Integr. (VLSI) Syst., vol. 14, no. 12, pp. 13221335, Dec. 2006.
[7] K. Bernstein, D. Frank, A. Gattiker, W. Haensch, B. Ji, S. Nassif, E.
Nowak, D. Pearson, and N. Rohrer, High-performance CMOS vari-
ability in the 65-nm regime and beyond, IBM J. Res. Develop., vol.
50, no. 4.5, pp. 433449, Jul. 2006.
[8] S. Bhunia, S. Mukhopadhyay, and K. Roy, Process variations and
process-tolerant design, in Proc. Int. Conf. VLSI Design, Jan. 2007,
pp. 699704.
[9] S. Ghosh and K. Roy, Parameter variation tolerance and error re-
siliency: New design paradigm for the nanoscale era, Proc. IEEE, vol.
98, no. 10, pp. 17181751, Oct. 2010.
[10] R. Sung and D. Elliott, Clock-logic Domino circuits for high-speed
APPENDIX II and energy-efficient microprocessor pipelines, IEEE Trans. Circuits
Syst. II, Exp. Briefs, vol. 54, no. 5, pp. 460464, May 2007.
NOISE MARGIN VARIABILITY FOR THE PROPOSED KEEPER [11] F. Frustaci, P. Corsonello, S. Perri, and G. Cocorullo, High-perfor-
mance noise-tolerant circuit techniques for CMOS dynamic logic, IET
Monte Carlo simulations revealed that the noise margin under Circuits Devices Syst., vol. 2, no. 6, pp. 537548, Jun. 2008.
the proposed keeper also has the same variability as the tra- [12] W. Hwang, R. V. Jooshu, and W. H. Henkels, A 500 MHz, 32-word
ditional one, as well as the same nominal value. As the noise 64-bit, eight-port self resetting CMOS register file, IEEE J. Solid-
State Circuits, vol. 34, no. 1, pp. 5667, Jan. 1999.
margin is set by the keeper strength in a given logic gate, this [13] R. K. Krishnamurthy, A. Alvandpour, G. Balamurugan, N. Shanbhag,
means that the proposed and the traditional keeper have the K. Soumyanath, and S. Borkar, A 130 nm 6 Ghz 256 32 bit leakage-
same strength variability (other than the same nominal value, tolerant register file, IEEE J. Solid-State Circuits, vol. 37, no. 5, pp.
624632, May 2002.
by design). This was confirmed by extensive Monte Carlo sim- [14] A. Alvandpour, R. K. Krishnamurthy, K. Soumyanath, and S. Y.
ulations. As an example, Table VIII reports the variability of the Borkar, A sub-130-nm conditional keeper technique, IEEE J.
keeper strength in a NOR3 gate with nominal equal to Solid-State Circuits, vol. 37, no. 5, pp. 633638, May 2002.
[15] M. Alioto and G. Palumbo, Analysis and comparison on full adder
0.1 and 0.5. In this table, the keeper strength has been evaluated block in sub-micron technology, IEEE Trans. Very Large Scale Integr.
as the on-current of the keeper when with source-gate voltage (VLSI) Syst., vol. 10, no. 6, pp. 806823, Dec. 2002.
equal to (i.e., in Figs. 1 and 4) and source-drain [16] M. Alioto, E. Consoli, and G. Palumbo, General strategies to design
nanometer flip-flops in the energy-delay space, IEEE Trans. Circuits
voltage equal to 50 mV (i.e., ). Appar- Syst. I, Reg. Papers, vol. 57, no. 7, pp. 15831596, Jul. 2010.
ently, the strength variability of the proposed keeper matches [17] J.-J. Liou, A. Krstic, Y.-M. Jiang, and K.-T. Cheng, Modeling, testing,
very well that of the traditional keeper. The same results were and analysis for delay defects and noise effects in deep submicron de-
vices, IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., vol.
found to hold for all logic gates in all conditions and for dif- 22, no. 22, pp. 756769, Jun. 2003.
ferent nominal values of . [18] A. Srivastava, D. Sylvester, and D. Blaauw, Statistical Analysis and
The fact that the proposed keeper has the same strength (and Optimization for VLSI: Timing and Power. New York: Springer,
2005.
hence noise margin) variability as the traditional one can be [19] Y. Cao and L. T. Clark, Mapping statistical process variations toward
intuitively grasped through the following circuit considerations. circuit performance variability: An analytical modeling approach,
On one hand, the presence of the minimum-sized transistor IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., vol. 26, no.
10, pp. 18661873, Oct. 2007.
in Fig. 4 tends to increase the keeper strength variability [20] V. Gupta and G. Rincn-Mora, Achieving less than 2% mis-
because of its small size. On the other hand, this larger vari- match with minimum channel-length CMOS devices, IEEE Trans.
ability of is compensated by the lower variability of the Circuits Syst. II, Exp. Briefs, vol. 54, no. 3, pp. 232236, Mar. 2007.
[21] M. Orshansky, S. Nassif, and D. Boning, Design for Manufacturability
stacked transistor , since the latter is larger and also tends and Statistical Design. New York: Springer, 2008.
to dominate the overall resistance (since it has ). [22] H. Nho, S. Yoon, S. Wong, and S. Jung, Numerical estimation of yield
Apparently, the two effects tend to compensate each other, in sub-100-nm SRAM design using Monte Carlo simulation, IEEE
Trans. Circuits Syst. II, Exp. Briefs, vol. 56, no. 4, pp. 295299, Apr.
thereby justifying why the noise margin variability under the 2009.
proposed keeper was found to be consistently the same as the [23] V. Agarwal, J. Sun, and J. Wang, Delay uncertainty reduction by gate
traditional one in all conditions. splitting, IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 55, no. 9, pp.
907911, Sep. 2008.
[24] M. Alioto, G. Palumbo, and M. Pennisi, Understanding the effect of
REFERENCES variations on the delay of static and domino logic, IEEE Trans. Very
[1] J. A. Croon, W. Sansen, and H. E. Maes, Matching Properties of Deep Large Scale Integr. (VLSI) Syst., vol. 18, no. 5, pp. 697710, May 2010.
Sub-Micron MOS Transistors. New York: Springer, 2005. [25] M. Anis, M. Allam, and M. Elmasry, Energy-efficient noise-tolerant
[2] A. Srivastava, D. Sylvester, and D. Blaauw, Statistical Analysis and dynamic styles for scaled-down CMOS and MTCMOS technologies,
Optimization for VLSI: Timing and Power. New York: Springer, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 10, no. 2, pp.
2005. 7178, Apr. 2002.
[3] K. Bernstein et al., High Speed CMOS Design Styles. Boston, MA: [26] V. Kursun and E. Friedman, Domino logic with variable threshold
Kluwer Academic, 1999. voltage keeper, IEEE Trans. Very Large Scale Integr. (VLSI) Syst.,
[4] M. Eisele, J. Berthold, D. Schmitt-Landsiedel, and R. Mahnkopf, The vol. 11, no. 6, pp. 10801093, Dec. 2003.
impact of intra-die device parameter variations on path delays and on [27] H. Dadgour and K. Banerjee, A novel variation-tolerant keeper ar-
the design for yield of low voltage digital circuits, IEEE Trans. Very chitecture for high-performance low-power wide fan-in dynamic OR
Large Scale Integr. (VLSI) Syst., vol. 5, no. 4, pp. 360368, Dec. 1997. gates, IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 18, no.
[5] J.-J. Liou, A. Krstic, Y.-M. Jiang, and K.-T. Cheng, Modeling, testing, 11, pp. 15671577, Nov. 2010.
and analysis for delay defects and noise effects in deep submicron de- [28] C. Kim, S. Hsu, R. Krishnamurthy, S. Borkar, and K. Roy, Self cal-
vices, IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., vol. ibrating circuit design for variation tolerant VLSI systems, in Proc.
22, no. 22, pp. 756769, Jun. 2003. On-Line Testing Symp., 2005, pp. 100105.
2300 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSI: REGULAR PAPERS, VOL. 59, NO. 10, OCTOBER 2012

[29] A. Kabbani and A. J. Al-Khalili, A technique for dynamic CMOS Melita Pennisi (M07) was born in Catania, Italy, in
noise immunity evaluation, IEEE Trans. Circuits Syst. I, Fundam. 1980. She received the Laurea degree in electronics
Theory Appl., vol. 50, no. 1, pp. 7488, Jan. 2003. engineering and the Ph.D. degree in electronics
[30] I. Sutherland, B. Sproull, and D. Harris, Logical Effort. Designing Fast and automatic engineering from the University of
CMOS Circuits. San Mateo, CA: Morgan Kaufmann, 1999. Catania, in 2004 and 2008, respectively.
[31] J. Millman and A. Grabel, Microelectronics (Second Edition). New Since 2008, she has worked with the DIEES
York: McGraw-Hill, 1987. (Dipartimento di Ingegneria Elettrica Elettronica
[32] G. Palumbo and S. Pennisi, Feedback Ampilfiers: Theory and De- e dei Sistemi) of the University of Catania as a
sign. Boston, MA: Kluwer Academic, 2002. Researcher. She is coauthor of more than 15 pub-
lications on international journals and conference
proceedings. Her primary research interests include
the modeling and the optimized design of CMOS high-performance, analysis
of analog nonlinear circuits, behavioral modeling of complex mixed-signal
circuits, and design/modeling for variability-tolerant and low-leakage VLSI
circuits.

Massimo Alioto (M01SM07) was born in


Brescia, Italy, in 1972. He received the Laurea
degree in electronics engineering and the Ph.D.
degree in electrical engineering from the University
Gaetano Palumbo (M91SM98F07) was born in of Catania, Italy, in 1997 and 2001, respectively.
Catania, Italy, in 1964. He received the Laurea de- In 2002, he joined the Department of Information
gree in electrical engineering and a Ph.D. degree from Engineering of the University of Siena as a Research
the University of Catania, Italy, in 1988 and 1993, re- Associate and in the same year as an Assistant
spectively. Professor. In 2005 he was appointed Associate Pro-
Since 1993 he has conducted courses on electronic fessor of Electronics. In the summer of 2007, he was
devices, electronics for digital systems, and basic a Visiting Professor at EPFL-Lausanne, Switzerland.
electronics. In 1994 he joined the DEES (Dipar- In 20092011, he held a Visiting Professor position at BWRC-UCBerkeley,
timento Elettrico Elettronico e Sistemistico), now investigating on next-generation ultra-low power circuits and wireless nodes.
DIEES (Dipartimento di Ingegneria Elettrica Elet- In 20112012, he is also Visiting Professor at University of Michigan, in-
tronica e dei Sistemi), at the University of Catania vestigating on active techniques for resiliency in near-threshold processors,
as a Researcher, subsequently becoming Associate Professor in 1998. Since error-aware VLSI design for wide energy scalability, ultra-low power circuits,
2000 he has been a full Professor in the same department. His primary research and energy scavenging. He has authored or coauthored 170 publications on
interest has been analog circuits with particular emphasis on feedback circuits, journals (60, mostly IEEE Transactions) and conference proceedings. Two of
compensation techniques, current-mode approach, low-voltage circuits. Then, them are among the most downloaded TVLSI papers in 2007 (respectively
his research has also embraced digital circuits with emphasis on bipolar and 10th and 13th). He is coauthor of the book Model and Design of Bipolar and
MOS current-mode digital circuits, adiabatic circuits, and high-performance MOS Current-Mode Logic: CML, ECL and SCL Digital Circuits (Springer,
building blocks focused on achieving optimum speed within the constraint of 2005). His primary research interests include ultra-low power VLSI circuits
low power operation. In all these fields he is developing some the research and wireless nodes, near-threshold circuits for green computing, error-aware
activities in collaboration with STMicroelectronics of Catania. He is the coau- and widely energy-scalable VLSI circuits, circuit techniques for emerging
thor of three books, CMOS Current Amplifiers, Feedback Amplifiers: Theory technologies. He is the director of the Electronics Lab at University of Siena
and Design, and Model and Design of Bipolar and MOS Current-Mode Logic (site of Arezzo).
(CML, ECL and SCL Digital Circuits), all by Kluwer Academic Publishers, Prof. Alioto is a member of the HiPEAC Network of Excellence. He is
in 1999, 2001, and 2005, respectively, and a textbook on electronic devices in the Chair of the VLSI Systems and Applications Technical Committee of
2005. He is the author of about 380 scientific papers on referred international the IEEE Circuits and Systems Society, for which he was also Distinguished
journals (more than 150) and in conferences. Moreover, he is coauthor of Lecturer in 20092010 and member of the DLP Coordinating Committee in
several patents. 20112012. He serves or has served as a Track Chair in a number of confer-
Dr. Palumbo he served as an Associate Editor of the IEEE TRANSACTIONS ences (ISCAS, ICCD, ICECS, APCCAD, ICM). He was Technical Program
ON CIRCUITS AND SYSTEMS PART I from June 1999 to the end of 2001 and Chair of the ICM 2010 and NEWCAS 2012 conferences. He serves as Associate
from 2004 to 2005 for the topic Analog Circuits and Filters and Digital Cir- Editor of the IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION
cuits and Systems, respectively. From 2006 to 2007 he served as an Associate (VLSI) SYSTEMS, the ACM Transactions on Design Automation of Electronic
Editor of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS PART II. From Systems, IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSPART I, the
2008 to 2011 he served as an Associate Editor of the IEEE TRANSACTIONS ON Microelectronics Journal, IntegrationThe VLSI Journal, the Journal of
CIRCUITS AND SYSTEMS PART I. In 2005 he was one of the 12 panelists in the Circuits, Systems, and Computers, the Journal of Low Power Electronics, as
scientific-disciplinare area 09industrial and information engineering of the well as the Journal of Low Power Electronics and Applications. He is or was
CIVR (Committee for Evaluation of Italian Research), which has the aim to also Guest Editor of the Special Issue Ultra-low voltage circuits and systems
evaluate the Italian research in the above area for the period 20012003. In 2003 of the IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMSPART II, as well as
he received the Darlington Award. Since 2011 he is a member of the Board of of the Special Issue Advances in oscillator analysis and design of the Journal
Governors of the IEEE CAS Society. of Circuits, Systems, and Computers.

Вам также может понравиться