Вы находитесь на странице: 1из 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/3452301

XNOR-based double-edge-triggered flip-flop for two-phase pipelines

Article  in  Circuits and Systems II: Express Briefs, IEEE Transactions on · March 2006
DOI: 10.1109/TCSII.2005.855734 · Source: IEEE Xplore

CITATIONS READS

16 354

4 authors, including:

Shing Tenqchen Wu-Shiung Feng


Telecommunication Laboratories, Chunghwa Telecom Co., Ltd. Chang Gung University
47 PUBLICATIONS   70 CITATIONS    191 PUBLICATIONS   1,372 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Intelligent Transportation Control View project

All content following this page was uploaded by Shing Tenqchen on 09 January 2014.

The user has requested enhancement of the downloaded file.


An XNOR-based Double-edge-triggered Flip-Flop for Two-phase Pipelines
Ying-Haw SHU, Shing TENQCHEN, Ming-Chang SUN
Department of Electrical Engineering, National Taiwan University
Taipei, Taiwan, R.O.C.
And
Wu-Shiung FENG
Department of Electronics Engineering, Chang Gung University
Taoyuan, Taiwan, R.O.C.

ABSTRACT asynchronous modules may have their own dedicated power


systems and different operating frequencies to reduce excess
Asynchronous modules operating under two-phase micro- power consumption. Furthermore, a wrapped asynchronous
pipeline methodology are faster than those in common four-phase module with its own built-in power system and operating clock
control schemes because the double-edge-triggered flip-flops in will make IP-reuse easier and more reliable [3].
the two-phase control systems capture data twice within each
clock cycle. This paper proposes a newly designed double-edge- Pipelined control systems divide tasks into steps to be
triggered flip-flop based on an alternative XNOR gate. By performed by sequences of functional units, similar to an
utilizing the sensitivity to the driving capacity of the previous assembly line in a factory. The pipelined system allows all the
stage, we use this simplified XNOR gate as a pulse-generator. A separate steps to be processed in parallel. Thus, the pipeline
modified transparent latch following the pulse-generator acts as concept is widely implemented in current VLSI systems to get
an XNOR-based double-edge-triggered flip-flop, which is 42% greater throughputs, and is the basic concept used in constructing
faster and more reliable than a conventional double-edge- entire systems of independent asynchronous modules in GALS
triggered flip-flop under HSPICE simulation. We also show that applications. Since there is no clock to fire the independent
our proposed double-edge-triggered flip-flop is not affected by modules in an asynchronous system, handshaking signals—
input-data logic-levels at various supply voltages. We “request” and “acknowledge”—are used to indicate when data is
implemented the XNOR-based double-edge-triggered flip-flop in ready and when data is captured. The two common asynchronous
a two-phase-pipeline system, and verified it with HSPICE handshaking protocols are shown in Fig. 1.
simulation in the TSMC 0.25um CMOS process.

Keywords: double edge-triggered flip-flop, two-phase, pipeline,


XNOR.

INSTRUCTION
It is well known [1-3] that over the past two decades,
researches into asynchronous circuits have revealed the
possibility they can accomplish better average-case performance
than other types by removing the unnecessary overhead of the
globally synchronizing clock signal. The non-synchronizing (a) (b)
property of asynchronous circuits presents new attractions as an
alternative to synchronous circuits for certain applications. Figure 1. Pipelined protocols: (a) two-phase, (b) four-phase.
Circuit designers may investigate asynchronous solutions for
certain local function blocks in which various computing results
reveal large timing divergences, or, for certain applications in The two-phase handshaking protocol uses a transition on
local pipeline systems where timing considerations are also the request signal to signify “data ready to send”, as shown in
severely restricted. The Intel research group experimentally Figure 1(a). The receiver then uses a transition on the
implemented its asynchronous instruction-length decoder design, acknowledge signal to indicate capture completion. “Two-phase”
which outperformed the original synchronous circuits in the refers to rising and falling transitions, thus the two-phase
Pentium II by threefold, and which is famous in CPU protocol is also called “transition signaling”.
development history as RAPPID (Revolving Asynchronous
Pentium Processor Instruction Decoder) [1]. The four-phase handshaking protocol treats only rising
The concept of globally asynchronous locally synchronous transitions on the request signal as “data ready”, and only rising
(GALS) circuit extends the application of asynchronous circuits transitions on the acknowledge signal as “data captured”. Falling
to VLSI systems [2]. Advocators promote implementing an entire control signal transitions in the four-phase protocol are used to
VLSI system with isolated asynchronous modules, which are pre-charge the computing module, or to indicate that the
self-controlled by local internal clocks, and communicate with asynchronous module ready for the next computation. Normally,
other modules via a data-driven controller. Their asynchronous the four-phase protocol is easy to implement in GALS systems or
handshaking protocol not only keeps system stability robust and modular synchronous systems since most latches and decision
free from clock-skew problems, but also brings many potential cells used in asynchronous systems are level-sensitive, and there
advantages. Since the asynchronous modules are isolated by their is an extra period to reset/pre-charge modules. Thus, the implied
own data-driven controllers with various clock timings, wrapped timing concern in the four-phase protocol is not so severe. By

79
contrast, the two-phase protocol may be more efficient since it
eliminates the unnecessary control overhead of waiting for signal
transitions on the control bus, particularly for those asynchronous
modules without large timing variations between common and
rare cases. Of course, the key components in two-phase pipeline
systems are the latches and flip-flops, which capture data on
rising and falling control signal transitions.

This paper is organized as follows. In section II, we present


concept and concerns regarding the topic of implementing a
double edge-triggered flip-flop using an alternative XNOR/XOR Figure 2. Functional diagram of the MOS-style double-edge-
gate. Section III discusses those added asynchronous components triggered flip-flop.
used to implement our simulated asynchronous pipelined system.
We provide some simulation results and explanations in section We chose to use a pulse generator to trigger a transparent
IV. Section V is our brief conclusion. latch twice per control signal cycle. The pulse generator contains
only two kinds of common logic gates, XNOR and inverter. A
XNOR GATE AND DOUBLE-EDGE- functional diagram of the proposed design is shown in Fig. 3.
TRIGGERED D FLIP-FLOP Theoretically, the inverter provides a delayed complementary
control signal, and the XNOR logic provides signal pulses on the
The better performance achieved by two-phase control systems rising and falling transitions of the control signal. There are two
in simple asynchronous modules is due to their two data-captures practical problems with such a design. The first is that the XNOR
per cycle of control-signal transitions. The direct practical gate is constructed from transmission paths or lots of
application is to use a newly defined interface to make a conventional gates, that means it must intrinsically have either
conventional transparent latch active on the rising and falling greater power dissipation or more propagation delays since the
edges of the control signal. That is because conventional signals pass through three logic stages. The other problem is due
transparent latches are level-sensitive, and such transparent to the inverter cell shown in Fig. 3. It is usually constructed of
latches determine their status, opaque (blocking) or transparent many inverters in odd stages, since those inverters must provide
(transmissive), according to the logic state of the control signal. longer delays to have wide enough pulses for the following
Therefore, the AMULET group (merged into the advanced transparent latch to capture data. Therefore, the XNOR may be a
processor technology [APT] group) proposed a two-to-four- direct solution to making conventional level-sensitive latches fit
phase interface to enable level-sensitive latches to work in two- two-phase pipelined controllers, but it is seldom discussed in the
phase control schemes [4]. Of course, another direct approach for past.
two-phase control systems is to design a truly new double-edge-
triggered D flip-flop (DETDFF) [5-8] in which the D flip-flop
(signal latch) used in a two-phase micro-pipeline scheme is
active during low-to-high and high-to-low transitions on the
control wire. One good demonstration depicted by K. Y. Yun is
the pseudo-static DETDFF [7] constructed from a pair of
complementary edge-triggered flip-flops. Each edge-triggered
flip-flop is made up of three NAND or NOR gates in series. Its
capture mechanism stores the logic state of transferred data in the
previous control state, then captures and passes the stored data to
the output through the last NAND gates at the next control signal Figure 3. Functional diagram of the XNOR-based double-edge-
transition. To avoid having the stored data influenced by clock triggered flip-flop.
transitions, a pair of weak inverters maintains the data until the
feedback signal from the new output state arrives. But the The transmission paths, or so-called “capture-pass logics”,
resulting trade-off is slower response times. perform the XNOR logic used in our design [9], as shown in Fig.
4. It works well, with greater speed and lower power dissipation
Another study also based on storing input data uses the than the conventional capture-pass adders described in [10-11].
parasitic capacitance of MOSFETs. Input selection-pairs There are two inherent problems with this proposed XOR/XNOR
controlled by control (clock) signals are constructed from cell. The first problem is its poor driving capacity. Input signals
transmission paths, unlike [7], which used inverters with enable- just pass through transmission paths to the output ports, which
switches. The inverters used in this DETDFF are N-type means they must directly drive the input capacitors of the next
structures with active P-MOSFET (MOS-style) clocked inverters stage via the transmission paths. The Mp3 and Mn3 transistors
[8]; the functional diagram is shown in Fig. 2. There are two provide additional current paths for the XOR port when both
obvious drawbacks in this simpler and faster structure. First, the inputs are logical-high (A=B=’1’), and for the XNOR port when
width-to-length ratios of N-MOSFETs must be much larger than both inputs are logical-low (A=B=’0’). However, this transistor
those of P-MOSFETs in order to have low enough logic level and pair also causes the other problem. The logic-state patch may
larger enough parasitic capacitance to reliably store data. This provide an undesired current path from Vdd to the XNOR, or
implies that the transistor area of the MOS-style DET-FF cannot from the XOR to ground if an internal latching problem occur
be smaller than those of other solutions despite being constructed (XNOR=”1”, XOR=”0”). Thus, the promised better performance
from only 16 transistors. The next is that control signals (clock only happens in small-scale circuits. In our application, we apply
and reverse clock) must be flawlessly matched. Otherwise, the the sensitivity of the driving capacity to validate variances in the
next input data and the slower responding weak inverter may input clock pair. We give the input inverter of the postponed
affect the stored data, and a sudden spike caused by the previous clock a longer channel-length, and that allows this proposed
data may appear in the output. XNOR to generate wide enough pulses at the control signal

80
transitions, as shown in Fig. 5. Actually, the width-to-length ratio also connected the input line to the Mp1 and Mn2 of the selector
of the input inverter determines whether our XNOR based shown in Fig. 6(b) so the Mp1 and Mn2 gate and channel
DETDFF can work normally, and how low its supply voltage can capacitors are pre-charged prior to the control signal because the
be. timing requirement of asynchronous logic is that input data be
ready for next stage when the control signal arrives.

(a)
Figure 4. Alternative XOR/XNOR based on the capture pass
(transmission path) concept.

Figure 5. Pulse generator used in XNOR-based DET-FF. (b)

In deciding on whether to get complementary pulses from a Figure 6. Transparent latch: (a) functional diagram, (b) transistor-
single output of the XNOR gate with an added inverter or from a level.
pair of differential outputs from XOR and XNOR, we choose the
single output and removed Mn3. Thus, we cut off an undesired
current path, and made the width-to-length ratio of the input TWO-PHASE PIPELINE SYSTEM
inverter fit our design criterion, as mentioned above. We also
added inverters as output buffers to get better driving capacity. The simplified system
The remaining issue was the mismatch between the “clk” and the
“Iclk”, since the “Iclk” signal follows the “clk” signal. Noting the The two-phase asynchronous control system used in our
result discussed in [12], we enlarged the width-to-length ratio of simulation was modified from Sutherland’s micro-pipeline,
the “Iclk” inverter, which reduced the mismatch between the which is well discussed in [7, 14], and shown in Fig. 7. Only five
clocks due to different driving capacities and different loads. logic components are used in this system, C-element, delay
We also needed a transparent latch with small input capacitance. buffer, inverter, XNOR gate, and transparent latch. The XNOR
The latch used in our proposed DET flip-flop is a common gate and transparent latch work as double edge-triggered flip-
transparent latch made up of a latching inverter pair and an input flops (DET-FF), as discussed above. In our simulation, we treated
selector (multiplexer), as shown in Fig. 6(a). Normally, the time the logic cells between stages as a simple signal path, which
required to break the latch status between the inverter pair means the entire system worked as a simple FIFO (first-in-first-
dominates the entire transient time of a transparent latch. To out) operation. In practical applications, those logic cells may be
avoid reaction from a weak inverter, the channel resistance of the asynchronous modules or computing cells.
input selector must be much smaller than that of the inverter. In The transition of the “request” signal from the left stage means
our design, we set the width-to-length ratio of the weak inverter that the left-stage data is ready, and will enable the C-element to
to one-fifth of the input-selection gates. However, as noted in fire the DETDFF to capture the data after the acknowledge signal
[12], the larger gate and diffusion capacitances incurred by from the right C-element arrives. In the meantime, the firing
longer channel lengths postpone responses. Thus, we modified signal sent from this C-element resets the left-stage C-element to
our transparent latch by implementing active loads on the weak wait for the next “request” signal. This firing signal is also the
inverter to reduce propagation delays during transitions. This “request” signal for the right C-element after passing the delay
active-load concept is also discussed in [12-13] as “skinny buffer. Obviously, there may be racing problems or
current limit”, and “pre-charging method”. That is the pre- synchronizing failures in such a simple two-phase control system.
charged active load provides the major channel resistance, and If the delay time provided by the delay buffer is not long enough
permits the smallest portion of the original weak inverter to to allow the logic cells to reflect the new data before enabling the
perform the defined latch function, as shown in Fig. 6(b). We transparent latch of the next stage, the next stage may capture

81
unpredictable logic states, even the previous logic state. falling times of a simple inverter. Because the originally
designed width-to-length ratio of the MOS-style DET-FF [8]
could not work normally in our simulative CMOS process, we
doubled the NMOS width-to-length ratio to allow a low enough
logic-low for its weak inverters.

Logic Cell

Figure 8. Delay buffer cell from Fig. 7.


Figure 7. Two-phase pipelined control system used in the
simulation.

Delay buffer and C-element


The delay buffer used in our simulation was a timing X Rising delay of XNOR based + Falling delay of XNOR based
controller that not only determined how much time a single □ Rising delay of conventional ◇ Falling delay of conventional
computing process took, but also whether there was enough time △ Rising delay of MOS-style ﹡ Falling delay of MOS-style
for the logic cells in the asynchronous modules to finish their
computation before the next latch became active. Thus, the
timing requirement of this delay buffer had to satisfy the
equations presented in [7]—in simplified form:
td  tck Q  tlogic  t R A' (1)
where td, tckQ, tlogic, and tRA’ are the response times of
the delay buffer, transparent latch, logic cell, and C-element,
respectively.
The simple signal delay used in our simulation is shown in Fig. 8.
The added capacitor, C1, adjusts the possible delay time, and the
last two inverters in the diagram are used to shape the output
waveform.
The C-element used in this simulation is a symmetric design
discussed in [11, 15]. Most of the C-element is a set of transistors
called a keeper, which provides feedback from the output port to X Power dissipation of XNOR based
lock the output state when the values of the inputs are unmatched. □ Power dissipation of conventional
Of course, such keeper provides an additional current path, which
△ Power dissipation of MOS-style
may conflict with the input signal. In other words, it will take
more time and energy to break the latch status when new
matched signals appear at the inputs. The keeper of the
symmetric C-element used in our simulation provides other
signal paths for inputs only when the logic states of the inputs are
unmatched. When new matched signals appear at the inputs,
those signal paths will be useless. Thus, this symmetric C-
element provides the best power-delay performance. Another
issue concerning C-elements is whether there are inputs that may
change twice in succession, or a JOIN [16] or RENDEZVOUR
[17]. These were not a problem in our simulation system. As (b)
mentioned above, the state period of the system clock must be Figure 9. Simulation results for various W/L ratios: (a)
longer than the computing period of the entire isolated propagation delay, (b) power dissipation.
asynchronous module. Otherwise, the latch may capture the
wrong data and sustain a system failure. Our simulation yielded some attractive phenomena. First,
we found that the response speed of the pseudo-static DET-FF [7]
SIMULATION RESULTS varies in accordance with the input data and control because the
current data may be pre-stored at either the first stage or the
We performed HSPICE simulation using the fast-mode second stage, and timing differences may result from the signal
model of TSMC 0.25um CMOS process. All the N-MOSFETs propagation path length and the cutting-in voltage of the clock
used in this simulation were of the same transistor size, except signal. Furthermore, this pseudo-static DET-FF will take more
for those used in the output inverters and those critical time to transfer the correct signal if the first stage or the second
MOSFETs discussed in Section II. We adjusted the PMOS stage needs to break the locked status of its weak inverters. A
width-to-length ratio to two-and-a-half times the length of the similar weak-inverter postponement problem also happens with
NMOS ratio based on the simulation results for the rising and the MOS-style DET-FF [8]. The only difference between these

82
two phenomena is that channel charge-release plays a major part
in the MOS-style DET-FF since the pre-stored data relies on the
input capacitance of the main inverter. Because we consider the
worst-case control overhead as the timing assessment in an
asynchronous control system, the power-delay performance
shown in Fig. 10(c) is based on the worst case for each DET-FF.
We must especially emphasize that the propagation delay shown
in our simulation results is the time delay from the control signal
transition to the output data signal transition, and “power
dissipation” refers to the power consumption of all control X Power dissipation of XNOR based
circuits, including the input and output buffers, because the □ Power dissipation of conventional
driving current for the transmission paths comes from the output
△ Power dissipation of MOS-style
of the previous stage.

First, we compared the propagation delay of our XNOR-


based DET-FF, for various transistor sizes, with those of two
other DET-FFs. The simulation results shown in Fig. 9 clearly (b)
demonstrate that the MOS-style DET-FF had the best speed
performance for various transistor sizes at a cost of over four
times the power dissipation of the other solutions. This was due
to the active-load drive scheme used, and the current path hidden
in its decision stage. Although the pseudo-static DET-FF has
more transistors, there are more transmission paths and more
transition stages in our XNOR based DET-FF. Thus, we
predicted the conventional DET-FF should be faster at a cost of
more power dissipation and larger size. The simulation results
shown in Fig. 9 reveal that our XNOR-based DET-FF had a 42%
reduction in delay and a 12% expansion in power, compared with
X Power–delay of XNOR based
the pseudo-static DET-FF. That was because not all the
□ Power–delay of conventional
transistors in the pseudo-static DET-FF work at the clock
transition, and the data signal has to pass through 3 switched- △ Power–delay of MOS-style

inverter stages to the final output inverter. Basically, expanding


the transistor size (width) made no evident difference in response
speed for all simulated DET-FF solutions.
(c)
We also considered the impact of supply-voltage variations, Figure 10. Simulation results for various supply voltages: (a)
and show those simulation results in Fig. 10. All the DET-FF propagation delay, (b) power dissipation, and (c) power-delay
solutions functioned properly when the supply voltage was
performance.
varied from 1.8V to 3.5V, although this 0.25um CMOS model is
at optimum at 2.5V. The MOS-style DET-FF still had the best In pipelined simulation, we did not insert any asynchronous
speed performance, even though its power dissipation was modules or computing cells in the “logic cell” between the two
frightful when the supply voltage was increased. Our XNOR double-edge-triggered D flip-flops. Actually, we directly
based DET-FF still showed a significant reduction in propagation connected those DET-DFFs with C-elements, and triggered the
delay and power dissipation similar to the pseudo-static DET-FF. first C-element with a simple clock. Thus, the whole pipelined
Our XNOR-based DET-FF clearly had the best power-delay system functioned just like an asynchronous FIFO. Also, the
performance, as shown in Fig. 10(c). delay-buffers delay times determined the actual propagation
delay between the stages. The waveforms delivered by all the
DET-FF solutions are shown in Fig. 11. Again, the 0.8ns delay
X Rising delay of XNOR based
+ Falling delay of XNOR based
between neighboring stages is due just to the design timing of the
□ Rising delay of conventional
delay-buffer. In a pipelined control system, the propagation delay
◇ Falling delay of conventional
of any internal cell is included in the entire control timing, as
△ Rising delay of MOS-style discussed in [7]. Thus, the real timing consideration, as shown in
﹡ Falling delay of MOS-style Fig. 11, is the rising/falling time of the output waveform, which
limits the enable timing of the next stage. Some significant
results were found in simulating the pipelined system. First, it is
clear that only our XNOR-based DET-FF had symmetrical
“high” and “low” periods. That was due to the unequal rising and
falling times of the other two solutions. Second, obvious spikes
were found in the output waveform of the MOS-style DET-FF
because the pre-stored data in the parasitic capacitance could be
affected by new data and the unmatched reverse clock. Since it is
almost impossible in real VLSI implementations to have a
perfectly matched clock pair, the reverse clock in our simulated
(a) MOS-style DET-FF was transferred from the C-element enable
signal. And the last is that the MOS-style DET-FF took longer to
pull up the output because of the active P-MOSFET used in the

83
decision stage. Even the double-size decision-N-MOSFET took a [5] M. Afghahi and J. Yuan, “Double edge-triggered d-flip-flops
long time to drop the input of the output stage. for high-speed CMOS circuits,” IEEE Journal of Solid-State
Circuits, 26(8), August 1991.
3rd stage of [6] R. Hossain, L. Wronski and A. Albicki, “Low power design
XNOR based
using double edge-triggered flip-flops,” IEEE transaction on
4th stage of
VLSI system, vol. 2, No. 2, pp.261-265, June 1994.
XNOR based [7] K. Y. Yun, P. A. Beerel and J. Arceo, “High-performance two-
phase micropipeline building blocks: double edge-triggered
3rd stage of latches and burst-mode select and toggle circuits,” IEE
conventional
Proceeding, Circuits, Devices and Systems, 143(5):282-288,
4th stage of October 1996.
conventional [8] Pradeep Varma, B. S. Panwar, Ashutosh Chakraborty, and
Dheeraj Kapoor, “A MOS approach to CMOS DET flips-flop
3rd stage of design,” IEEE Transactions on Circuits and Systems I, vol. 49,
MOS-style
No7. July 2002.
4th stage of [9] J. M. Wang, S. C. Fang and W. S. Feng, “New efficient
MOS-style designs for XOR and XNOR functions on the transistor level,”
IEEE Journal of Solid-State Circuits, vol. 29, pp. 780-786,
July 1994.
Figure 11. Output waveforms from the DET-FFs in pipeline [10] Nan Zhuang and Haomin Wu, “A new design of the CMOS
simulation. full adder,” IEEE Journal of Solid-State Circuits, vol. 27, No.
5, pp.840-844, May 1992
CONCLUSION [11] Abdellatif Bellaouar and Mohamed I. Elmasry, “Low-power
The concept of wrapped asynchronous modules opens a new digital VLSI design: Circuits and Systems”, Kluwer Reading,
possibility of using various power systems and operating Kluwer Academic Publishers, MA: 1995.
frequencies in isolated computing modules in system-on-chip [12] Maitham Shames, Jo C. Ebergen, and Mohamed I. Elmasry,
(SOC) and other single-chip applications. The two-phase “Modeling and Comparing CMOS Implementations of the C-
pipelined control system removes the unnecessary overhead of Element”, IEEE Trans. on VLSI System, Vol. 6, No. 4, Dec.
waiting for signal transitions on the control bus. The key 1998.
component in an efficient two-phase pipelined system is a [13] Marc Renaudin, Bachar El Hassan, and Alain Guyot, “A
specified edge-triggered flip-flop that can latch data on the rising new Asynchronous Pipeline Scheme: Application to the
and falling control signal transitions. We have presented a Design of Self-Timed Ring Divider,” IEEE Journal of Solid-
different approach to implementing XNOR-based double-edge- State Circuits, Vol. 31, No. 7, pp.1001~1013, July 1996.
triggered flip-flops. We re-used an alternative capture-pass [14] Ivan E. Sutherland, “Micropipelines,” Communications of
XOR/XNOR gate, and took advantage of its potential weakness, ACM, 32(6): 720-738, June 1989.
sensitivity to the driving capacity of the previous stage, to [15] K. V. Berkel, “Beware the isochronic fork,” Integration, The
generate a pair of stable clock pulses for a simple transparent VLSI J. vol. 13, pp. 103-128, June 1992.
latch. Our approach allows use of the good property of level- [16] J. C. Ebergen, J. Segers and I. Benko, “Parallel progress and
sensitive latches from other studies to be reproduced in a two- asynchronous circuit design,” in Asynchronous Digital Circuit
phase pipelined control system. Simulation results show that the Design, New York: Springer-Verlag, 1995, pp. 51-103.
proposed XNOR-based double-edge-triggered flip-flop is faster [17] W. A. Clark, “Macromodular computer systems,” in Proc.
than the pseudo-static DET-FF and more power saving. Although AFIPS Conf. 1967 Spring Joint Comput. Conf., Atlantic City,
the MOS-style DET-FF demonstrated significant speed NJ. vol. 30, pp. 335-336.
performance, the trade-off for the active load and simple latching
mechanism is more power dissipation and a disappointing output
spike. Generally, our proposed XNOR-based DET-FF had stable
performance, even when the system power was reduced to 1.8V,
72% of rated supply voltage. And this reliable performance was
also evident in the two-phase pipelined system simulation.

REFERENCES
[1] S. Rotem, K. Stevens, R. Ginosar, P. Beerel, C. Myers, K.
Yun, R. Kol, C. Dike, M. Roncken, and B. Agapiev, “ RAPPID:
An asynchronous instruction length decoder”, in Proc. Int.
Symp. Advanced Research in Asynchronous Circuits and
Systems, Apr. 1999, pp.60-70.
[2] D. M. Chapiro, “Globally-asynchronous locally-synchronous
systems”, Ph. D. dissertation, Stanford University. Oct. 1984.
[3] J. Carlsson, W. Li, T. Njolstad, K. Palmkvist, L. Wanhammar,
and S. Zhuang, “A Modular Asynchronous Wrapper”, National
Conf. Radio Science (RVK), Stockholm, Sweden, Jane. 10-13,
2002.
[4] Paul Day and J. Viv Woods, “Investigation into Micropipeline
latch design styles,” IEEE Transactions on VLSI Systems,
V3(2):264-272, June 1995.

84

View publication stats

Вам также может понравиться