Вы находитесь на странице: 1из 131

UNIT II

COMBINATIONAL LOGIC CIRCUITS


Combinational vs. Sequential Logic

In Out
Combinational Combinational
In Logic Out Logic
Circuit Circuit

State

Combinational Sequential

Output = f(In) Output = f(In, Previous In)


Static Complementary CMOS
 Pull-up network (PUN) and pull-down network (PDN)
VDD
PMOS transistors only
In1
pull-up: make a connection from VDD to F
In2 PUN
when F(In1,In2,…InN) = 1
InN
F(In1,In2,…InN)
In1
pull-down: make a connection from F to
In2 PDN
GND when F(In1,In2,…InN) = 0
InN
NMOS transistors only

PUN and PDN are dual logic networks


NMOS Transistors
in Series/Parallel Connection
Transistors can be thought as a switch controlled by its gate signal
NMOS switch closes when switch control input is high
A B

X Y Y = X if A and B

X B Y = X if A OR B
Y

NMOS Transistors pass a “strong” 0 but a “weak” 1


PMOS Transistors
in Series/Parallel Connection
PMOS switch closes when switch control input is low

A B

X Y Y = X if A AND B = A + B

X B Y = X if A OR B = AB
Y

PMOS Transistors pass a “strong” 1 but a “weak” 0


Threshold Drops

VDD VDD
PUN
S D
VDD

D 0  VDD S 0  VDD - VTn


VGS
CL CL

PDN VDD  0 VDD  |VTp|


VGS
D CL S CL
VDD

S D
Complementary CMOS Logic Style
Example Gate: NAND
Example Gate: NOR
Complex CMOS Gate

B
A
C

D
OUT = D + A • (B + C)

A
D
B C
Constructing a Complex Gate
VDD VDD

C
SN1 F SN4 A
F
SN2 B
A A
D D SN3

B C B C D

(a) pull-down network (b) Deriving the pull-up network A


hierarchically by identifying
D
sub-nets
B C

(c) complete gate


Elmore Delay
• ON transistors look like resistors
• Pullup or pulldown network modeled as RC
ladder
• Elmore delay of RC ladder
R1 R2 R3 RN

C1 C2 C3 CN

t pd  
nodes i
Ri to sourceCi

 R1C1   R1  R2  C2  ...   R1  R2  ...  RN  CN


Example: 2-input NAND
• Estimate rising and falling propagation delays
of a 2-input NAND driving h identical gates.
2 2 Y
A 2 6C 4hC

B 2x 2C
h copies

R
Y
(6+4h)C
t pdr 
Example: 2-input NAND
• Estimate rising and falling propagation delays
of a 2-input NAND driving h identical gates.
2 2 Y
A 2 6C 4hC

B 2x 2C h copies

R
Y
(6+4h)C
t pdr  6  4h  RC
Estimate the worst case falling propagation delays of
a 2-input NAND driving h identical gates

The worst case occurs when the node x is already


charged up to nearly Vdd through the top nMOS

2 2 Y Suppose A = 1, B = 0, then

A 2 6C 4hC Y = 1, node X is nearly VDD

B 2x 2C Now change inputs to A=B=1


both node Y and node X need
to discharge
Example: 2-input NAND
• Estimate rising and falling propagation delays
of a 2-input NAND driving h identical gates.
2 2 Y
A 2 6C 4hC

B 2x 2C h copies

t pdf 
x R/2 Y
R/2 2C (6+4h)C
Example: 2-input NAND
• Estimate rising and falling propagation delays
of a 2-input NAND driving h identical gates.
2 2 Y
A 2 6C 4hC

B 2x 2C h copies

x R/2 Y
t pdf   2C   R2    6  4h  C   R2  R2 
  7  4h  RC
R/2 2C (6+4h)C
Delay Components
• Delay has two parts
– Parasitic delay, gate driving its own internal
diffusion capacitance
• 6 or 7 RC
• Independent of load
– Effort delay, depends on the ration of external load
capacitance to input capacitance,
– Effort delay changes with transistor width
• Proportional to load capacitance
• Logical effort and Electrical effort
Contamination Delay
• Best-case (contamination) delay can be
substantially less than propagation delay.
• Ex: If both inputs fall simultaneously, the
output should be pulled up in half the time
2 2 Y
A 2 6C 4hC

B 2x 2C tcdr = (R/2)(6+4h)C

tcdr   3  2h  RC
R R
Y
(6+4h)C
CIRCUIT FAMILIES

• Static CMOS
• Ratioed Circuits
• Cascode Voltage Switch Logic (CVSL)
• Dynamic Circuits
• Pass-transistor Circuits
Static CMOS Circuits
Static CMOS
Circuits
 In Static CMOS circuits with n inputs, 2n transistors are
needed.
 nMOS block is a dual of the pMOS block.
 What ever is in series in nMOS, appears in parallel in
pMOS and vice versa.
 CMOS gates consume power only during the transition of
inputs.
Static complementary gate
structure

Pull-up and pull-down networks


VDD

pull-up
network
inputs out
Pull-down
network
VSS
Pull-up/pull-down network
design

 Pull-up and pull-down networks are duals.

 To design one gate, first design one network, then compute


dual to get other network.
Inverter
CMOS Logic Style Construction
Example Gate: NAND
BUBBLE PUSHING
Compound Gates - AOI/OAI
gates

 AOI = and/or/invert; OAI = or/and/invert.


 Implement larger functions.
 Pull-up and pull-down networks are compact: smaller
area, higher speed than NAND/NOR network equivalents.
AOI
example

invert

or

and
AOI CMOS Gate

• AOI complex CMOS gate can be used to directly implement


a sum-of-products Boolean function
• The pull-down N-tree can be implemented as follows:
– Product terms yield series-connected NMOS transistors
– Sums are denoted by parallel-connected legs
– The complete function must be an inverted
representation
• The pull-up P-tree is derived as the dual of the N-tree
OAI CMOS Gate

• An Or-And-Invert (OAI) CMOS gate is similar to the AOI gate except that it is an
implementation of product-of-sums realization of a function
• The N-tree is implemented as follows:
– Each product term is a set of parallel transistors for each input in the term
– All product terms (parallel groups) are put in series
– The complete function is again assumed to be an inverted representation
• The P-tree can be implemented as the dual of the N-tree
• Note: AO and OA gates (non-inverted function representation) can be
implemented directly on the P-tree if inverted inputs are available
Properties of CMOS Gates

High noise margins:


VOH and VOL are at VDD and GND, respectively.
No static power consumption:
There never exists a direct path between VDD and
VSS (GND) in steady-state mode.
Comparable rise and fall times:
(under the appropriate scaling conditions)
Ratioed Circuits

 Pseudo-nMOS Circuits
 Ganged CMOS
 Source-Follower Pull-up Logic (SFPL)
Pseudo-nMOS Circuits
• Adding a single pFET to otherwise nFET-only
circuit produces a logic family that is called
pseudo-nMOS
– Less transistor than CMOS
– For N inputs, only requires (N+1) FETs
– Pull-up device: pFET is biased active since
the grounded gate gives VSGp = VDD
Figure 1 General structure of
– Pull-down device: nFET logic array acts as a a pseudo-nMOS logic gate
large switch between the output f and
ground
– However, since the pFET is always biased
on, VOL can never achieve the ideal value
of 0 V
• A simple inverter using pseudo-nMOS as
Figure 2 Figure 2 Pseudo-nMOS inverter
Ganged CMOS (Symmetric Circuits)

A B C

 Inverters ganged together to perform a function.


 NOR gate ; Z = A+B+C
Source-Follower Pull-up Logic (SFPL)
• SFPL is a variation on pseudo-NMOS
whereby the load device is an N pull-
down transistor and N source-follower
pull-ups are used on the inputs.
– N pull-up transistors can be small
limiting input capacitance
– N transistors are also duplicated as pull-
down devices in order to improve the
fall time
– Rise time is determined by the P1
inverter pull-up transistor when all
inputs are low
• SFPL is useful for high fan-in NOR logic
gates

R. W. Knepper
SC571, page 5-25
Cascode Voltage Switch Logic (CVSL)

• Differential type of logic circuit where both true and


complement inputs are required.
• N pull down tree are the dual of each other.
• P pull-up devices are cross-coupled to latch output..
• Both true and complement outputs are obtained.
Basic Structure of CVSL

Q Q Q Q

a
a
b
...
...

b
a b c
c
Dynamic CMOS Logic

 Logic function is implemented by the PDN only


No. of transistors is N+2
Smaller in area than static CMOS
 Full swing outputs (VOL = gnd and VOH = VDD)
 Non-ratioed
 Faster switching speed
 Power dissipation should be better
 Needs precharge clock.
Dual-Rail Domino
• Domino only performs noninverting functions:
– AND, OR but not NAND, NOR, or XOR
• Dual-rail domino solves this problem
– Takes true and complementary inputs
– Produces true and complementary outputs

sig_h sig_l Meaning Y_l  Y_h


0 0 Precharged
0 1 ‘0’ inputs
f f
1 0 ‘1’
1 1 invalid 
Example: AND/NAND

• Given A_h, A_l, B_h, B_l


• Compute Y_h = A * B, Y_l = ~(A * B)
Example: AND/NAND
• Given A_h, A_l, B_h, B_l
• Compute Y_h = A * B, Y_l = ~(A * B)
• Pulldown networks are conduction complements

Y_l  Y_h
= A*B A_h = A*B
A_l B_l B_h


Example: XOR/XNOR
• Sometimes possible to share transistors

Y_l  Y_h
= A xnor B A_h A_l A_l A_h = A xor B
B_l B_h


TRANSMISSION GATES

• NMOS pass transistor passes a strong 0 and a weak 1.


• PMOS pass transistor passes a strong 1 and a weak 0.
• Combine the two to make a CMOS pass gate which will
pass a strong 0 and a strong 1.
TRANSMISSION GATE
PROBLEMS WITH TRANSMISSION GATES

 No isolation between the input and output.


 Output progressively deteriorates as it passes through various
stages.

However designs get simplified.


Multiplexer
XOR gate
Transmission Gates
• N-Channel MOS Transistors pass a 0 better than a 1

• P-Channel MOS Transistors pass a 1 better than a 0

• This is the reason that N-Channel transistors are used in the pull-down
network and P-Channel in the pull-up network of a CMOS gate.
Otherwise the noise margin would be significantly reduced.
Transmission Gates
• Pass transistors produce degraded outputs
• Transmission gates pass both 0 and 1 well
Input Output
g = 0, gb = 1 g = 1, gb = 0
g
a b 0 strong 0
a b g = 1, gb = 0 g = 1, gb = 0
a b 1 strong 1
gb

g g g
a b a b a b symbols
gb gb gb
Transmission Gates
• Implementing XOR gates
– With NAND gates and inverters:

– With transmission gates:

• Why would one of these circuits be preferable to the other?


Transmission Gates
• Implementing a multiplexer with transmission gates:

– When S = 0, input X1 is connected to the output Y


– When S = 1, input X2 is connected to the output Y
Pass Transistors
• Transistors can be used as switches
g g=0 Input g = 1 Output
s d 0 strong 0
s d
g=1 g=1
s d 1 degraded 1

g g=0 Input Output


g=0
s d 0 degraded 0
s d
g=1
g=0
s d strong 1
Pass Transistor
• Pass-transistor circuits are formed by dropping the PMOS
transistors and using only NMOS pass transistors

• In this case, CMOS inverters (or other means) must be used


periodically to recover the full VDD level since the NMOS
pass transistors will provide a VOH of VDD – VTn in some
cases

• The pass transistor circuit requires complementary inputs


and generates complementary outputs to pass on to the next
stage
Pass Transistor
• This figure shows a simple
XNOR implementation
using pass transistors:

• If A is high, B is passed
through the gate to the
output

• If A is low, -B is passed
through the gate to the
output
Pass Transistor
• At right,
– (a) is a 2-input NAND
pass transistor circuit
– (b) is a 2-input NOR pass
transistor circuit

• Each circuit requires 8


transistors, double that
required using conventional
CMOS realizations
Pass Transistor
• Pass-transistor logic gate can implement Boolean functions
NOR, XOR, NAND, AND, and OR depending upon the P1-P4
inputs, as shown below.
– P1,P2,P3,P4 = 0,0,0,1 gives F(A,B) = NOR
– P1,P2,P3,P4 = 0,1,1,0 gives F(A,B) = XOR
– P1,P2,P3,P4 = 0,1,1,1 gives F(A,B) = NAND
– P1,P2,P3,P4 = 1,0,0,0 gives F(A,B) = AND Circuit can be
– P1,P2,P3,P4 = 1,1,1,0 gives F(A,B) = OR operated with
clocked P pull-up
device or inverter-
based latch
Pass Transistor Logic
Families
 Complementary Pass Transistor Logic
 Double Pass Transistor Logic
Complementary Pass-Transistor Logic (CPL)

Pass Variables

Inputs

Control f f
Variables

F F
Basic logic functions in CPL
A B B A A B A A A A A B

B B

B B

A B B A A B
A B B A A C B C
B
C
B
C
CPL Logic
A A
A A
B
n1 n2

B B
n3 n4

B
C

Q Qb C
S S (a) (b)
XOR gate S S
Sum circuit

CPL provides an efficient implementation of XOR function


Full Adder Design III
• Complementary Pass Transistor Logic (CPL)
– Slightly faster, but more area
B

B C B C
A

B C B C
S Cout
A
B C B C

A
B C B C
S Cout

B
A

B
Double Pass-Transistor Logic (DPL):
VDD AND/NAND
A B B A
A B
B B

A A

O
O

A B A B A B A B XOR/XNOR

B A
A B
A B
B A
A B

O
O
Double Pass-Transistor Logic (DPL):
A A

A A

n1 p2 n1 p2
B
B
p1 n2 p1 n2

B C

Q Qb
C

O O
S S
(a) XOR (b)
One bit full-adder:
Sum circuit
Double Pass-Transistor Logic (DPL):
AND/NAND
DPL Full Adder
Vcc

A
A C C
B Vcc
B S

Vcc

A S
A
Multiplexer Buffer
B
B
The critical path traverses two transistors only
(not counting the buffer)
OR/NOR
Dynamic CMOS
• In static circuits at every point in time (except when
switching) the output is connected to either GND or
VDD via a low resistance path.
– fan-in of n requires 2n (n N-type + n P-type)
devices

• Dynamic circuits rely on the temporary storage of


signal values on the capacitance of high impedance
nodes.
– requires on n + 2 (n+1 N-type + 1 P-type)
transistors
Dynamic Gate
off
Clk Mp Clk Mp on
1
Out Out
In1 CL ((AB)+C)
A
In2 PDN
C
In3
B
Clk Me
off
Clk Me on

Two phase operation


Precharge (Clk = 0)
Evaluate (Clk = 1)
Conditions on Output
• Once the output of a dynamic gate is discharged, it
cannot be charged again until the next precharge
operation.
• Inputs to the gate can make at most one transition
during evaluation.

• Output can be in the high impedance state during


and after evaluation (PDN off), state is stored on CL
Properties of Dynamic Gates
• Logic function is implemented by the PDN only
– number of transistors is N + 2 (versus 2N for static complementary CMOS)
• Full swing outputs (VOL = GND and VOH = VDD)
• Non-ratioed - sizing of the devices does not affect the
logic levels
• Faster switching speeds
– reduced load capacitance due to lower input capacitance
(Cin)
– reduced load capacitance due to smaller output loading (Cout)
– no Isc, so all the current provided by PDN goes into
discharging CL
Properties of Dynamic Gates
• Overall power dissipation usually higher than static
CMOS
– no static current path ever exists between VDD and
GND (including Psc)
– no glitching
– higher transition probabilities
– extra load on Clk
• PDN starts to work as soon as the input signals exceed
VTn, so VM, VIH and VIL equal to VTn
– low noise margin (NML)
• Needs a precharge/evaluate clock
Dynamic Logic
• Dynamic gates uses a clocked pMOS pullup
• Two modes: precharge and evaluate
2 2/3  1
A Y Y Y
1 A 4/3 A 1

Static Pseudo-nMOS Dynamic

 Precharge Evaluate Precharge

Y
The Foot
• What if pulldown network is ON during
precharge?
• Use series evaluation transistor to prevent
fight.
 
precharge transistor
 Y Y
Y inputs inputs
A f f
foot

footed unfooted
Logical Effort
Inverter NAND2 NOR2

 1
Y
 1  1
A 2
unfooted Y Y
A 1 B 2 A 1 B 1
gd = 1/3 gd = 2/3 gd = 1/3
pd = 2/3 pd = 3/3 pd = 3/3

 1
Y
 1  1
A 3
Y Y
footed A 2 B 3 A 2 B 2
gd = 2/3 gd = 3/3 gd = 2/3
2 pd = 3/3 3 pd = 4/3 2 pd = 5/3
Issues in Dynamic Design 1: Charge Leakage

CLK
Clk Mp
Out

A CL

VOut Evaluate
Clk Me
Precharge

Leakage sources

Dominant component is subthreshold current


Solution to Charge Leakage
Keeper

Clk Mp Mkp

A Out
CL
B

Clk Me

Same approach as level restorer for pass-transistor logic


Issues in Dynamic Design 2: Charge Sharing

Charge stored originally on CL


Clk Mp is redistributed (shared) over
Out CL and CA leading to reduced
A CL robustness
B=0 CA

Clk Me CB
Charge Sharing Example

Clk
Out
A A CL=50fF

Ca=15fF B B B !B Cb=15fF

Cc=15fF C C Cd=10fF

Clk
Charge Sharing
V DD
VDD case 1) if Vout < VTn

Clk Mp
 Mp
L out  
Out C V = C V t + C V – V V 
L DD a DD Tn X
Out
or
CL
A Ma CL Ca
A Ma V out = Vout  t  – V DD = – --------  V DD – V Tn  V X  
XX CL

= M CCa a
BB 00 Mbb case 2) if Vout > VTn

 Ca 
CCb b Vout = –V DD  ---------------------- 
Clk  M
Mee C +C
 a L
Solution to Charge Redistribution

Clk Mp Mkp Clk


Out
A

Clk Me

Precharge internal nodes using a clock-driven transistor (at the cost of


increased area and power)
Issues in Dynamic Design 3: Backgate Coupling

Clk Mp Out1 =1
Out2 =0
A=0 In
CL1 CL2

B=0

Clk Me

Dynamic NAND Static NAND


Backgate Coupling Effect

2
Out1
1 Clk

0 Out2
In

-1
0 2 Time, ns 4 6
Issues in Dynamic Design 4: Clock Feedthrough

Coupling between Out and Clk


Clk Mp input of the precharge device
Out due to the gate to drain
A CL capacitance.
So voltage of Out can rise
B
above VDD. The fast rising (and
Clk Me falling edges) of the clock
couple to Out.
Clock Feedthrough
Clock feedthrough
Clk
Out 2.5
In1
In2 1.5

In3 In &
0.5 Clk
In4 Out
Clk -0.5
0 0.5 Time, ns 1
Clock feedthrough
Other Effects
• Capacitive coupling
• Substrate coupling
• Minority charge injection
• Supply noise (ground bounce)
Cascading Dynamic Gates
V

Clk Clk Clk


Mp Mp
Out2
Out1
In
In

Clk Clk VTn


Me Me Out1

V
Out2

Only 0  1 transitions allowed at inputs!


Monotonicity
• Dynamic gates require monotonically rising
inputs during evaluation 
– 0 -> 0 A

– 0 -> 1
violates monotonicity
– 1 -> 1 during evaluation
A
– But not 1 -> 0
 Precharge Evaluate Precharge

Output should rise but does not


Monotonicity Woes
• But dynamic gates produce
monotonically falling
outputs during evaluation
• Illegal for one dynamic gate
to drive another!
A
A == 11

  Precharge Evaluate Precharge


 Y  Precharge Evaluate Precharge
A
X Y
X X
A
X X monotonically falls during evaluation
Y

Y Y should rise but cannot


Domino Logic

Clk Mp Clk Mp Mkp


11
Out1 Out2
10
00
In1 01
In2 PDN In4 PDN
In3 In5

Clk Me Clk Me
Domino Gates
• Follow dynamic stage with inverting static gate
– Dynamic / static pair is called domino gate
– Produces monotonic outputs
domino AND
 Precharge Evaluate Precharge

W X Y Z W

A
X
B C
 Y

dynamic static
Z
NAND inverter

 
 
A W X A X
H Y =
B H Z B Z
C C
Domino Optimizations
• Each domino gate triggers next one, like a string of dominos
toppling over
• Gates evaluate sequentially but precharge in parallel
• Thus evaluation is more critical than precharge
• HI-skewed static stages can perform logic

S0 S1 S2 S3
D0 D1 D2 D3
Y
H

S4 S5 S6 S7
D4 D5 D6 D7
Dual-Rail Domino
• Domino only performs noninverting functions:
– AND, OR but not NAND, NOR, or XOR
• Dual-rail domino solves this problem
– Takes true and complementary inputs
– Produces
sig_h sig_ltrue and complementary outputs
Meaning
0 0 Precharged Y_l  Y_h

inputs
0 1 ‘0’ f f

1 0 ‘1’ 

1 1 invalid
Example: AND/NAND
• Given A_h, A_l, B_h, B_l
• Compute Y_h = AB, Y_l = AB
• Pulldown networks are conduction
complements
Y_l  Y_h
= A*B A_h = A*B
A_l B_l B_h


Example: XOR/XNOR
• Sometimes possible to share transistors

Y_l  Y_h
= A xnor B A_h A_l A_l A_h = A xor B
B_l B_h


np-CMOS
NORA Logic
NP Domino
Zipper CMOS
• The NP-Domino or NORA logic is very
susceptible to noise and leakage.
• Zipper Domino has the same structure, but
the precharge transistors are left slightly ON
during evaluation.
Leakage
• Dynamic node floats high during evaluation
– Transistors are leaky (IOFF  0)
– Dynamic value will leak away over time
– Formerly miliseconds, now nanoseconds
• Use keeper to hold dynamic node
weak keeper
– Must be weakenough
1 k not to fight evaluation
X
H Y
A 2
2
Charge Sharing
• Dynamic gates suffer from charge sharing



Y A
A x CY
Y
B=0 Cx Charge sharing noise

CY
Vx  VY  VDD
C x  CY
Secondary Precharge
• Solution: add secondary precharge transistors
– Typically need to precharge every other node
• Big load capacitance CY helps as well
secondary
 precharge
Y transistor
A x
B
Noise Sensitivity
• Dynamic gates are very sensitive to noise
– Inputs: VIH  Vtn
– Outputs: floating output susceptible noise
• Noise sources
– Capacitive crosstalk
– Charge sharing
– Power supply noise
– Feedthrough noise
– And more!
Power
• Domino gates have high activity factors
– Output evaluates and precharges
• If output probability = 0.5, a = 0.5
– Output rises and falls on half the cycles
– Clocked transistors have a = 1
– For a 4 input NAND, aCMOS = 3/16, aDynamic = 1/4
• Leads to very high power consumption
• However, glitching does not occur in dynamic
logic.
• The load capacitances are lower.
MODL
• It is often necessary to compute multiple
functions where one is a subfunction of the
other or shares a subfunction.
• One very typical example is the carry in
addition:
c1  g1  p1c 0
c 2  g2  p2 g1  p1c 0 

 
c 3  g3  p3 g2  p2 g1  p1c 0 

  
c 4  g4  p4 g3  p3 g2  p2 g1  p1c 0 
MODL Carry Chains
MODL
• Beware of sneak paths.
• Certain inputs must be mutually exclusive.
Domino Summary
• Domino logic is attractive for high-speed circuits
– 1.3 – 2x faster than static CMOS
– But many challenges:
• Monotonicity, leakage, charge sharing, noise
• Widely used in high-performance
microprocessors in 1990s when speed was king
• Largely displaced by static CMOS now that power
is the limiter
• Still used in memories for area efficiency
POWER DISSIPATION
Power is drawn from a voltage source attached to
the VDD pin(s) of a chip.

Instantaneous Power: P(t )  iDD (t )VDD


T T
E   P(t )dt   iDD (t )VDDdt
Energy: 0 0

E 1 T
Pavg    iDD (t )VDD dt
Average Power: T T 0
Overview of Power Dissipation

 Ptotal = Pdynamic+Pstatic
 Power Consumption (Pdynamic)
Dynamic power Consumption Pdynamic = Pswitching + Pshortcircuit
Switching load capacitances
Short-circuit current
– Charging and discharging capacitors
 Short Circuit Power Consumption (Pshort-circuit)
– Short circuit path between supply rails during switching
Power Dissipation Sources

Static power: Pstatic = (Isub + Igate + Ijunct + Icontention)VDD

Subthreshold leakage
Gate leakage
Junction leakage
Contention current
Dynamic Power
 Dynamic power is required to charge and discharge load
capacitances when transistors switch.
 One cycle involves a rising and falling output.
 On rising output, charge Q = CVDD is required
 On falling output, charge is dumped to GND
 This repeats Tfsw times Vdd

over an interval of T
Vin Vout

CL
fsw
Dynamic Power
T
1
Pdynamic   iDD (t )VDD dt
T 0
T
VDD

T 0 iDD (t )dt
VDD
VDD iDD(t)
 TfswCVDD 
T
 CVDD 2 f sw
C
fsw
Dynamic Power
 Suppose the system clock frequency = f
 Let fsw = af, where a = activity factor
 If the signal is a clock, a = 1
 If the signal switches once per cycle, a = ½
 Dynamic gates:
 Switch either 0 or 2 times per cycle, a = ½
 Static gates:
 Depends on design, but typically a = 0.1

 Dynamic power: Pdynamic  aCVDD 2 f


Dynamic Power
 Pdynamic = Energy/per-transition  Transition rate
= CLVDD2 f0→1
= CL VDD2 P0→1 f
= Ceff VDD2 f
 Ceff = effective capacitance = CL P0→1
 Power dissipation is data dependent
– Function of Switching Activity
 Activity Factor (P0→1)
– Clock signal: P0→1(clk) = 1
– Data signal: P0→1(data) < 0.5
Short Circuit Current
 When transistors switch, both nMOS and pMOS networks
may be momentarily ON at once
Vdd
 Leads to a blip of “short circuit” current.
 ~ 15% of dynamic power Vin Vout
– ~85% to charge capacitance CL CL
 NMOS and PMOS on
– Both transistors in saturation
 Long rise / fall times
– Slow input transition
– Increase short circuit current

Make input signal transitions fast to save power!


VDD
Short Circuit Current
Because of finite slope of input signal, there is a
ISC≈0 period when both PMOS and NMOS device are “on”
and create a path from supply to ground

Vin Vout E / E
CL
8 W/L|P = 7.2 mm/1.2mm VDD = 5 V
7 W/L|N = 2.4 mm/1.2mm
Large capacitive load 6
5
VDD 4
3
ISC≈IMAX 2 VDD = 3.3 V
1
Vin Vout 0 r
CL 0 1 2 3 4 5
The power dissipation due to short circuit
currents is minimized by matching the rise/fall
Small capacitive load times of the input and output signals.
Dynamic Power Reduction

Pswitching  a CVDD f
2

 Try to minimize:
– Activity factor
– Capacitance
– Supply voltage
– Frequency
Voltage Scaling

Dual voltage supply


 Internal voltage
– Reduced internal voltage 1.2V
• For low power operation
 External voltage
– Compatible IO voltage 3.3V
• To interface other ICs
Capacitance Minimization

– Gate capacitance
– Fewer stages of logic
– Small gate sizes
 Wire capacitance
– Good floorplanning to keep communicating blocks close
to each other
– Drive long wires with inverters or buffers rather than
complex gates
Clock Gating

 The best way to reduce the activity is to turn off the clock
to registers in unused blocks
– Saves clock activity (a = 1)
– Eliminates all switching activity in the block
– Requires determining if block will be used
Voltage / Frequency

 Run each block at the lowest possible voltage and frequency


that meets performance requirements
 Voltage Domains
– Provide separate supplies to different blocks
– Level converters required when crossing
from low to high VDD domains

 Dynamic Voltage Scaling


– Adjust VDD and f according to
workload
Static power Dissipation

Power dissipation occurring when device is in standby mode


As technology scales this becomes significant
Leakage power dissipation
Components:
Reverse biased p-n junction
Sub threshold leakage
DIBL leakage
Channel punch through
GIDL Leakage
Narrow width effect
Oxide leakage
Hot carrier tunneling effect
Source of Leakage Current
Leakage
 Sub-threshold current
Vdd
– Transistor conducts below Vt
– For sub-micron relevant
• VDD / Vt ratio smaller
Vout
• Can dominate power consumption! Drain
• Especially in idle mode. junction
leakage
Charge nodes fully to VDD! Sub-
threshold
Discharge nodes completely to GND! current
 Drain leakage current
– Reverse biased junction diodes
Static Power

 Static power is consumed even when chip is quiescent.


– Ratioed circuits burn power in fight between ON
transistors
– Leakage draws power from nominally OFF devices
Vgs Vt
 Vds

I ds  I ds 0e nvT
1  e
vT


 

Vt  Vt 0  Vds    s  Vsb  s 
Subthreshold current

Sub-threshold current increases exponentially


 Subthreshold current can be reduced by increasing Vt

Selective application of multiple threshold


(low-Vt transistors on critical paths, high Vt transistors on
other paths)
Control Vt through the body voltage
Sub-threshold current decreases in long channel
transistors and increases in short channel
Sub-threshold Leakage
Component
Gate Leakage

 Extremely strong function of tox and Vgs


– Negligible for older processes
– Approaches subthreshold leakage at 65 nm and below
in some processes
 An order of magnitude less for pMOS than nMOS
 Control leakage in the process using tox > 10.5 Å
– High-k gate dielectrics help
– Some processes provide multiple tox
• e.g. thicker oxide for 3.3 V I/O transistors
 Control leakage in circuits by limiting VDD
Junction Leakage

 From reverse-biased p-n junctions


– Between diffusion and substrate or well
 Ordinary diode leakage is negligible
 Band-to-band tunneling (BTBT) can be significant
– Especially in high-Vt transistors where other leakage is
small
– Worst at Vdb = VDD
 Gate-induced drain leakage (GIDL) exacerbates
– Worst for Vgd = -VDD (or more negative)
RATIOED CIRCUIT

 Pseudo-NMOS logic style PMOS as resistor


– PDN as static CMOS logic
 Static current
– When output low
 Power consumption
– Even without switching activity
Static power Reduction

Reduce static power


•Selectively use ratioed circuits
•Selectively use low Vt devices
•Leakage reduction:
stacked devices, body bias, low temperature

Вам также может понравиться