Timing Design in Digital Systems: Dr. Paul D. Franzon

ECE 520 Class Notes
Timing Design in Digital Systems

Dr. Paul D. Franzon
Outline
1. Timing design in Synchronous (clocked) Logic
z Min/Max timing with flip-flops
z Latch-based design
3. Timing Issues in CMOS circuits
4. Timing verification and SDF
References
1. Smith and Franzon, Chapters 11 and 13.
2. Smith, Sections 2.4, 13.6, 13.7, 16.4.1
3. H.B. Bakoglu, ‘Circuits, Interconnections and Packing for VLSI’,
Addison-Wesley, Ch. 8.
Attachments : Need sample library sheets.
©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 1

ECE 520 Class Notes
General Approach to Timing Design

z In general, all signals start and end in registers every clock period
D-Flip-flops
Combinational Logic
z In general, there is only one clock (in any one sub-block of the chip) and
each flip-flop is clocked every cycle
z If a chip needs multiple clocks, they should share a common root clock (e.g.
divide a master clock) and be fed to different sub-designs within the chip

ECE 520 Class Notes
Clock Level Timing

z Example (Revision):
In[0] Out[0]
In[1] Out[1]
In[2] Out[2]
In[3]
clock
Clock
In A 2 c 6
Out 3 1 4 0 4

ECE 520 Class Notes
Critical Path
z Thus, the clock speed is determined by the slowest feasible path between
registers in the design
z Often referred to as “the critical path”
Critical path is longer

with increased logic depth
(# gates in series)

ECE 520 Class Notes
Clock Distribution
z The goal of clock tree is for the clock to arrive at every leaf node at the same time:
z Usually designed after synthesis: Matched buffers; matched capacitance loads

z Some clock skew (difference in arrival times) is unavoidable
z Clock skew specification is an important parameter
z Clock tree carefully designed to meet a skew requirement
z Clock jitter (cycle to cycle skew variations) is also important but is treated as
being lumped in with the fixed clock skew to make a total skew (+jitter) budget
z Comes from phase noise in Phase Lock Loop, etc.

ECE 520 Class Notes
Flip-Flop based design

Edge triggered D-flip-flop
D Q Q becomes D after clock edge
Ck Set-up time:
Data can not change no later than this
point before the clock edge.
Ck Hold time:
tSetUp thold
Data can not change during this time after
D the clock edge.
tclock-Q
Q t_clock-Q
Delay on output (Q) changing from positi
Register Register
Q1
clock edge
D1 Comb D2
Logic Q2
tLogic-delay
+/-tskew

ECE 520 Class Notes
Preventing Set-Up Violations

Set-up violation:
Logic is too slow for the correct logic value to arrive at the inputs to the
register on the right before one set-up time before the clock edge
Constraint to prevent this:
tclock > tclock −Q − max + tlog ic − max + tset −up + tskew
clock
tskew
clock’
tclock-Q tset-up
Q1
tlogic
D2
•The amount of time required to turn ‘<‘ into ‘=‘ is referred to as timing slack

ECE 520 Class Notes
Preventing hold violations

Hold violations occur when race-through is possible
Constraint to prevent hold violations:
thold + tskew < tclock −Q − min + tlog ic − min

clock
tskew
clock’
thold Hold
tclock-Q Violation
Q1
tlogic
D2
Problem:
Q2 Race-through
•sometimes have to insert additional logic to prevent hold violations

ECE 520 Class Notes
Latch Based Design

D-latch
D Q
•Q follows D while clock is high
Ck •Value on D when clock goes low
is stored on Q
Set-up and hold times:

Ck D can not change close to the falling
tSetUp thold (‘latching’) clock edge.
D
tclock-Q
t_clock-Q
Q Delay from clock going high to Q
changing
Register Register
Q1
D1 Comb D2
Logic Q2
tLogic-delay
+/-tskew

ECE 520 Class Notes
Latch timing constraints

To prevent set-up violations:
tclock + tclock −high − max > tclock −Q − max + tlog ic − max + tset −up + tskew
clock
tskew
clock’
Q1 tclock-Q tslack tset-up

D2
tlogic
Notes:
•The percentage of time the clock is high is referred to as the duty-cycle
•If part of the following clock-high time is used to allow this logic to be slower, then
the logic-block connected to Q2 must be proportionally faster
•Using the clock-high time like this is called cycle-stealing
ECE 520 Class Notes
Latch Set Up Violations

Notes:
• The percentage of time the clock is high is referred to as the duty-cycle
• If part of the following clock-high time is used to allow this logic to be

slower, then the logic-block connected to Q2 must be proportionally
faster
• Using the clock-high time like this is called cycle-stealing
z Normally cycle stealing is not enabled

ECE 520 Class Notes
Latches … Cycle Stealing

ECE 520 Class Notes
…Latch timing constraints

To prevent hold violations:
tclock −high − max + thold + tskew < tclock −Q − min + tlog ic − min
clock
tskew
clock’
thold Hold
Q1 Violation
tlogic
D2
Note:
•Hold violations are harder to prevent in latch-based designs

ECE 520 Class Notes
Example:
D-flip-flop min : typ : max
Tclock-Q = 3 : 4 : 5
TNOR =1:2:3
D-flip-flop Tsu_max = 1
Thold_max = 2
Tskew = 1 ns
If this is the critical path, what is the fastest clock frequency?

Is there potential for a hold violation?
Setup: tclock > tclock −Q − max + tlog ic − max + tset −up + tskew
= 5 + 6 + 1 + 1 = 13 ns I.e. 76 MHz
Hold: Is thold + tskew < tclock −Q − min + tlog ic − min ?
2+1 3+1
< OK. No hold violation

ECE 520 Class Notes
CMOS Drive Strength

Revision: CMOS transistors operating in the linear region:
I ds = β ((Vgs − Vt )Vds − Vds2 / 2

where β = ( µε/tox )(W / L)
where W is the transistor width, and L is the channel length
i.e. To a first approximation,
I ds ≈ Vds / Ron
Ron ≈ 1/ β(VGS-VT)
Thus, delay in CMOS circuits depends largely on W/L of the drive transistor
and the capacitance of the load it is driving.
That capacitance consists of:
•Input gates of cells being driven, and τ = RonCload
•Capacitance of wiring

ECE 520 Class Notes
Data Sheet Example

z Using timing approximations in the datasheet, what is the maximum
clock frequency for this circuit (ignore wire load, Tskew):
Tclock-Q:
D-flip-flop
A0 D-flip-flop
A1
Tclock-Q Tsu
Tlogic CL: NOR2 DFF
Q -> A1:
DFF
CL_max = 0.0371+0.0117 pF = 0.0488 pF
T_cp-Q_max = max (1.12+1.59*CL, 1.09 + 1.32*CL)
= max (1.12+1.59*0.0488, 1.09 + 1.32*0.0488)
= max (1.2, 1.15) = 1.2 ns
Q->A2: T_cp-Q_max = 1.2 ns
ECE 520 Class Notes
… Example
Tlogic:
CL:
NOR2
DFF
NOR2
CL= 0.0136 + 0.0184 = 0.032 pF

From A0 : Tlogic = max (0.503 + 3.27*0.032, 0.371 +3.21*0.032)
= 0.61 ns
From A1 : Tlogic = max (0.420+2.17*0.032, 0.505+2.12*0.032)
= 0.57 ns

ECE 520 Class Notes
… Example
Tsu
Tclock > Tcp-Q_max + Tlogic_max + Tsu_max

= 1.2 + 0.61 + 1.4
= 3.21 ns
Fclock < 311 MHz

ECE 520 Class Notes
Timing in Design
z The ASIC vendor will develop delay models for each cell
z Best/worst models developed as transistor β changes with process,
temperature, etc.
z Models also developed for delay as a function of capacitive load (fan-out)
z Timing models are stored as part of the cell library
z Timing Verification is done with a combination of the following:
z Logic simulation using actual delays from the timing library
z Dynamic timing analysis
z Static timing analysis:
z A static timing analyzer traces through the actual logic and calculates slowest
and fastest delay for each true path through the logic
Sometimes, however, it can not tell the difference between true and false
(not possible) paths
Î Often the designer has to identify false paths
z It outputs a timing report for critical (Slowest) and other paths

ECE 520 Class Notes
Timing in ASIC Design Flow

Standard Delay Format (SDF) files:
z Tools communicate timing information using SDF files
z Specifies timing information as calculated from:
ASIC vendor timing models (Synopsys outputs as SDF file based on this
information)
Wiring delay calculations after place and route
z The SDF file is then used by the logic simulator or static timing verifier to verify
that the design will not have any setup or hold violations
Timing Closure
z = Ensuring no timing problems after “Place and Route”
z More than simple back-annotation needed
z Becoming more difficult with increasing clock frequency and finer lithography
z Interconnect RC delay becoming large compared with logic delay
z Current synthesis tools are “interconnect blind”
z New classes of tools emerging (Physical compilers)
z New planning tools needed in the future

ECE 520 Class Notes
Good and Bad Timing Design Practices

z Good General Practices:
z In each module, use only one clock and one edge.
z Pay attention to the amount of logic specified in long logic paths during
design
z Only use the globally distributed clock(s)
z Bad Design Practices:

z Feeding back combinational logic on itself
Specifying a latch!
z Using multiple clocks, multiple edges

z Gating the clock or generating local clocks

ECE 520 Class Notes
Exercise
z With a flip-flop based design, what is the minimum clock period?
z Is there potential for a hold violation?
z If t_clock_high is 50% of the clock period (“50% duty cycle”), and cycle
stealing is enabled, what is potentially the fastest possible latch-based
design?
Q1 T_clock_Q = 1-2 ns
D1 Comb D2 Tsu = 1 ns
Logic Q2
Thold = 2 ns
6-7 ns
Skew = 1ns
Flip Flop : T_clock > T_clock_Q_max + Tlogic_max + Tsu + Tskew

= 2 + 7 + 1 + 1 = 11 ns 90 MHz
Is thold+ tskew < Tclock-Q_min + tlogic_min : 2+1 < 1+6 No Hold violation
Latch : 1.5 T_clock > T_clock_Q_max + Tlogic_max + Tsu + Tskew
> 11
T_clock > 7.3 ns 136 MHz
ECE 520 Class Notes
Summary
z What determines the maximum clock frequency?
Total delay in critical path + Clock skew
• Determined in part by maximum logic depth
z What is a hold violation?

When logic change runs through storage elements in sequence
in one clock edge.
•Determined in part by minimum logic depth
z Why do we prefer flip-flop designs over using latches?
Less potential for hold violations
z What tool is used to check timing at design closure?
Static timing analysis, after logic & RC extraction.

ECE 520 Class Notes
Remember
z Methodology for purposes of ECE 520
z If at all possible one-edge of one clock
z If you need multiple clocks, they must have a common root and be
related by factors of 2
E.g. Root clock : Tclock = 5 ns
This is OK:
Î Tclock, Tclock10, Tclock20
Î Tclock10 = 10 ns (Tclock*2)
This is NOT OK
ONE CLOCK
Î Tclock, Tclock15, Tclock17
ONE EDGE
Î Tclock17 = 17 ns (??)

Timing Design in Digital Systems: Dr. Paul D. Franzon

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Timing Design in Digital Systems: Dr. Paul D. Franzon

Загружено:

Авторское право:

Доступные форматы

ECE 520 Class Notes

Timing Design in Digital Systems

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 1

General Approach to Timing Design

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 2

Clock Level Timing

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 3

Critical path is longer

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 4

z Usually designed after synthesis: Matched buffers; matched capacitance loads

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 5

Flip-Flop based design

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 6

Preventing Set-Up Violations

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 7

Preventing hold violations

thold + tskew < tclock −Q − min + tlog ic − min

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 8

Latch Based Design

Set-up and hold times:

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 9

Latch timing constraints

Q1 tclock-Q tslack tset-up

Latch Set Up Violations

• If part of the following clock-high time is used to allow this logic to be

• Using the clock-high time like this is called cycle-stealing

z Normally cycle stealing is not enabled

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 11

Latches … Cycle Stealing

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 12

…Latch timing constraints

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 13

If this is the critical path, what is the fastest clock frequency?

Hold: Is thold + tskew < tclock −Q − min + tlog ic − min ?

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 14

CMOS Drive Strength

I ds = β ((Vgs − Vt )Vds − Vds2 / 2

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 15

Data Sheet Example

Tlogic CL: NOR2 DFF

CL= 0.0136 + 0.0184 = 0.032 pF

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 17

Tclock > Tcp-Q_max + Tlogic_max + Tsu_max

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 18

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 19

Timing in ASIC Design Flow

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 20

Good and Bad Timing Design Practices

z Bad Design Practices:

z Using multiple clocks, multiple edges

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 21

Flip Flop : T_clock > T_clock_Q_max + Tlogic_max + Tsu + Tskew

z What is a hold violation?

Less potential for hold violations

z What tool is used to check timing at design closure?

Static timing analysis, after logic & RC extraction.

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 23

Î Tclock, Tclock10, Tclock20

©2004 Dr. Paul D. Franzon, www.ece.ncsu.edu/erl/faculty/paulf.html 24

Вам также может понравиться