Академический Документы
Профессиональный Документы
Культура Документы
Pentium 4
125M transistors
850mW/mm2
90nm tech
103W
3.4GHz
Individual components
Flops & latches
(power and timing critical)
D Q
D Q
Logic
Clk
Clk
Individual components
Flops & latches
(power and timing critical)
D Q
D Q
Vdd2,
Vdd3,
Vth2
Vth3
Vdd4,
Vdd5,
Vth4
Vth5
Logic A
Clk
-Physical (Vdd,Vth,Sizing)
-Logic
-uArchitecture
(parallelism, pipelining)
D Q
Logic
Clk
System level,
VLSI blocks and circuits
Logic B
Clk
D Q
D Q
Logic A
D Q
Logic A
Logic B
Clk
D Q
Logic B
Clk
Clk
Clk
Channel
Transmitter
Receiver
Circuits
Communications
Optimization
Integrated Systems Group
High speed
link chip
Outline
Backplane environment
Package
On-chip parasitic
Line card trace
Package
via
Line card
via
Backplane via
Line attenuation
Reflections from stubs (vias)
Integrated Systems Group
10
Backplane channel
Loss is variable
Same backplane
Different lengths
Different stubs
-20
-30
-40
>30dB @ 3GHz
But is that bad?
-50
9" FR4
-10
Attenuation is large
Attenuation [dB]
26" FR4
9" FR4,
via stub
26" FR4,
via stub
-60
6
8
10
frequency [GHz]
11
pulse response
1
0.8
0.6
Tsymbol=160ps
0.4
0.2
Dispersion
short latency
(skin-effect,
dielectric loss)
Reflections
long latency
(impedance mismatches
connectors, via stubs,
device parasitics,
package)
0
0
3
ns
Integrated Systems
Group
12
ISI
1
Error!
Amplitude
0.8
0.6
0.4
0.2
0
0
8
10 12
Symbol time
14
16
18
13
Data Slicer
Channel
serializer
deserializer
dataOut
ref Clk
PLL
Clock, data
recovery
14
15
Mostly non-existent
Gaussian distributions
16
-2
-4
-6
-8
-10
-1 0
40m V erro r @ 10
25% o f eye h eig h t
25
50
75
100
re sidual ISI [m V ]
10
9% T s ym bol
-4
-6
-8
-10
4% T s ym bol
log
log
10
probability [cdf]
erro r @ 10
80
-1 0
17
A new model
Interference
18
Voltage noise
when receiver
clock is off
Jittered
sampling
Ideal
sampling
Voltage noise
19
bk
TX
k
kT
TX
k +1
(k + 1)T
kT
(k +1)T
kTX
bk
bk
TX
k +1
noise
bk kTX
bk kTX+1
bk
20
Transmitter jitter
kRx
Receive jitter
kRx
21
RefClk
Phase
+detector
Kpd
Icp
Icp
VCO
R Kvco/s
C
Clock
buffer
from
clock buffer supply
-10
-20
from
VCO supply
-30
N
5
10
Noise sources
10
10
10
10
10
10
frequency [Hz]
M. Mansuri, C-K.K. Yang, "Jitter optimization based on phase-locked loop design parameters,"
IEEE Journal Solid-State Circuits, Nov. 2002
E. Alon, V. Stojanovic, M. Horowitz Circuits and Techniques for High-Resolution Measurement
of On-Chip Power Supply Noise, IEEE Symposium on VLSI Circuits, June 2004.
Integrated Systems Group
22
dn
dataOut
dn
PD
en
data Clk
edge Clk
Phase
mixer
en (late)
ref Clk
PLL
dn-1
Phase
control
23
pdn,i
phold ,i
-5
-10
i 1 i
pup,i
i +1
-15
50
100
150
200
250
Phase Count
A.E. Payzin, "Analysis of a Digital Bit Synchronizer," IEEE Transactions on Communications, April 1983.
Integrated Systems Group
24
Outline
25
Attenuation [dB]
Baseline channels
0
-20
26" NELCO,
no stub
(b)
-40
-60
-80
26" FR4,
via stub
-100
0
10
15
20
frequency [GHz]
26
140
therm al noise
120
100
80
Capacity [Gb/s]
Capacity [Gb/s]
NELCO
140
120
100
therm al noise
80
60
60
40
40
20
20
0
-25
0
-25
-20
log10(Clipping probability)
-15
-10
-5
0
-20
log10(Clipping probability)
-15
-10
-5
0
27
Removing ISI
Linear transmit equalizer
Tx
Data
Sampled
Data
Anticausal taps
Deadband
Feedback taps
Channel
50
Causal
taps
outP
outN
I eq 0
TapSel
Logic
50
Decision-feedback equalizer
J. Zerbe et al, "Design, Equalization and Clock Recovery for a 2.5-10Gb/s 2-PAM/4-PAM Backplane
Transceiver Cell," IEEE Journal Solid-State Circuits, Dec. 2003.
Integrated Systems Group
28
Tx
Data
Anticausal taps
Attenuation [dB]
unequalized
-5
-10
-15
Channel
equalized
-20
Causal
taps
-25
0
frequency [GHz]
0.5
1.5
2.5
29
Optimization example:
Power constrained linear precoding
pow er constraint
ak
precoder
channel
pulse response
g
noise
ak
ek
ak
MSE( w, g ) = Ea 1 2 g w P1 + g 2 w PPT w + g 2 2
Ea ( w P1 ) 2
SINRunbiased ( w) =
T
T
T
Ea w P (I 1 1 )(I 1 1 )T P T w + 2
SINRunbiased
maximize =
T
T
T
w
Ea w P (I 1 1 I PD )(I 1 1 I PD )T P T w + 2
s.t.
1/ 2
w 1 1
2=wTS0TXw+wTS0RXw+2thermal
Minimize BER
31
Feedback
equalization
0.8
Amplitude
0.6
0.4
0.2
0
0
6 8 10 12 14 16 18
Symbol time
32
Pulse response
+1
-1
33
+1+
+1
+
0
-1+
-1
34
+1+
+1
+
+1-
0
-
-1+
-1
-1-
35
+1+
d n | d n 1 = 1
+
+1-
xn
D Q
dClk
-
-1+
d n | d n 1 = 0
-
-1-
d n 1
dClk
36
BER contours
5 tap Tx Eq
150
-5
-5
100
-10
50
-15
0
-50
-20
-100
-150
0
20
40
margin [mV]
margin [mV]
100
-10
50
-15
0
-50
-20
-25
-100
-25
-30
-150
0
20
40
-30
Voltage margin
37
Binary (NRZ)
1 bit / symbol
Symbol rate = bit rate
PAM4
2 bits / symbol
Symbol rate = bit rate/2
00
1
01
11
10
38
thermal noise
35
PAM16
30
PAM8
25
20
PAM16
PAM4
25
20
5
2
4 6
PAM4
15
8 10 12 14 16 18 20
Symbol rate [Gs/s]
PAM2
10
10
0
0 2 4 6 8 10 12 14 16 18 20
Symbol rate [Gs/s]
0
0 2 4 6 8 10 12 14 16 18 20
Symbol rate [Gs/s]
PAM2
10
PAM8
25
PAM2
PAM4
15
30
15
20
0
0
30
Data rate [Gb/s]
45
40
thermal noise +
offset+
jitter
PAM8
39
18
18
16
PAM4
14
12
20
20
20
18
16
16
14
14
PAM8
12 PAM16
10 PAM8
8
10
8
PAM2
thermal noise
thermal noise
+ offset+ jitter
PAM4
12
PAM4
10
PAM2
PAM8
PAM2
0
0 2 4 6 8 10 12 14 16
Symbol rate [Gs/s]
0
0 2 4 6 8 10 12 14 16
Symbol rate [Gs/s]
0
0 2
6 8 10 12 14 16
Symbol rate [Gs/s]
40
Outline
Low-cost adaptation
Dual-mode link (hardware re-use)
41
CDR
Logic
Phase
Mixers
PLL
Receiver
Reflection
Canceller
PAM2/PAM4
2-10Gb/s
0.13m
40mW/Gb/s
Transmitter
Backchannel RX
Backchannel TX
Adaptive equalization
Transmit and receive equalization
DFE with loop unrolling
error
adaptive
aClk sampler
Rx data
Channel
Adaptive
macro
dClk
thresholds
tap
updates
edge
CDR
eClk
Adaptive sampler
tap updates
43
errorinit
xn
dLevmid
p-p
dLevend
Sign(en )
Initial eye
Sign( xn )
Mid-way equalized
Equalized
Equalizer loop
wn +1 = wn + stepw sign(en ) sign( x n )
Scale the equalizer - output Tx constraint
Integrated Systems Group
44
1000
800
dLev [mV]
80
60
40
20
400
200
post2
pre1
-200
0
0
main tap
600
50
100
150
number of updates
200
-400
0
post1
50
100
150
200
number of updates
Both loops are stable within wide range 0.1 10x of relative
speeds
Integrated Systems Group
45
thresh (+)
D Q
D Q
dClk
in
D Q
D Q
prDFE enable
0
dClk
msb
D Q
thresh(-)
1
D Q
thresh (-)
D Q
prDFE enable
D Q
dClk
lsb(+)
D Q
thresh(+)
0
lsb(-)
D Q
PAM4
46
thresh (+)
D Q
D Q
dClk
in
D Q
prDFE enable
D Q
dClk
thresh (-)
clk
D Q
inP
outN
outP
inP
D Q
clk
prDFE enable
outP
D Q
dClk
PAM4
msb
D Q
thresh(-)
lsb(+)
D Q
thresh(+)
D Q
outN
inN
I
+ I thresh
2
I
I th resh
2
clk
comparator
47
lsb(-)
thresh (+)
D Q
D Q
lsb(+)
D Q
0
0
dClk
in
D Q
D Q
prDFE enable
0
dClk
msb
D Q
1
1
D Q
thresh (-)
D Q
prDFE enable
D Q
dClk
lsb(-)
D Q
PAM2
48
thresh (+)
D Q
D Q
lsb(+)
D Q
dClk
in
D Q
D Q
prDFE enable
0
dClk
msb
D Q
1
1
D Q
thresh (-)
D Q
prDFE enable
D Q
dClk
lsb(-)
D Q
49
thresh (+)
D Q
D Q
lsb(+)
D Q
thresh(+)
dClk
in
D Q
D Q
prDFE enable
0
dClk
thresh(-)
1
D Q
thresh (-)
D Q
prDFE enable
D Q
dClk
msb
D Q
lsb(-)
D Q
50
unequalized
0.3
-3
200
0.2
150
-3.5
100
0.1
-4
[ps]
0
0.25
1000
[V]
2000
3000
[mV]
50
4000
0.2
0.1
0
0
1000
2000
-5
0
0.05
-4.5
-100
0.15
0
-50
transmit equalized
with one tap DFE
[V]
50
100
150
200 [ps]
[ps]
3000
4000
Integrated Systems Group
51
log10(BER)
-2
-4
-6
-8
-10
-12
-14
80
60
40
20
52
Outline
53
bits/dimension
Nelco 64 Gb/s
FR4 38 Gb/s
4
2
0
10
12
GHz
54
data0
LPF
Nelco 64 Gb/s
dataN
LPF
10
12
GHz
BPF
ejw1t
BPF
data0
LPF
BPF
LPF
data1
ejw1t
# levels
LPF
FR4 38 Gb/s
data1
BPF
LPF
dataN
f
ejwNt
ejwNt
Challenge balancing the inter-symbol and
inter-channel interference
55
Multi-modal dispersion
0.4
0.8
0.2
0.6
0
0
10
15
20
25
0.4
0.2
0
0
0.8
0.6
0.4
Source - Corning
0.2
0
0
56
2000
500
0
5
-2000
5
x 10
-5
0
-5 -5
5
x 10
-5
-5
x 10
-5 -5
2000
2000
-2000
5
-2000
5
x 10
-5
0
-5 -5
5
-5
x 10
x 10
-5
-5 -5
5
x 10
-5
5
x 10
-5
57
0
5
x 10
-5
0
-5 -5
5
x 10
-5
MEMS
Spatial Light Modulator
Optimize to reduce modal dispersion
dmin
58
Conclusions
Baseband links
59