Академический Документы
Профессиональный Документы
Культура Документы
2013
COMPUTER ARCHITECTURE
CSE Fall 2013
BK
TP.HCM
dce
2013
Chapter 5
B nh my tnh
Fall 2013, CS
dce
2013
Ni dung trnh by
Cng ngh v thut ng lin quan n b nh
T chc / thit k b nh
S cn thit phi c b nh m
Phn loi b nh m
nh gi hiu nng ca b nh m
Fall 2013, CS
dce
2013
My tnh PC
Nhn t bn ngoi, b
x l kt ni vi RAM
v chip cu bc (qun
l cc kt ni vo bus
tc cao nh card
ha, khe PCI), chip cu
bc ni vi chip cu
nam (qun l USB bus,
network, cng,
CD)
RAM
Card
ha
CPU
RAM
Chip cu bc
Khe cm PCI
Chip
cu
nam
Fall 2013, CS
H thng b nh my tnh
B nh my tnh hiu
theo ngha rng l tt
c ni cha d liu
(thanh ghi, b nh
m, b nh chnh, b
nh ngoi)
H thng b nh t
chc theo m hnh
phn cp, trn cng l
b thanh ghi, n b
nh m L1, L2, b
nh chnh, b nh ph
Processor
Control
Devices
Memory
Datapath
Input
Output
Microprocessor
Registers
L1 Cache
L2 Cache
Memory Bus
Bigger
2013
Faster
dce
Main Memory
I/O Bus
Magnetic or Flash Disk
Fall 2013, CS
RW
Rd
0
1
A
L
U
BusB
BusW
32
0
1
2
3
1
0
ALUout
D-Cache
Address
32
Data_out
WB Data
RB
BusA
32
Data_in
32
Rd4
Address
Rt 5
RA
ALU result 32
PC
Instruction
Rs 5
Register File
I-Cache
Instruction
0
1
Rd3
Imm16
2013
Rd2
dce
Data Block
D-Cache miss
Block Address
Instruction Block
I-Cache miss
Block Address
clk
Fall 2013, CS
dce
Cu to thanh ghi
2013
C
D
C
D0
D1
D Flip-Flop
Q
Data_In
Data_Out
.
.
.
32
D
C
D31
32
D Latch
WE
Clock
Computer Architecture Chapter 5
Fall 2013, CS
dce
2013
RA
Register
File BusA
32
RB
32
BusB
RW
Clock
BusW
RegWrite
32
Fall 2013, CS
dce
2013
RAM
Address
Data
m
OE
WE
Yu cu ghi d liu
Fall 2013, CS
dce
2013
Cng ngh b nh
Static RAM (SRAM) dng cho b nh m
1 bit nh cn 6 transistor
Tc truy xut d liu cao (~ 1ns)
Fall 2013, CS
10
dce
2013
Cu to SRAM
Fall 2013, CS
11
dce
2013
Cu to DRAM
Fall 2013, CS
12
dce
2013
Fall 2013, CS
13
dce
Chu k lm ti DRAM
2013
Voltage
for 1
1 Written
Refreshed
Refreshed
Refreshed
Threshold
voltage
Voltage
for 0
0 Stored
Refresh Cycle
Time
Fall 2013, CS
14
dce
V d mt IC DRAM
2013
Ch thch:
11 ng a ch A0-A10
Ai
CAS
Dj
NC
OE
RAS
WE
Address bit i
Column address strobe
Data bit j
No connection
Output enable
Row address strobe
Write enable
10 11
12
Fall 2013, CS
15
dce
2013
V d cu trc ca DRAM
La chn ct c/ghi
...
Column decoder
Row Decoder
Row address
Row decoder
2r 2c m bits
Cell Matrix
Cell Matrix
Mng 2 chiu cc phn t nh
Sense/Write amplifiers
Sense/write amplifiers
Data
...
Column Decoder
c
Column address
Fall 2013, CS
16
dce
Thng s DRAM
2013
Year
Produced
1980
1983
1986
1989
64 Kbit
256 Kbit
1 Mbit
4 Mbit
DRAM
DRAM
DRAM
DRAM
Row
access
170 ns
150 ns
120 ns
100 ns
1992
16 Mbit
DRAM
80 ns
15 ns
120 ns
1996
64 Mbit
SDRAM
70 ns
12 ns
110 ns
1998
128 Mbit
SDRAM
70 ns
10 ns
100 ns
2000
256 Mbit
DDR1
65 ns
7 ns
90 ns
2002
512 Mbit
DDR1
60 ns
5 ns
80 ns
2004
2006
2010
2012
1 Gbit
2 Gbit
4 Gbit
8 Gbit
DDR2
DDR2
DDR3
DDR3
55 ns
50 ns
35 ns
30 ns
5 ns
3 ns
1 ns
0.5 ns
70 ns
60 ns
37 ns
31 ns
Chip size
Type
Column
access
75 ns
50 ns
25 ns
20 ns
Cycle Time
New Request
250 ns
220 ns
190 ns
165 ns
Fall 2013, CS
17
dce
2013
Fall 2013, CS
18
dce
2013
Memory
Bus Clock
Millions Transfers
per second
Module
Name
Peak
Bandwidth
DDR-200
100 MHz
200 MT/s
PC-1600
1600 MB/s
DDR-333
167 MHz
333 MT/s
PC-2700
2667 MB/s
DDR-400
200 MHz
400 MT/s
PC-3200
3200 MB/s
DDR2-667
333 MHz
667 MT/s
PC-5300
5333 MB/s
DDR2-800
400 MHz
800 MT/s
PC-6400
6400 MB/s
DDR2-1066
533 MHz
1066 MT/s
PC-8500
8533 MB/s
DDR3-1066
533 MHz
1066 MT/s
PC-8500
8533 MB/s
DDR3-1333
667 MHz
1333 MT/s
PC-10600
10667 MB/s
DDR3-1600
800 MHz
1600 MT/s
PC-12800
12800 MB/s
DDR4-3200
1600 MHz
3200 MT/s
PC-25600
25600 MB/s
Fall 2013, CS
19
dce
2013
Ni dung trnh by
Cng ngh v thut ng lin quan n b nh
T chc / thit k b nh
S cn thit phi c b nh m
Phn loi b nh m
nh gi hiu nng ca b nh m
Fall 2013, CS
20
dce
T chc b nh
2013
n
a
ch
Gii m
a ch
4
a ch 00 (0)
a ch 01 (1)
a ch 10 (2)
a ch 11 (3)
Gii m
a ch
2n
la
chn
4 x 8 bit
D liu 8 bit
a ch 000 (0)
a ch 001 (1)
a ch 010 (2)
a ch 011 (3)
a ch 100 (4)
a ch 101 (5)
a ch 110 (6)
a ch 111 (7)
Gii m
a ch
8 x 16 bit
D liu 16 bit
Fall 2013, CS
21
21
dce
2013
A15 A0
64K x 8 bit
64K x 8 bit
64K x 8 bit
2
A17
A16
4
64K x 8 bit
Gii m
a ch
D7 D0
2010, Dr. Dinh Duc Anh Vu
Fall 2013, CS
22
22
dce
2013
64K x 8 bit
64K x 8 bit
64K x 8 bit
64K x 8 bit
D31 D24
D23 D16
D15 D8
D7 D0
A15 A0
Fall 2013, CS
23
23
dce
Gii m a ch b nh
2013
A12
A11
00
0000 0000
08
0000 1000
B nh
0000
0000
07FF
0800
Bank 0
Tuyn a ch
Mch
gii m
a ch
0FFF
1000
17FF
07FF
0000
Bank 1
07FF
0000
1800
Bank 2
1FFF
07FF
0000
10
0001 0000
Bank 3
18
07FF
0001 1000
2010, Dr. Dinh Duc Anh Vu
Fall 2013, CS
24
24
dce
Gii m a ch b nh
2013
a ch
A15
A14
A13
A12
A11
A10
A9
A8
A7
A6
A5
A4
A3
A2
A1
A0
0000
0001
07FF
0800
0801
0FFF
1000
1001
17FF
1800
1801
...
...
...
...
1FFF
Fall 2013, CS
25
25
dce
2013
U5
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
RD
WR
8
7
6
5
4
3
2
1
23
22
19
18
20
21
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
U2A
A11
A12
2
3
MREQ
A
B
G
Y0
Y1
Y2
Y3
4
5
6
7
RD
WR
8
7
6
5
4
3
2
1
23
22
19
18
20
21
D0
D1
D2
D3
D4
D5
D6
D7
D0
D1
D2
D3
D4
D5
D6
D7
CS
OE
WE
U6
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
U3
9
10
11
13
14
15
16
17
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
RD
WR
8
7
6
5
4
3
2
1
23
22
19
18
20
21
HM6116/SO
D0
D1
D2
D3
D4
D5
D6
D7
9
10
11
13
14
15
16
17
CS
OE
WE
D0
D1
D2
D3
D4
D5
D6
D7
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
8
7
6
5
4
3
2
1
23
22
19
18
20
21
D0
D1
D2
D3
D4
D5
D6
D7
9
10
11
13
14
15
16
17
D0
D1
D2
D3
D4
D5
D6
D7
CS
OE
WE
U4
RD
WR
HM6116/SO
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
A0
A1
A2
A3
A4
A5
A6
A7
A8
A9
A10
HM6116/SO
D0
D1
D2
D3
D4
D5
D6
D7
9
10
11
13
14
15
16
17
D0
D1
D2
D3
D4
D5
D6
D7
CS
OE
WE
HM6116/SO
74LS139
Fall 2013, CS
26
26
dce
Bi tp (1)
2013
MREQ
2
3
A
B
G
U8A
Y0
Y1
Y2
Y3
4
5
6
7
CS1
CS2
A11
A12
74LS139
2
3
A
B
G
Y0
Y1
Y2
Y3
4
5
6
7
CS5
U9A
1
CS3
CS4
74LS139
74LS08
U10A
1
2
74LS08
Fall 2013, CS
27
27
dce
Bi tp (2)
2013
A10
A11
A12
1
2
3
15
Y0
Y1
Y2
Y3
A15
A14
MREQ
Y4
6
4
5
G1
Y5
G2A
Y6
G2B
Y7
U2A
14
A8
13
A9
12
11
2
3
Y0
Y1
Y2
1
G
10
9
4
A
CS1
U3A
13
12
CS3
Y3
74LS139
74LS11
74LS138
U2B
A7
A8
14
13
12
A
Y0
Y1
Y2
15
G
11
10
CS2
Y3
74LS139
Fall 2013, CS
28
28
dce
2013
Bi tp (3)
5. Mt h thng my tnh
dng CPU c tuyn a ch
16 ng A15 A0 v c
bn s dng b nh
nh sau:
a) Hy xc nh tm a ch
s dng ca tng chip b
nh trong h thng.
b) Thit k mch gii m
a ch b nh y cho
CPU trn. Tn hiu chn b
nh c tn l MREQ.
0000H
4K SRAM1
1K khng c
2K EPROM1
3K khng c
4K SRAM 2
2K EPROM 2
Fall 2013, CS
29
29
dce
2013
Ni dung trnh by
Cng ngh v thut ng lin quan n b nh
T chc / thit k b nh
S cn thit phi c b nh m
Phn loi b nh m
nh gi hiu nng ca b nh m
Fall 2013, CS
30
dce
2013
Fall 2013, CS
31
31
dce
2013
Cc b v mt khng gian
Mt lnh ang c thc thi, lnh k tip nhiu kh nng c
thc thi
Mt phn t c tham kho, cc phn t gn c kh truy
xut sau
VD: Cc lnh c thc thi tun t; duyt cc phn t mng
lin tip nhau
Computer Architecture Chapter 5
Fall 2013, CS
32
dce
2013
Cc b v mt thi gian
sum =
for (i
sum
return
0;
= 0; i<n; i++)
+= a[i];
sum
Cc b v mt khng gian
Cc phn t ca mng a[] ln lc c tham kho qua mi ln
lp
Fall 2013, CS
33
dce
2013
Cc b v mt khng gian
Sch ca nhng mn khc
(vd: Cu trc d liu gii
thut) c ging dy trong
k ny cng c tham kho
Fall 2013, CS
34
dce
2013
S cn thit ca b nh m
Khon cch tc ngy cng xa gia CPU - DRAM
Fall 2013, CS
35
dce
2013
B nh m l g ?
B nh nhanh (SRAM), dung lng t
m mt phn d liu v lnh ca chng trnh ang thc thi
Mc ch t c
Tc truy xut nhanh ca SRAM dng trong b nh m
Fall 2013, CS
36
T chc b nh phn cp
Cc thanh ghi trn cng
Dung lng thng thng < 1 KB
Thi gian truy xut (access time) < 0.5 ns
B nh m cp 1 L1 Cache (8 64 KB)
Microprocessor
Access time: 1 ns
B nh m L2 (512KB 8MB)
Registers
Access time: 3 10 ns
L1 Cache
B nh chnh (4 16 GB)
L2 Cache
Memory Bus
Bigger
2013
Faster
dce
Main Memory
I/O Bus
Magnetic or Flash Disk
Fall 2013, CS
37
dce
c tnh ca b nh phn cp
2013
Chiu tng
ca thi
gian truy
xut theo
khong
cch n
b x l
n v dch
chuyn
gia mi
cp
Processor
4-8 bytes (word)
L1$
8-32 bytes (block)
L2$
1 to 4 blocks
Main Memory
1,024+ bytes
(disk sector = page)
Secondary Memory
Tnh bao gp
nhng g
trong L1$ l
tp con ca
nhng g
trong L2$ v
l tp con ca
nhng g
trong b nh
chnh v l tp
con ca b
nh th cp
38
Fall 2013, CS
38
dce
m d liu ca b nh m
2013
Cp k:
D liu c
dch chuyn
gia cc cp
theo n v
khi (block)
Cp k+1:
Cp m k c tc
nhanh, dung lng t,
d liu l tp con ca
cp m k+1
Cp m k+1 c tc
chm, dung lng ln,
d liu c chia
thnh tng khi
Fall 2013, CS
39
dce
Cc khc nim v b nh m
2013
Cache hit:
Cp k:
Cache miss:
b (vd: 12) khng c cp k, phi
ly t cp k+1, ng thi c
a vo cp k
Cch nh x? b s c a vo
v tr no (vd: 12 mod 4 = 0)
Cp
k+1:
Nu b nh m cp k y, mt
block no s c thay th:
Chin lc thay th? vd: thay th
bock c a vo sm nht
Computer Architecture Chapter 5
Fall 2013, CS
40
dce
2013
Hot ng ca b nh m
B x l yu cu load mt d liu w trong block b
Hit (tm
thy trong
b nh
m)
a block b vo b nh
m + thay th nu cn
Ly d liu w t block b
b nh m cho b x l
Computer Architecture Chapter 5
Fall 2013, CS
41
dce
2013
Ni dung trnh by
Cng ngh v thut ng lin quan n b nh
T chc / thit k b nh
S cn thit phi c b nh m
Phn loi b nh m
nh gi hiu nng ca b nh m
Fall 2013, CS
42
dce
2013
Bn cu hi lin quan ti b nh m
Q1: Mt khi s c a
vo u trong b nh m?
Direct Mapped, Set Associative,
Fully Associative
Fall 2013, CS
43
dce
2013
Q1: a mt block vo b nh m
Block 12 c Bt k v tr
no
a vo v tr
no?
Computer Architecture Chapter 5
Bt k v tr
no trong set
0 (12 mod 4)
Ti v tr 4
(12 mod 8)
Fall 2013, CS
44
dce
2013
Fall 2013, CS
45
dce
2013
Fall 2013, CS
46
dce
2013
nh x mt a ch vo mt block b nh m
V d:
Xt mt b nh m direct-mapped c 256 block (line)
Block size = 16 bytes
Li gii
32-bit a ch c chia:
Block Address
20
Tag
Index offset
Fall 2013, CS
47
dce
2013
Xt mt b nh m direct-mapped c 32 block
B nh m ban u rng, Block size = 16 bytes
Cc a ch sau (dng thp phn) c tham kho:
Li gii:
1000 = 0x3E8
1004 = 0x3EC
1008 = 0x3F0
2548 = 0x9F4
2552 = 0x9F8
2556 = 0x9FC
Tag
Index offset
48
dce
2013
00
0 miss
Mem(1) Mem(0)
00
1 hit
Mem(1) Mem(0)
00
00
3 hit
Mem(1) Mem(0)
Mem(3) Mem(2)
01
00
00
4 miss
5
4
Mem(1) Mem(0)
Mem(3) Mem(2)
01
00
4 hit
Mem(5) Mem(4)
Mem(3) Mem(2)
1101
00
00
00
2 miss
Mem(1) Mem(0)
Mem(3) Mem(2)
01
00
3 hit
Mem(5) Mem(4)
Mem(3) Mem(2)
15 miss
Mem(5) Mem(4)
15
14
Mem(3) Mem(2)
2011, Dr. Dinh Duc Anh Vu
Fall 2013, CS
49
49
dce
2013
Fall 2013, CS
50
dce
2013
Cc kt hp thng dng:
Write Through & No Write Allocate
Write Back & Write Allocate
Computer Architecture Chapter 5
Fall 2013, CS
51
dce
2013
Ni dung trnh by
Cng ngh v thut ng lin quan n b nh
T chc / thit k b nh
S cn thit phi c b nh m
Phn loi b nh m
nh gi hiu nng ca b nh m
Fall 2013, CS
52
dce
2013
Hit Rate
Fall 2013, CS
53
dce
2013
Fall 2013, CS
54
dce
2013
Fall 2013, CS
55
dce
2013
Fall 2013, CS
56
dce
2013
Fall 2013, CS
57
dce
2013
Instruction
data
Fall 2013, CS
58
dce
2013
Li gii:
AMAT = 1 + 0.05 20 = 2 cycles = 4 ns
Khi khng dng b nh m, AMAT s bng Miss penalty = 20 cycle
Computer Architecture Chapter 5
Fall 2013, CS
59
dce
2013
Fall 2013, CS
60
dce
2013
Fall 2013, CS
61
dce
2013
Miss Rate
14%
1-way
12%
2-way
10%
4-way
8%
8-way
6%
Capacity
Compulsory
4%
2%
0
1
16
32
64
62