Академический Документы
Профессиональный Документы
Культура Документы
Chapter 2
Memory Hierarchy Design
Introduction
Introduction
Introduction
Memory Hierarchy
Introduction
Introduction
Intel Core i7 can generate two references per core per clock
Four cores and 3.2 GHz clock
25.6 billion 64-bit data references/second +
12.8 billion 128-bit instruction references
= 409.6 GB/s!
Introduction
Introduction
Introduction
Write-through
Write-back
Miss rate
Introduction
Causes of misses
Compulsory
Capacity
Conflict
Introduction
10
Higher associativity
Introduction
11
Advanced Optimizations
12
Advanced Optimizations
13
Advanced Optimizations
14
Advanced Optimizations
Way Prediction
Way selection
Increases mis-prediction penalty
Copyright 2012, Elsevier Inc. All rights reserved.
15
Examples:
Pentium: 1 cycle
Pentium Pro Pentium III: 2 cycles
Pentium 4 Core i7: 4 cycles
Advanced Optimizations
Pipelining Cache
16
Advanced Optimizations
Nonblocking Caches
17
Advanced Optimizations
Multibanked Caches
18
Early restart
Advanced Optimizations
19
Advanced Optimizations
No write
buffering
Write buffering
20
Loop Interchange
Advanced Optimizations
Compiler Optimizations
Blocking
21
Advanced Optimizations
Hardware Prefetching
Pentium 4 Pre-fetching
Copyright 2012, Elsevier Inc. All rights reserved.
22
Cache prefetch
Advanced Optimizations
Compiler Prefetching
23
Advanced Optimizations
Summary
24
Performance metrics
Cycle time
Memory Technology
Memory Technology
25
SRAM
Memory Technology
Memory Technology
DRAM
Every ~ 8 ms
Each row can be refreshed simultaneously
One transistor/bit
Address lines are multiplexed:
26
Amdahl:
Memory Technology
Memory Technology
Some optimizations:
Wider interfaces
Double data rate (DDR)
Multiple banks on each DRAM device
27
Memory Technology
Memory Optimizations
28
Memory Technology
Memory Optimizations
29
DDR:
DDR2
DDR3
1.5 V
800 MHz
DDR4
Memory Technology
Memory Optimizations
1-1.2 V
1600 MHz
30
Graphics memory:
Memory Technology
Memory Optimizations
Lower voltage
Low power mode (ignores clock, continues to
refresh)
31
Memory Technology
32
Type of EEPROM
Must be erased (in blocks) before being
overwritten
Non volatile
Limited number of write cycles
Cheaper than SDRAM, more expensive than
disk
Slower than SRAM, faster than disk
Memory Technology
Flash Memory
33
Memory Technology
Memory Dependability
34
Role of architecture:
Virtual Memory
35
Virtual Machines
36
37