Академический Документы
Профессиональный Документы
Культура Документы
The Issues Silicon designers can choose a variety of methods to increase processor performance Commercial end-customers are demanding
More capable systems with more capable processors That new systems stay within their existing power/thermal infrastructure
Processor frequency and power consumption seem to be scaling in lockstep How can the industry-standard PC and Server industries stay on our historic performance curve without burning a hole in our motherboards? This session is not about process technology
Session Outline Definition: What is a processor? Core Design System Architecture Manufacturing, Power, and Thermals Multi-Core Processor Architecture Performance Impacts
What is a Processor? A single chip package that fits in a socket 1 core (not much point in <1 core)
Cores can have functional units, cache, etc. associated with them, just as today Cores can be fast or slow, just as today
Shared resources
More cache Other integration: Northbridge, memory controllers, high-speed serial links, etc.
Core Design
Frequency
Is only as good as the rest of the core architecture
Fetch L1 Icache 64KB
Branch Prediction
L1 Dcache 64KB
Data paths
Increasing bandwidth between functional units in a core makes a difference
Such as comprehensive 64-bit design, but then where to?
Core Design
Pipeline
Deeper pipeline buys frequency at expense of increased cache miss penalty and lower instructions per clock Shallow pipeline gives better instructions per clock at the expense of frequency scaling Max frequency per core requires deeper pipelines Industry converging on middle ground9 to 11 stages
Successful RISC CPUs are in the same range
Cache
Cache size buys performance at expense of die size, its a direct hit to manufacturing cost Deep pipeline cache miss penalties are reduced by larger caches Not always the best match for shallow pipeline cores, as cache misses penalties are not as steep
As manufacturing economies and frequency increase, power consumption is increasing disproportionately There are no process or architectural quick-fixes
Transistors Are Not Free The number of transistors in a core determines basic power consumption Architectural efficiency matters a lot when designing new cores
More functional units means more transistors Deeper pipelines mean more transistors Larger caches mean more transistors
Static Current
Very High Leakage and Power Embedded Parts Fast, Low Power Fast, High Power
1.0
Frequency
1.5
Power Consumption
Frequency
(Gross relative numbers summarized from a mountain of real data)
Total Effect on Dual-Core Frequencies Substantially lower power with lower frequency Thermals easier to handle at any frequency Result is dual-core running at n-2 in same thermal envelope as single-core running at top speed
What to integrate?
Northbridge crossbar switch is key
Look for innovation and differentiation in how cores are connected on-chip Must integrate Northbridge to integrate anything else
Memory controller to reduce memory latency and further reduce the need for cache High-speed serial links for system I/O
Crossbar (XBAR)
RAS/CAS/Cntl
DRAM Data
Frequencies
Platforms
OLTP Workload
4P/8C 4P/4C 2P/4C 2P/2C
Memory, disks, network, etc. % Comparison to Base 2P/2C Early dual-core validation SPEC and the benchmark name SPECint are registered system used, different trademarks of the Standard Performance Evaluation motherboards Corporation. SPEC scores for AMD Opteron Model 270
and 870 based systems are estimated
Call to Action Most application software doesnt need to do anything to benefit from dual-core Be aware that, for a processor within a given power envelope
Fewer cores will clock faster than more cores
Single-threaded performance-sensitive applications
Community Resources
Windows Hardware & Driver Central (WHDC)
www.microsoft.com/whdc/default.mspx
Technical Communities
www.microsoft.com/communities/products/default.mspx
Microsoft Blogs
www.microsoft.com/communities/blogs
Web Resources
AMD http://www.amd.com/ AMD Multi-Core http://www.amd.com/multicore/ AMD Opteron Processor http://www.amd.com/opteron/ AMD Multi-Core White Paper
http://enterprise.amd.com/downloadables/33211A_Multi-Core_WP.pdf