Computer System Overview: 1 Spring 2015

Chapter 1
Computer System Overview
Spring 2015
2015 by Greg Ozbirn, UTD, for use with

Stalling's 8th Ed. OS book
Introduction
Computer system:
Hardware
Software
Operating System (system software)
Applications
Hardware
Processor
The processor (CPU) is the brain of the system.
It fetches instructions from memory and executes them.
Main memory
Volatile storage for programs and data
I/O modules
Allow data to be moved to and from the computer and
I/O devices.
System bus
The pathway for data and instructions to move between
the processor, memory, and I/O modules.
3
Hardware Configuration
CPU
System
Bus
I/O Modules
Input /
Output
Devices
Main
Memory
The system bus connects the CPU, I/O Modules, and Main Memory
4
Evolution of the Microprocessor

The invention of the microprocessor enabled desktop and
handheld computing.
A microprocessor is a processor contained on a single chip.
Originally slower but now faster than multichip
processors.
Modern microprocessors may have multiple processors on
a single chip, called processor cores.
Each core might be able to support two logical processors
by sharing an execution unit.
A modern microprocessor chip might contain 2 or 4 cores,
enabling a view of 4 to 8 logical processors.
6
Evolution of the Microprocessor
GPU Graphical Processing Unit.

A GPU provides efficient computation on arrays of data using SIMD
(Single-Instruction Multiple Data) techniques.
Originally for graphics, now used in general numeric processing,
including physics simulations, games, and large spreadsheets.
DSP Digital Signal Processor.

A DSP can processor a stream of data such as audio or video.
Other supporting chips co-exist with the CPU to provide such things as
encoding/decoding video (codecs), or for encryption and security.
SoC System on a Chip.

For handheld devices where many of the components (CPU, DSP,
GPU, Memory, etc.) are on a single chip.
Instruction Cycle
A program is a set of instructions to be executed in
memory.
A simplified view of the processor consists of two phases,
fetch and execute.
The fetch step retrieves an instruction from memory.
The execute step carries out the instruction.
After each fetch, the program counter (PC) is normally
incremented so that it is ready to fetch the next instruction.
The fetched instruction loads into the IR.
The instruction in the IR is then decoded to determine
what the processor should do.
8
Processing
Main Memory
Execute
Fetch Instruction
CPU
The CPU fetches an instruction from memory and executes it.

9
Instruction Execution
An instruction cycle is the processing of a single

instruction, involving the fetch cycle and execute cycle.
10
Instruction Actions
Processor-memory
transfer data between processor and memory
Processor-I/O
data transferred to or from a peripheral device
Data processing
arithmetic or logic operation on data
Control
alter sequence of execution (branch instruction)
11
Instruction and Data Formats

Example
Instruction:
0
3 4
Opcode
Data (integer):
0
1
Sign
Magnitude
15
Address
15
12
Instruction Format
Notice that a machine will have a size for instructions like
16-bit or 32-bit based on its architecture.
This limits the size of an instruction.
If part of the instruction is an opcode and part an address,
then the portion set aside for the opcode determines how
many opcodes there can be, and the portion set aside for the
address determines how much memory is directly
addressable.
For example, if 4 bit opcodes, then only 16 opcodes are
possible (which isnt many).
13
Example Opcodes
0001 = Load AC from memory
0010 = Store AC to memory
0101 = Add to AC from memory
AC is the Accumulator register. It is a register
commonly used for loading and storing values to memory
and doing arithmetic.
14
15
Interrupts
Interrupts are a mechanism for interrupting the processors
execution in order to do something else.
Common types of interrupts are:
Program instruction exceptions:
arithmetic overflow
division by zero
execute illegal instruction
reference outside users memory space
Timer system timer
I/O completion of I/O or I/O error.
Hardware failure power failure or memory parity
error.
16
17
Interrupts
Interrupts improve processor utilization by notifying the
processor of an event versus making the processor wait.
This frees the processor to do other processing.
Without interrupts, the processor would have to keep
checking to see if an event has occurred.
With interrupts, the processor is free to do something else
until the event occurs, at which time it will be notified.
18
Interrupts
An example of using interrupts to improve processor
utilization is in processing I/O commands.
Most I/O devices are much slower than the processor.
Rather than have the processor wait for an I/O to finish,
the processor could continue processing and be interrupted
when an I/O finishes. This is much more efficient.
The next slide illustrates this technique:
19
20
Time
Processor
Wait
I/O Operation
Interrupt
2a
Interrupt handler
2b
Processor
Wait
I/O Operation
Interrupt
3a
5
I/O Operation
I/O Operation
Interrupt handler
3b
3
Without interrupts
With interrupts
Program timing: Short I/O Wait
21
Interrupt Cycle
22
Interrupt Cycle
After instruction execution, the processor checks to see if
an interrupt has occurred.
If so, it executes an Interrupt Handler, which is code that
processes the interrupt.
The processor must save system state (the registers) on the
stack before executing the interrupt handler, or the
interrupt handler may be written to preserve system state.
The processor may have to determine which interrupt
handler to run for this interrupt.
23
24
Interrupt Processing
Memory
Stack
When an interrupt occurs, the CPU

switches from executing the program
to executing the interrupt handler.
Interrupt
handler
CPU
Program
System state is saved on the stack so

that execution can resume in the
program when the handler is finished as
if the interrupt had never occurred. 25
26
Multiple Interrupts
It is possible for an interrupt to occur while another
interrupt is being processed.
Two solutions:
Disable interrupts while processing an interrupt.
Disadvantage: high-priority interrupt may wait too
long and lose data while waiting for a low-priority
interrupt to finish.
Interrupt Priorities: lets high-priority interrupt execute
when processing low-priority interrupt. Interrupts lowpriority interrupt just like interrupting a program.
27
28
29
Memory Hierarchies
Memory on a computer system is composed of a hierarchy
of differing kinds of memory.
Generally these tradeoffs apply across the spectrum of
memory technologies:
Faster access time, greater cost per bit
Greater capacity, smaller cost per bit
Greater capacity, slower access time.
30
Memory Hierarchies
The computer system designer wants greater capacity for
larger programs and reduced cost per bit, but also wants
fast access times which requires lower capacity memories
having higher cost.
Therefore, a single memory solution isnt appropriate, that
is why a hierarchy of memory types is used.
31
Decr. cost per bit

Incr. capacity
Incr. access time
Decr. freq. of access
by the processor.
32
Why does a Memory Hierarchy work?

Locality of reference: program execution of instructions
and data occurs in clusters.
Therefore, as long as a cluster is loaded high up in the
memory hierarchy, execution is fast until that cluster is
finished and the next cluster is loaded.
If we think of program execution as a series of clusters
which can load into the highest memory level and execute,
then we can see that program execution can be quite fast.
33
Cache Memory
Today most processors can execute instructions faster than
they can be retrieved from main memory.
Cache memory sits between the processor and main
memory and exploits the principle of locality of
reference by keeping a working set of instructions and
data in its small but high-speed memory.
34
35
Cache Principles
When the processor fetches an instruction, a block is read
into the cache.
When the next instruction is fetched, there is a high
probability it is contained in the same cache block, which
can be accessed quickly.
A cache consists of slots (or lines) of K words.
Main memory is divided into blocks of K words.
If main memory is size 2N words, then there are M=2N/K
blocks of memory.
If C is the number of cache slots, C << M.
Cache lines are addressed by a tag, which is some number
of high-order bits of the address.
36
37
Cache Example
The hit ratio is the fraction of memory accesses that are
found in the cache.
Suppose memory has an access time of 1s and the cache
has an access time of .1s.
If the hit ratio is 95%, then the average access time is:
(0.95)(0.1 s) + (0.05)(0.1 s + 1 s) = 0.15 s
Thus, the access time is much closer to the cache rate than
the memory rate.
38
39
Cache Design
Considerations:
Cache size how big is the entire cache (a small cache still
offers a lot of improvement).
Block size how big is a block? Needs to be around the
size of a cluster.
Mapping function where to put a new block?
Replacement algorithm how to replace a block?
Write policy when to write a block back to main
memory?
40
I/O Communication Techniques

Programmed I/O:
Processor moves data between I/O module and memory.
Processor must wait for I/O to complete.
Interrupt-driven I/O:
Processor issues I/O command and then goes on to other
processing.
An interrupt lets the processor know when the I/O is
complete, at which time the processor moves the data.
DMA (Direct Memory Access):
Processor delegates the I/O transfer to the DMA module.
When processor is notified of completion (by interrupt),
the data has already been moved.
41
Multiprocessor and Multicore

Organization
Symmetric Multiprocessors (SMP)
A single computer system having these characteristics:
Two or more processors of comparable capability.
These processors share the same memory and I/O
facilities and are interconnected by a bus such that
memory access is about the same for each.
Processors share the I/O devices.
Processors perform the same functions (symmetric).
The system is controlled by an OS that manages the
interaction between programs and processors.
42

Organization
SMP Advantages:
Performance: if work can be done in parallel across the
various processors, then there can be a performance
gain.
Availability: since all processors have equal capability,
a failing processor does not halt the system, but only
reduces its performance.
Incremental growth: performance can be enhanced by
adding additional processors.
Scaling: vendors can offer a range of products with
different price and performance characteristics based on
the number of processors in the system.
43
44

Organization
Multicore computers
Combines two or more processors on a single chip (a
piece of silicon called a die).
Each core typically has all of the components of an
independent processor, such as registers, ALU, pipeline
hardware, control unit, and caches.
One reason for multicores is the limitations of continuing
to improve a single processor.
45

Organization
An example is the Intel Core i7, which has 4 x86
processors, each with an L2 cache and a shared L3 cache.
The processor employs prefetching, in which the hardware
examines memory access patterns and attempts to fill the
caches speculatively with data thats likely to be requested
soon.
The processor employs a DDR3 memory controller, which
permits 32 GB/s of data transfer.
It also uses QuickPath Interconnect (QPI) for high-speed
communication with other system chips.
46
47
End of Slides
based on Stalling's official slides
48

Computer System Overview: 1 Spring 2015

Загружено:

Сведения о документе

Исходное описание:

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

Computer System Overview: 1 Spring 2015

Загружено:

Авторское право:

Доступные форматы

Chapter 1

Computer System Overview

2015 by Greg Ozbirn, UTD, for use with

Evolution of the Microprocessor

Evolution of the Microprocessor

GPU Graphical Processing Unit.

DSP Digital Signal Processor.

SoC System on a Chip.

The CPU fetches an instruction from memory and executes it.

An instruction cycle is the processing of a single

Instruction and Data Formats

Program timing: Short I/O Wait

When an interrupt occurs, the CPU

System state is saved on the stack so

Decr. cost per bit

Why does a Memory Hierarchy work?

I/O Communication Techniques

Multiprocessor and Multicore

Multiprocessor and Multicore

Multiprocessor and Multicore

Multiprocessor and Multicore

based on Stalling's official slides

Вам также может понравиться