DSP Final

DSP Architectures
PRESENTED BY – SUDHANSHU JAIN (16BEE0151)

SIDDHANT NAGAR (16BEE0407)
DSP Architectures
Harvard Architecture
The Harvard architecture requires two memory buses.

For example – DSP using 32 bit words and 32 address space requires at least 64 pins for each
memory bus - a total of 128 pins.
If Harvard architecture is brought off the chip , it results in a very large chip which are difficult to
design into a circuit.
DSP Architectures
Von Neumann Architecture

The von Neumann architecture only uses a single memory bus.
Which makes it relatively cheaper and requiring less pins so programmer
can place instruction or data anywhere throughout the available memory.
It does not permit the multiple memory access.
The modified von Neumann architecture allows multiple memory accesses
per instruction cycle by running the memory cycle faster than the
instruction cycle.
DSP Architectures
Von Neumann Architecture

Each instruction cycle can be divided into multiple ‘machine states’ and a
memory access can be made in each machine state, permitting memory
access per instruction cycle.
The Modified von Neumann architecture permits all the memory accesses
needed to support addition or multiplication: fetch of the instruction; fetch
of the two operands and a storage of result.
Why use architectures for DSPs …
Unique DATA pattern

Bit reversed addressing (FFT)
Multiple memory access
Math operation focus
Streams of data requiring high bandwidth
ALU with 16-bit operands and 32-bit result
Specialized peripherals on the board
Classification of current DSP Architecture
Modern Conventional DSPs

Similar to original DSPs of the early 1980s
Single instruction cycle. Example – TI TMS320C54x
Enhanced Conventional DSPs

Parallel execution units
Complex and Compound instructions. Example – TI TMS320C55x
Multiple – Issue DSPs

VLIW Example – TI TMS320C62xx , TI TMS320C64xx
Superscalar , Example – LSI Logic ZPS400
A conventional DSP - TI TMS320C54x
The ’54x DSPs use an advanced, modified Harvard architecture that maximizes
processing power by maintaining one program memory bus and three data memory
buses.
These processors also provide an arithmetic logic unit (ALU) that has a high degree of
parallelism, application-specific hardware logic, on-chip memory, and additional on-
chip peripherals.
Separate program and data spaces allow simultaneous access to program instructions
and data, providing the high degree of parallelism. Two reads and one write operation
can be performed in a single cycle.
TI TMS320C55x
TI TMS320C55x is based on earlier 54xx family but adds significant improvement to

its architecture and instruction set –
Two Instructions/ cycle –

Instructions are scheduled for parallel execution by the assembly programmer or
compiler.
Mixed – Width instructions with 8 bits to 48 bits
Complex and compound instructions
200 MHz @ 1.5v
TI TMS320C62x
This device is based on the high-performance, advanced very-long-

instruction-word (VLIW) architecture developed by Texas Instruments (TI),
making the C6204 an excellent choice for multichannel and multifunction
applications.
With performance of up to 1600 million instructions per second (MIPS) at a
clock rate of 200 MHz
This processor has 32 general-purpose registers of 32-bit word length and
eight highly independent functional units.
Program memory consists of a 64K-byte block that is user-configurable as
cache or memory-mapped as program space.
TI TMS320C64xx
Ti TMS320C64xx has 64 32-bit general- purpose registers, twice as many as

its predecessors
The instruction set is a superset of that used in previous DSPs and among
other enhancements , adds significant processing capabilities –
8 – bit operations for image/video processing
600 MHz clock speed with Dynamic caches
The only DSP family with compatible fixed and floating point versions
Very Long Instruction Word (VLIW)
VLIW describes the computer processing architecture in which a

language compiler or pre processor breaks program instruction
down into basic operations that can be performed by the
processor parallel manner.
It consists of multiple independent operations grouped together.
Each operation in the instruction is aligned to a functional unit.
Pipeline Architecture
Pipeline architecture is a parallel operation which enables every

component of the hardware to continue working at every
instance.
In pipeline there is a synchronous attempt of executing the
command.
Pipeline enables the continuous workflow, hence reducing the
time limit.
cntd…
Comparison between Parallel and serial Processing
Parallel
In parallel processing every Serial
hardware component is at
In serial processing the next
work at every instance of time
step is defined only when the
The efficiency is more than previous step is finished.
the serial processing.
The time taken to complete
It consists of different stages the task is much more than of
such as 1,2 3 etc parallel processing.

DSP Final

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

DSP Final

Загружено:

Авторское право:

Доступные форматы

DSP Architectures

PRESENTED BY – SUDHANSHU JAIN (16BEE0151)

The Harvard architecture requires two memory buses.

Von Neumann Architecture

Von Neumann Architecture

Unique DATA pattern

Modern Conventional DSPs

Enhanced Conventional DSPs

Multiple – Issue DSPs

TI TMS320C55x is based on earlier 54xx family but adds significant improvement to

Two Instructions/ cycle –

This device is based on the high-performance, advanced very-long-

Ti TMS320C64xx has 64 32-bit general- purpose registers, twice as many as

VLIW describes the computer processing architecture in which a

Pipeline architecture is a parallel operation which enables every

Вам также может понравиться