Академический Документы
Профессиональный Документы
Культура Документы
1
1. INTRODUCTION
Mobile computing has been the buzzword for quite a long time. Mobile
computing devices like laptops, web slates & notebook PCs are becoming common
nowadays. The heart of every PC whether a desktop or mobile PC is the
microprocessor. Several microprocessors are available in the market for desktop PCs
from companies like Intel, AMD, and Cyrix etc. The mobile computing market has
never had a microprocessor specifically designed for it. The microprocessors used in
mobile PCs are optimized versions of the desktop PC microprocessor.
2
Crusoe is the new microprocessor which has been designed specially for the
mobile computing market. It has been designed after considering the above
mentioned constraints. This microprocessor was developed by a small Silicon Valley
startup company called Transmeta Corp. after five years of secret toil at an
expenditure of $100 million. The concept of Crusoe is well understood from the
simple sketch of the processor architecture, called 'amoeba'. In this concept, the x86-
architecture is an ill-defined amoeba containing features like segmentation, ASCII
arithmetic, variable-length instructions etc. The amoeba explained how a traditional
microprocessor was, in their design, to be divided up into hardware and software.
Currently, this is used to allow the chips to emulate the Intel x86 instruction
set. In theory, it is possible for the CMS to be modified to handle other instruction
streams (i.e. to emulate other microprocessors).
The addition of an abstraction layer between the x86 instruction streams and
the hardware means that the hardware architecture can change without breaking
compatibility, just by modifying the CMS. For example Efficeon, the second-
generation Crusoe, has a 256-bit-wide VLIW core versus 128-bit in the first
generation.
3
produced two x86-compatible CPU architectures: Crusoe and Efficeon. These CPUs
have appeared in ultra-portable laptops, blade servers, tablet PCs, a personal cluster
computer, and a silent desktop, where low power consumption and heat dissipation
are of primary importance. History throughout Transmeta's first few years little was
known about exactly what it would be offering. Its web site went online in mid-1997,
and for approximately two and a half years displayed nothing but the text "This web
page is not yet here." Information gradually came out of the company, suggesting of
a very long instruction word-based (VLIW) design that translated x86 code into its
own native code. As Intel's then-forthcoming "Merced" processor was also a VLIW
design which could translate x86 code, speculation arose suggesting that Transmeta's
product could have supercomputer-level processing power while actually being
cheaper to manufacture than any offering by Intel, AMD or Cyrix.
4
to be a key licensee of this Transmeta technology. Timeline Founded in 1995.
Corporate launch on January 19, 2000. [3]
In July of 2002, Transmeta experience first set of layoffs equaling 40% of the
company. [4]
On May 31, 2005, Transmeta announced the signing of asset purchase and
license agreements with Hong Kong's Culture.com Technology Limited led by Chu
Bong-Foo, the inventor of the Cangjie method and one of the founding fathers of
modern Chinese computing. However, due to delays in obtaining the necessary
technology export licenses from the US Department of Commerce, the parties
announced the termination of this agreement on February 9, 2006.
On July 6 2007, AMD invested $7.5 million in Transmeta. AMD plans to use
Transmeta's patent portfolio related to energy-efficient technologies. [6]
Yes, there is a secret message, and this is it: Transmeta's policy has been to
remain silent about its plans until it had something to demonstrate to the world. On
January 19th, 2000, Transmeta is going to announce and demonstrate what Crusoe
processors can do. Simultaneously, all of the details will go up on this Web site for
everyone on the Internet to see. Crusoe will be cool hardware and software for
mobile applications. Crusoe will be unconventional, which is why we wanted to let
you know in advance to come look at the entire Web site in January, so that you can
5
get the full story and have access to all of the real details as soon as they are
available.
The company was largely successful in hiding its ambitions until the official
announcement. Over 2000 non-disclosure agreements (NDAs) were signed during
the stealth period [9].
On October 11, 2006, Transmeta announced that they have filed a lawsuit
against Intel Corporation for infringement of ten Transmeta U.S. patents covering
computer architecture and power efficiency technologies.
The complaint charges that Intel has infringed and is infringing Transmeta's
patents by making and selling a variety of microprocessor products including at least
Intel's Pentium III, Pentium 4, Pentium M, Core and Core 2 product line. Technology
the actual Transmeta processors are in-order very long instruction word (VLIW)
cores. To execute x86 codes, a pure software-based instruction translator
dynamically compiles or emulates x86 code sequences, using execution-hotspot
guided heuristics. While similar technologies existed (WABI for Sun, FX!32 for
Alpha and IA-32 EL for Itanium) in the 1990s, the Transmeta approach has set a
much higher bar for compatibility—able to execute all x86 instructions from initial
boot up to the latest multimedia instructions—while retaining most of its core
performance.
As the market leaders Intel and/or AMD would extend the core x86
instruction set, Transmeta could quickly upgrade their product with a software
upgrade rather than requiring a respin of their hardware.
The processor could emulate multiple other architectures, possibly even at the
same time. (At its initial Crusoe launch, Transmeta demonstrated pico-Java and x86
running intermixed on the native hardware.)
6
performance of the Crusoe based HP Compaq TC1000 tablet PC. It was used again
in 2004 when NX bit and SSE3 support were added to the Efficeon product line
without requiring hardware changes. In the field upgrades were rare in practice due
to system hardware vendors not wanting to incur additional customer support costs
or spend additional money on QA for the potential upgrades or bug fixes to shipped
products they had already closed the revenue books on.
Viability
Transmeta lost much credibility and endured significant criticism due to the
poor initial Crusoe showing with large discrepancies between projections and actuals
for both performance and power. Although power consumption was somewhat better
than Intel and AMD offerings, the end user experience (i.e. battery life) only showed
a marginal overall improvement. [2]
First, the Code Morphing Software (CMS) combined with cache architecture
artificially inflated comparisons between benchmarks and real-world applications.
This is due to the repetitive nature of benchmarks and their small footprints. The
CMS software overhead may have actually been a key cause of much lower
performance for many real-world applications; the simple VLIW core architecture
could not compete on computationally-intensive applications; and the southbridge
interface was limited by its low bandwidth for graphics or other I/O-intensive
applications. Some standard benchmarks even failed to run, questioning the claim of
full x86 compatibility.[3]
The notion of selling a product into a specific thermal envelope was typically
not understood by the mass of reviewers, who tended to compare Efficeon to the
gamut of x86 microprocessors, regardless of power consumption or application. One
such example of this criticism suggests the performance still significantly lagged
Intel's Pentium M (Banias) and AMD's Mobile Athlon XP. [4]
7
the show 24, the fictional character Tony Almeida is listed as a former systems
validation analyst at Transmeta.
The VLIW approach, on the other hand, executes operation in parallel based
on a fixed schedule determined when programs are compiled. Since determining the
order of execution of operations (including which operations can execute
simultaneously) is handled by the compiler, the processor does not need the
scheduling hardware that the three techniques described above require. As a result,
VLIW CPUs offer significant computational power with less hardware complexity
(but greater compiler complexity) than is associated with most superscalar CPUs.
Efficeon most closely mirrors the feature set of Intel Pentium 4 processors,
although, like AMD Opteron processors, it supports a fully integrated memory
controller, a HyperTransport IO bus, and the NX bit, or no-execute x86 extension to
PAE mode. NX bit support is available starting with CMS version 6.0.4.
8
Efficeon comes in two package types: a 783 and a 592 ball grid array. Its
power consumption is moderate (with some consuming as little as 3 watts at 1 GHz
and 7 watts at 1.5 GHz), so it can be passively cooled.
Two generations of this chip were produced. The first generation (TM8600)
was manufactured using a TSMC 0.13 micrometre process and productized at speeds
up to 1.1 GHz. The second generation (TM8800 and TM8820) was manufactured
using a Fujitsu 90 nm process and productized at speeds ranging from 1 GHz to 1.7
GHz).
The Efficeon has 128 k instruction + 64 k data level 1 cache and a 1Mb level
2 cache on the chip.
These additional functional units, which are typically, part of the core system
logic that surrounds the microprocessor, Allow the Crusoe processor to provide a
highly integrated and cost effective platform solution for the x86 mobile.
9
2. ARCHITECTURE
The Crusoe processor incorporates integer and floating point execution units,
separate instruction and data caches, a level-2 write-back cache, memory
management unit, and multimedia instructions. In addition to these traditional
processor features, the device integrates a DDR SDRAM memory controller, SDR
SDRAM memory controller, PCI bus controller and serial ROM interface controller.
These additional units are usually part of the core system logic that surrounds the
microprocessor. The VLIW processor, in combination with Code Morphing software
and the additional system core logic units, allow the Crusoe processor to provide a
highly integrated, ultra-low power, high performance platform solution for the x86
mobile markets. The Crusoe processor block diagram is shown in Figure 1.
CPU Core
Integer unit
MMU
L1 Instruction Cache
L1 Data Cache
10
64K
64K
Unified TLB
256 entries
DDR SDRAM
Controller
SDR SDRAM
Controller
PCI Controller
&
Southbridge
Interface
DMA
Multimedia Instructions
64
64
Serial ROM
Interface
L2 WB Cache
512K
Bus
11
Interface
7/5/2001 3 of 8
1 Processor Core
12
emulate all other features of the x86 architecture. The software that converts x86
programs into the core VLIW instructions is the Code Morphing software.
The Crusoe Processor includes a PCI bus controller that is PCI 2.1 compliant.
The PCI bus is 32 bits wide, operates at 33 MHz, and is compatible with 3.3V signal
levels. It is not 5V tolerant, however. The PCI controller on provides a PCI host
bridge, the PCI bus arbiter, and a DMA controller.
13
3. HIERARCHY MODEL
INSTRUCTION MOLUCULES
14
1. It has been designed purely for fast low power implementation.
2. It uses conventional COMS fabrication.
3. The surrounding software layer gives X-86 program impression that they are
running on X-86 hardware.
15
4. FEATURES OF VARIOUS CRUSOE PROCESSORS
16
Figure: Crusoe processor Architecture –Model TM5400
17
5. INSTRUCTION SET
18
6. PERFORMANCE
Execution time
Comparable to direct hardware implementation by intel or AMD
TM 5400 at 667 MHz is about the same as a Pentium III running at 500Mhz
Low Cost
Much simpler hardware.
Crusoe TM5400 is a about 7 million transistors (P4 is at 41 Million)
Easier to design, more scalable, easier to reach high clock rate, more room for
caches, better yield etc.
Doesn’t have to worry about binary compatibility!!
Low Power
Less hardware lower power
Additional power management features (such as variable supply voltage and
clock frequency)
19
7. CODDE MORPHING:CRUSOE’S KEY
20
8. SOME DRAWBACKS:
1. Code optimization does not start until a block of code has been executed more
the a few times.
2. 2. Code translation requires clock cycles which could otherwise be used in
performing application computation.
21
9. CONCLUSION
In 1995, Transmeta set out to expand the reach of microprocessors into new
markets by dramatically changing the way microprocessors are designed. The initial
market is mobile computing, in which complex power-hungry processors have forced
users to give up either battery running time or performance. The Crusoe processor
solutions have been designed for lightweight (two to four pound) mobile computers
and Internet access devices such as handhelds and web pads. They can give these
devices PC capabilities and unplugged running times of up to a day. To design the
Crusoe processor chips, the Transmeta engineers did not resort to exotic fabrication
processes. Instead they rethought the fundamentals of microprocessor design. Rather
than “throwing hardware” at design problems, they chose an innovative approach
that employs a unique combination of hardware and software. Using software to
decompose complex instructions into simple atoms and to schedule and optimize the
atoms for parallel execution saves millions of logic transistors and cuts power
consumption on the order of 60–70% over conventional approaches—while atthe
same time enabling aggressive code optimization techniques that are simply not
feasible in traditional x86 implementations. Transmeta’s Code Morphing software
and fast VLIW hardware, working together, achieve low power consumption without
sacrificing high performance for real-world applications. Although the model
TM3120 and model TM5400 are impressive first efforts, the significance of the
Transmeta approach to microprocessor design is likely to become more apparent
over the next several years. The technology is young and offers more freedom to
innovate (both hardware and software) than conventional hardware-only designs.
Nor is the approach limited to low-power designs or to x86-compatible processors.
Freed to render their ideas in a combination of hardware and software, and to evolve
hardware without breaking legacy code, Transmeta microprocessor designers may
produce one surprise after another in the coming years.
22
10. GLOSSARY
Cache -Generally a small chunk of fast memory that sits between either 1) a
smaller, faster chunk of memory and a bigger, slower chunk of memory, or 2) a
processor and a bigger, slower chunk of memory. This is to provide a bridge from
something that's comparatively very fast to something that's comparatively slow.
CISC -Intel x86 chips are CISC chips because of the complexity of the instruction
set. On the other side of the coin, you have RISC chips that use a reduced instruction
set. RISC chips split big operations into lots of simple, tiny instructions that are
processed very quickly.
Code -what computer scientists call the collection of computer instructions that
constitute an application program separate from the data.
Code Morphing software -Code Morphing software translates the ones and zeros of
the application program instructions into a more efficiently executed set of
instructions for the Crusoe processor.
Commit atom -an atom specific to the Crusoe VLIW engine that commits a set of
pending changes (to memory or to registers) in a single operation.
CPU (Central Processing Unit, also called "processor") -the active part of the
computer where all computation (such as addition) is performed. Today, the CPUs of
almost all computers are contained on a single chip.
Crusoe -for consumers, the Crusoe name will mean long battery life, full
Compatibility, and high performance. The Crusoe brand stands for a family of smart
microprocessors from Transmeta intended for use in Mobile Internet Computers.
23
DDR memory -a new type of SDRAM memory that is both higher performance and
lower power than standard SDRAM, making it appropriate for the latest portable
devices. "DDR" is short for "Double Data Rate," referring to the high speed at which
this memory transfers data.
Full Internet experience -the ability to get access to all the data and details found on
the World Wide Web, in the full form intended by the web site. This can only be
done with a computer that is fully Internet compatible
Gated store buffer -a hardware mechanism in the Crusoe VLIW engine that holds
writes to memory until those writes are either committed (via a commit atom) or
discarded (via a rollback atom). The "gated" refers to the notion of the write
operations being held behind a fence until they are ready to be released.
Load-and-protect atom -an atom that loads a data value from a memory location,
while at the same time "protecting" that memory location from being overwritten.
The Code Morphing software uses load-and-protect atoms to generate faster code.
24
11. REFERENCES
JOURNALS
i. IEEE SPECTRUM, May 2000, "Transmeta's Magic Show"
ii. CHIP, June 2000
iii. PC Magazine, November 2000, “The Mobile Edge”
WEBSITES
i. http://www.wikipedia.org/
ii. http://www.google.co.in/
iii. http://www.scribd.com/
iv. http://www.transmeta.com
v. http://www.mdronline.com
vi. http://www.techextreme.com
vii. http://www.ibm.com
viii. http://www.ieee.org
ix. http://www.mit.edu
x. http://www.eetimes.com
xi. http://www.zdnet.com
25