Академический Документы
Профессиональный Документы
Культура Документы
Mark Horowitz
and His Impact on
Computer Architecture
His work in energy efficiency
and parallel multiprocessing
I
n the early 1990s, computer archi- problems were still paramount: systems designers were
tecture research, both academic and pushing on instruction-level parallelism using reorder
industrial, was flush with big problems buffers and speculation, multilevel branch prediction,
worthy of attack. Designers were trans- and of course multiscalar machines. They were also
lating workloads from minicomputers to working to tear down the memory wall, leveraging inclu-
the microprocessor, and so many classic performance sion as a key cache design concept, and extending caches
to hold both eject and inject data.
Digital Object Identifier 10.1109/MSSC.2016.2580259 Among the many microprocessor architecture problem
Date of publication: 2 September 2016 areas attracting attention were two in particular: energy
1.4
delay product might still lead to sub-
1.2
optimal designs, since it assumes
1.0 that the designers goal is to evenly
0.8 trade off between energy and delay.
0.6 Clearly not all designs share this as-
0.5
0.4 sumption. Papers by Dinesh Patil and
0.8 Sameh Galal leveraged the concepts
0.2
of Pareto optimality curves in the
0.0
0.0 0.1 0.2 0.3 0.4 0.5 0.6 broad design space of energy-per-
Threshold Voltage (V) operation and performance. This al-
lowed them to articulate families of
FIGURE 1: Counters of a constant energy-delay-product for CMOS circuits under variability machines that are highly energy effi-
in voltage, threshold, and temperature (from [1]). cient for single-thread performance,
x
=
W/cm
has proven remarkably prescient 0.03 D Ma = 30
with todays emergent work on ax
x
/ AM
0.02 P Ma
work load-specific lang uages, al-
gorithms, and hardware [5][7] and
0.01
provides the hardware bridge to the Optimal Design
realization of those application-tar-
0
geted systems. 0 0.02 0.04 0.06 0.08 0.10 0.12 0.14
Interconnects, both off and on
A (mm2/GFlops)
chip, act as another key consumer of
energy. Marks impact on the field of
serial links will be explored in other FIGURE 2: Energy and throughput Pareto tradeoffs under different constraints (from [4]).
Observation 1 topics not in that set. By contrast, if Mark asserts something, you had
Anyone who volunteers for IEEE or works on a conference committee better have a really, really good reason for disputing it.
will recognize the syndrome. You find yourself in a room full of people
you dont know very well, but you know their names and their reputa- Observation 4
tions, and you think I sure hope I dont make a fool of myself. Then Theres a related issue, and again, Mark exemplifies the right way to do
the saying better be silent and be thought a fool, than to speak up and it: hes a good listener. When others are making legitimate points, he
remove all doubt pops into your head. The discussion begins, and listens intently, and then he often does a remarkable thing (remarkable
you spend half your time thinking what the heck did they just say? for how seldom it happens): he asks follow-up questions intended to
and the other half thinking wait, that cant be right. And then Mark help the other person with their exposition and, also presumably, to help
Horowitz speaks up, cuts right to the heart of whats being discussed, Mark clearly understand what theyre proposing. Too often, tech folks in
and in a reasonable, nonoffensive way, gets two key points across: a similar position are simply waiting for whoever has the floor to shut up
1) youre all on the wrong track and 2) heres the right one. Its a beauti- so that they can start talking. We should all be more like Mark.
ful thing to watch. Observation 5
Observation 2 Mark is world class at knowing what level of abstraction is the right one for
Mark doesnt disdain philosophical and abstract discussions per se. In- helping to resolve or clarify whatever is under discussion. Sometimes that
stead, he deftly substitutes a quantitative argument that illuminates the level is so low that atomic physics and thermodynamics are the entrance
actual issue and effectively indirectly quashes the angels on a pin ar- fee; other times the big picture is required so that the details dont over-
gument that had been holding sway. In so doing, he single-handedly whelm. No matter what level is the right one, Mark will think carefully, con-
lowers the frustration level in the room, even on the part of the people sider what others are saying, and then reliably offer up some trenchant ob-
who were so intently estimating angel populations. servation that is universally on target, interesting, insightful, and influential.
Observation 3 Observation 6
Marks not afraid to say the words that, in certain tech circles, are almost Finally, you know that guy in the ads, where they say hes the most
never uttered: I dont know. Nobody can possibly know everything, interesting man in the world? I really dislike that guy and that ad. But in
but the percentage of people willing to fake it is awfully high sometimes. thinking about why I hate the ad, I realized that I have come to really
The problem seems not quite so chronic these days, when all the silent value people who are, in fact, interesting. They notice things about the
observers (see Observation 1) can instantly Google-check any particu- world, and then they think deeply about what their observations might
larly sketchy assertions made by inexplicably confident patriarchs of the mean about the way the physical world is and the way it works. I think
field. (I cant think of any cases where women pulled this stunt. Its usu- Mark is one of these people: hes interesting!
ally the men.) Not so long ago, however, meetings were routinely de- Bob Colwell
railed by people who actually were bona fide experts in a set of certain About the Author
things but who were equally willingand equally likelyto opine on Bob Colwell most recently served as director of the Microsystems Tech-
nology Office at DARPA. He was an Intel Fellow and the chief IA-32 ar-
Digital Object Identifier 10.1109/MSSC.2016.2580260 chitect at Intel for the Pentium Pro, II, III, and 4 microprocessors, retiring
Date of publication: 2 September 2016 in 2000. He received the ACM Eckert-Mauchly Award in 2005.
enormous transformations over the past few processors for energy efficiency: A joint
architecture-circuit approach, Ph.D.
dissertation, Stanford University, CA,
decades, and hand-held compute systems we 2010.
[12] D. Lenoski, J. Laudon, K. Gharachorloo, W.
casually store in our pockets today are Weber, A. Gupta, J. Hennessy, M. Horow-
itz, and M. Lam, The Stanford DASH mul-
profoundly more powerful than the tiprocessor, IEEE Computer, vol. 25, pp. 3,
1992.
mainframes of the past. [13] J. Kuskin, D. Ofelt, M. Heinrich, J. Hein-
lein, R. Simoni, K. Gharachorloo, J. Chap-
in, D. Nakahira, J. Baxter, M. Horowitz, A.
Gupta, M. Rosenblum, and J. Hennessy,
The Stanford FLASH multiprocessor, in
ASICs fabricated with the sponsorship discussed advances in energy efficien- Proc. Int. Symp. Computer Architecture,
1994.
of companies like Silicon Graphics, Inc. cy and parallel distributed memory, and
and LSI and then taught the team how to we simply lacked the space to continue
package, test, and sort the chips that the narrative into CAD tools, circuits, About the Authors
resulted. He was equally at home at a dry microarchitectures, compilers, and Ricardo E. Gonzalez received the B.S.,
erase board drawing stick figures of lay- so on. But partly due to these contribu- M.S., and Ph.D. degrees in electrical
out to floor plan critical circuits, wield- tions, each generation of computers has engineering from Stanford University,
ing a soldering iron to swap out a bad been just powerful enoughin compute California. He was a member of found-
connector, showing the students proper performance, or storage, or interconnect ing teams at Tensilica and Stretch,
oscilloscope techniques (hint: never use bandwidth, or richness of displayto where he led the development of con-
the auto-scale button around Mark), build the next generation of comput- figurable and extensible processors.
or grabbing a screwdriver to assemble ers. This feedback loop has allowed He has also worked at Intel, VMware,
a chassis. the engineering community to repeat- and Pure Storage. In the fall, he plans
Finally, Mark embodied one of the edly bootstrap design after design and, to pursue an M.S. degree in ecosystems
most under-appreciated virtues of a thus, put incredibly powerful systems and climate change from Imperial Col-
faculty advisor: he pushed students in large data farms, in our cars, in our lege London.
hard to finish their doctoral degrees. He pockets, and inextricably woven into Jeffrey Kuskin received an un
was an exacting mentor, but he worked the fabric of everyday life. dergraduate degree from Dartmouth
to ensure that students were driving College and M.S. and Ph.D. degrees in
toward an actual conclusion to their References electrical engineering from Stanford Uni-
[1] R. Gonzalez and M. Horowitz, Energy versity, California. From 1997 to 2000, he
Ph.D. tenures. If he was one of your dissipation in general purpose micropro-
advisors, you knew the weekly grill- cessors, IEEE J. Solid-State Circuits, vol. designed large-scale distributed shared
31, pp. 9, Sept. 1996. memory multiprocessors at Silicon
ing, painful as it might be at times, was
[2] R. Gonzalez and M. Horowitz, Supply
bringing you closer to completion. and threshold voltage scaling for low- Graphics. From 2000 to 2004, he devel-
power CMOS, IEEE J. Solid-State Circuits, oped wireless networking chip sets at
vol. 32, pp. 8, Aug. 1997.
Final Thoughts [3] D. Patil, O. Azizi, R. Anantharaman, R. Ho, Atheros Communications. Since 2004,
Computer architecture, much like the and M. Horowitz, Robust energy-effi- he has been at D.E. Shaw Research,
cient adder topologies, in Proc. IEEE
rest of the silicon revolution, has un- Symp. Computer Arithmetic, 2007. designing s pecial-purpose hardware
dergone enormous transformations [4] S. Galal and M. Horowitz, Energy-efficient to accelerate molecular dynamics sim-
floating-point unit design, IEEE Trans.
over the past few decades, and the hand- Computers, vol. 60, pp. 7, June 2010. ulations of proteins and other biologi-
held computer systems we casually [5] B. Amrutur and M. Horowitz, Speed and cal macromolecules.
power scaling of SRAMs, IEEE J. Solid-
store in our pockets today are profound- State Circuits, vol. 35, pp. 2, Feb. 2000. Ron Ho (ronho@ieee.org) received
ly more powerful than the mainframes [6] K. Mai, T. Mori, B. Amrutur, R. Ho, and M. his undergraduate, masters, and
Horowitz, Low-power SRAM design us-
of the past. One of the hallmarks of this ing half-swing pulse-mode techniques, doctoral degrees from Stanford Uni-
progress has been a virtuous feedback IEEE J. Solid-State Circuits, vol. 33, pp. 11, versity, California. He is currently
Nov. 1998.
loop in which advances in all aspects of [7] K. Mai, T. Paaske, N. Jayasena, R. Ho, W. Dally, director of Interconnect IP at Altera,
computer technology become fun- and M. Horowitz, Smart memories: A modu- now part of Intel Corporation. From
lar reconfigurable architecture, in Proc. Int.
damental enablers for building new- Symp. Computer Architecture, 2000. 2003 to 2014, he was with Sun Micro-
er, faster, and more powerful machines. [8] G. Wei and M. Horowitz, A fully digital, systems (later Oracle Corporation),
energy-efficient adaptive power supply
Mark and his students have been an in- regulator, IEEE J. Solid-State Circuits, vol.
and from 1993 to 2003 he was with
tegral part of this cycle, making signifi- 34, pp. 4, Apr. 1999. Intel Corporation.
[9] R. Ho, K. Mai, and M. Horowitz, The fu-
cant and sometimes paradigm-shifting ture of wires, Proc. IEEE, vol. 89, pp. 4,
contributions across the space. Here we Apr. 2001.