Академический Документы
Профессиональный Документы
Культура Документы
5, OCTOBER 2001
Abstract—In this paper, we present an approach for applying the subthreshold current [9], [20]. In either case, an implicit as-
two supply voltages to optimize power in CMOS digital circuits sumption is that the reduced supply voltages are uniformly ap-
under the timing constraints. Given a technology-mapped net- plied to all logic modules. Instead, the entire circuit performance
work, we first analyze the power/delay model and the timing slack
distribution in the network. Then a new strategy is developed does not necessarily get worse if the reduced supply voltages are
for timing-constrained optimization issues by making full use of applied only to the logic modules on the noncritical paths. The
slacks. Based on this strategy, the power reduction is translated reason is that these modules typically have high timing slack
into the polynomial-time-solvable maximal-weighted-indepen- which may allow the increase of their delay without violating
dent-set problem on transitive graphs. Since different supply the timing constraints. Experience shows that, for most circuits,
voltages used in the circuit lead to totally different power con-
sumption, we propose a fast heuristic approach to predict the logic modules (or gates, at logic level) on critical paths only ac-
optimum dual-supply voltages by looking at the lower bound of count for a small portion of all modules. In Fig. 1, for example,
power consumption in the given circuit. To deal with the possible we plot the average distribution of gates with different slack for
power penalty due to the level converters at the interface of 16 MCNC91 benchmarks after technology mapping (note that
different supply voltages, we use a “constrained F-M” algorithm the slack value has been normalized to the longest path delay). It
to minimize the number of level converters. We have implemented
our approach under SIS environment. Experiment shows that the can be seen from this figure that, the number of gates on critical
resulting lower bound of power is tight for most circuits and that paths (i.e., with zero or close-to-zero slack) accounts for only
the predicted “optimum” supply voltages are exactly or very close about 14% of total gates, while more than 60% of gates have
to the best choice of actual ones. The total power saving of up to their slack larger than 0.2. This potentially provides much room
26% (average of about 20%) is achieved without degrading the for power reduction using the reduced supply voltage.
circuit performance, compared to the average power improve-
ment of about 7% by gate sizing technique based on a standard In general, power reduction technique with the reduced
cell library. Our technique provides the power-delay tradeoff voltage (or variable voltage) is called voltage scaling. De-
by specifying different timing constraints in circuits for power pending on how many supply voltages of different value are
optimization. available, voltage scaling may be classified as multiple voltage
Index Terms—Dual-supply voltages, gate level, maximal- approach or dual-voltage approach. Prior work on multiple
weighted-independent-set, power optimization, timing con- voltage approach includes [4], [12], [13], and [25]. Most of
straints. them basically focus on the scheduling problem for low power
at the behavioral level. For example, [12] proposes an optimal
I. INTRODUCTION scheduling algorithm to reduce The systems power while
meeting the timing constraint. In [13], a dynamic programming
supply voltage and, hence, the delay of node . This leads all edges incident with at least one of the nodes) from . This
to the following considerations. process repeats until . An exact and polynomial-time
1) Since the weight function denotes the power reduction by algorithm can be found in [11] and [14], respectively.
increasing a unit delay on a node, we need to select nodes
in MWIS3 instead of MINS for the delay increase of so B. Prediction of “Optimum” Supply Voltages
that the maximum power reduction can be obtained. It is interesting to look at the effect of low supply voltage
2) Since the weight function is related to the node delay, a on the delay and power reduction. For each gate, the reduced
small value of is generally preferable. Theoretically, is promises more power saving at the cost of increased delay
required to approach zero for the maximum power saving. of the gate. Under the same timing constraints, using the lower
In the real world, however, smaller is not always better. means that the fewer gates are permitted to work with.
First, too small value of can lead to prohibitively ex- Therefore, one can expect there is a specific “optimal” value
pensive computation cost. Second, since MWIS depends of supply voltage for the total power reduction, depending on
on the relative weight of nodes in , the small change the given circuits. Unfortunately, the existing dual-voltage al-
of weight for specific nodes will not necessarily affect gorithms work with the fixed supply voltages. These algorithms
MWIS. This is true especially when the weights of nodes cannot help if the given supply voltages are far from their op-
in are distributed in a wider range. In our experiment timum. Also, as mentioned in Section I, an exhaustive approach
part, we will show that for most circuits, the results are for selecting optimum supply voltages is computationally ex-
reasonably good when using . pensive. This motivates us to find a fast prediction of “optimal”
In the above discussion, we assumed the supply voltage is dual-supply voltages.
a continuous variable. Instead, when only dual-discrete supply When the lower-bound algorithm terminates, the supply
voltages are available, the actual power reduction is higher than voltage is obtained for each node in . In order to
it is with continuous voltage. From this point of view, our ap- meet the given timing constraints, the supply voltage, ,
proach provides a lower bound of power consumption with dual selected for node has to be kept greater than . Under
voltages. The procedure of our algorithm is outlined below: dual-voltage environment, either or can be used for
each node. It is reasonable to select .
The “optimal” value of can be estimated by finding
Lower-Bound-Algorithm {
input:
output: supply voltages for all nodes and
lower bound of the total power (6)
if
Calculate node delay, slack and weight where
function for each node in under if
using (1), (2) and (5); This can be done by calculating the specific summation in
Identify and in and let (6) for different values of ranging from
to and selecting the maximum sum. Equation
while { (6) cannot take its maximum value unless is equal to
Find and construct ; for a specific node . We prove this below by way of
Find MWIS of ; contradiction. Without loss of generality, suppose we have
Increase node delay by for each
node in MWIS; and (6) takes its maximum value
Update the node slack, voltage,
weight and ;
}
Calculate the final supply voltage,
, for each node in using (2); when , where . Note
Obtain the lower bound of power con- that for , and for . On the other
sumption using (4); hand, when , the value of (6) becomes
}
MWIS increases by , we check if some of these nodes are able Step 2: Let and ;
to work at . If yes, select them to work at , and update Step 3: while ( ) {
their weight to zero. The reason is that, once a node works at Find of the induced sub-
, no further increase of its delay is needed. By doing so, graph, , of on a set of nodes with
more delay budget can be provided for other nodes. The formal maximum slack;
description of power optimization algorithm based on predicted Increase node delay by for all
optimum supply voltages will be given in the next section. nodes in MWIS;
if the delay of node is so large
V. POWER OPTIMIZATION WITH DUAL-SUPPLY VOLTAGES that it is able to work at ,
, then
In this section, we describe an effective algorithm for power
;
optimization using dual-supply voltages. Since a level converter
Update the delay, slack , and the
is required at the interface of two voltages, we also discuss the
weight function of node , ;
reduction of the level converters’ power penalty.
endif
}
A. Power Optimization Algorithm Using Dual-Supply Voltages
}
Before presenting our algorithm, we take a look at the perfor-
mance of existing techniques for power optimization with two We now apply the dual-voltage power optimization (DVPO)
supply voltages. Example 5.1 shows how these techniques work algorithm to Fig. 3, as shown in Example 5.2. For comparison,
on Fig. 3. it is assumed that the supply voltages are the same as in CVS
Example 5.1: One of existing techniques is CVS scheme as and ZSA algorithms and, hence, the prediction of and
mentioned in Section I. Assuming, in Fig. 3, that is a primary is not included in this example.
output and the power gain, , of is as fol- Example 5.2: Consider Fig. 3 again. The weight functions
lows: , , , for all nodes are approximated as ,
and . The CVS algorithm [2] starts 1 6. We have , , ,
with selecting the primary output . Once works at , , and . In the first iteration
the slack of its immediate predecessor, , changes from three of DVPO algorithm, , , ,
to zero. To meet the timing constraint, cannot work at , , MWIS , and
and the algorithm terminates. The power reduction is two. An- , . No nodes are put into since
other technique available is zero-slack-algorithm (ZSA) [10]. , . The slack values of all nodes
The basic idea behind ZSA is that, at each step, it first finds are updated as: , and .
the nodes with minimum positive slack and then selects some In the second iteration, , , ,
of them to work at such that their slacks become zero (or , MWIS , and
small enough in this application). At the first step, since node , . The delay of nodes , and
has maximum power gain of three, is selected to work at are large enough to be put into , i.e., .
with the power gain of three. At the second step, nodes , The new slacks are: , ,
and have the minimum positive slack of two. Because three and , and are updated to be zero.
of them are nonfeasible, the algorithm stops with the final power In the third iteration, , , ,
reduction of three. , MWIS , and
As a matter of fact, however, the optimal solution of Fig. 3 is , . Thus, node is added into .
to select , , , and to work at . That will result in the Finally, the algorithm terminates with . As a result, we get
maximum power reduction of six while keeping the circuit safe. the optimal solution of with maximum
Both CVS and ZSA break down because they select the nodes power reduction of six.
locally (i.e., in a very greedy fashion) and ignore the interaction The time complexity of DVPO algorithm is analyzed as fol-
between the delay and slack changes of nodes. Thus, a more lows. The time complexity of lower-bound-algorithm is basically
effective algorithm, with a global view, is required to tackle this the same as that in Step 3 of DVPO. Finding the maximum value
problem. of requires time, where is the number of
Based on the discussions in Section IV, we next outline our nodes in . Estimation of the “optimal” value of takes
algorithm for power optimization. time, since at most values need to be tried for finding the max-
imum sum in equation (6). Therefore, the prediction of optimum
Dual-Voltage-Power-Optimization (DVPO) supply voltages just needs time. Calculating the delay,
Algorithm { slack and weight function for all nodes takes time. Finding
input: also needs time. The MWIS of can be obtained in
output: The set of gates, time because . Since MWIS , and
Step 1: Predict the “optimum” supply linear time is required to update the slacks and weight functions
voltages by calling lower-bound-algo- of all nodes in and for each node MWIS,
rithm ( ), unless they have been given each iteration in Step 3 can be completed in time. There-
by the users; fore, assuming the number of iterations is , the time complexity
CHEN et al.: ON GATE LEVEL POWER OPTIMIZATION USING DUAL-SUPPLY VOLTAGES 623
where the first term is the reduced power due to LCs and the
(8)
second term accounts for the increased power of the node it-
624 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 9, NO. 5, OCTOBER 2001
TABLE I
THE LOWER BOUND OF POWER CONSUMPTION
TABLE II
THE OPTIMUM SUPPLY VOLTAGE AND POWER REDUCTION USING DIFFERENT VALUES OF V (THE SHADED ENTRIES CORRESPOND TO THE ACTUAL VOLTAGE
WITH MAXIMUM POWER REDUCTION)
(a) (a)
(b) (b)
Fig. 11. Performance of lower-bound algorithm on a set of circuits. (a) Lower Fig. 12. The estimated lower bound of power consumption and “optimum”
bound on power consumption. (b) CPU time. supply voltages under different timing constraints. (a) Lower bound on power
consumption. (b) Optimum supply voltage.
in terms of both accuracy and efficiency of the
algorithm. This is because the timing penalty increases quickly when the
Also, we tested our algorithm under different timing con- supply voltage of a gate is reduced to the smaller value.
straints. Fig. 12 shows the lower bound of power and optimum
supply voltage given the timing constraints of , B. Comparison of Our Dual-Voltage Approach and Gate
and , where is the minimum delay of the circuit. Sizing Technique
As can be expected, the lower bound of power decreases as the In order to compare our dual-voltage approach with gate
timing constraints are relaxed. The “optimum” , however, sizing technique, we implemented a gate sizing technique [17]
basically stays at 5 V under various timing constraints. What is which is also based on the MWIS. Table III summarizes the
interesting is that looser timing constraints do not always lead results (with V and V) using a standard
to the lower “optimum” value of , as shown in Fig. 12(b). cell library. The average power reduction over tested circuits
626 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 9, NO. 5, OCTOBER 2001
TABLE III slack drastically increases after optimization. For circuit i3, in
POWER REDUCTION (%) COMPARISON OF DUAL-VOLTAGE TECHNIQUE AND contrast, only 6 out of 310 gates work at and, hence, the
GATE SIZING TECHNIQUE
slack change of the gates is much smaller, resulting in very lim-
ited power saving (only 0.1%).
D. Power-Delay Tradeoff
To provide the power-delay tradeoff, we specified the dif-
ferent timing constraints on the given circuits for our power
optimization. As an example, the power-delay tradeoff curves
for two circuits (C880 and apex6) are shown in Fig. 14, where
and are the two extreme points. corresponds to the
minimum timing constraint with relatively less power reduc-
tion, while stands for relatively loose timing constraint with
the possible maximum of power reduction which is reached
when all gates of the circuit work at . Using V
and V, the maximal percentage of power reduction
is 6.9% for gate sizing and 19.6% for dual-voltage approach. should be theoretically %. However,
When using the optimum supply voltage, more power saving since the primary inputs always work at , the actual min-
is possible with dual-voltage technique. However, it should imum is higher. As shown in Fig. 14, the actual minimum power
be pointed out that for dual-voltage technique, there will be for circuit C880 and apex6 are 62.8% and 66.1% of the initial
some power overheads at physical level, which could not be total power, respectively, and the power consumption no longer
accounted for at gate level. More recently, an approach that decreases when the delay (i.e., the given timing constraint) in-
uses gate sizing and dual-voltage techniques simultaneously creases further beyond point .
for low power has also been reported [17], [27]. In the above discussions, we are assuming the capacitive
loading of a gate remains the same whether it works at
C. Performance of Our Approach Compared With CVS or . In the real design, however, the gate with has
less parasitic capacitance than that with , indicating that
For comparison with CVS which is the existing dual-voltage our power estimation for gates tends to be pessimistic. In
approach, we also implemented CVS under SIS environment this sense, the actual power saving can be a little higher than
and tested it on benchmarks. Table IV shows the comparison of estimated here.
our algorithm and CVS ( V and V were
used for this experiment). Columns 2, 3, and 4 of this table are
the number of gates in the circuit, the circuit delay and initial E. Power Savings Under Different Input Statistics
power consumption (i.e., with supply voltage of 5 V), respec- In the above experiments we assume that the input activity
tively, before running the algorithms. With our algorithm, the is 0.5 for all circuits. Considering the fact that the actual input
number of gates working at in the circuit, on average, ac- conditions may not be known, it is interesting to look at how
counts for 65.7% of the total number of gates. Individual per- sensitive our optimization results are to input statistics. While
centage ranges from 34.6% to 86.3%, depending on the specific the proposed technique does not require any specific input pat-
circuits. The average power penalty due to the LCs is about 5%. terns for optimization, the weight function given by (5) does
Our algorithm achieves a total power reduction of 19.6%, on av- vary with switching activity, , which depends on the input
erage, with the maximum reduction of 26.3% for circuit C880. statistics. Thus, input activity affects the optimal dual voltages,
In contrast, only 11.4% average power reduction is obtained by final assignment of nodes to or and, hence, the power
CVS. A typical instance is circuit 9symml which has only one optimization results. For a case study, we use the random input
primary output. Under the timing constraints of minimum cir- activity ranging from zero to one for circuit 9symml, and run our
cuit delay, the slack of the primary output is zero and no power dual-voltage optimization algorithms. The results are plotted
reduction is possible by using CVS. Instead, our algorithm gets in Fig. 15. Fig. 15(a) shows the curve of initial power con-
15.4% power improvement for this circuit. Note that all results sumption versus random input activity, and Fig. 15(b) shows
in Table IV were achieved without degrading the timing perfor- the power saving using our approach with random input ac-
mance of the circuit. This demonstrates the great potential ob- tivity. From Fig. 15(a), the power consumption of this circuit
tained by our dual-voltage power optimization. The last column changes a lot due to different input patterns, ranging from 295.1
of Table IV shows the CPU time running on a SUN SPARCsta- to 1367.2. From Fig. 15(b), however, the power reduction varies
tion 5 with 32 MB RAM. from 15.9% to 17.6%, showing that percentage power saving is
To see how the excessive slacks in the circuit contribute to not sensitive to input statistics. This is a desirable feature of our
power reduction, we conducted experiments on the slack change approach. Our experiments with other benchmarks showed the
of all gates resulting from our algorithm. Fig. 13 shows the dis- similar feature. We believe that percentage power reduction is
tribution histogram of gates over the slack value for two circuits: dominated by other factors (such as timing constraints and cir-
c8 and i3. For circuit c8 whose longest path delay is 13.2 ns, 45 cuit topology which affect the slack distribution) instead of input
out of 130 gates work at . The number of gates with small statistics.
CHEN et al.: ON GATE LEVEL POWER OPTIMIZATION USING DUAL-SUPPLY VOLTAGES 627
TABLE IV
PERFORMANCE OF OUR ALGORITHM AND CVS ALGORITHM ON A SET OF BENCHMARK CIRCUITS
(a)
(a)
(b) (b)
Fig. 13. Gate distribution over the slack value for (a) circuit c8 and (b) circuit Fig. 14. Power-delay tradeoff for (a) circuit C880 and (b) circuit apex6.
i3.
VII. CONCLUSION AND FURTHER WORK connection delay cannot be reasonably estimated until the phys-
In this paper, we targeted gate level power optimization with ical design phase, while the slew effect depends strongly on the
dual-supply voltages. We have shown that dual-voltage approach specific value of dual-supply voltages applied. In addition, the
can achieve significant power saving without degrading timing delay given by (2) is based on the assumption that the input
performance of the circuit. We demonstrated that it is important to voltage of a gate at supply voltage is also . However,
predict optimum supply voltages before using dual-voltage tech- when a gate drives a gate, the delay of the gate
nique and that the power penalty due to level converters can be is more accurately proportional to instead
wellcontrolled.Byspecifyingdifferenttimingconstraints,weob- of . Therefore, in this case, (2) tends to over-
tained a power-delay tradeoff with dual voltages. estimate the delay. With these issues in mind, further work is
In this work, we used a simpler delay model which does not required to develop more accurate delay model with dual volt-
account for interconnection delay and signal slew effect. Inter- ages. Besides, there is the additional noise due to cross-coupling
628 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 9, NO. 5, OCTOBER 2001
[3] M. Igarashi et al., “A low power design method using multiple supply
voltages,” in Proc. Int. Symp. Low Power Electronics and Design
(ISLPED), Monterey, CA, Aug. 1997, pp. 36–41.
[4] S. Raje and M. Sarrafzadeh, “Variable voltage scheduling,” in Proc. Int.
Symp. Low Power Design (ISLPD), Dara Point, CA, Apr. 1995, pp. 9–14.
[5] K. Usami et al., “Automated low power technique exploiting multiple
supply voltage applied to a media processor,” in Proc. Custom Integrated
Circuit Conf. (CICC), Santa Clara, CA, May 1997, pp. 131–134.
[6] C. M. Fiduccia and R. M. Mattheyses, “A linear time heuristic for im-
proving network partitions,” in Proc. IEEE/ACM Design Automation
Conference (DAC), Las Vegas, NV, June 1982, pp. 175–181.
[7] E. M. Sentovish et al., “SIS: A system for sequential circuit synthesis,”
Univ. California, Berkeley, Tech. Rep. UCB/ERL M92/41, May 1992.
[8] A. P. Chandrakasan and R. W. Brodersen, Low-Power CMOS Digital
Design. Norwell, MA: Kluwer, 1995.
(a) [9] S. Mutoh et al., “1-V power supply high-speed digital circuit technology
with multithreshold voltage CMOS,” IEEE J. Solid-State Circuits, vol.
30, pp. 847–853, Aug. 1995.
[10] R. Nair, C. L. Berman, P. S. Hauge, and E. J. Yoffa, “Generation of per-
formance constraints for layout,” IEEE Trans. Comput.-Aided Design,
vol. 8, pp. 860–874, Aug. 1989.
[11] R. H. Mohring, Graphs and Orders: The Role of Graphs in the Theory
of Ordered Sets and Its Application, I. Rival, Ed. New York: D. Reidel,
May 1984, pp. 41–101.
[12] S. Raje and M. Sarrafzadeh, “Scheduling with multiple voltages,” Inte-
gration VLSI J. 23, pp. 37–59, 1997.
[13] J. M. Chang and M. Pedram, “Energy minimization using multiple
supply voltages,” IEEE Trans. VLSI Syst., vol. 5, pp. 1–8, Dec. 1997.
[14] D. Kagaris and S. Tragoudas, “Maximum independent sets on transi-
tive graphs and their applications in testing and CAD,” in Proc. Int.
Conf. Computer-Aided Design (ICCAD), San Jose, CA, Nov. 1997, pp.
736–740.
(b)
[15] K. Usami et al., “Design methodology of ultra low-power MPEG4 codec
Fig. 15. Power consumption for circuit 9symml under different input statistics. core exploiting voltage scaling techniques,” in Proc. ACM/IEEE Design
(a) Initial power versus random input activity. (b) Power savings versus random Automation Conf. (DAC), San Francisco, CA, June 1998, pp. 483–488.
input activity. [16] C. Yeh, Y. Kang, S. Shieh, and J. Wang, “Layout techniques supporting
the use of dual supply voltages for cell-based designs,” in Proc.
ACM/IEEE Design Automation Conf. (DAC), New Orleans, LA, June
between signals with different voltages. Noise checks are thus 1999, pp. 62–67.
[17] C. Chen and M. Sarrafzadeh, “Power reduction by simultaneous voltage
increasingly necessary during layout. scaling and gate sizing,” in Proc. Asia and South Pacific Design Automa-
More recently, we are combining gate level design with phys- tion Conf. (ASPDAC), Yokohama, Japan, Jan. 2000, pp. 333–338.
ical-level for power optimization with dual-supply voltages. [18] C. Yeh, M. Chang, S. Chang, and W. Jone, “Gate-level design ex-
Since some important parameters (such as wiring capacitance) ploiting dual supply voltages for power-driven applications,” in Proc.
ACM/IEEE Design Automation Conf. (DAC), New Orleans, LA, June
cannot be available at gate level, some kind of feedback 1999, pp. 68–71.
between logic and physical levels is required in order to know [19] V. Sundararajan and K. K. Parhi, “Synthesis of low power CMOS VLSI
the exact effect of dual-voltage approach on physical level. circuits using dual supply voltages,” in Proc. ACM/IEEE Design Au-
tomation Conf. (DAC), New Orleans, LA, June 1999, pp. 72–75.
In the logic level, dual-voltage technique gives an estimate
[20] L. Wei, Z. Chen, K. Roy, M. C. Johnson, Y. Ye, and V. K. De, “Design
of gate power reduction. In the physical level, we can use and optimization of dual-threshold circuits for low-voltage low-power
simulated annealing with additional constraints for placement applications,” IEEE Trans. VLSI Syst., vol. 7, pp. 16–23, Mar. 1999.
and routing followed by some post processing with an ultimate [21] C. Chen and M. Sarrafzadeh, “Provably good algorithm for low power
consumption with dual supply voltages,” in Proc. Int. Conf. Comput.-
goal of reducing power consumption due to both gates and Aided Design (ICCAD), San Jose, CA, Nov. 1999, pp. 76–79.
interconnections [26]. Further work is under way. [22] C. Chen and M. Sarrafzadeh, “An effective algorithm for gate-level
power-delay tradeoff using two voltages,” in Proc. Int. Conf. Computer
Design (ICCD), Austin, TX, Oct. 1999, pp. 222–227.
[23] M. Nemani and F. N. Najm, “Toward a high level power estimation capa-
ACKNOWLEDGMENT bility,” IEEE Trans. Comput.-Aided Design, vol. 15, pp. 588–598, June
1996.
The authors would like to thank the anonymous reviewers for [24] G. Karypis, R. Aggarwal, V. Kumar, and S. Shekhar, “Multilevel
their comments, which improved the quality of the paper. hypergraph partitioning: Applications in VLSI design,” in Proc.
ACM/IEEE Design Automation Conf. (DAC), Anaheim, CA, June 1997,
pp. 526–529.
[25] M. Sarrafzadeh and S. Raje, “Scheduling with multiple voltages under
REFERENCES resource constraints,” in Proc. Int. Symp. Circuits Syst. (ISCAS), Or-
lando, FL, May 1999, pp. 350–353.
[1] A. P. Chandrakasan, S. Sheng, and R. W. Brodersen, “Low-power [26] A. Nayak, P. Banerjee, C. Chen, and M. Sarrafzadeh, “Power optimiza-
CMOS digital design,” J. Solid-State Circuits, vol. 27, no. 4, pp. tion issues in dual voltage design,” in Proc. Int. Conf. Chip Design Au-
473–484, April 1992. tomation (ICDA), Beijing, China, Aug. 2000, pp. 99–105.
[2] K. Usami and M. Horowitz, “Cluster voltage scaling technique for low- [27] A. Nayak, M. Haldar, P. Banerjee, C. Chen, and M. Sarrafzadeh, “Power
power design,” in Proc. Int. Symp. Low Power Design (ISLPD), Dara optimization of delay constrained circuits,” J. VLSI Design—Special
Point, CA, Apr. 1995, pp. 3–8. Issue Low Power System Design, to be published.
CHEN et al.: ON GATE LEVEL POWER OPTIMIZATION USING DUAL-SUPPLY VOLTAGES 629
Chunhong Chen (M’99) received the B.S. and M.S. degrees in electrical engi- Majid Sarrafzadeh (M’87–SM’92–F’96) received the B.S., M.S., and Ph.D.
neering from Tianjin University, Tianjin, China, and the Ph.D. degree in elec- degrees in 1982, 1984, and 1987, respectively, from the University of Illinois at
trical engineering from Fudan University, Shanghai, China. Urbana-Champaign in electrical and computer engineering.
From 1986 to 1996, he was with Zhejiang University of Technology, He joined Northwestern University, Evanston, IL, as an Assistant Professor
Hangzhou, China, as an Assistant/Associate Professor. From 1997 to 1998, in 1987. In 2000, he joined the Computer Science Department at University of
he was a Research Associate at the Hong Kong University of Science and California at Los Angeles (UCLA). He has collaborated with many industries in
Technology, Hong Kong. From 1999 to 2001, he was a Postdoctoral Fellow, the past ten years, including IBM and Motorola and many CAD industries. He
first at Northwestern University, Evanston, IL, and then at the University of contributed to Theory and Practice of VLSI Design. He has published approx-
California, Los Angeles. Since then, he has been an Assistant Professor at the imately 200 papers, is a Co-Editor of Algorithm Aspects of VLSI Layout (Sin-
University of Windsor, ON, Canada. His current research interests include gapore: World Scientific, 1994), coauthor of An Introduction to VLSI Physical
physical layout, logic synthesis, timing analysis, and power optimization for Design (New York: McGraw Hill, 1996), and the author of an invited chapter in
integrated circuits. the Encyclopedia of Electrical and Electronics Engineering in the area of VLSI
circuit layout. He is on the editorial board of the VLSI Design Journal and an
Associate Editor of ACM Transaction on Design Automation (TODAES). His re-
cent research interests include embedded and reconfigurable computing, VLSI
CAD, and design and analysis of algorithms.
Ankur Srivastava (S’00) received the Bachelor of Technology degree in elec- Dr. Sarrafzadeh received the NSF Engineering Initiation award, two Distin-
trical engineering from the Indian Institute of Technology, Delhi, India, in 1998 guished Paper Awards in the International Conference on Computer-Aided De-
and the M.S. degree in computer engineering from Northwestern University, sign (ICCAD), and the Best Paper Award in the Design Automation Conference
Evanston, IL, in 2000. He is currently working toward the Ph.D. degree at the (DAC) for his work in the area of physical design. He has served on the tech-
Computer Science Department, University of California, Los Angeles, under nical program committee of numerous conferences in the area of VLSI Design
the guidance of Prof. M. Sarrafzadeh. and CAD. He has served as committee chair of a number of these conferences,
He has served as an Intern at Fujitsu Laboratories of America, Sunnyvale, including International Conference on CAD and International Symposium on
CA, during the summer of 1999. His primary research interests include delay Physical Design. He was the General Chair of the 1998 International Sympo-
and power optimization in logic circuits, reconfigurable computing, and power sium on Physical Design. He is an Associate Editor of IEEE TRANSACTIONS ON
issues in sensor networks. COMPUTER-AIDED DESIGN.