What contributions have the authors mentioned in the paper "Thermal-induced leakage power optimization by redundant resource allocation" ?

In this paper, the authors propose a technique to reduce the total leakage power of a design by identifying the optimal number of resources during allocation and binding. The authors demonstrate that, contrary to the general tendency to minimize the number of resources, the best solution can actually be achieved if a certain degree of redundancy is allowed. In this paper, the authors show that there is a power density, hence, temperature, at which the total leakage power will reach its optimal value. The authors also present a high-level power density-aware leakage model. Distributing activity over a higher number of resources can reduce power density, remove potential hotspots and subsequently minimize thermal induced leakage.

How many resources are needed to achieve the optimal dynamic power?

Experimental results reported in past work [2] show that for a given design example the optimal dynamic power for five resources is 70.882, for six resources 67.872, and for seven resources it is 65.514.

How much power reduction did the authors achieve on av-erage?

As the authors can see from the results, the authors achieved at most 56.5%, on av-erage 35.7%, leakage power reduction compared to thermal-aware resource binding technique.

(Open Access) Thermal-induced leakage power optimization by redundant resource allocation (2006) | Min Ni

Q: What is the popular technique for reducing leakage power?

Assigning different threshold and/or supply voltages to transistors or gates, together with simultaneous gate sizing [6, 11, 14, 16] is one of the most popular techniques for both standby and operating mode leakage optimization.

Q: How can the authors find the lowest cost package for each binding?

the authors will find the lowest cost (highest h) feasible package for each binding based on the relationship between power density and package heat coefficient.

Thermal-Induced Leakage Power Optimization by

Redundant Resource Allocation

Min Ni and Seda Ogrenci Memik

Electrical Engineering and Computer Science

Northwestern University, Evanston, IL

mni166, seda

@ece.northwestern.edu

ABSTRACT

Traditionally, at early design stages, leakage power is associated

with the number of transistors in a design. Hence, intuitively an im-

plementation with minimum resource usage would be best for low

leakage. Such an allocation would generally be follo wed by switch-

ing optimal resource binding to achieve a low power design. This

treatment of leakage power is unaware of operating conditions such

as temperature. In this paper, we propose a technique to reduce the

total leakage power of a design by identifying the optimal num-

ber of resources during allocation and binding. We demonstrate

that, contrary to the general tendency to minimize the number of

resources, the best solution can actually be achieved if a certain de-

gree of redundancy is allowed. This is due to the fact that leakage is

strongly dependent on the on-chip temperature proﬁle. Distributing

activity over a higher number of resources can reduce power den-

sity, remove potential hotspots and subsequently minimize thermal

induced leakage. On the other hand, using an arbitrarily high num-

ber of resources will not yield the best solution. In this paper, we

sho w that there is a power density, hence, temperature, at which the

total leakage power will reach its optimal value. Such an optimal

resource number can be a better starting point for the subsequent

switching-driv en low power binding. We also present a high-level

po wer density-aware leakage model. Based on the estimates by this

model, we optimize the total leakage power by 53.8% on average

compared to the minimum resource binding, and 35.7% on average

compared to a temperature-aware resource binding technique.

1. INTRODUCTION

Due to technology scaling, the share of leakage power in the to-

tal power budget is on the rise. Supply voltage levels are lowered

with each technology generation, which in turn necessitates lower-

ing of the threshold voltage levels of devices in order to maintain

low delay. Leakage increases exponentially with decreasing thresh-

old voltage levels. As a result, leakage power starts to become sig-

niﬁcant, sometimes even dominant in total power budgets, which

could be up to 50% of the total power [5].

A plethora of techniques to reduce leakage power have been

proposed in literature. Majority of these techniques focus on the

Permission to make digital or hard copies of all or part of this work for

personal or classroom use is granted without fee provided that copies are

not made or distributed for proﬁt or commercial advantage and that copies

bear this notice and the full citation on the ﬁrst page. To copy otherwise, to

republish, to post on servers or to redistribute to lists, requires prior speciﬁc

permission and/or a fee.

ICCAD 2006, November 5–9, 2006, San Jose, California, USA.

gate or transistor-level optimizations. Assigning different thresh-

old and/or supply voltages to transistors or gates, together with si-

multaneous gate sizing [6, 11, 14, 16] is one of the most popular

techniques for both standby and operating mode leakage optimiza-

tion. Other techniques, such as using sleep transistors to put the

circuit into sleep mode whenever it idles for a certain period [5]

are also used for reducing standby state leakage power. All these

techniques are derived from the observation that the subthreshold

leakage current, which is the most signiﬁcant one among the four

main sources of leakage current [12], can be expressed by the fol-

lowing equation [15]:

sub

µv

sth

(

;

ηV

)

(

ηV

)

(

;

)

(1)

Therefore, subthreshold current is a function of device size, supply

voltage, temperature, and other process parameters, such as thresh-

old voltage (V

). Most of the above techniques trade-off leakage

po wer with the design complexity to manipulate the threshold volt-

age and supply voltage by adding extra power control components.

Another aspect of leakage is related to dynamic conditions such

as temperature. Leakage has a superlinear dependency on temper-

ature. Fallah et al. reported that the share of leakage power can

increase from 6% at the ambient temperature to as high as 56% of

total power at 110

C [5]. Another study reported that the leakage

po wer in an embedded processor can increase by about 30% due to

thermal-induced leakage [8].

Temperature on a chip is itself a function of various parame-

ters, where the foremost factors are the power density on the chip

and the properties of the package. The power density, hence tem-

perature, will continue increasing in future technologies according

to α-power law [13]. The abov ementioned techniques for leak-

age optimization generally do not address the power density on a

chip. Often times, they can in fact ex acerbate the effects of power

density while aiming to consolidate activity on fewer localized re-

sources (for instance in an effort to place parts of the chip in sleep

mode and channel computation towards a selected subset of com-

ponents).

In this work, we investigate a technique to consider the impact

of resource selection on the overall power density and consequently

on thermal-induced leakage in future technology nodes. Resource

allocation and binding is a proper stage during high-level synthe-

sis to consider the potential impact of area on po wer density. At

that stage it is decided how many resources and which type of re-

sources will be utilized in the design. More resources will result in

larger area and most likely in lower power density. In this paper,

we are trying to establish an effective tradeoff between the num-

ber of resources and the total leakage power. There exists an op-

timal point where the amount of resources used yields the most

favorable power density, which in turn results in the least thermal-

297

induced leakage po wer. Our study re veals that often times in order

to reach this point the amount of resources should be higher than

the amount which would be sufﬁcient to satisfy the same perfor-

mance constraint. A judicious introduction of redundant resources

when there is need to relieve power density, will ultimately help

reduce thermal-induced leakage and total leakage signiﬁcantly.

The major difference between our work and other hotspot-moving

resource allocation techniques is that in almost all the hotspot-

moving techniques [9, 10] a threshold temperature is assumed.

Based on this given constraint, they are trying to make sure that

there are no places on the chip where the static temperature will

exceed that threshold value. However, our work is to decide what

this threshold temperature is, in order to optimize performance,

e.g., to optimize leakage power in our work. Other low power re-

source binding techniques [2–4] which consider switching power

can be supported by our initial allocation. In this way, the low

power resource binding would address two components within two

stages. The ﬁrst stage is to ﬁnd the optimal resource number, re-

sulting in best power density and temperature, such that the leak-

age power will be minimized. The second stage is to optimize the

dynamic power and maintain control over thermal behavior by ex-

isting thermal-driv en techniques and switching-driven techniques

based on the results of the ﬁrst stage.

One reason rendering this distinction feasible is that with differ-

ent starting points (different number of resources and temperature

constraints), the optimal dynamic power considering switching ac-

tivity does not vary signiﬁcantly [2]. Experimental results reported

in past work [2] show that for a given design example the opti-

mal dynamic power for ﬁve resources is 70.882, for six resources

67.872, and for seven resources it is 65.514. Only less than 5%

change is observed when adding more resources. Often times, in-

troduction of redundancy to the resource set might in fact help re-

duce the impact of conﬂicts due to dependencies and scheduling

compatibility and create more opportunities for the switching op-

timal binding to ﬁnd a slightly lower switching assignment, which

reduces the dynamic power. Therefore, we can safely conclude

that the optimal dynamic power of functional units will not in-

crease when we add resource redundancy to achieve the optimal

leakage power. On the other hand, the leakage po wer is much more

sensitive to the selection of the resource set than dynamic power.

Even adding one more resource may probably reduce the leakage

po wer by more than 50%, because leakage power is strongly cou-

pled with power density and in turn the chip temperature. There-

fore, the two-stage optimization is meaningful and effective. We

will address mainly the ﬁrst stage, i.e. power density and resulting

thermal-induced leakage optimization during allocation.

The rest of this paper is organized as follows. Section 2 describes

the leakage power estimation model we will use in this paper. Main

ideas of our low power resource binding technique are discussed in

Section 3. Section 4 presents our experimental ﬂo w and results.

Conclusions are given in Section 5.

2. LEAKAGE ESTIMATION MODEL

Before we start to ﬁnd the optimal number of resources for leak-

age power, it is necessary to establish ﬁrst a simple model for leak-

age estimation. It is important to emphasize that the intention of

this model is not to compute exact temperature levels. This model

intends to establish the prevailing trend linking power density and

temperature and subsequent expected rate of increase in leakage.

Once we establish this trend it will be a reasonable tool for us to

search for the best resource allocation. Most importantly, it will

help us identify the point where the rate of increase in leakage

po wer due to addition of redundant resources will ﬁnally counter-

balance the decrease in thermal-induced leakage due to reduction of

power density after addition of each redundant resource. Up until

that point addition of redundant resources and distribution of oper-

ations onto them will be expected to progressively improve power

density and hence, the total leakage.

We need to establish the following in order to achieve this goal.

First, we need to have the means to compare the relative leakage

of different modules at ambient temperature. For this purpose, we

have used transistor-level (HSpice) simulation of simple building

blocks encountered within the resources in our library to obtain

leakage po wer values for each resource. After simulating the leak-

age power for a simple structure, such as a transistor or a gate, we

scale it to obtain ambient leakage power for individual modules.

Each module implementation in our library requires a customized

scaling factor. The scaling factor not only depends on the number

of transistors in the module, but also on the sizing of individual

transistors and the actual threshold voltage used in the design. We

used empirical data [12] to derive the leakage power scaling fac-

tors of each module type, under the basic idea that leakage po wer

becomes a certain fraction of total power at a gi ven temperature.

Next, we establish the trends to represent the rate of increase in

leakage in response to a change in temperature analytically. In-

stead of using Equation 1 directly, we use Lagrange’s interpolation

formula to implement the curve ﬁtting, as shown in Equation (2),

(

∑

∏

(

;

)

∏

(

;

)

(2)

where

(



)

is the leakage point obtained from the Hspice simu-

lation. Using analytic leakage formula such as Equation 1 directly

is also feasible. However, we prefer to let the simulation engine to

decide the physics details and then ﬁt the experiment data exactly

by Lagrange’s interpolation.

Having obtained the analytical form of the leakage power trend,

we can use a numerical method to establish the relationship be-

tween power density and temperature. At this point, we turn our

attention towards the two most important factors that affect the ther-

mal behavior: the power density P

A and the heat transfer coefﬁ-

cient.

Equation 3 [7] illustrates the relationship between power density,

heat transfer coefﬁcient (i.e. thermal properties of packaging), and

temperature.





(3)

where T

is the ambient temperature, P is the total power dissipa-

tion, A is the area of design, and h is the heat transfer coefﬁcient

as used in the heat transfer theory. The value of h represents how

well the chip package can dissipate the heat. A large value of h

always implies poor cooling package. An example of h value is

4.75cm



C/W, based on the operating chip temperature of 120

degree for the 180nm technology [7]. We will sho w that for ev-

ery power density level, there is always a maximum package heat

coefﬁcient (thus poorest acceptable package). Using a packaging,

which has an even larger heat coefﬁcient than this will be likely to

cause thermal run-away.

Figure 1 illustrates the relationship between average power den-

sity across a given chip, the heat coefﬁcient of the package and the

expected steady state temperature. In this ﬁgure, the lines starting

from the origin represent the heat transfer ability of the package.

It is proportional to the chip temperature. High temperature results

in need for fast heat dissipation by the package. The other three

curves represent the different power density levels of the chip. The

298

bending of the curve reﬂects the fact that the leakage power has

become a signiﬁcant part of total power consumption and the leak-

age power has a superlinear dependency on temperature. When

the heat generation equals the heat dissipation, the chip tempera-

ture will become steady. Therefore, the intersection point of both

po wer density curve and package heat coefﬁcient curve represents

the steady state point. It can be seen from Figure 1 that for power

density, the higher it is, the higher steady temperature it will reach

with respect to the same packaging conﬁguration.

50 60 70 80 90 100 110 120

0.5

1.5

2.5

x 10

−3

temperature(

leakage power(W)

power density 1

power density 2

power density 3

package cooling level

Figure 1: Establishing the relationship between temperature

and power density.

This relationship between power density and package heat coef-

ﬁcient is the base for our leakage estimation model. The analytical

formula for calculating the steady state temperature is,

(

;

)=(

∑

∏

(

;

)

∏

(

;

)



)

(4)

where A is the total area of resources, n is the number of resources,

and f is the leakage power scaling factor. In our experiments, f is

250 for a 16-bit multiplier module and 80 for a 32-bit adder mod-

ule. It is approximately proportional to the area of the module.

represents the dynamic power. Our purpose is to solve for the

steady state temperature T

from this equation. Before that, we ﬁrst

sho w that it is the superlinear relationship between leakage power

and temperature that leads to our conclusion that there exists an

optimal number of resources (corresponding to an optimal temper-

ature).

LE MMA 1. The steady state temperature T

monotonically de-

creases with the incr easing number of resources n if the Lagrange

formula is linear.

ROOF. After rearranging Equation 4, we have



(

)

(5)

where P

is the dynamic power, which is constant as we discussed

above. n is the number of resource, a

is the area of one resource.

Using linear Lagrange interpolation, we substitute L

(

into equation (5) and solve for T

;

(6)

it can be seen that T

decreases monotonically when n increases.

LE MMA 2. The leakag e power in the form of n



(

)

mono-

tonically increases with increasing number of resources.

HEOREM 1. The leakage power in the form of n



(

)

1, is not a monotonic function. It obtains a minimal value at some

resource number n



ROOF. We only analyze the situation where p

2 here. Higher

order Lagrange interpolation can be analyzed numerically in the

similar way. Suppose L

(

c, substitute it into Equa-

tion (5),

;

(

;

)

;

4ah

(

)

2ah

(7)

Therefore the total leakage power in the form of n



(

)

becomes,



(

(8)

where s



are some coefﬁcients. The optimal solution can

be found by setting the derivative to zero. It is in the form of a

quadratic equation.

We proved theoretically that there exists an optimal number of

resources which minimizes the total leakage power. In the next sec-

tion we will show how to reach the optimal solution by a numerical

method.

3. REDUNDANT RESOURCE ALLOCATION

FOR LEAKAGE OPTIMIZATION

Our main goal is to achieve low power density by introduction

of redundant resources in the search of the optimal point where the

reduction in thermal-induced leakage still brings a higher beneﬁt

compared to the additional leakage due to the redundant resources.

Ho wever, deriving an analytic formula for the optimal number of

resources is only possible for 2-degree Lagrange interpolation. In

reality, we will use at least a 10-degree Lagrange formula (there-

fore at least 10 experiment data points) in order to maintain good

accuracy. Another way to solve this problem is to perform an incre-

mental search in the solution space. This is feasible because of the

number of resources will take discrete values. The main algorithm

is illustrated in Figure 2.

Algorithm

Redundant Resource Allocation

Input

: Resource library with power

characterization, resource scheduled DFG,

minimum required leakage power reduction a%

Output

: Number of resources after redundant

allocation

For each resource type

find

avg dynamic power();

find

resnum bounds();

find

package parameter();

n = min

resource number;

While (

∆P

> a%)

add

resource redundancy(n);

steady

temperature = secant(n,

(

)

);

∆P

(

)

;

(

)

(

)

;

End

Return number of resources n in new allocation;

End

Figure 2: Pseudocode of the redundant resource allocation al-

gorithm.

The basic idea of this algorithm is to increment the number of re-

sources until the beneﬁts of leakage power reduction become less

than some expectation constraint. In each iteration, we use a nu-

merical method to solve equation (4). In this equation, T

is the

v ariable. Before we can solve it, we have to know the dynamic

299

po wer value P

and package heat coefﬁcient h. Leakage power

scaling factor f is derived empirically [12].

Therefore, based on the information given by the scheduled DFG,

we ﬁrst calculate the average dynamic power for each resource

type. At such a high level, we have to ignore the thermal coupling

between different resources because we have no physical position

information available. However, our methodology is still applica-

ble if thermal coupling information is av ailable. The new steady

state temperature can be calculated by combining our results and

the information of thermal coupling. Moreover, ignoring coupling

only underestimates the total leakage po wer, because when one

resource temperature reduces due to resource redundancy, other

resources can also reduce their temperature through thermal cou-

pling. In other words, we can at least get as much leakage reduc-

tion as our result shows. Higher beneﬁts can be expected if ther-

mal coupling is introduced into the leakage estimation model. The

lower bound and upper bound for the number of resources can also

be derived from these DFG ﬁles and incorporated into the search.

The next step is to decide the package heat coefﬁcient according

to different power density levels. Using a very lo w package heat

coefﬁcient h is always good, because the chip temperature can be

controlled effecti vely. However, such very low h always implies

high packaging cost. Therefore, we will ﬁnd the lowest cost (high-

est h) feasible package for each binding based on the relationship

between power density and package heat coefﬁcient. This packag-

ing characteristics will be used in our experiments.

We will discuss estimating the average dynamic power in sub-

section 3.1. The algorithm for identifying the lowest cost package

is presented in subsection 3.2. In subsection 3.3 we will show how

to use a numerical method to obtain the expected steady state tem-

perature, and relate it to the leakage trends.

3.1 Av erage Resource Dynamic Power

We assume that each resource will consume a typical av erage

dynamic power for executing one operation. In other words, the

total dynamic power will be represented by a constant after the

scheduled DFG is given. The total power will be decided by the

total number operations that will be executed in a given number of

control steps. This approximation helps us focus on the contribu-

tion of leakage power. This is a reasonable assumption as we have

discussed in Section 1. Also, at the high-level synthesis stage in-

put switching probabilities are highly unpredictable. Individual dy-

namic power consumptions of operations can be weighted with re-

spective input switching behavior if an appropriate statistical model

is provided.

We ﬁrst derive a typical dynamic power value of the module

by some existing power estimation technique. We have used

the po wer estimations obtained after synthesizing different mod-

ules using Synopsys Design Compiler. Assume the signal toggle

rate is TR. It represents how many logic transitions there are per

unit time when the dynamic power is P

. Given a scheduled DFG,

which spans a total of m control steps and with the clock cycle time

of the design being s, we can calculate the dynamic power of each

operation as:

opt



(9)

Dynamic power consumption per operation corresponds to the power

consumption when there is only one operation scheduled on the re-

source within m control steps. By using this metric, we can scale

the dynamic power of any resource by the total number of opera-

tions assigned to it.

3.2 Estimating the Package Prop erties

The chip temperature, hence leakage power, is highly related to

the cooling package. Using an arbitrarily low h package will al-

ways guarantee a low temperature. Ho wever, it also means the

package cost will increase. We show that for each po wer density

level, there is a maximum h (minimum cost) package. If the h

exceeds this maximum value, the package heat dissipation curve

and the chip heat generation curve will not have any intersection,

which means that the heat dissipation is always slower than heat

generation. Eventually, the chip temperature will increase to an

uncontrolled high level. This phenomenon is called thermal run-

away. Mathematically, we can get the minimum cost h value when

Equation (4) has only one root.

We use a binary search algorithm to ﬁnd the maximum package

coefﬁcient. The basic idea in this algorithm is to ﬁnd a point on

the power density curve such that its tangent line intersects the zero

point of the x-axis. We can select any two points as our initial

v alues as long as one of them intersects the x-axis at a negativ e

value and the other intersects at a positive value. The algorithm

runs recursively, and ﬁnally stops when the intersection point is

close enough to the zero point.

After getting the maximum package coefﬁcient, we will decrease

its by some constant value, e.g., 10%, in order to make sure that it is

safely far away from the thermal run-away condition, but still very

low cost. This may also be needed to identify the applicable safe

and lo west cost coefﬁcient among a discrete set of v alues. We will

use this package parameter in the process of estimating the steady

state temperature level.

3.3 Steady State Temperature

The calculation of steady state temperature is basically to ﬁnd

the solution of a nonlinear equation. Newton-Raphson method can

be a good candidate. However, this method is only applicable when

the order of Lagrange interpolation is not too high.

Therefore, we use the secant method, which has the iteration

expression as shown below.

;

(

)

(

)

;

(

)

;

(

)

;

(

;

)

]

(10)

It substitutes the derivative value by a secant estimation. The con-

vergence speed depends on how far the initial point is from the real

solution. Therefore, ﬁnding a good starting point is critical in order

to guarantee the running time of our algorithm.

One such good start point can be obtained by ﬁnding the inter-

section of two lines. One is the heat package dissipation line, the

other is the simpliﬁed heat generation line by assuming that there

is no leakage power.

(11)

It can be seen analytically that this point is very near the solution.

Starting from this initial point and searching in the positive direc-

tion, we can ﬁnd t he solution within a few iterations.

Having obtained the steady state temperature by the secant method,

we use P

(



∑

∏

(

;

)

∏

(

;

)

to calculate the total leakage

po wer for a given resource allocation, that is, for certain number of

resources.

4. EXPERIMENTAL RESULTS

4.1 Experimental Flow

300

arf ewf fdct fft jct1 jdm1 jdm3 jdm4 mot2 mot3 noi

x 10

leakage power(µw)

min−resource allocation

temperature−aware allocation

optimal−leakage allocation

(a)

arf ewf fdct fft jct1 jdm1 jdm3 jdm4 mot2 mot3 noi

0.5

1.5

2.5

3.5

x 10

total power(µw)

min−resource allocation

temperature−aware allocation

optimal−leakage allocation

(b)

arf ewf fdct fft jct1 jdm1 jdm3 jdm4 mot2 mot3 noi

100

120

temperature(

min−resource allocation

temperature−aware allocation

optimal−leakage allocation

(c)

arf ewf fdct fft jct1 jdm1 jdm3 jdm4 mot2 mot3 noi

100

150

temperature(

min−resource allocation

temperature−aware allocation

optimal−leakage allocation

(d)

Figure 3: (a)Leakage power of our redundancy resource allocation technique compared with thermal-aware resource allocation

technique and minimum resource number allocation; (b)Total power of our technique and other resource allocation techniques;

(c)Average temperature of adders in three different resource allocation schemes; (d)Average temperature of multiplier in three

different resource allocation schemes.

We used two types of functional units (adders and multipliers) to

bind operations in a set of scheduled DFGs. The minimum number

of resources required is determined by the compatibility between

operations as dictated by the schedule. The maximum number of

operations of the same type, which are scheduled in the same con-

trol step correspond to the minimum number of resources required

of that type.

The area value and the average dynamic power consumption of

each module type is obtained after synthesizing them using Synop-

sys Design Compiler with the tsmc 180nm library. We scale down

these values to 70nm technology by full-scale methodology after

synthesis.

4.2 Results

The relevant information regarding our benchmarks is given in

Table 1. Our benchmark DFGs are extracted from popular DSP

and multimedia kernels [1]. Their names are listed in the ﬁrst col-

umn. The second column is the total number of operations of each

type in these DFGs. The third column presents the minimum num-

ber of resources required by the schedule of each DFG. The re-

maining columns present the average dynamic power consumption

estimated per adder and multiplier module during the execution of

these DFGs, using the method described in Section 3.

Table 1: Properties and Relevant Information on the Scheduled

DFGs

Schedule Num. of Minimum Dyn. Dyn.

Name Nodes Resources Power µW Power µW

[add,mul] [add,mul] per ADD per MUL

arf [12,16] [2,2] 534.19 3446.26

ewf [26, 8] [3,2] 659.89 4257.15

fdct [26,16] [4,4] 934.84 6030.96

fft [26,16] [3,3] 747.87 4824.77

jctrans1 [13,2] [3,2] 801.29 5169.40

jdmerge1 [23,4] [3,3] 659.89 4257.15

jdmerge3 [30,4] [3,3] 487.74 3146.59

jdmerge4 [18,12] [3,3] 509.91 3289.62

motion2 [26,14] [4,3] 467.42 3015.48

motion3 [26,14] [5,3] 467.42 3015.48

noise est [17,9] [3,2] 659.89 4257.15

Figure 4 illustrates the trends for total leakage power of one

resource type (multiplier in this case) with allocations of the re-

source in the same design. The most important observation is that

there exists an optimal number of resources which achiev es the

least total leakage power. We have observed similar trends for all

test cases. As we mentioned before, adding extra resources is not

free. The total leakage power will start to increase after some point

with further increase in number of resources. The sharpest leak-

age po wer reduction happens at high temperatures, i.e., when using

few resources at high power densities. At that point allocating one

more resource impacts the power density and thermal-induced leak-

age most. As we introduce more and more redundancy the return

diminishes. This is expected, since the thermal-induced leakage

po wer only becomes signiﬁcant at high temperature levels.

When there are more than one resource type in a DFG, we ﬁrst

add redundancy for the module with highest power density. Be-

cause such a module will be very likely to contain a hotspot leading

to high thermal-induced leakage po wer.

In practice, we set a lo wer bound on leakage power reduction to

accept the addition of a new resource. Only if adding further redun-

dancy can reduce the leakage power by a percentage larger than a

predeﬁned level, we add an extra resource. In our experiment, we

set the value to be 20% for every additional resource. This value

plays the role of judging how important power is compared to area.

Ho wever, as seen from our results, there is an optimal number of

resources, which can achieve minimum total leakage power. In

the power -critical design, we can perform a full search and use as

many resources as that optimal number indicates. Otherwise, if we

choose to stop the search earlier we might not have reached that

optimal number yet.

4 5 6 7 8 9 10 11 12 13 14 15

0.5

1.5

2.5

3.5

x 10

resource number

total leakage power (µw)

Figure 4: Trends in leakage for different allocations of the mul-

tiplier module for FFT design.

Figure 3 illustrates our results. We compared our results against

the thermal-aware resource binding techniques [9, 10]. These tech-

niques try to meet a temperature constraint while using minimum

number of resources during binding. The temperature constraint is

100

C, exactly the same as what has been used in these works. As

we can see from the results, we achieved at most 56.5%, on av-

301

Thermal-induced leakage power optimization by redundant resource allocation

Figures

Citations

The effect of data center temperature on energy efficiency

An Efficient Application Mapping Approach for the Co-Optimization of Reliability, Energy, and Performance in Reconfigurable NoC Architectures

A Multi-Objective Model Oriented Mapping Approach for NoC-based Computing Systems

High-level Synthesis for Low-power Design

Hardware synthesis using thermally aware scheduling and binding

References

MediaBench: a tool for evaluating and synthesizing multimedia and communications systems

Alpha-power law MOSFET model and its applications to CMOS inverter delay and other formulas

Thermal Modeling, Analysis, and Management in VLSI Circuits: Principles and Methods

Standby and Active Leakage Current Control and Minimization in CMOS VLSI Circuits

Design and optimization of low voltage high performance dual threshold CMOS circuits

Related Papers (5)

An Integrated Approach to Thermal Management in High-Level Synthesis

Temperature-aware resource allocation and binding in high-level synthesis

Energy-efficient real-time task scheduling with temperature-dependent leakage

Standby and Active Leakage Current Control and Minimization in CMOS VLSI Circuits

TAPHS: thermal-aware unified physical-level and high-level synthesis

Frequently Asked Questions (9)

Q1. What is the popular technique for reducing leakage power?

Q2. What contributions have the authors mentioned in the paper "Thermal-induced leakage power optimization by redundant resource allocation" ?

Q3. What is the effect of lowering the threshold voltage levels of devices?

Q4. What are some other techniques used for reducing standby state leakage power?

Q5. How many resources are needed to achieve the optimal dynamic power?

Q6. What are the foremost factors in the equation of temperature on a chip?

Q7. How much power reduction did the authors achieve on av-erage?

Q8. What is the maximum value of the package heat dissipation curve?

Q9. How can the authors find the lowest cost package for each binding?