Algorithmic problems in power management

doi:10.1145/1067309.1067324

Algorithmic Problems in Power Management

Sandy Irani

School of Information and Computer Science

University of California, Irvine

Kirk R. Pruhs

∗

Computer Science Department

University of Pittsburgh

1 Introduction

We survey recent research that has appeared in the theoretical computer science literature on algorithmic

problems related to power management. We will try to highlight some open problems that we feel are

interesting. This survey places more concentration on lines of research of the authors: managing power

using the techniques of speed scaling and power-down which are also currently the dominant techniques in

practice.

1.1 Motivation

The power consumption rate of computing devices has been increasing exponentially. Since the early 1970s,

the power densities in microprocessors have doubled every three years [34]. This increased power usage

poses two types of difﬁculties:

• Energy Consumption: As energy is power integrated over time, supplying the required energy may

become prohibitively expensive, or even technologically infeasible. This is a particular difﬁculty in

devices that rely heavily on batteries for energy, and will will become even more critical as battery

capacities are increasing at a much slower rate than power consumption. Anyone using a laptop on a

long ﬂight is familiar with this problem.

• Temperature: The energy used in computing devices is in large part converted into heat. For

high-performance processors, cooling solutions are rising at $1 to $3 per watt of heat dissipated,

meaning that cooling costs are rising exponentially and threaten the computer industry’s ability

to deploy new systems [34]. In May 2004 Intel publicly acknowledged that it had hit a “ther-

mal wall” on its microprocessor line. Intel scrapped the development of its Tejas and Jayhawk

chips in order to rush to the marketplace a more efﬁcient chip technology. Designers said that

the escalating heat problems were so severe that they threatened to cause its chips to fracture [27].

For a striking example of the grievous effect of removing the fan from a modern processor, see

http : //www.cs.pitt.edu/ ∼ kirk/cool.avi.(You will need a DivX codec installed.)

These two factors have resulted in power becoming a ﬁrst-class design constraint for modern computing

devices [28].

There is an extensive literature on power management in computing devices. Overviews can be found

in [11, 28, 36]. All of these techniques that have been investigated are similar in that they reduce or elim-

inate power to some or all components of the device. Sensor networks have emerged as an important new

∗

Supported in part by NSF grants CCR-0098752, ANI-0123705, CNS-0325353, and CCF-0448196.

1

paradigm in which power-aware computation is absolutely critical. The explosive interest in sensor net-

works is the result of the development of low-cost, low-power mutifunctional sensor devices, such as the

Smart Dust Mote [1, 22], that are small in size and communicate untethered at short distance.

There is an inherent conﬂict between power reduction and performance; in general, the more power

that is available, the better the performance that can be achieved. As a result, it is generally proposed

that power reduction techniques be preferentially applied during times when performance is less critical.

However, this requires a policy to determine how essential performance is at any given time and how to

apply a particular power reduction technique. Current tools and mechanisms for power management are

inadequate and require more research [14]. Furthermore, there is a growing consensus that these policies

must incorporate information provided by applications and high levels of the operating system in order to

achieve necessary advances [14].

We advocate formalizing power management problems as optimization problems, and then developing

algorithms that are optimal by these criteria. The goal is to develop effective algorithms for speciﬁc problems

within the domain of power management as well as to build a toolkit of widely applicable algorithmic

methods for problems that arise in energy-bounded and temperature-bounded computation.

2 Speed Scaling

2.1 Formulation as a Scheduling Problem

Speed scaling involves dynamically changing the voltage and/or frequency/speed of the processor. A pro-

cessor consumes less power when it is run at a lower speed. Both in academic research and practice,

dynamic voltage/frequency/speed scaling is the dominant technique to reduce switching loss, which is cur-

rently the dominant form of energy consumption in microprocessors [11, 28, 36]. Current microprocessors

from AMD, Intel and Transmeta allow the speed of the microprocessor to be set dynamically. Informally,

speed scaling problems involve determining the speed of the processor at each point in time.

Theoretical investigations of speed scaling algorithms were initiated by Yao, Demers, and Shankar [37].

Yao et al. [37] propose formulating speed scaling problems as scheduling problems. The setting is a collec-

tion of tasks, where each task i has a release time r

i

when it arrives into the system, and an amount of work

w

i

that must be performed to complete the task. A schedule speciﬁes which task to run at each time, and at

what speed that task should be run.

In particular, Yao et al. [37] consider the case that there is also a deadline d

i

associated with each task

that speciﬁes the time by which the task should be completed. In some settings, for example, the playing

of a video or other multimedia presentation, there may be natural deadlines for the various tasks imposed

by the application. In other settings, the system may impose deadlines to better manage tasks or insure a

certain quality of service to each task [12]. Yao et al. [37] assume that the system’s performance measure is

deadline feasibility; that is, each task must ﬁnish by its deadline.

They study the problem of minimizing the total energy used subject to the deadline feasibility con-

straints. Bansal, Kimbrel and Pruhs [7, 8] study the problem of minimizing the maximum temperature

attained subject to the deadline feasibility constraints.

2.2 Energy and Temperature

Before proceeding further, we need to explain how speed, power, energy, and temperature are modeled,

and how they are related. Yao et al. [37] assume a continuous function P (s) such that if the device runs

at speed s, then it consumes power at a rate of P (s). For example, the well known cube-root rule for

CMOS based devices states that the speed s is roughly proportional to the cube-root of the power P , or

equivalently, P (s) = s

3

, the power is proportional to the speed cubed [11]. Yao et al. [37] only assume that

2

P (s) is strictly convex. This assumption implies that that the slower a task is run, the less energy is used to

complete that task. Some simplicity of analysis, and little loss of applicability, comes from assuming that

P (s) = s

α

for some constant α > 1.

The total energy used by the system is then

R

∞

0

P (s(t))dt where s(t) is the speed of the device at time

t.

We now turn our attention to temperature. Cooling, and hence temperature, is a complex phenomenon

that can not be modeled completely accurately by any simple model [33]. In [7], Bansal Kimbrel and Pruhs

propose a model in which the environmental temperature is assumed to be constant. While this assumption

certainly is not strictly true, the hope is that it is sufﬁciently close to being true that insight gained with this

model will be useful in real settings. They also assume that the rate of cooling of the device adheres to

Fourier’s Law. Fourier’s law states that the rate of cooling is proportional to the difference in temperature

between the object and the environment. Without loss of generality one can scale temperature so that the

environmental temperature is zero. A ﬁrst order approximation for the rate of change T

0

of the temperature

T is then T

0

= aP − bT , where P is the supplied power, and a, b are constants.

Some modern processors are able to sense their own temperature, and thus can be slowed down or

shut down so that the processor temperature will stay below its thermal threshold [34]. If one views

http : //www.cs.pitt.edu/ ∼ kirk/cool.avi, this is the reason why the Pentium only slows down, and

doesn’t fry like the AMD processor.

Bansal and Pruhs [8] show that the maximum temperature is within a factor of 4 of a times the maximum

energy used over any interval of length

1

b

. This observation also shows that there is a relationship between

total energy and maximum temperature optimization and simpliﬁes the task of reasoning about temperature.

If the cooling parameter b is 0 then the temperature minimization problem becomes equivalent (within a

constant factor) to the energy minimization problem. This also explains why some algorithms in the liter-

ature for energy management are poor for temperature management, that is, these algorithms critically use

the fact that the parameter b = 0. If the cooling parameter b is ∞ then the temperature minimization prob-

lem becomes equivalent to the problem of minimizing the maximum power, or equivalently minimizing the

maximum speed. We say that an algorithm is cooling oblivious if it is simultaneously O(1)-approximate for

minimizing the maximum temperature for all values of a and b in the temperature equation T

0

= aP − bT ,

Thus a cooling oblivious algorithm is also O(1)-approximate for total energy and maximum speed/power.

The energy minimization problem, when the speed to power parameter α is ∞ is also equivalent to mini-

mizing the maximum power.

2.3 Energy Minimization with Deadline Feasibility

Yao, Demers and Shankar [37] study the problem of minimizing the total energy used to complete all tasks

subject to the deadline feasibility constraints. They give an ofﬂine greedy algorithm (YDS) that optimally

solves this problem. The algorithm YDS proceeds in a series of iterations. During each iteration, tasks in

the maximum intensity interval are scheduled Earliest Deadline First at a speed equal to the intensity of this

interval; the intensity of a time interval is deﬁned to be the sum of the work requirements of all tasks whose

release time and deadline are both contained within the interval divided by the length of an interval. The

newly scheduled time interval is then blacked out, and all the remaining tasks must be executed during the

remaining time that is not blacked out. It was shown by Bansal and Pruhs [8] that the energy optimality of

the YDS schedule follows as a direct consequence of the well known KKT optimality conditions for convex

programs.

Theorem 1. The YDS algorithm is optimal for energy minimization.

3

Proof. Consider a convex program

min f

0

(x)

f

i

(x) ≤ 0 i = 1, . . . , n

Assume that this program is strictly feasible, that is, there is some point x where where f

i

(x) < 0 for

i = 1, . . ., n. Assume that the f

i

are all differentiable. Let λ

i

, i = 1, . . ., n be a variable (Lagrangian

multiplier) associated with the function f

i

(x). Then a necessary and sufﬁcient KKT conditions for solutions

x and λ to be primal and dual feasible are [10]:

f

i

(x) ≤ 0 i = 1, . . ., n (1)

λ

i

≥ 0 i = 1, . . ., n (2)

λ

i

f

i

(x) = 0 (3)

∇f

0

(x) +

n

X

i=1

λ

i

∇f

i

(x) = 0 (4)

To state the energy minimization problem as a convex program, we break time into intervals t

0

, . . .t

m

at

release times and deadlines of the tasks. Let J(i) be the tasks that can feasibly be executed during the time

interval I

i

= [t

i

, t

i+1

], and J

−1

(j) be intervals during which task j can be feasibly executed. We introduce

a variable w

i,j

, for j ∈ J(i), that represents the work done on task j during time [t

i

, t

i+1

]. Our (interval

indexed) mathematical program P is then:

min E (5)

w

j

≤

X

i∈J

−1

(j)

w

i,j

j = 1, . . ., n (6)

m

X

i=1

P

j∈J(i)

w

i,j

t

i+1

− t

i

!

α

(t

i+1

− t

i

) ≤ E (7)

w

i,j

≥ 0 i = 1, . . . , m j ∈ J(i) (8)

By applying the KKT optimality conditions (see [8] for details) one can conclude that a sufﬁcient con-

dition for a primal feasible solution to be optimal is that:

• For each task j, the processor runs at the same speed, call it s

j

, during the intervals i in which task j

is run.

• And the processor runs at speed no less than s

j

during intervals i, such that j ∈ J(i), in which task j

is not run.

The schedule produced bt the YDS algorithm clearly has these properties and hence is optimal.

A naive implementation of YDS runs in time O(n

3

). This can be improved to O(n

2

) if the intervals have

a tree structure [26]. It would be interesting to see if the cubic running time of YDS for arbitrary instances

can be improved. For jobs with a ﬁxed priority, Yun and Kim [39] show that it is NP-hard to compute an

minimum energy schedule. They also give a fully polynomial time approximation scheme for the problem.

Kwon and Kim [25] give a polynomial time algorithm for the case of a processor with discrete speeds.

In the online version of the problem, an algorithm only learns about a task at its release time, at which

time it is given the exact work requirements of the job as well as its deadline. Yao et al. [37] deﬁne two

simple online algorithms. The online algorithm Average Rate (AVR) runs each task in the optimal manner

4

under the assumption that it is the only task in the system. That is, the work on each task is spread evenly

between its release date and its deadline. The online algorithm Optimal Available (OA) at any point of time

schedules the unﬁnished work optimally under the assumption that no more tasks will arrive. They give a

lower bound of α

α

on the approximation ratio for AVR and OA for energy minimization. In this instance

r

i

= 0, d

i

= i/n and w

i

= (n/i)

α+1

α

, for i = 1, . . ., n. They also prove, using a rather complicated spectral

analysis, that the approximation ratio of AVR is at most 2

α

for energy minimization. It was shown by

Bansal, Kimbrel and Pruhs in [7] using a simple potential function argument, that OA is α

α

-competitive

with respect to energy minimization. That is, they show that at all times t, P

OA

(t)+Φ

0

(t) ≤ P

Opt

(t), where

P

OA

(t) is the power of OA at time t, P

Opt

(t) is the power of adversary at time t, and Φ

0

(t) is the change in

a potential function Φ(t) that we deﬁne.

The general lower bounds in [37] on the competitive ratio, with respect to energy minimization, for

an arbitrary algorithm are of the form Ω(c

α

) for some constant c. This suggests the question of whether

an online algorithm exists that can achieve a competitive ratio that is O(c

α

). Such an algorithm would be

better than OA or AV R for large α. Bansal, Kimbrel and Pruhs [7] introduce an online algorithm and

prove that it achieves such a competitive ratio. We refer to this algorithm as BKP. To explain the BKP

algorithm, we need to ﬁrst introduce some notation. Let w(t, t

1

, t

2

) denote amount of work that has arrived

by time t, that has release time ≥ t

1

and deadline ≤ t

2

. Let k(t) be the maximum over all t

0

> t of

(w(t, et −(e −1)t

0

, t

0

))/(e(t

0

−t)). Note that w(t, t

1

, t

2

) and k(t) may be computed by an online algorithm

at time t. At all times t, the BKP algorithm works at rate e · k(t) on the unﬁnished job with the earliest

deadline. Intuitively, k(t) is a lower bound, and a current estimate, of the speed of the optimal algorithm

YDS at time t. The online algorithm has to run at a higher rate than k(t) in case that more work arrives in

the future, and its lower bound of k(t) was too small. The constants are chosen to provide the best analysis.

Bansal, Kimbrel and Pruhs [7] show that the BKP is also e-competitive with respect to the maximum

speed. Furthermore, this is optimal among deterministic online algorithms. That is, the objective is to

minimize the maximum speed that the processor runs subject to the constraint that all jobs ﬁnish by their

deadline. Therefore, BKP is also strongly e

α

-competitive with respect to maximum power. In the instance

that establishes this lower bound, the adversary works at a rate a(t) =

−1

(ln x)(1−t)

from time 0 to time 1 − x.

We look at the limit of the resulting instances as x goes to 0. Work is released at the rate that the adversary

does the work, and the deadline of all work is 1 − x. Understanding this instance is useful in understanding

the underlying motivation for the deﬁnition of the BKP algorithm.

2.4 Maximum Temperature Minimization with Deadline Feasibility

Bansal, Kimbrel and Pruhs show in [7] that in principle the problem of minimizing maximum temperature,

subject to deadline feasibility, can be stated as a convex program as follows. We break time into intervals

t

0

, . . .t

m

at release dates and deadlines of the jobs. We introduce a variable T

i

that represents T (t

i

), the

temperature at time t

i

. Let J(i) be the jobs j that can feasibly be executed during time [t

i

, t

i+1

], that

is, r

j

< t

i+1

and d

j

> t

i

. We introduce a variable W

i,j

, for j ∈ J(i), that represents the work done

on job j during time [t

i

, t

i+1

]. Let MaxW (x, y, X, Y ) be the maximum work that can be done starting

at time x at temperature X and ending at time y at temperature Y subject to the temperature constraint

T ≤ T

max

throughout the interval [x, y]. Let MaxT (x, y, X, y) be a corresponding temperature curve.

Let UM axW (x, y, X, Y ) and U M axT (x, y, X, Y ) be similarly deﬁned except that there is no maximum

temperature constraint. We can then express the temperature problem as:

5

Algorithmic problems in power management

Citations

Greening geographical load balancing

Energy-efficient algorithms

Speed scaling to manage energy and temperature

Receding Horizon Control

A Review of Engineering Research in Sustainable Manufacturing

References

Convex Optimization

An Energy-Efficient MAC Protocol for Wireless Sensor Networks

An energy-efficient MAC protocol for wireless sensor networks

Next century challenges: mobile networking for “Smart Dust”

Hard Real-Time Computing Systems: Predictable Scheduling Algorithms and Applications

Related Papers (5)

A scheduling model for reduced CPU energy

Speed scaling to manage energy and temperature

Energy-efficient algorithms

Dynamic speed scaling to manage energy and temperature

Algorithms for power savings