what is the value of jiji rrrr?

The value ofji rr , σ (the transition rate from state ri to state rj) canbe calculated as:jiji rrrr ,, τσ = , ri≠rj; ∑ ≠ −=ijjiii rrrrrr ,, σσ (3.4)The request rates λl(r) and λh(r) are associated with state r∈R.

What is the transition rate for a transition between active and inactive states?

It can be obtained as:),()(),( rsara SQ II SP II SQSP GGG ⊕=− (4.4)Notice that, after the operation, the parameter s in GSQ(s, r) has been removed by substituting the real state of the SP.),( raII SQSP −G is calculated directly by the ⊕ operation because transition between inactive SP states is not correlated with the transition of SQ state.

How long does the SP wait for a request?

2. Timeout policy (T=20): turn on the SP whenever a request comes and turn off the SP whenever the SP has been idle for 20 seconds and there is no request in the queue.

What is the switching energy ene(si, sj) matrix?

The switching energy ene(si, sj) matrix is: ∞ ∞∞∞=093066.004.4210000),(JJJJJJ ssene jiEntry of ∞ means that the SP cannot switch between the corresponding states.

What is the way to improve the performance of the dynamic power management method?

The authors also showed that the dynamic power management method out-perform the heuristic approaches in terms of better and more flexible power-delay trade-off.

What is the way to handle the workload?

The new model can adjust the optimal policy when workload characteristics change, while the greedy and timeout methods are not adaptive to the workload.

(Open Access) Stochastic modeling of a power-managed system-construction and optimization (2001) | Qinru Qiu

Stochastic Modeling of a Power-Managed System:

Construction and Optimization

Qinru Qiu, Qing Wu and Massoud Pedram

Department of Electrical Engineering-Systems

University of Southern California

Los Angeles, CA 90089

Abstract -- The goal of a dynamic power management policy is

to reduce the power consumption of an electronic system by

putting system components into different states, each representing

certain performance and power consumption level. The policy

determines the type and timing of these transitions based on the

system history, workload and performance constraints. In this

paper, we propose a new abstract model of a power-managed

electronic system. We formulate the problem of system-level power

management as a controlled optimization problem based on the

theories of continuous-time Markov decision processes and

stochastic networks. This problem is solved exactly and efficiently

using a “policy iteration” approach. Our method is compared

with existing heuristic approaches for different workload

statistics. Experimental results show that power management

method based on Markov decision process outperforms heuristic

approaches in terms of power dissipation savings for a given level

of system performance.

I. I

NTRODUCTION

With the rapid progress in the semiconductor technology, the chip

density and operation frequency have increased, making the power

consumption in battery-operated portable devices a major concern.

High power consumption reduces the battery service life. The goal

of low-power design of battery-powered devices is thus to extend

the battery service life while meeting performance requirements.

Reducing power dissipation is a design goal even for non-portable

devices since excessive power dissipation results in increased

packaging and cooling costs as well as potential reliability

problems. Many low power design methodologies and techniques

that target digital VLSI circuits have been proposed [1]-[5].

Portable electronic devices tend to be much more complex

than a single VLSI chip. They contain many components, ranging

from digital and analog to electro-mechanical and electro-

chemical. Much of the power dissipation in a portable electronic

device comes from non-digital components. System designers

have started to respond to the requirement of power-constrained

system designs by a combination of technological advances and

architectural improvements. Dynamic power management – which

refers to selective shut-off or slow-down of system components

that are idle or underutilized – has proven to be a particularly

effective technique. Incorporating a dynamic power management

scheme in the design of an already-complex system is a difficult

process that may require many design iterations and careful

debugging and validation.

To simplify the design and validation of complex power-

managed systems, a number of standardization attempts have

stated. Best known among them is the

Advanced Configuration

and Power Interface

(ACPI) [6] that specifies an abstract and

flexible interface between power-manageable hardware

components (VLSI chips, disk drivers, display drivers, etc.) and

the

power manager

(the system component that controls the turn-

on and turn-off of the system components). It is important to

mention that, ACPI defines multiple power modes for system

components, which is a key requirement for approaches based on

Markov decision processes to outperform heuristic approaches.

The problem of finding a power management scheme (or

policy) that minimizes power dissipation under performance

constraints is of great interest to system designers. A simple and

well-known heuristic policy is the “time-out” policy, which is

widely used in today’s portable computers. In the “time-out”

policy, one component will be shut down after it has been idle for

a certain amount of time. The predictive system shutdown

approach in [7][8] tries to achieve better power-delay trade-off by

predicting the “on” and “off” time of the component. This

prediction approach uses a regression equation based on the

component’s previous “on” and “off” time to estimation the next

“turn-on” time, such that the component can be turned on

immediately before the request comes. Therefore, the system

performance can be improved. However, this method is only

applicable to few cases in which the requests are highly correlated.

Because heuristic policies do not have a robust system model

and solid theoretical background, their major shortcomings are

obvious. Firstly, they can never achieve the best power-delay

trade-off for the system. Secondly, they cannot deal with complex

components that have more than two (on and off) operating modes

such as defined in ACPI. In addition, they cannot deal with

complex system with multiple and interactive components.

A power management approach based on Markov decision

process has been proposed in [9]. The system is modeled as a

discrete-time

Markov decision process by combining the

stochastic models of its components. Once the model and its

parameters are determined, an optimal power management policy

can be obtained to achieve the best power-delay trade-off for the

system. This approach offers significant improvements over

previous power management techniques in terms of theoretical

framework for modeling and optimizing the system. There are

however some shortcomings. Firstly, because the system is

modeled in the discrete-time domain, some assumptions about the

system components may not hold for real applications. Secondly,

the state transition probability of the system model cannot be

obtained accurately. Moreover, the power management program

needs to send control signals to the components in every time-

slice, which results in heavy signal traffic and heavy load on the

system resources (therefore more power).

The work of [10] overcomes the shortcomings of [9] by

introducing a new system model (as well as component models)

based on the

continuous-time

Markov decision process. In [10], a

power-managed system is modeled in the continuous-time

domain, which is closer to the situation encountered in practice;

the component models are simpler and can accurately model many

realistic applications.

In this paper, we improve the work of [10] in the following

ways:

1. We present a new model of the service provider that

explicitly distinguishes between the two cases where the

server is busy (

and servicing some request) and idle (

but not servicing any request).

2. We introduce a new model for the service requester to

capture complex workload characteristics.

3. We introduce a new model for the service queue that consists

of a normal queue and a high-priority queue. This is

important since some service requests are “urgent” and need

immediate response from the server.

4. We present a new system model that is composed of the new

component models.

*This work was supported in part by SRC under contract No. 98-DJ-606

and NSF under contract No. MIP-9628999.

This paper is organized as follows, Sections II and III

describes the models for the components and the system. Sections

0 and V present the experimental results and conclusions.

II. C

OMPONENT MODELING

We first give the notation that will be used throughout the paper:

⇒

(

): transition probability from state

(directly or

indirectly) to state

during time 0 to

(

): probability of that the system is in state

at time

generator matrix

of a continuous-time Markov process

λ: service request generation rate for

Service Requestor

(SR)

µ: service rate of the

Service Provider

(SP)

i,,j

: transition rate from state

to state

: set of available actions when a system is in state

π: power management policy

The introduction to continuous-time Markov decision process

is omitted to save space. Please refer [10] for detailed background.

In this section, we describe the mathematical models of the

components in a power-managed system.

We assume that the system is embedded in an environment

where there is only a single source of requests, which is defined as

the service requestor (SR). Requests generated by the SR can be

divided into two categories:

low-priority requests

and

high-

priority requests

, which are generated independent of each other.

Requests generated by the SR are serviced by the system. The

system itself consists of three components: a server that processes

requests (the SP), a queue which stores the requests that cannot be

immediately serviced upon arrival (SQ), and a power manager

(PM) that issues commands. The SR is an input source, which is

outside and independent of the system.

Although we consider a relatively simple system in this paper,

our approach can be extended to a more complicated application

that may consist of multiple SR’s, SP’s, and SQ’s.

Both the request arrival event and the request service event are

stochastic processes and follow the Poisson distribution. For

example, the request arrival event follows the Poisson process

(i.e., during time (0,

] the number of the events has the Poisson

distribution with mean

). Consequently, the request inter-arrival

time follow the exponential distribution with mean 1/

. We

assume that the request will be rejected if the SQ is full at the time

when it comes.

The SP can operate in a number of different power modes. We

also assume that the time needed for the SP to switch from one

state to another follows the exponential distribution. The PM is a

controller that reads the system state (the joint states of SP, SQ

and SR) and issues mode-switching commands to the SP.

In the remainder of this paper, we will use upper case bold

letters (

e.g.

, M) to denote matrices, lowercase bold letters (

e.g.

, v)

to denote vectors, italicized Arial-Font letters (

e.g.

) to denote

sets, uppercase italicized letters (

e.g.

) to denote scalar constants

and lower case italicized letters (

e.g.

) to denote scalar variables.

A. Model of the Service Provider

The Service Provider (SP) is modeled as a stationary, continuous-

time Markov decision process with state (operation mode) set

S={s

s.t. i=1, 2, …, S}, action set A, and parameterized generator

matrix

)(

, a∈A. It can be described by a quadruple (χ, µ(s),

pow(s), ene(s

, s

)) where: (i) χ is an S×S matrix; (ii) µ

(s) and

(s) are functions, µ

,µ

: S→R; (iii) pow(s) is a function, pow:

S→R; (iv) ene(s

, s

) is a function, ene: S× S→ R.

We call χ, the switching speed matrix of the SP. The (

)th entry

of χ is denoted as

and represents the switching speed from

state

. The average switching time from state

to state

then 1/

. We set

to be ∞, because the switch from

state

to itself is instantaneous.

The entries of the parameterized generator matrix

)(

G can be

calculated as:

ssj

asa

),()(

δσ

⋅= ,

≠

; (3.1)

∑

≠

−=

jiii

ssss

)()(

σσ

(3.2)

where







otherwise 0

action of state ndestinatio theis if 1

),(

(3.3)

The

service rates

(

) and

(

) represent the service speed of

SP for low-priority requests and high-priority requests in state

respectively. Therefore, 1/

(

) or 1/

(

) gives the average time

which is needed by SP to complete the service for one request

when SP is in state

power consumption

pow

(

) is associated with each state

∈

. It represents the power consumption of SP during the time it

occupies state

. The cost rate

s,s

of state

is equal to

pow

(

switching energy ene

(

, s

) is associated with each state pair (

∈

≠

. It represents the energy needed for SP to switch

from state

to state

. The cost

is equal to

ene

(

From Eqn. (2.5), we know that the expected power

consumption (earning rate) of SP when it is in state

and action

is chosen, can be calculated as:

∑

≠

′

ssenespowc

),()(

In reality, the working modes of the SP can be divided into

three groups: busy, idle, and power-down. In busy modes, the SP

is fully powered and working on the first request in the SQ. In idle

modes, the SP is fully powered, but it is not working on any

request. In power-down modes, the SP is partially or completely

shut down, i.e., not it is functional. We distinguish idle modes

from busy modes, because the SP cannot switch to other state

when it is working on some request. In other words if we want to

turn the SP off (switch to a power-down mode), it must be

switched off from an idle state.

Different busy modes may be used to model a component

working under different supply voltages. We associate different

power and delay (service rate) values to each of these modes to

model the server performance under different supply voltages.

Therefore, our policy optimization approach (cf. Section V) also

finds the best policy for dynamic voltage scaling as it finds the

optimal policy for power management.

For each busy mode, there exists a corresponding idle state.

The SP may have multiple power-down modes (e.g. standby, soft

off, hard off).

In our mathematical model of the SP, we divide the state set

into two subsets:

(1) The set of active states,

active

where

(

act

) is larger than 0

for each

act

∈

active

(2) The set of inactive states,

inactive

, where

(

ina

) is 0 for each

ina

∈

inactive

The busy modes belong to the first subset. The idle and

power-down modes belong to the second subset.

Not all actions in

are valid in all SP states. Constraints on a

valid action can be stated as follows:

1. The action cannot make a transition between a busy mode to

a power-down mode directly. Transitions between them must

go through an idle mode.

2. The action cannot cause a transition from a busy mode to its

correspondent idle mode. The transition from a busy mode to

an idle mode is done autonomously when the SP finishes a

service (therefore it is not controllable).

3. The action cannot cause a transition between two busy

modes. When the SP is in a busy mode, no transition to any

other state is allowed.

Definition 3.1 Inactive state

is more

vigilant

than inactive state

if the SP in state

wakes up (switches to an active state) faster

than the same SP in state

Example 3.1 Consider a SP with four states,

={busy, idle, wait,

sleep}. When the SP is in state busy, it provides the service for the

requests. The average time needed for each service (for both low-

priority requests and high-priority requests) is 5 second.

Therefore,

(busy) and

(busy) are 0.2.

(idle),

(wait),

(sleep) and

(sleep) are all 0. Let the

command set be defined as

={go_busy, go_idle, go_wait,

go_sleep}. Notice that not all four commands are valid (or

available) in all states. The switching speed matrix χ is given by:













∞

∞∞

∞

5.1166.00

5.1454.00

5.01

002.0

By default, the order of states in rows and columns are the same as

the order of states in

=∞ means that the SP can transfer

from state

immediately.

=0 means that the SP can

never transfer from state

. In this example, the SP needs no

time to transfer from a state to itself. The SP can transfer from the

busy

state to

idle

state with the transition rate equal to the service

rate because it goes to the

idle

state autonomously immediately

after it finishes a request. The SP cannot switch between the

busy

state and

wait

state (or

sleep

state) directly (it must go through the

idle

state), therefore the corresponding entries in the matrix are 0.

The power consumption is:

pow

(

busy

)=2.3W,

pow

(

idle

)=2.3W,

pow

(

wait

)=0.8W,

pow

(

sleep

)=0.1W.

The switching energy

ene

(

) matrix is:













∞

∞∞

0930

66.004.4

2100

),(

ssene

Entry of ∞ means that the SP cannot switch between the

corresponding states. Note that the energy cost of autonomous

state change (busy to idle) is zero.

A graphical illustration of the SP is shown in Figure 1. The

transition rates associated with the directed edges have not been

shown in the figure. They can be extracted from

)(

for

specific actions.

Figure 1 Markov process model of the SP

B. Model of the Service Requester

The

Service Requester

(SR) is modeled as a stationary,

continuous-time Markov process, with state set

={r

s.t. i=0, 1,

…

, R} and generator matrix

. It can be characterized by a

pair (

ττ

(r)), where: (i)

ττ

is an R

R matrix, (ii)

(r) and

(r) are

functions

R→

We call τ the switching speed matrix of the SR. The (

)th entry of

τ is denoted as

. We assume that the time needed for the SR

to switch from one operation state to another is a random variable

with exponential distribution. The average switch time from state

to state

is given by 1/

. We set

to be ∞, because the

switch from state

is instantaneous. The SR model is a

continuous-time Markov process with the generator matrix G

The value of

(the transition rate from state

to state

) can

be calculated as:

jiji

rrrr

τσ

= ,

≠

;

∑

≠

−=

jiii

rrrr

σσ

(3.4)

The

request rates

(

) and

(

) are associated with state

∈R

When the SR is in state

, the generation of the low-priority

requests follows the Poisson process with mean value

(

), and

the generation of the high-priority requests follows the Poisson

process with mean value

(

C. Model of the Service Queue

Single

Service Queue

(SSQ) is modeled as a stationary,

continuous-time Markov process, with state set

SSQ

={q

, i=0, 1,

…

, Q} and the generator matrix

SSQ

(

s, r

)

, where s is the state

of SP, r is the state of SR state.

The shortcoming of using SSQ as the stochastic model of the

service queue is that, we can assign only one delay constraint (i.e.

the constraint on the average waiting time of the requests) during

the policy optimization. However, in real applications, some

service requests may have higher priority than others. Especially

in a power-managed system, the PM always buffers the incoming

service requests, that is, to achieve the best power-delay trade-off.

The SP, under control of the PM, may not service the incoming

request immediately even there is no other request in the queue.

However, there may exist high-priority requests that need

immediate service by the SP. In this case, if we use a loose delay

constraint, the power management policy does not serve the

request immediately (in order to save power). This long latency

may not be acceptable for high-priority requests. We can instead

use a tight delay constraint to make sure the high-priority requests

are serviced immediately. However, this tight delay constraint is

also applied to low-priority requests. Consequently, there will be

undesirable power dissipation related to unnecessarily tight delay

constraint on low-priority requests.

We henceforth model the service queue as a combination of

two SSQs: one (denoted as HSQ) for the high-priority requests

and the other (denoted as LSQ) for the low-priority requests. The

relations between these two queues are:

1. Two different delay constraints are assigned to HSQ and LSQ

separately such that the requests in HSQ have smaller waiting

time than those in LSQ.

2. The requests in LSQ can be serviced by the SP (under the

chosen PM policy) only when there is no request in HSQ.

3. The SP will not start serving the requests in LSQ until it

finishes all the requests (under PM policy) in HSQ.

Although we have introduced two queues in our stochastic

model of the service queue, we are actually modeling a single

priority queue in real applications. The SQ model can be used to

model the commonly used priority queue in an operating system

where two different priorities are assigned to tasks and high-

priority tasks, when they come, are inserted into the front of the

queue. Moreover, obviously, the SQ model can be extended to

model a queue of requests that have more than two priority levels.

The formal definition of the SQ model is as follows.

The

Service Queue

(SQ) is modeled as a stationary,

continuous-time Markov process, which is the combination of two

SSQs: LSQ and HSQ. The state set of the SQ is given by

LSQ

HSQ

and the generator matrix is given by

(

s, r

LSQ

(

s, r

)

⊕

HSQ

(

s, r

)

, where s is the state of SP, r is the state of

SR state, and the “

⊕

” operation is the tensor sum defined in

Definition 3.2

III.

YSTEM MODELING

We first show how to construct the model of the entire system by

combining the component models. Next we explain how the

power-managed system model is applied to practical applications.

A. Model of the Power-Managed System

The

Power-Managed System

(SYS) can be modeled as a

continuous-time Markov process which is the composition of the

sleep

wait

idle

busy

models of the SP, the SR and the SQ. The state set is given by:

S×Q×R

-{invalid states where SP is busy and SQ is empty}. An

action set of all possible actions which is the same as

in the SP

model. A parameterized generator matrix G

SYS

(a) gives the state

transition rates under action a. A cost function Cost(x, a) gives the

system cost under action a when the SYS is in state x.

Similar to the situation of the SP model, not all actions are

valid for any system state. The action constraints (which is

described in Section III.A) for the SP model still apply to the

model of SYS. In addition, we add the following constraints

related to the SYS model.

(1) When both LSQ and HSQ are full and the SP is in an inactive

state, the SP cannot make a transition to another inactive state

which is less vigilant (Definition 4.1) than the current one.

This constraint is reasonable because the SP must go to the

working mode as soon as possible in this situation.

(2) When both LSQ and HSQ are full and the SP is in an idle

state, the SP cannot make a transition to a power-down state

or another idle state whose corresponding busy state has a

slower service rate. This constraint is reasonable, because

when SP and SQ are in the above states, it means that the

service speed cannot catch the incoming speed of the

requests. Therefore, we need to increase the service rate.

The SYS state can be represented as (s, r, (lq, hq)), where

s∈

, r∈

, lq∈

LSQ

and hq∈

HSQ

. The SYS model is a

connected Markov process. Consequently, the limiting

distributions of the state probabilities exist and are independent of

the initial state.

B. Calculating the generator matrix

We next introduce the method of calculating the generator matrix

SYS

(a) from the generator matrices of the system components:

(a), G

, and G

(s, r).

First, we show how to calculate the generator matrix of a joint

process of two independent continuous-time Markov processes.

Proposition 4.1 gives a method to obtain the joint transition rate of

two independent continuous-time Markov processes. Proposition

4.2 gives a method of generating the generator matrix of the joint

system using matrix operations.

Proposition 4.1 Given two independent stochastic processes X

and Y, let

(x,y),(x’y’)

denote the transition rate of the joint process

from the joint state (x,y) to joint state (x’,y’), where x and x’

∈

state space of X, y and y’

∈

state space of Y. Let

x,x’

denote the

transition rate of process X from state x to state x’ and

y,y’

denote

the transition rate of process Y from state y to state y

′

Then

(x,y),(x,y’)

y,,y’

(x,y),(x’,y)

x,x’

(x,y),(x’,y’)

= 0.

Given two matrices A and B as follows:













2221

1211

A and













333231

232221

131211

bbb

Definition 4.1 The tensor product C=A⊗B is given by













2221

1211

. The tensor sum C=A⊕B is given by:

BIIAC ⊗+⊗=

, where n

is the order of A, n

is the order

of B,

I is the identity matrix of order n

Proposition 4.2 Given two independent continuous-time Markov

processes with generator matrices A and B, the generator matrix

of the joint process is given by A⊕B.

We have mentioned that the SR is independent from the rest of

the system. Therefore, G

SYS

(a) can be calculated as:

SYS

(a)=G

SP-SQ

(a, r)⊕G

(4.1)

where G

SP-SQ

(a, r) is the generator matrix of the joint process

of SP and SQ. Notice that G

SYS

(a) generator matrix is also a

parameterized matrix of action a.

The Markov processes of the SP and the SQ are however

correlated. Because whenever the SP makes a transition from a

busy state to an idle state (finishes the service for a request), the

SQ must make a transition which decreases the number of requests

in SQ by 1.

To show how to calculate G

SP-SQ

(a, r) from G

(a) and G

(s,

r), we need to firstly partition G

(a) as follows:













)()(

)(

(4.2)

Matrix )(a

G contains the transition rates for transitions

between inactive states. Matrix )(a

G contains the transition

rates for transitions from any inactive state to any active state.

Matrix )(a

G contains the transition rates for transitions from

any active state to any inactive state. Matrix )(a

G contains the

transition rates for transitions between active states.

We can partition G

SP-SQ

(a, r) as:













−−

−

),(),(

),(

rara

SQSP

G (4.3)

To calculate G

SP-SQ

(a, r), we first calculate the four sub-

matrices in Eqn. (4.3) except the diagonal of G

SP-SQ

(a, r). The

entries on the diagonal are calculated using Eqn. (2.4) after the

sub-matrices are calculated.

),( ra

SQSP

−

G defines the transition rates for transitions between

any two states (s

, (lq

, hq

)) and (s

, (lq

, hq

)) s.t. s

, s

∈

inactive

(defined in Section III.A), lq

, lq

∈

LSQ

and hq

, hq

∈

HSQ

. It

can be obtained as:

),()(),( rsara

SQSP

GGG ⊕=

−

(4.4)

Notice that, after the operation, the parameter s in G

(s, r) has

been removed by substituting the real state of the SP.

),( ra

SQSP

−

G is calculated directly by the ⊕ operation because

transition between inactive SP states is not correlated with the

transition of SQ state.

We let g

, x

) denote the transition rate for the transition

from state x

to x

of a Markov process X. Notice that g

, x

)

may be a parameterized quantity as in G

, G

SP-SQ

, and G

SYS

),( ra

SQSP

−

G defines the transition rates for transitions between

any two states (s

, (lq

, hq

)) and (s

, (lq

, hq

)) s.t. s

∈

inactive

∈

active

, lq

∈

LSQ

and hq

, hq

∈

HSQ

. The rule for

calculating the entries of ),( ra

SQSP

−

G is as follows:

SP-SQ

((s

, (lq

, hq

)), (s

, (lq

, hq

))) is equal to g

, s

) if

{(s

is an idle state) AND (s

is the busy state correspondent to s

)

AND (lq

==lq

) AND (hq

==hq

)} holds; Otherwise, it is zero.

),( ra

SQSP

−

G defines the transition rates for transitions between

any two states (s

, (lq

, hq

)) and (s

, (lq

, hq

)) s.t. s

∈

active

∈

inactive

, lq

∈

LSQ

and hq

, hq

∈

HSQ

. The rule for

calculating the entries of ),( ra

SQSP

−

G is as follows:

SP-SQ

((s

, (lq

, hq

)), (s

, (lq

, hq

))) is equal to

) if {(s

a busy state) AND (s

is the idle state correspondent to s

) AND

(lq

==(lq

+1)) AND (hq

==hq

==0)} holds; It is equal to

) if

{(s

is a busy state) AND (s

is the idle state correspondent to s

)

AND (lq

==lq

) AND (hq

==(hq

+1))} holds; Otherwise it is zero.

),( ra

SQSP

−

G defines the transition rates for transitions between

any two states (s

, (lq

, hq

)) and (s

, (lq

, hq

)) s.t. s

∈

active

∈

active

∈

LSQ

and

∈

HSQ

. The rule for

calculating the entries of ),(

SQSP

−

is as follows:

SP-SQ

((

, (

)), (

, (

))) is equal to

(

) if

{(s

==s

)

AND

(lq

==(lq

-1))

AND

(hq

==hq

)} holds; It is equal

(

) if {(s

==s

)

AND

(lq

==lq

)

AND

(hq

==(hq

-1))} holds;

Otherwise it is zero.

C. Calculating the cost function

The cost of the system is related to the state

of the SYS and the

action

taken by the SYS in state

. As in [10], we use the

average power consumption and the average number of waiting

requests to capture the system cost. Therefore, we have three cost

functions in our model: the power consumption of the SP

pow

(

the average number of requests in the LSQ of the SQ

lsq

(

), and

the average number of requests in the HSQ of the SQ

hsq

(

x,a

Let

be denoted as (

, (

)), where

s∈

r∈

lq∈

LSQ

and

hq∈

HSQ

The power cost can be calculated as:

∑

≠∈

ssSs

pow

ssenessgspowaxC

','

)',()',()(),( (4.5)

where

pow

(

) and

ene

(

’) were defined in Section III.A, and

(

’) is the transition rate from state

’ of the SP. Notice that

’) is a function of

In addition, the delay costs are:.

lsq

(

and

hsq

(

(4.6)

The average waiting time of the requests is often used as the

cost of delay

. However, in [10], it is shown that there exists a

linear relationship between the average number of requests in the

queue and the average waiting time. Therefore, Eqn. (4.6) can be

used as the delay cost.

We define the total cost as a weighted summation of the power

and delay costs:

Cost(x,a)=w

⋅

pow

(x,a)+w

⋅

lsq

(x)+w

⋅

hsq

(x) (4.7)

where

=1.

The optimal policy for the system model is then solved using

the policy iteration algorithm used in [10].

D. Application issues

Using the SYS model, a power-managed system in real

application can work in the following way: When the SP of the

system changes state, it sends an interrupt signal SWITCH_DONE

to the PM. The PM then reads the states of all components in the

power-managed system (hence obtains the joint system state),

issues a command according to the chosen policy. The SP receives

the command and immediately starts to switch to a state which is

given by the command. Notice that the command may ask the SP

to switch to its current state, therefore the SP state will not change.

We assume that, after the SP finishes a service, it will stay in the

idle state for some time that is long enough for it to accept the

command from the PM and switch to another state. We also

assume that the PM reads the states and issues command in a short

time that does not affect the system performance.

IV. E

XPERIMENTAL RESULTS

Experiments have been designed to evaluate the performance of

our system model and the optimization method.

A. Experiment for comparing models of the SQ

The original model (i.e., the model which does not consider the

request priority) [10] includes:

1. A SP model that is the same as in

Example 3.1

2. A SR model with only one state

(

)= 1/80 and

(

)=1/100.

3. A SQ model with a SSQ of length 7.

Our new system model includes:

1. A SP model that is the same as the one in

Example 3.1

2. A SR model with only one state

(

)= 1/80 and

(

)=1/100.

3. A SQ model with a LSQ of length 5 and a HSQ of length 2.

Our goal is to apply a tight delay constraint

on the high-priority

requests such that they are serviced within a required amount of

time. And as for the low-priority requests, we only want to

maintain their throughput (same incoming and outgoing rate).

Optimal policies for both models are calculated under following

two different scenarios:

1. Since the original model cannot distinguish between high-

priority requests and low priority requests, to make sure that

the delay of high priority requests meets the constraint

, it

has to apply the constraint

on all requests. Using the new

model, we only need to apply

on the HSQ and use a looser

delay constraint on the LSQ to maintain the throughput of the

low-priority requests.

2. Results from scenario 1 shows that, by applying the

constraint

on all requests, the original model always gets

much smaller delay on both high and low priority requests

than required. Therefore in this scenario, in the new model,

we further tighten the HSQ constraint such that the delay of

high-priority requests matches those in the original model.

Different

values are used to generate the multiple rows in Table

1 and Table 2. Optimal policies are simulated using an even-

driven simulator which has the following setup:

1. The SP is modeled the same as in

Example 3.1

2. A total of 20,000 service requests are randomly generated.

Both low-priority and high-priority requests are generated

independently such that they follow Poisson distributions

with parameters

(

)= 1/80 and

(

)=1/100, respectively.

3. A queue of length 7 buffers the incoming service requests.

An incoming high-priority request will be inserted in front of

all other requests except the high-priority requests that came

earlier and/or the low-priority request that is being serviced

by the SP.

Tables 1 and 2 show the experimental results for both scenarios.

Table 1 Experimental results for scenario 1

Original model New model

Ave. # of

high-

priority

requests in

the queue

Ave. # of

low-

priority

requests in

the queue

Ave.

power

consump-

tion (W)

Ave. # of

high-

priority

requests in

the queue

Ave. # of

low-

priority

requests in

the queue

Ave.

power

consump-

tion (W)

Power

differ-

ence

0.26 0.83 1.99 0.62 1.60 0.89 55%

0.42 0.61 1.36 1.02 1.84 0.66 51%

0.63 0.86 1.04 1.26 2.41 0.54 48%

Table 2 Experimental results for scenario 2

Original model New model

Ave. # of

high-

priority

requests in

the queue

Ave. # of

low-

priority

requests in

the queue

Ave.

power

consump-

tion (W)

Ave. # of

high-

priority

requests in

the queue

Ave. # of

low-

priority

requests in

the queue

Ave.

power

consumpt

ion (W)

Power

differ-

ence

0.26 0.83 1.99 0.24 0.35 1.85 7%

0.42 0.61 1.36 0.38 1.18 1.14 16%

0.63 0.86 1.04 0.62 1.60 0.89 14%

Notice that the data in both tables are the same for the original

model because they have the same delay constraints.

From the results in Tables 1 and 2, we can draw following

conclusions:

1. The original model sets the same delay constraints on both

low-priority requests and high-priority requests. This results

in undesirable increase in power dissipation. In addition, the

original model always over-estimates the delay of high-

priority requests, i.e., the simulated delay of high-priority

requests is always smaller than the pre-set constraint. While

in the new model, the simulated delay of both high-priority

and low priority requests is always close to the pre-set

constraints.

2. Even though the policy based on the old model gets smaller

delay (in simulation) for high-priority requests in scenario 1,

we can always find an optimal policy while matching the

HSQ delay constraint (which is the situation in scenario 2).

Stochastic modeling of a power-managed system-construction and optimization

Figures

Citations

Stochastic model checking

Thermal Modeling, Analysis, and Management in VLSI Circuits: Principles and Methods

Evaluating the effectiveness of model-based power characterization

Autonomic power and performance management for computing systems

Logic soft errors in sub-65nm technologies design and CAD challenges

References

Dynamic Programming and Markov Processes

Dynamic Programming and Markov Processes.

Stochastic models in operations research

Low Power Digital CMOS Design

Low Power Design Methodologies

Related Papers (5)

Policy optimization for dynamic power management

Predictive system shutdown and other architectural techniques for energy efficient programmable computation

A survey of design techniques for system-level dynamic power management

Dynamic power management based on continuous-time Markov decision processes

Dynamic power management for non-stationary service requests

Frequently Asked Questions (13)

Q1. What is the shortcoming of using SSQ as the stochastic model of the service?

Q2. What is the purpose of the SQ model?

Q3. Why does the PM always buffer the incoming service requests?

Q4. What is the simplest way to model a power-managed system?

Q5. what is the value of jiji rrrr?

Q6. What is the effect of the new model on the delay of high priority requests?

Q7. What are the two different scenarios for the Optimal Policies for both models?

Q8. What is the transition rate for a transition between active and inactive states?

Q9. Why is the transition rate correlated between the SP and the SQ?

Q10. How long does the SP wait for a request?

Q11. What is the switching energy ene(si, sj) matrix?

Q12. What is the way to improve the performance of the dynamic power management method?

Q13. What is the way to handle the workload?