Asymptotic Probability Extraction for Nonnormal Performance Distributions

doi:10.1109/TCAD.2006.882593

Journal Article•DOI•

Asymptotic Probability Extraction for Nonnormal Performance Distributions

Xin Li¹, Jiayong Le¹, P. Gopalakrishnan¹, Larry Pileggi¹•Institutions (1)

01 Jan 2007-IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE)-Vol. 26, Iss: 1, pp 16-37

TL;DR: The APEX begins by efficiently computing the high-order moments of the unknown distribution and then applies moment matching to approximate the characteristic function of the random distribution by an efficient rational function, and is proven that such a moment-matching approach is asymptotically convergent when applied to quadratic response surface models.

read less

Abstract: While process variations are becoming more significant with each new IC technology generation, they are often modeled via linear regression models so that the resulting performance variations can be captured via normal distributions. Nonlinear response surface models (e.g., quadratic polynomials) can be utilized to capture larger scale process variations; however, such models result in nonnormal distributions for circuit performance. These performance distributions are difficult to capture efficiently since the distribution model is unknown. In this paper, an asymptotic-probability-extraction (APEX) method for estimating the unknown random distribution when using a nonlinear response surface modeling is proposed. The APEX begins by efficiently computing the high-order moments of the unknown distribution and then applies moment matching to approximate the characteristic function of the random distribution by an efficient rational function. It is proven that such a moment-matching approach is asymptotically convergent when applied to quadratic response surface models. In addition, a number of novel algorithms and methods, including binomial moment evaluation, PDF/CDF shifting, nonlinear companding and reverse evaluation, are proposed to improve the computation efficiency and/or approximation accuracy. Several circuit examples from both digital and analog applications demonstrate that APEX can provide better accuracy than a Monte Carlo simulation with 104 samples and achieve up to 10times more efficiency. The error, incurred by the popular normal modeling assumption for several circuit examples designed in standard IC technologies, is also shown

...read moreread less

Summary (5 min read)

Jump to: [Introduction] – [B. Response Surface Modeling] – [C. Classical Moment Problem] – [III. APEX] – [A. Mathematical Formulation] – [B. Connection to Probability Theory] – [C. Proof of Convergence] – [A. Binomial Moment Evaluation] – [B. PDF/CDF Shifting] – [C. Nonlinear Companding] – [D. Reverse Evaluation] – [E. Summary] – [V. HANDLING NONNORMAL PROCESS VARIATIONS] – [B. Correlated Nonnormal Process Variations] – [VI. APPLICATIONS OF APEX] – [VII. NUMERICAL EXAMPLES] – [A. Single-Variable Quadratic Model] – [C. Low-Noise Amplifier (LNA)] – [D. Operational Amplifier] and [VIII. CONCLUSION]

Introduction

A S IC TECHNOLOGIES are scaled to the deep submi-crometer region, process variations are becoming critical and significantly impact the overall performance of a circuit.
Given a circuit performance f (e.g., the delay of a digital circuit path or the gain of an analog amplifier), the response surface modeling approximates f as a polynomial function of the process variations (e.g., ∆VTH, ∆TOX, etc.).
The authors prove that the moment-matching approach utilized in APEX is asymptotically convergent when applied to quadratic response surface models.
The authors propose their APEX approach in Section III and discuss several implementation issues, including binomial moment evaluation, PDF/CDF shifting, nonlinear companding, and reverse evaluation in Section IV.

B. Response Surface Modeling

The linear regression model in (3) is accurate when process variations are small.
Without loss of generality, the authors assume that A is symmetric in this paper, since any asymmetric quadratic form XTAX can be easily converted to an equivalent symmetric form 0.5 · XT(A+AT)X [10].
Since generating each sampling point can be quite expensive, many response surface modeling algorithms [11]–[13] attempt to minimize the required number of sampling points while maintaining good modeling quality.
Even if a DOE algorithm (e.g., Latin hypercube sampling [15]) generates random sampling points, the number of these sampling points is typically too small (e.g., 10–100) to accurately estimate PDF/CDF functions.

C. Classical Moment Problem

The classical moment problem was first proposed and studied by T. Stieltjes in 1894.
The process variations are modeled as normal distributions, which are unbounded and distributed over (−∞,+∞).
Therefore, the circuit performance approximated by the quadratic model (4) is also unbounded.
The classical moment problem has been widely studied by mathematicians for over 100 years, focusing on the theoretical aspects of the problem, e.g., the existence and uniqueness of the solution.
The practical applications of this moment problem, especially the computation efficiency of solving the problem, have not been sufficiently explored.

III. APEX

The optimal approximation is determined by matching the first 2M moments between h(t) and pdf(f) for an M th order approximation.
The authors first describe the mathematical formulation of the APEX algorithm.
Then, the authors link the APEX to probability theory and explain why it is efficient in approximating PDF/CDF functions.
Finally, the authors prove that the moment-matching method utilized in APEX is asymptotically convergent when applied to quadratic response surface models.

A. Mathematical Formulation

(8) In (8), the definition of time moments is identical to the traditional definition of moments in probability theory except for the scaling factor (−1)k/k!.
The probability density function and the cumulative distribution function are functions of f .
The nonlinear equations in (11) can be solved using the algorithm proposed in [18], which first solves the poles {bi} and then the residues {ai}.
It should be noted that many implementation issues must be considered to make their proposed approach, APEX, feasible and efficient.
The aforementioned moment-matching method was previously applied to IC interconnect order reduction [18], [19], and it is related to the Padé approximation in linear control theory [20].

B. Connection to Probability Theory

In probability theory, given a random variable f whose PDF is pdf(f), the characteristic function is defined as the Fourier transform of pdf(f) [8].
Matching the first 2M moments in (11) is equivalent to matching the first 2M Taylor expansion coefficients between the original characteristic function Φ(ω) and the approximated rational function H(s).
Therefore, the optimally approximated H(s) in (9) is a low-pass system.
It is well known that a Taylor expansion is accurate around the expansion point.
These conclusions have been verified in other applications (e.g., IC interconnect order reduction [18], [19]), and they provide the theoretical background to explain why the APEX works well for PDF/CDF approximation, as will be demonstrated by the numerical examples in Section VII.

C. Proof of Convergence

In the previous section, the authors have intuitively explained why the moment-matching approach is efficient in approximating PDF/CDF functions.
There are special cases for which the moment problem is guaranteed to converge, i.e., the PDF/CDF functions are uniquely determined by the moments.
The proposed APEX approach is made practically feasible by applying several novel algorithms, including: 1) a binomial scheme for high-order moment computation; 2) a modified Chebyshev inequality for PDF/CDF shifting; 3) a nonlinearcompanding method to improve approximation accuracy; and 4) a reverse-evaluation technique for best case/worst case analysis.

A. Binomial Moment Evaluation

In addition, remember that the random variables ∆Y satisfy the standard normal distribution N(0, 1), which yields [8].
In addition, the following theorem guarantees that the random variables ∆Z defined in (24) are still independent and satisfy the standard normal distribution N(0, 1).
The main advantage of the binomial-moment-evaluation algorithm is that, unlike the direct moment evaluation in (21), it does not explicitly construct the high-order polynomial fk.
In addition, the matrix diagonalization in (23) is only required once and has a complexity of O(N3).

B. PDF/CDF Shifting

APEX approximates the unknown PDF pdf(f) as the impulse response h(t) of an LTI system.
The shifted function pdf(f + f0) can be accurately approximated by the impulse response h(t) of a low-order LTI system, since it has a much smaller phase shift in the frequency domain.
The previous analysis implies that it is crucial to determine the correct value of f0 for PDF/CDF shifting.
Therefore, any circuit performance f approximated by the quadratic model in (4) is also unbounded.
The authors have modified the secondorder Chebyshev inequality to higher orders and, therefore, refer to (31) as the modified Chebyshev inequality.

C. Nonlinear Companding

For most practical applications, the performance variation should be dominated by the linear term in the response surface model.
Therefore, applying the moment matching to pdf2(g), which almost has a zero skewness, can achieve better approximation accuracy than directly estimating pdf1(f).
In order to make the nonlinear companding efficient, the companding function should be easy to use.
It should be noted that, even if the transformed PDF pdf2(g) does not have an exactly zero skewness, the proposed companding function defined in (41)–(43) with the positive coefficient α always pushes the negative skewness toward zero, thereby always improving the approximation accuracy.
3) Moment Evaluation in Companding: After the companding function in (41) is determined, the high-order moments of the random variable g must be computed efficiently such that the transformed PDF pdf2(g) can be approximated using the moment matching.

D. Reverse Evaluation

In many practical applications, such as robust circuit optimization [4], [21], the best case performance (e.g., the 1% point on CDF) and the worst case performance (e.g., the 99% point on CDF) are two important metrics to be evaluated.
As discussed in Section III-B, APEX matches the first 2M Taylor expansion coefficients between the original characteristic function Φ(ω) and the approximated rational function H(s).
It, in turn, implies that the proposed approach can accurately estimate the 99% point of the random distribution, as shown in Fig. The 1% point of the original pdf(f) now becomes the 99% point of the flipped pdf(−f) which can be accurately evaluated by APEX.
After the reverse evaluation is applied, however, the skewness of the unknown probability distribution will be changed.

E. Summary

Algorithm 2 (Asymptotic Probability Extraction) 1. Start from the quadratic response surface model in (4) and a given approximation order M .
(48) End If Algorithm 2 summarizes the overall implementation of APEX without the reverse evaluation.
In addition, it is worth mentioning that using an approximation order greater than ten can result in serious numerical problems [18], [19].

V. HANDLING NONNORMAL PROCESS VARIATIONS

The moment-matching method used in APEX is general and is not limited to normal process variations.
The binominal moment evaluation proposed in Section IV-A relies on the normal distribution assumption for process parameters.
The nonlinear transform approach is not valid for correlated nonnormal distributions.
Since the random variables 4Correlated normal distributions can be decomposed into independent ones by principal component analysis.
As such, the APEX algorithm can be directly applied to the quadratic response surface model f(∆Y ).

B. Correlated Nonnormal Process Variations

At first glance, the aforementioned nonlinear transform approach seems applicable to the correlated nonnormal variations.
This property can be understood from the following example described in [8].
In particular, let pdf1(∆y1) and pdf2(∆y2) be normal distributions.
Both ∆y1 and ∆y2 are marginally normal; however, their joint PDF in (51) is not a multivariate normal distribution.
In addition, using the correlated nonnormal distributions also increases the difficulty of process characterization, because extracting the multidimensional joint PDF from the process testing data is not trivial.

VI. APPLICATIONS OF APEX

APEX can be applied to many statistical analysis and optimization problems (e.g., [3]–[7] and [21]) where estimating the probability distribution of the circuit performance is required.
The authors have applied the APEX to a robust analog design flow in [21].
Next, based on the model f(DV,∆Y ), a robust optimization is performed to search the optimal design variable values that yield the best circuit performance.
Since the probability extraction is repeated for many times, its computational cost can be quite expensive, if using Monte Carlo simulation in the loop.

VII. NUMERICAL EXAMPLES

The authors demonstrate the efficacy of the APEX using several circuit examples.
For testing and comparison, both linear and quadratic response surface models are created using the standard model fitting algorithm [12], and these models are utilized for generating performance distributions.
In order to make a fair comparison, the following two issues are considered when implementing the Monte Carlo simulation.
Applying these advanced techniques is efficient in those applications where evaluating the circuit performance is expensive, e.g., by running a transistor-level simulator.
For this reason, a simple random number generator is used to generate the sampling points, and a simple bin-based histogram is used to estimate the performance distributions in this paper.

A. Single-Variable Quadratic Model

Such a simple example allows us to make a full comparison between the APEX and other traditional probability extraction techniques.
The linear regression approach approximates the quadratic function f in (38) by a best fitted linear model with least squares error, resulting in a normal probability distribution for f .
Therefore, for simplicity, the authors only consider the interdie variations for the CMOS transistors in this example.
Table III compares the computation time required for the direct moment evaluation and their proposed binomial moment evaluation, also known as 2) Moment Evaluation.
The delay values at several specific points of the CDF are estimated by these probability extraction techniques.

C. Low-Noise Amplifier (LNA)

The variations on both MOS transistors and passive components (resistor, capacitor, and inductor) are considered.
The performance of the LNA is characterized by eight different specifications.
Table V shows the response surface modeling error and the model skewness for all these eight perfor- mances.
2) Comparison of Accuracy and Speed: Tables VI and VII compare the estimation accuracy for five different statistical analysis approaches: corner simulation, linear regression, Legendre approximation, Monte Carlo simulation with 104 samples, and APEX.
The linear regression approach provides more accurate results than the corner simulation, but the errors are expected to increase as IC technologies continue to scale.

D. Operational Amplifier

Fig. 17 shows a two-stage folded-cascode operational amplifier designed in the IBM BiCMOS 0.25-µm process, also known as 1) Response Surface Modeling.
Given a fixed circuit design, each circuit performance is a function of the process variations.
2) Comparison of Accuracy and Speed: Tables X and XI compare the estimation accuracy for four different statistical analysis approaches: linear regression, Legendre approximation, Monte Carlo simulation with 104 samples, and APEX.
As shown in Tables X and XI, the linear regression and the Legendre approximation yield large estimation errors, especially when the circuit performance has a large skewness, e.g., gain and offset.
By using the proposed nonlinear-companding technique, APEX achieves a significant error reduction.

VIII. CONCLUSION

As IC technologies reach nanoscale, process variations are becoming relatively large and nonlinear (e.g., quadratic) response surface models might be required to accurately characterize the large-scale variations.
He is currently working toward the Ph.D. degree in electrical and computer engineering at Carnegie Mellon University, Pittsburgh, PA.
He received the Best CAD Transactions Paper Awards in 1991 and 1999.

Did you find this useful? Give us your feedback

Figures (29)

TABLE I ESTIMATED TECHNOLOGY PARAMETERS AND 3σ VARIATIONS [1]

Fig. 12. Estimation error of various probability extraction techniques.

Fig. 13. Circuit schematic of the longest path in the ISCAS’89 S27 benchmark circuit.

Fig. 4. Phase plots of the Fourier transforms of pdf(f) and pdf(f + f0). (a) The phase plot of pdf(f) presents a large phase shift. (b) The phase plot of pdf(f + f0) shows a phase reduction of ωf0.

Fig. 3. PDF/CDF shifting is required in two cases.

Fig. 8. Relation between the skewness and the scaling factor r for nonlinear companding.

Fig. 11. Function f = exp(∆y) is approximated as a quadratic model within the ±3σ range of ∆y.

TABLE VI ESTIMATION ERROR FOR THE LOWER BOUND OF THE LNA PERFORMANCE (1% POINT ON CDF)

TABLE IX RESPONSE SURFACE MODELING ERROR AND MODEL SKEWNESS OF THE OP AMP

Fig. 17. Circuit schematic of an operational amplifier.

TABLE VIII COMPUTATIONAL COST FOR THE PROBABILITY EXTRACTION OF THE LNA (SEC.)

TABLE VII ESTIMATION ERROR FOR THE UPPER BOUND OF THE LNA PERFORMANCE (99% POINT ON CDF)

TABLE II SELECTIVELY APPLYING THE NONLINEAR COMPANDING AND/OR THE REVERSE EVALUATION IN APEX

Fig. 10. Probability density function of the log-normal distribution ∆x ∼ exp[N(0, 1/3)], i.e., the logarithm of ∆x is a normal distribution whose mean is 0 and standard deviation is 1/3.

TABLE X ESTIMATION ERROR FOR THE LOWER BOUND OF THE OP-AMP PERFORMANCE (1% POINT ON CDF)

TABLE XI ESTIMATION ERROR FOR THE UPPER BOUND OF THE OP-AMP PERFORMANCE (99% POINT ON CDF)

TABLE XII COMPUTATIONAL COST FOR THE PROBABILITY EXTRACTION OF THE OP AMP (SEC.)

TABLE V RESPONSE SURFACE MODELING ERROR AND MODEL SKEWNESS OF THE LNA

TABLE IV ESTIMATION ERROR (COMPARED AGAINST MONTE CARLO WITH 106 SAMPLES) AND COMPUTATIONAL COST

Fig. 16. Circuit schematic of a low-noise amplifier (LNA).

Content maybe subject to copyright Report

16 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 26, NO. 1, JANUARY 2007

Asymptotic Probability Extraction for Nonnormal

Performance Distributions

Xin Li, Member, IEEE, Jiayong Le, Student Member, IEEE, Padmini Gopalakrishnan, Student Member, IEEE,

and Lawrence T. Pileggi, Fellow, IEEE

Abstract—While process variations are becoming more signiﬁ-

cant with each new IC technology generation, they are often mod-

eled via linear regression models so that the resulting performance

variations can be captured via normal distributions. Nonlinear re-

sponse surface models (e.g., quadratic polynomials) can be utilized

to capture larger scale process variations; however, such models

result in nonnormal distributions for circuit performance. These

performance distributions are difﬁcult to capture efﬁciently since

the distribution model is unknown. In this paper, an asymptotic-

probability-extraction (APEX) method for estimating the un-

known random distribution when using a nonlinear response

surface modeling is proposed. The APEX begins by efﬁciently

computing the high-order moments of the unknown distribution

and then applies moment matching to approximate the character-

istic function of the random distribution by an efﬁcient rational

function. It is proven that such a moment-matching approach

is asymptotically convergent when applied to quadratic response

surface models. In addition, a number of novel algorithms and

methods, including binomial moment evaluation, PDF/CDF shift-

ing, nonlinear companding and reverse evaluation, are proposed

to improve the computation efﬁciency and/or approximation ac-

curacy. Several circuit examples from both digital and analog

applications demonstrate that APEX can provide better accuracy

than a Monte Carlo simulation with 10

samples and achieve

up to 10× more efﬁciency. The error, incurred by the popular

normal modeling assumption for several circuit examples designed

in standard IC technologies, is also shown.

Index Terms—Circuit performance, probability, process var-

iation, response surface modeling.

I. INTRODUCTION

S IC TECHNOLOGIES are scaled to the deep submi-

crometer region, process variations are becoming critical

and signiﬁcantly impact the overall performance of a circuit.

Table I shows some typical process parameters and their 3σ

variations as technologies are scaled from 250 to 70 nm.

These large-scale variations introduce uncertainties in circuit

behavior, thereby making IC design increasingly difﬁcult. Low

product yield or unnecessary overdesign cannot be avoided if

Manuscript received June 22, 2005; revised December 29, 2005 and March

29, 2006. This work was supported in part by the MARCO Focus Center for Cir-

cuit & System Solutions (C2S2, www.c2s2.org) under Contract 2003-CT-888.

This paper was presented in part at the IEEE/ACM International Conference

on Computer Aided Design (ICCAD) 2004. This paper was recommended by

Associate Editor J. R. Phillips.

The authors are with the Department of Electrical and Computer

Engineering, Carnegie Mellon University, Pittsburgh, PA 15213 USA

(e-mail: xinli@ece.cmu.edu; jiayongl@ece.cmu.edu; pgopalak@ece.cmu.edu;

pileggi@ece.cmu.edu).

Digital Object Identiﬁer 10.1109/TCAD.2006.882593

TABLE I

STIMATED TECHNOLOGY PARAMETERS AND 3σ VARIATIONS [1]

process variations are not accurately modeled and analyzed

within the IC design ﬂow.

During the past decade, various statistical analysis tech-

niques [1]–[7] have been proposed and utilized in many appli-

cations such as statistical timing analysis, mismatch analysis,

yield optimization, etc. The objective of these techniques is to

model the probability distribution of the circuit performance

under random process variations. Nassif [1] applies a linear

regression to approximate a given circuit performance f (e.g.,

delay, gain, etc.) as a function of the process variations (e.g.,

∆V

, ∆T

, etc.) and assumes that all random variations

are normally distributed. As such, the performance f is also

a normal distribution, since the linear combination of normally

distributed random variables is still a normal distribution [8].

The linear regression model is efﬁcient and accurate when

process variations are sufﬁciently small. However, the large-

scale variations in deep submicrometer technologies, which

reach almost ±50% in Table I, suggest the need for higher

order regression models in order to guarantee high approxima-

tion accuracy [4]–[7]. Using a higher order response surface

model, however, brings about new challenges due to the non-

linear mapping between the process variations and the circuit

performance f. The distribution of f is no longer normal, unlike

the case of the linear model. The authors in [3]–[5] utilize the

Monte Carlo simulation to evaluate the probability distribution

of f, but this is computationally expensive. Note that reducing

the computational cost for this probability extraction is crucial,

especially when the extraction procedure is an inner loop within

an optimization ﬂow.

In this paper, we propose a novel asymptotic-probability-

extraction (APEX) approach for estimating the unknown

random distribution using the nonlinear response surface

modeling. Given a circuit performance f (e.g., the delay of

a digital circuit path or the gain of an analog ampliﬁer), the

response surface modeling approximates f as a polynomial

function of the process variations (e.g., ∆V

, ∆T

, etc.).

Since the process variations are modeled as random variables,

LI et al.: ASYMPTOTIC PROBABILITY EXTRACTION FOR NONNORMAL PERFORMANCE DISTRIBUTIONS 17

Fig. 1. Overall ﬂow of APEX.

the circuit performance f, which is a function of these random

variables, is also a random variable. The APEX applies a

moment matching to approximate the characteristic function

of f (i.e., the Fourier transform of the probability density

function (PDF) [8]) by a rational function H. We conceptually

consider H to be of the form of the transfer function of a linear

time-invariant (LTI) system, and the PDF and the cumulative

distribution function (CDF) of f are approximated by the

impulse response and the step response of the LTI system H,

respectively. Fig. 1 shows the overall ﬂow of the APEX. In this

paper, we assume that the response surface model is already

available and it is provided to the APEX for estimating the

probability distribution of the circuit performance.

We prove that the moment-matching approach utilized in

APEX is asymptotically convergent when applied to quadratic

response surface models. In other words, given a quadratic re-

sponse surface model f that is a quadratic function of normally

distributed random variables, the PDF and the CDF of f can

be uniquely determined by its moments {m

,k =1, 2,...,K}

when K approaches inﬁnity, i.e., K → +∞. The PDF and

the CDF extracted by the APEX can be used to characterize

and/or optimize the statistical performance of analog and digital

circuits under process variations.

APEX extends existing moment-matching methods via four

important new contributions which signiﬁcantly reduce the

computational cost and improve the approximation accuracy

for this particular application. First, a key operation required

by the APEX is the computation of the high-order moments,

which is extremely expensive when using traditional techniques

such as the direct moment evaluation. In APEX, we propose a

binomial evaluation scheme to recursively compute the high-

order moments for a given quadratic response surface model.

The binominal-moment-evaluation scheme is derived from a

statistical independence theory and achieves a speedup of more

than 10

× over the direct moment-evaluation technique in our

tested examples.

Second, the APEX approximates the unknown PDF by the

impulse response of an LTI system. Directly applying such

an approximation to any circuit performance with negative

value is infeasible, since it results in an LTI system that is

noncausal. To overcome this difﬁculty, the APEX applies a

modiﬁed Chebyshev inequality for PDF/CDF shifting.

Third, in several practical applications, we observe that a

direct moment matching is not efﬁcient when the unknown PDF

has a large negative skewness. In such cases, we propose to ap-

ply a nonlinear-companding scheme to automatically compress

and/or expand the PDF such that the transformed PDF almost

has a zero skewness and can be approximated accurately.

Finally, the best case performance (e.g., the 1% point on

CDF) and the worst case performance (e.g., the 99% point

on CDF) are two important metrics to be evaluated in many

practical applications. Direct moment matching cannot capture

the 1% point value accurately since the moment-matching

approximation is most accurate for low-frequency components

(corresponding to the ﬁnal value of a CDF) and least accurate

for high-frequency components (corresponding to the initial

value of a CDF). To address this problem, a reverse-evaluation

technique is proposed in this paper to produce an accurate

estimation of the 1% point.

The remainder of this paper is organized as follows. In

Section II, we review the background on principal component

analysis (PCA), response surface modeling, and classical mo-

ment problem. We propose our APEX approach in Section III

and discuss several implementation issues, including binomial

moment evaluation, PDF/CDF shifting, nonlinear companding,

and reverse evaluation in Section IV. We extend the APEX

algorithm to handle nonnormal process variations in Section V

and discuss the applications of APEX in Section VI. The

efﬁcacy of APEX is demonstrated by several circuit examples

in Section VII, followed by our conclusions in Section VIII.

II. B

ACKGROUND

A. PCA

The PCA [9] is a statistical method that ﬁnds a set of

independent factors to represent a set of correlated random

variables. Given N process parameters X =[x

,...,x

]

the process variations ∆X = X − X

, where X

contains the

mean values of X, are often approximated as zero-mean normal

distributions, and the correlations of ∆X can be represented by

a symmetric positive semideﬁnite correlation matrix R.PCA

decomposes R as

R = V · Σ · V

(1)

where Σ = diag(λ

,λ

,...,λ

) contains the eigenvalues of

R and V =[V

,...,V

] contains the corresponding eigen-

vectors that are orthonormal, i.e., V

V = I (I is the identity

matrix). Based on Σ and V , the PCA deﬁnes a set of new

random variables

∆Y =Σ

−0.5

· V

· ∆X. (2)

These new random variables ∆Y are called the principal com-

ponents or factors. It is easy to verify that ∆Y are independent

and satisfy the standard normal distribution N (0, 1) (i.e., zero

mean and unit standard deviation).

The essence of PCA can be interpreted as a coordinate

rotation of the space deﬁned by the original random vari-

ables. In addition, if the magnitude of the eigenvalues {λ

}

decreases quickly, it is possible to use a small number of

random variables, i.e., a small subset of principal components,

to approximate the original N-dimensional space. More details

on PCA can be found in [9].

B. Response Surface Modeling

Given a circuit topology, the circuit performance (e.g.,

gain, delay, etc.) is a function of the design parameters

18 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 26, NO. 1, JANUARY 2007

(e.g., bias current, transistor sizes, etc.) and the process pa-

rameters (e.g., V

, T

, etc.). The design parameters are

optimized and ﬁxed during the design phase; however, the

process parameters must be modeled as random variables to

account for any random variations. After the PCA shown in

(1) and (2), the process variations can be represented by N

independent random variables ∆Y that satisfy the standard nor-

mal distribution N(0, 1). Therefore, given a set of ﬁxed design

parameters, the circuit performance f can be approximated by

a linear regression model of ∆Y [1]

f(∆Y )=B

· ∆Y + C (3)

where B ∈ R

stands for the linear coefﬁcients and C ∈ R

is the constant term. The linear regression model in (3) is

accurate when process variations are small. However, such a

linear model is becoming increasingly inaccurate as process

variations become relatively large in nanoscale technologies.

In addition, the linear regression model in (3) always yields

the conclusion that the worst case performance appears at one

of the process corners. This conclusion might not be valid

for many circuit performances, especially those analog circuit

performances with strong nonlinearities, e.g., the offset voltage

of an analog ampliﬁer. For these reasons, applying quadratic re-

sponse surface models might be required to provide a sufﬁcient

modeling accuracy [3]–[5] as

f(∆Y )=∆Y

· A · ∆Y + B

· ∆Y + C (4)

where A ∈ R

N×N

denotes the quadratic coefﬁcients, B ∈ R

represents the linear coefﬁcients, and C ∈ R is the constant

term. Without loss of generality, we assume that A is symmetric

in this paper, since any asymmetric quadratic form X

can be easily converted to an equivalent symmetric form 0.5 ·

(A + A

)X [10].

The model coefﬁcients in (3) and (4) can be determined

by solving a set of overdetermined linear equations over a

number of sampling points [11]–[13]. These sampling points

are typically generated from SPICE simulations or measure-

ment results. Since generating each sampling point can be

quite expensive, many response surface modeling algorithms

[11]–[13] attempt to minimize the required number of sampling

points while maintaining good modeling quality. For this pur-

pose, a great number of algorithms for design of experiments

(DOE) have been proposed [14], [15]. Many of these DOE

algorithms (e.g., fractional factorial design, orthogonal array,

etc.) generate deterministic sampling points. These sampling

points do not represent the actual probability distributions of

process variations and, therefore, cannot be used for estimating

performance distributions. Even if a DOE algorithm (e.g., Latin

hypercube sampling [15]) generates random sampling points,

the number of these sampling points is typically too small

(e.g., 10–100) to accurately estimate PDF/CDF functions. For

this reason, after the response surface model is created, special

techniques are required to extract the probability distribution.

C. Classical Moment Problem

The classical moment problem was ﬁrst proposed and studied

by T. Stieltjes in 1894. The Stieltjes moment problem is deﬁned

as follows [16].

Deﬁnition 1 (Stieltjes Moment Problem): Given a sequence

of numbers {m

,k =1, 2,...}, ﬁnd a nondecreasing function

F (f), where f ∈ [0, +∞), such that

+∞



dF (f). (5)

Note that if the symbol f in (5) represents a random vari-

able, the numbers {m

,k =1, 2,...} are its moments and the

function F (f) is its CDF. In the Stieltjes moment problem, the

random variable f is restricted to be nonnegative, i.e., its CDF

F (f) is deﬁned only for f ≥ 0. By varying the interval in which

F (f) is valid, two further variations of the classical moment

problem can be deﬁned [16].

Deﬁnition 2 (Hamburger Moment Problem): Given a se-

quence of numbers {m

,k =1, 2,...}, ﬁnd a nondecreasing

function F (f), where f ∈ (−∞, +∞), such that

+∞



−∞

dF (f). (6)

Deﬁnition 3 (Hausdorff Moment Problem): Given a se-

quence of numbers {m

,k =1, 2,...}, ﬁnd a nondecreasing

function F (f), where f ∈ [0, 1], such that



dF (f). (7)

As shown in (6) and (7), the Hamburger moment prob-

lem and the Hausdorff moment problem are deﬁned in the

interval (−∞, +∞) and [0,1], respectively. In this paper, the

process variations are modeled as normal distributions, which

are unbounded and distributed over (−∞, +∞). Therefore, the

circuit performance approximated by the quadratic model (4)

is also unbounded. The probability extraction problem that we

aim to solve is the Hamburger moment problem in (6).

The classical moment problem has been widely studied by

mathematicians for over 100 years, focusing on the theoretical

aspects of the problem, e.g., the existence and uniqueness of

the solution. Details of these theoretical results can be found

in [16] or other recent publications, e.g., in [17]. However,

the practical applications of this moment problem, especially

the computation efﬁciency of solving the problem, have not

been sufﬁciently explored. In this paper, we develop the APEX

algorithm which aims to solve the moment problem efﬁciently,

i.e., to improve the approximation accuracy and reduce the

computational cost for practical applications.

III. APEX

Given the quadratic response surface model in (4), the ob-

jective of APEX is to estimate the PDF pdf(f) and the CDF

LI et al.: ASYMPTOTIC PROBABILITY EXTRACTION FOR NONNORMAL PERFORMANCE DISTRIBUTIONS 19

cdf(f) for the performance f.

Instead of running expensive

Monte Carlo simulations, the APEX tries to ﬁnd an Mth

order LTI system H whose impulse response h(t) and step

response s(t) are the optimal approximations for pdf(f) and

cdf(f), respectively.

The optimal approximation is determined

by matching the ﬁrst 2M moments between h(t) and pdf(f) for

an Mth order approximation. In this section, we ﬁrst describe

the mathematical formulation of the APEX algorithm. Then,

we link the APEX to probability theory and explain why it

is efﬁcient in approximating PDF/CDF functions. Finally, we

prove that the moment-matching method utilized in APEX is

asymptotically convergent when applied to quadratic response

surface models.

A. Mathematical Formulation

We deﬁne the time moments [18] for a given circuit perfor-

mance f whose PDF is pdf(f) as follows:

(−1)

+∞



−∞

· pdf(f) · df . (8)

In (8), the deﬁnition of time moments is identical to the tradi-

tional deﬁnition of moments in probability theory except for the

scaling factor (−1)

/k!.

Similarly, the time moments can be deﬁned for an LTI

system H [18]. Given an Mth order LTI system whose transfer

function and impulse response are

H(s)=



i=1

s − b

and h(t)=









i=1

, (if t ≥ 0)

0, (if t<0).

(9)

The time moments of H are deﬁned as [18]

(−1)

+∞



−∞

· h(t) · dt = −



i=1

k+1

. (10)

In (9), the poles {b

,i=1, 2,...,M} and residues {a

,i=

1, 2,...,M} are the 2M unknowns that need to be determined.

Matching the ﬁrst 2M moments in (8) and (10) yields the

following 2M nonlinear equations:

−



+ ···+



= s

−



+ ···+



= s

−



+ ···+



= s

2M−1

. (11)

In this paper, ∆X, ∆Y ,and∆Z represent the random variables for mod-

eling process variations, and f represents the circuit performance of interest.

The probability density function and the cumulative distribution function are

functions of f. Therefore, they are denoted as pdf(f) and cdf(f ), respectively.

The variable t in h(t) and s(t) corresponds to the variable f in pdf(f) and

cdf(f).

The nonlinear equations in (11) can be solved using the algo-

rithm proposed in [18], which ﬁrst solves the poles {b

} and

then the residues {a

}. In what follows, we brieﬂy describe this

two-step algorithm for solving (11).

1) Solving Poles: In order to solve the poles {b

} in (11),

Pillage and Rohrer [18] ﬁrst formulate the following linear

equations:

−







··· s

M−1

··· s

M−1

··· s

2M−2













M−1













M+1

2M−1







(12)

After solving (12) for {c

,i=0, 1,...,M − 1}, the poles {b

}

in (11) are equal to the reciprocals of the roots of the following

characteristic polynomial:

+ c

−1

+ c

−1

+ ···+ c

M−1

−M+1

+ b

−M

=0. (13)

The detailed proof of (12) and (13) can be found in [18].

2) Solving Residues: After the poles {b

}are known, substi-

tute {b

} into (11) and the residues {a

} can be solved by using

the ﬁrst M moments

−







−1

··· b

−1

−2

··· b

−2

−M

··· b

−M

























M−1







. (14)

The aforementioned algorithm assumes that the poles {b

}

are distinct. Otherwise, if repeated poles exist, the unknown

poles and residues must be solved using a more comprehensive

algorithm described in [18]. Once the poles {b

} and residues

}are determined, the PDF pdf(f) is optimally approximated

by h(t) in (9) and the CDF cdf(f) is optimally approximated by

the step response:

s(t)=



h(τ)dτ =









i=1

· (e

− 1), (if t ≥ 0)

0, (if t<0).

(15)

It should be noted that many implementation issues must be

considered to make our proposed approach, APEX, feasible

and efﬁcient. For example, the impulse response of a causal

LTI system is only nonzero for t ≥ 0, but a PDF in practical

applications can be nonzero for f ≤ 0. In Section IV, we will

propose several schemes to address these problems.

The aforementioned moment-matching method was previ-

ously applied to IC interconnect order reduction [18], [19],

and it is related to the Padé approximation in linear control

theory [20]. In the following section, we will explain why

such a moment-matching approach is efﬁcient in approximating

PDF/CDF functions.

20 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 26, NO. 1, JANUARY 2007

Fig. 2. Characteristic functions of several typical distributions.

B. Connection to Probability Theory

In probability theory, given a random variable f whose PDF

is pdf(f), the characteristic function is deﬁned as the Fourier

transform of pdf(f) [8]

Φ(ω)=

+∞



−∞

pdf(f) · e

jωf

· df =

+∞



−∞

pdf(f) ·

+∞



k=0

(jωf)

· df .

(16)

Substituting (8) into (16) yields

Φ(ω)=

+∞



k=0

· (−jω)

. (17)

Equation (17) implies an important fact that the time moments

deﬁned in (8) are related to the Taylor expansion of the char-

acteristic function at the expansion point ω =0. Matching the

ﬁrst 2M moments in (11) is equivalent to matching the ﬁrst 2M

Taylor expansion coefﬁcients between the original characteris-

tic function Φ(ω) and the approximated rational function H(s).

To explain why the moment-matching approach is efﬁcient,

we ﬁrst need to show two important properties of the character-

istic function that are described in [8].

Property 1: A characteristic function has the maximal mag-

nitude at ω =0, i.e., |Φ(ω)|≤Φ(0) = 1.

Property 2: A characteristic function Φ(ω) → 0 when

ω →∞.

Fig. 2 shows the characteristic functions for several typical

random distributions. The above two properties imply an inter-

esting fact: Namely, given a random variable f, the magnitude

of its characteristic function decays as ω increases. Therefore,

the optimally approximated H(s) in (9) is a low-pass system.

It is well known that a Taylor expansion is accurate around the

expansion point. Since a low-pass system is mainly determined

by its behavior in the low-frequency range (around ω =0), it

can be accurately approximated by matching the ﬁrst several

Taylor coefﬁcients at ω =0, i.e., the moments. In addition, the

rational function form utilized in APEX is an efﬁcient form to

approximate the transfer function H(s) of a low-pass system.

These conclusions have been veriﬁed in other applications

(e.g., IC interconnect order reduction [18], [19]), and they

provide the theoretical background to explain why the APEX

works well for PDF/CDF approximation, as will be demon-

strated by the numerical examples in Section VII.

C. Proof of Convergence

In the previous section, we have intuitively explained why

the moment-matching approach is efﬁcient in approximating

PDF/CDF functions. However, there is a theoretical question

which might be raised: Given a random variable, can the

PDF/CDF functions always be uniquely determined by its

moments? In general, the answer is no. It has been observed

in mathematics that some probability distributions cannot be

uniquely determined by their moments. One example described

in [16] is the following PDF:

pdf(f)=



−0.5·[ln(f )]

√

2πf

·{1+a · sin [2π · ln(f)]}, (if f>0)

0, (if f ≤ 0)

(18)

where a ∈ [−1, 1]. It can be veriﬁed that all moments of the

PDF pdf(f) in (18) are independent of a, although varying a

changes pdf(f ) signiﬁcantly [16]. It, in turn, implies that the

PDF in (18) cannot be uniquely determined by its moments.

However, there are special cases for which the moment prob-

lem is guaranteed to converge, i.e., the PDF/CDF functions are

uniquely determined by the moments. The following Carleman

theorem states one of those special cases and gives a sufﬁcient

condition for the convergence of the moment problem.

Theorem 1 (Carleman [16]): A probability distribution on

the interval (−∞, +∞), i.e., the Hamburger moment problem

[see (6)], can be uniquely determined by its moments {m

,k =

1, 2,...} if

+∞



k=1

)

−1

= ∞. (19)

Based on the Carleman theorem, we can prove that the moment-

matching approach utilized in APEX is asymptotically con-

vergent when applied to quadratic response surface models.

Namely, given a quadratic response surface model f that is a

quadratic function of the normally distributed random variables,

the probability distribution of f can be uniquely determined

by its moments {m

,k =1, 2,...,K} when K approaches

inﬁnity, i.e., K → +∞. The asymptotic convergence of APEX

can be formally stated by the following theorem. The detailed

proof of Theorem 2 is given in Appendix.

Theorem 2: Given the quadratic response surface model f

in (4) where the random variables ∆Y are independent and

satisfy the standard normal distribution N(0, 1), the probability

distribution of f can be uniquely determined by its moments

,k =1, 2,...}.

IV. I

MPLEMENTATIONS OF APEX

Our proposed APEX approach is made practically feasible

by applying several novel algorithms, including: 1) a binomial

scheme for high-order moment computation; 2) a modiﬁed

HTML Viewer

Asymptotic Probability Extraction for Nonnormal Performance Distributions

Summary (5 min read)

Introduction

B. Response Surface Modeling

C. Classical Moment Problem

III. APEX

A. Mathematical Formulation

B. Connection to Probability Theory

C. Proof of Convergence

A. Binomial Moment Evaluation

B. PDF/CDF Shifting

C. Nonlinear Companding

D. Reverse Evaluation

E. Summary

V. HANDLING NONNORMAL PROCESS VARIATIONS

B. Correlated Nonnormal Process Variations

VI. APPLICATIONS OF APEX

VII. NUMERICAL EXAMPLES

A. Single-Variable Quadratic Model

C. Low-Noise Amplifier (LNA)

D. Operational Amplifier

VIII. CONCLUSION

Figures (29)

Citations

Cites methods from "Asymptotic Probability Extraction f..."

References

"Asymptotic Probability Extraction f..." refers background in this paper

Related Papers (5)