What is the problem of interest in comparing k systems?

The problem of interest is to compare k systems, where the ith system’s performance measure isits simulation output mean, E[Yi(F c i )], under real-world input distribution F c i (c for correct), where Yi(·) is the stochastic output performance which depends on the chosen input distribution.

What is the condition for asymptotic normality of B/m(B?

Theorem 2 requires B = mγ for 0 < γ < 2, which is the condition for asymptotic normality of√ B/m(B̂i − Bi) in Proposition 3 in Section EC.5.

How is the performance measure estimated in a realistic DOvS setting?

In a realistic DOvS setting, each system’s performance measure is estimated via simulation replications, which introduces stochastic error.

How is the all-in procedure protected against such an error?

The all-in IOU-C procedure is protected against such an error by accounting for the estimation error in the gradients at the price of its conservatism.

How many values of (c) were sampled in the random search algorithm?

A total of L= 1,000 values of (θ̂−θc) were sampled in the random search algorithm (see Section EC.2) to approximate the optimal solutions of Pi`, i 6= `.

What is the average subset size of the plug-in procedure?

The average subset size of the plug-in procedure is 1.82, which is much smaller than that of all-in IOU-C, yet theestimated simultaneous coverage probability of the plug-in procedure is 0.874 (dashed line).

What is the average size of S0 for the conditional MCB procedure?

The average size of S0 is 1.03 for this procedure, which is the smallest among all three procedures since it ignores input uncertainty.

What is the simplest way to find the multidimensional quantile vectors?

Assumption 1(vii) states that given the plug-in distribution of CID effects and Vi(θ̂), the authors can find the exact multidimensional quantile vectors for −w(1)i` and −w(2)i` , respectively.

What is the coverage probability of the conditional procedure?

Figure 2 also shows that the simultaneous MCB coverage probabilityof the conditional procedure is 0.235 (dotted line), which is far lower than 0.9.

What is the probability of each unit arriving?

The actual number of units that arrive has a binomial distribution where the probability that each unit in the order arrives is 0.95.

(Open Access) Input–Output Uncertainty Comparisons for Discrete Optimization via Simulation (2019) | Eunhye Song

Q: What are the contributions mentioned in the paper "Input-output uncertainty comparisons for discrete optimization via simulation" ?

However, use of a template does not certify that the paper has been accepted for publication in the named journal. INFORMS journal templates are for the exclusive purpose of submitting to an INFORMS journal and should not be used to distribute the papers in print or online or to submit the papers to another publication.

Q: How is the performance measure estimated in a realistic DOvS setting?

In a realistic DOvS setting, each system’s performance measure is estimated via simulation replications, which introduces stochastic error.

Q: How is the all-in procedure protected against such an error?

The all-in IOU-C procedure is protected against such an error by accounting for the estimation error in the gradients at the price of its conservatism.

Q: How many values of (c) were sampled in the random search algorithm?

A total of L= 1,000 values of (θ̂−θc) were sampled in the random search algorithm (see Section EC.2) to approximate the optimal solutions of Pi`, i 6= `.

Q: What is the average subset size of the plug-in procedure?

The average subset size of the plug-in procedure is 1.82, which is much smaller than that of all-in IOU-C, yet theestimated simultaneous coverage probability of the plug-in procedure is 0.874 (dashed line).

Q: What is the average size of S0 for the conditional MCB procedure?

The average size of S0 is 1.03 for this procedure, which is the smallest among all three procedures since it ignores input uncertainty.

Q: What is the simplest way to find the multidimensional quantile vectors?

Assumption 1(vii) states that given the plug-in distribution of CID effects and Vi(θ̂), the authors can find the exact multidimensional quantile vectors for −w(1)i` and −w(2)i` , respectively.

Q: What is the definition of input model risk?

when wemake decisions based on the simulation outputs, the authors are subject to the risk of making suboptimaldecisions when the input models do not faithfully represent the real-world stochastic processes; thisis known as input model risk.

Submitted to Operations Research

manuscript (Please, provide the manuscript number!)

Authors are encouraged to submit new papers to INFORMS journals by means of

a style ﬁle template, which includes the journal title. However, use of a template

does not certify that the paper has been accepted for publication in the named jour-

nal. INFORMS journal templates are for the exclusive purpose of submitting to an

INFORMS journal and should not be used to distribute the papers in print or online

or to submit the papers to another publication.

Input-Output Uncertainty Comparisons for Discrete

Optimization via Simulation

Eunhye Song

Department of Industrial and Manufacturing Engineering, The Pennsylvania State University, University Park, PA 16802,

eus358@psu.edu

Barry L. Nelson

Department of Industrial Engineering and Management Sciences, Northwestern University, Evanston, IL 60208,

nelsonb@northwestern.edu

When input distributions to a simulation model are estimated from real-world data, they naturally have

estimation error causing input uncertainty in the simulation output. If an optimization via simulation (OvS)

method is applied that treats the input distributions as “correct,” then there is a risk of making a suboptimal

decision for the real world, which we call input model risk. This paper addresses a discrete OvS (DOvS)

problem of selecting the real-world optimal from among a ﬁnite number of systems when all of them share

the same input distributions estimated from common input data. Since input uncertainty cannot be reduced

without collecting additional real-world data—which may be expensive or impossible—a DOvS procedure

should reﬂect the limited resolution provided by the simulation model in distinguishing the real-world opti-

mal solution from the others. In light of this, our input-output uncertainty comparisons (IOU-C) procedure

focuses on comparisons rather than selection: it provides simultaneous conﬁdence intervals for the diﬀerence

between each system’s real-world mean and the best mean of the rest with any desired probability, while

accounting for both stochastic and input uncertainty. To make the resolution as high as possible (intervals

as short as possible) we exploit the common input data eﬀect to reduce uncertainty in the estimated diﬀer-

ences. Under mild conditions we prove that the IOU-C procedure provides the desired statistical guarantee

asymptotically as the real-world sample size and simulation eﬀort increase, but it is designed to be eﬀective

in ﬁnite samples.

Key words : Optimization via simulation under input uncertainty, common input data eﬀect, multiple

comparisons with the best

History : First submitted on June 2016; revisions submitted on July 2017 and May 2018.

Song and Nelson: Input-Output Uncertainty Comparisons for DOvS

2 Article submitted to Operations Research; manuscript no. (Please, provide the manuscript number!)

1. Introduction

Due to the ﬂexibility of simulation, optimization via simulation (OvS) is a widely accepted tool

to improve system performance. Real-world problems typically involve stochastic processes, e.g.,

demand for a new product or arrivals of patients to an emergency room, which are often modeled

by probability distributions. Stochastic simulation is driven by random variates generated from

these input models to produce outputs that mimic real-world performance. Therefore, when we

make decisions based on the simulation outputs, we are subject to the risk of making suboptimal

decisions when the input models do not faithfully represent the real-world stochastic processes; this

is known as input model risk. Most standard OvS methods do not take into account input model

risk and instead optimize under the assumption that the input models are accurate representations

of the real-world randomness. However, the best system chosen conditional on the input models

may not be the best system with respect to real-world performance when implemented. We reﬁne

this point below and illustrate it further using an inventory management example with estimated

input demand distribution in Section 2. Of course, there may also be a logical discrepancy between

the simulation model and the real-world system but that is beyond the scope of this paper.

The problem of interest is to compare k systems, where the ith system’s performance measure is

its simulation output mean, E[Y

)], under real-world input distribution F

(c for correct), where

(·) is the stochastic output performance which depends on the chosen input distribution. When

there are many input processes in the system, F

represents the joint distribution of all of the

input random variables. Our speciﬁc goal is to ﬁnd arg max

E[Y

)] (or arg min

E[Y

)]) with

a statistical guarantee (e.g., 95%) that the selected system is the real-world optimal. As mentioned

earlier, in most cases F

, F

, . .. , F

are unknown, which forces us to use estimates,

, . . . ,

, to

run simulations and implicitly target E[Y

(

] instead of E[Y (F

)] to evaluate the ith system’s

Song and Nelson: Input-Output Uncertainty Comparisons for DOvS

Article submitted to Operations Research; manuscript no. (Please, provide the manuscript number!) 3

performance. Typically,

is estimated from ﬁnite real-world observations from F

and therefore

is subject to estimation error. Input model risk arises as E[Y

(

] depends on random

, and

thus the conditional optimal, arg max

E[Y

(

], may not be the same as arg max

E[Y

)]. In

this paper we show that it is possible to provide a meaningful statistical guarantee with respect to

the real-world optimal, rather than the conditional optimal.

To accomplish this we ﬁrst need to understand how much uncertainty in E[Y

(

] is caused

by the estimation error in

. This is referred to input uncertainty and formally deﬁned as

Var(E[Y

(

]), where the variance is taken with respect to the sampling distribution of

. Typ-

ically, we have only one “observation” of

estimated from the real-world data, which makes it

diﬃcult to evaluate the variance. Another challenge is that the functional form of E[Y

(

] is

generally unknown and can only be estimated via simulations. Several methods have been devel-

oped to quantify the marginal impact of input uncertainty on a single simulated system; see Barton

(2012), Song et al. (2014), and Lam (2016) for surveys.

Unlike simulation stochastic error, which can be reduced by increasing the number of simulation

replications, input uncertainty can only be reduced by collecting more real-world data. However,

real-world data collection is typically much more expensive than simulation replications, or it may

be impossible if an implementation decision has to be made before having another chance to collect

data (e.g., logistics decisions for a natural disaster). Our DOvS procedure is designed to provide

statistical inference on the real-world optimal solution in the presence of input model risk that will

not be further reduced by collecting more real-world data.

Optimization under input model risk is more challenging than conditional DOvS since even with

an inﬁnite number of simulation replications we may not be able to distinguish the real-world best

from the others due to the remaining input uncertainty. But eﬀective DOvS under input model risk

requires more than just quantifying the marginal input uncertainty in each system’s simulation

output; instead we need to compare how systems are aﬀected jointly by input uncertainty.

Recently, several DOvS procedures that incorporate input model risk have been proposed; they

can be categorized into three groups in terms of what they promise to deliver: the ﬁrst group

Song and Nelson: Input-Output Uncertainty Comparisons for DOvS

4 Article submitted to Operations Research; manuscript no. (Please, provide the manuscript number!)

of procedures selects a system that best hedges input model risk by identifying the worst-case

input distributions given real-world data for each system marginally, and then selects the sys-

tem with the best worst-case performance. For a maximization problem this beomes selecting

arg max

min

∈U

E[Y

(

], where U

is the uncertainty set that contains the candidates for F

inferred from the real-world data. Such a formulation is used in the distributionally robust opti-

mization literature (Scarf 1958, Delage and Ye 2010, Ben-Tal et al. 2013). The robust selection of

the best procedure of Fan et al. (2013) and the optimal computational budget allocation scheme

of Gao et al. (2017) belong in this category. A beneﬁt of this formulation is that we can always

select a single solution no matter how large input uncertainty is. However, the selected system may,

and often will, perform poorly under the true real-world input distributions. See Section 2.

The second category selects a system with the best performance averaged over input uncertainty,

i.e., arg max



E[Y

(

]



, where the outer expectation is taken with respect to the sampling

or posterior distribution of

. Corlu and Biller (2015) propose a subset selection procedure that

averages both stochastic and input uncertainties to ﬁnd a subset of optimal/near-optimal systems

where

is a Bayesian posterior distribution given real-world data. Even if the input uncertainty,

Var(E[Y

(

]), is large the variance of an estimate of E



E[Y

(

]



may be reduced by more

simulation replications. Hence, with a suﬃciently large simulation budget the size of the subset may

be as small as one provided that E



E[Y

(

]



is distinct for each i. However, E



E[Y

(

]



E[Y

)] in general, and therefore arg max



E[Y

(

]



may not be arg max

E[Y

)]. The

bias of E



E[Y

(

]



is larger when the number of real-world observations is smaller, causing

this fomulation to pose greater input model risk.

The last category of procedures directly attacks the problem of ﬁnding arg max

E[Y

)]. Corlu

and Biller (2013) present a subset selection procedure that includes the real-world best system

in the subset assuming that max

E[Y

)] is at least δ > 0 better than the rest of the systems’

true means. This procedure is distinguished from the subset selection procedure in Corlu and

Biller (2015) in that it does not average E[Y

(

] over the distribution of

, but uses δ to

Song and Nelson: Input-Output Uncertainty Comparisons for DOvS

Article submitted to Operations Research; manuscript no. (Please, provide the manuscript number!) 5

control the resolution to which the procedure can successfully separate the real-world best from

the rest with a given statistical guarantee. Under the same indiﬀerence-zone (IZ) setting, Song

et al. (2015) discuss a ranking-and-selection approach that guarantees the probability of correctly

selecting arg max

E[Y

)] in the presence of input model risk. Both Corlu and Biller (2013) and

Song et al. (2015) ﬁnd that δ has an unknown nonzero lower bound, which is an increasing function

of input uncertainty reﬂecting the fact that the procedures may not distinguish the real-world best

system from the rest if the mean diﬀerence is too small relative to input uncertainty. To put it

diﬀerently, for δ below an unknown threshold the probability of correctly selecting the optimal (or

including the optimal in the subset) has an upper bound less than 1 so that even with inﬁnite

simulation eﬀort we may not achieve the desired statistical guarantee. Further, assuming an IZ

mean conﬁguration makes both procedures conservative, because they are designed to provide the

statistical guarantee for the case where all suboptimal systems’ means are arg max

E[Y

)] −δ.

When F

, F

, . . . , F

are assumed known, this only makes us spend more simulation budget than

necessary to correctly select the optimal solution with the target probability. In the presence of

input model risk, however, the problem is much more severe and we may conclude that we cannot

provide the target probability guarantee at all when in fact we could if we did not assume an IZ

conﬁguration.

Our input-output uncertainty comparisons (IOU-C) procedure belongs in the third category.

However, we focus on comparisons of systems, not selection, and we do not assume any conﬁgura-

tion for the system means, which diﬀerentiates our approach from Corlu and Biller (2013) and Song

et al. (2015). By extending the multiple comparisons with the best (MCB) framework of Chang

and Hsu (1992) to incorporate input model risk, IOU-C provides k joint conﬁdence intervals (CIs)

on the true mean diﬀerences between each system and the best of the rest that account for both

stochastic and input uncertainties. With any given target probability guarantee, the CIs that con-

tain 0 indicate systems that are statistically inseparable from the real-world optimal.

We restrict our attention to the case where all systems share the same input distributions, i.e.,

= F

and

F for i = 1, 2, . . . , k, which is a common setting for DOvS problems. For instance,

Input–Output Uncertainty Comparisons for Discrete Optimization via Simulation

Figures

Citations

Subsampling to Enhance Efficiency in Input Uncertainty Quantification

Classification and literature review on the integration of simulation and optimization in maritime logistics studies

Random perturbation and bagging to quantify input uncertainty

Stochastic approximation for simulation optimization under input uncertainty with streaming data

Fixed confidence ranking and selection under input uncertainty

References

Multivariate stochastic approximation using a simultaneous perturbation gradient approximation

Distributionally Robust Optimization Under Moment Uncertainty with Application to Data-Driven Problems

A min-max solution of an inventory problem,

Robust Solutions of Optimization Problems Affected by Uncertain Probabilities

Robust Solutions of Optimization Problems Affected by Uncertain Probabilities

Related Papers (5)

Asymmetric kriging emulator for stochastic simulation

Multi-objective genetic programming approach for robust modeling of complex manufacturing processes having probabilistic uncertainty in experimental data

Variable-fidelity model selection for stochastic simulation

Bayesian sequential data collection for stochastic simulation calibration

Experimental design for groundwater modeling and management

Frequently Asked Questions (12)

Q1. What are the contributions mentioned in the paper "Input-output uncertainty comparisons for discrete optimization via simulation" ?

Q2. What is the problem of interest in comparing k systems?

Q3. What is the condition for asymptotic normality of B/m(B?

Q4. How is the performance measure estimated in a realistic DOvS setting?

Q5. How is the all-in procedure protected against such an error?

Q6. How many values of (c) were sampled in the random search algorithm?

Q7. What is the average subset size of the plug-in procedure?

Q8. What is the average size of S0 for the conditional MCB procedure?

Q9. What is the simplest way to find the multidimensional quantile vectors?

Q10. What is the definition of input model risk?

Q11. What is the coverage probability of the conditional procedure?

Q12. What is the probability of each unit arriving?