What are the two tools the authors use to protect against faulty inference?

The authors see their ambiguity set technique and nonparametric analysis as important tools to protect against potentially faulty inference in these settings.

How can the authors generate lower and upper bounds on the function f(x) quickly?

Using software for linear optimization, it is possible to generate lower and upper bounds on the function f(x̂) for various choices of x̂ quickly and efficiently.

What is the cost of traveling a arc?

Note that because of interdependencies in the network, the cost of traveling arc a may depend not only on xa , but on the flows on other arcs as well.

How can the authors use their approach to solve the inverse variational inequality problem?

3. Nonparametric estimation: Like existing methods in inverse optimization and structural estimation, their approach can be applied in a parametric setting.

What is the way to estimate a function in equilibrium?

In this paper, the authors propose a computationally tractable technique for estimation in equilibrium based on an inverse variational inequality formulation.

(Open Access) Data-driven estimation in equilibrium using inverse optimization (2015) | Dimitris Bertsimas

Q: What contributions have the authors mentioned in the paper "Data-driven estimation in equilibrium using inverse optimization" ?

The authors use this technique to estimate the utility functions of players in a game from their observed actions and to estimate the congestion function on a road network from traffic count data. A distinguishing feature of their approach is that it supports both parametric and nonparametric estimation by leveraging ideas from statistical learning ( kernel methods and regularization operators ). In computational experiments involving Nash and Wardrop equilibria in a nonparametric setting, the authors find that a ) they effectively estimate the unknown demand or congestion function, respectively, and b ) their proposed regularization technique substantially improves the out-of-sample performance of their estimators.

Q: What is the purpose of their estimate?

their estimate can be used either to predict congestion on the network in the future, or else to inform subsequent network design problems.

1 23

Mathematical Programming

A Publication of the Mathematical

Optimization Society

ISSN 0025-5610

Volume 153

Number 2

Math. Program. (2015) 153:595-633

DOI 10.1007/s10107-014-0819-4

Data-driven estimation in equilibrium

using inverse optimization

Dimitris Bertsimas, Vishal Gupta &

Ioannis Ch.Paschalidis

1 23

Your article is protected by copyright and

all rights are held exclusively by Springer-

Verlag Berlin Heidelberg and Mathematical

Optimization Society. This e-offprint is for

personal use only and shall not be self-

archived in electronic repositories. If you wish

to self-archive your article, please use the

accepted manuscript version for posting on

your own website. You may further deposit

the accepted manuscript version in any

repository, provided it is only made publicly

available 12 months after official publication

or later and provided acknowledgement is

given to the original source of publication

and a link is inserted to the published article

on Springer's website. The link must be

accompanied by the following text: "The final

publication is available at link.springer.com”.

Math. Program., Ser. A (2015) 153:595–633

DOI 10.1007/s10107-014-0819-4

FULL LENGTH PAPER

Data-driven estimation in equilibrium using inverse

optimization

Dimitris Bertsimas · Vishal Gupta ·

Ioannis Ch. Paschalidis

Received: 23 November 2012 / Accepted: 12 September 2014 / Published online: 30 September 2014

Abstract Equilibrium modeling is common in a variety of ﬁelds such as game theory

and transportation science. The inputs for these models, however, are often difﬁ-

cult to estimate, while their outputs, i.e., the equilibria they are meant to describe,

are often directly observable. By combining ideas from inverse optimization with

the theory of variational inequalities, we develop an efﬁcient, data-driven technique

for estimating the parameters of these models from observed equilibria. We use this

technique to estimate the utility functions of players in a game from their observed

actions and to estimate the congestion function on a road network from trafﬁc count

data. A distinguishing feature of our approach is that it supports both parametric and

nonparametric estimation by leveraging ideas from statistical learning (kernel meth-

ods and regularization operators). In computational experiments involving Nash and

Wardrop equilibria in a nonparametric setting, we ﬁnd that a) we effectively estimate

the unknown demand or congestion function, respectively, and b) our proposed reg-

ularization technique substantially improves the out-of-sample performance of our

estimators.

D. Bertsimas (

)

MIT, Sloan School of Management, Massachusetts Institute of Technology,

Cambridge, MA 02139, USA

e-mail: dbertsim@mit.edu

V. Gupta

Operations Research Center, Massachusetts Institute of Technology,

Cambridge, MA 02139, USA

e-mail: vgupta1@mit.edu

I. Ch. Paschalidis

Department of Electrical and Computer Engineering, Boston University,

Boston, MA 02215, USA

e-mail: yannisp@bu.edu

123

Author's personal copy

596 D. Bertsimas et al.

Keywords Equilibrium · Nonparametric estimation · Utility estimation ·

Trafﬁc assignment

Mathematics Subject Classiﬁcation 74G75 Equilibrium: Inverse Problems ·

62G05 Nonparametric Inference: Estimation · 62P20 Applications to Economics ·

90B20 Operations Research and Management Science: Trafﬁc Problems

1 Introduction

Modeling phenomena as equilibria is a common approach in a variety of ﬁelds. Exam-

ples include Nash equilibrium in game theory, trafﬁc equilibrium in transportation sci-

ence and market equilibrium in economics. Often, however, the model primitives or

“inputs” needed to calculate equilibria are not directly observable and can be difﬁcult

to estimate. Small errors in these estimates may have large impacts on the resulting

equilibrium. This problem is particularly serious in design applications, where one

seeks to (re)design a system so that the induced equilibrium satisﬁes some desirable

properties, such as maximizing social welfare. In this case, small errors in the estimates

may substantially affect the optimal design. Thus, developing accurate estimates of

the primitives is crucial.

In this work we propose a novel framework to estimate the unobservable model

primitives for systems in equilibrium. Our data-driven approach hinges on the fact

that although the model primitives may be unobservable, it is frequently possible to

observe equilibria experimentally. We use these observed equilibria to estimate the

original primitives.

We draw on an example from game theory to illustrate. Typically, one speciﬁes

the utility functions for each player in a game and then calculates Nash equilibria. In

practice, however, it is essentially impossible to observe utilities directly. Worse, the

speciﬁc choice of utility function often makes a substantial difference in the resulting

equilibrium. Our approach amounts to estimating a player’s utility function from her

actions in previous games, assuming her actions were approximately equilibria with

respect to her opponents. In contrast to her utility function, her previous actions are

directly observable. This utility function can be used either to predict her actions in

future games, or as an input to subsequent mechanism design problems involving this

player in the future.

A second example comes from transportation science. Given a particular road net-

work, one typically speciﬁes a cost function and then calculates the resulting ﬂow

under user (Wardrop) equilibrium. However, measuring the cost function directly in a

large-scale network is challenging because of the interdependencies among arcs. Fur-

thermore, errors in estimates of cost functions can have severe and counterintuitive

effects; Braess paradox (see [13]) is one well-known example. Our approach amounts

to estimating cost functions using current trafﬁc count data (ﬂows) on the network,

assuming those ﬂows are approximately in equilibrium. Again, in contrast to the cost

function, trafﬁc count data are readily observable and frequently collected on many

real-life networks. Finally, our estimate can be used either to predict congestion on

the network in the future, or else to inform subsequent network design problems.

123

Author's personal copy

Data-driven estimation in equilibrium 597

In general, we focus on equilibria that can be modeled as the solution to a variational

inequality (VI). VIs are a natural tool for describing equilibria with examples spanning

economics, transportation science, physics, differential equations, and optimization.

(See Sect. 2.1 or [26] for detailed examples.) Our model centers on solving an inverse

variational inequality problem: given data that we believe are equilibria, i.e., solutions

to some VI, estimate the function which describes this VI, i.e., the model primitives.

Our formulation and analysis is motivated in many ways by the inverse optimization

literature. In inverse optimization, one is given a candidate solution to an optimiza-

tion problem and seeks to characterize the cost function or other problem data that

would make that solution (approximately) optimal. See [27] for a survey of inverse

combinatorial optimization problems, [3] for the case of linear optimization and [28]

for the case of conic optimization. The critical difference, however, is that we seek

a cost function that would make t he observed data equilibria, not optimal solutions

to an optimization problem. In general, optimization problems can be reformulated

as variational inequalities (see Sect. 2.1), so that our inverse VI problem generalizes

inverse optimization, but this generalization allows us to address a variety of new

applications.

To the best of our knowledge, we are the ﬁrst to consider inverse variational inequal-

ity problems. Previous work, however, has examined the problem of estimating para-

meters for systems assumed to be in equilibrium, most notably the structural estimation

literature in econometrics and operations management ([4,5,32,35]). Although there

are a myriad of techniques collectively referred to as structural estimation, roughly

speaking, they entail (1) assuming a parametric model for t he system including proba-

bilistic assumptions on random quantities, (2) deducing a set of necessary (structural)

equations for unknown parameters, and, ﬁnally, (3) solving a constrained optimization

problem corresponding to a generalized method of moments (GMM) estimate for the

parameters. The constraints of this optimization problem include the structural equa-

tions and possibly other application-speciﬁc constraints, e.g., orthogonality conditions

of instrumental variables. Moreover, this optimization problem is typically difﬁcult to

solve numerically, as it can be non-convex with large ﬂat regions and multiple local

optima (see [4] for some discussion).

Our approach differs from structural estimation and other specialized approaches in

a number of respects. From a philosophical point of view, the most critical difference is

in the objective of the methodology. Speciﬁcally, in the structural estimation paradigm,

one posits a “ground-truth” model of a system with a known parametric form. The

objective of the method is to learn the parameters in order to provide insight into

the system. By contrast, in our paradigm, we make no assumptions (parametric or

nonparametric) about the true mechanics of the system; we treat is as a “black-box.”

Our objective is to ﬁt a model—in fact, a VI—that can be used to predict the behavior

of the system. We make no claim that this ﬁtted model accurately reﬂects “reality,”

merely that it has good predictive power.

This distinction is subtle, mirroring the distinction between “data-modelling” in

classical statistics and “algorithmic modeling” in machine learning. (A famous, albeit

partisaned, account of this distinction is [15].) Our approach is kindred to the machine

learning point of view. For a more detailed discussion, please see Appendix 2.

This philosophical difference has a number of practical consequences:

123

Author's personal copy

Data-driven estimation in equilibrium using inverse optimization

Figures

Citations

Smart "Predict, then Optimize"

Data-driven inverse optimization with imperfect information

A robust learning approach for regression models based on distributionally robust optimization

Cooperative Operation for Wind Turbines and Hydrogen Fueling Stations With On-Site Hydrogen Production

Smart “Predict, then Optimize”

References

Convex Optimization

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

Nonlinear Programming

Spline models for observational data

Optimization by Vector Space Methods

Related Papers (5)

Imputing a convex objective function

Inverse Optimization

Inverse Optimization with Noisy Data

Generalized Inverse Multiobjective Optimization with Application to Cancer Therapy

Data-driven inverse optimization with imperfect information

Frequently Asked Questions (9)

Q1. What contributions have the authors mentioned in the paper "Data-driven estimation in equilibrium using inverse optimization" ?

Q2. What are the two tools the authors use to protect against faulty inference?

Q3. How can the authors generate lower and upper bounds on the function f(x) quickly?

Q4. What is the common example of a cost function error?

Q5. What is the problem with measuring the cost function in a large-scale network?

Q6. What is the cost of traveling a arc?

Q7. How can the authors use their approach to solve the inverse variational inequality problem?

Q8. What is the purpose of their estimate?

Q9. What is the way to estimate a function in equilibrium?