scispace - formally typeset
Open AccessJournal ArticleDOI

Data-driven estimation in equilibrium using inverse optimization

TLDR
This work develops an efficient, data-driven technique for estimating the parameters of these models from observed equilibria, and supports both parametric and nonparametric estimation by leveraging ideas from statistical learning (kernel methods and regularization operators).
Abstract
Equilibrium modeling is common in a variety of fields such as game theory and transportation science. The inputs for these models, however, are often difficult to estimate, while their outputs, i.e., the equilibria they are meant to describe, are often directly observable. By combining ideas from inverse optimization with the theory of variational inequalities, we develop an efficient, data-driven technique for estimating the parameters of these models from observed equilibria. We use this technique to estimate the utility functions of players in a game from their observed actions and to estimate the congestion function on a road network from traffic count data. A distinguishing feature of our approach is that it supports both parametric and nonparametric estimation by leveraging ideas from statistical learning (kernel methods and regularization operators). In computational experiments involving Nash and Wardrop equilibria in a nonparametric setting, we find that a) we effectively estimate the unknown demand or congestion function, respectively, and b) our proposed regularization technique substantially improves the out-of-sample performance of our estimators.

read more

Content maybe subject to copyright    Report

1 23
Mathematical Programming
A Publication of the Mathematical
Optimization Society
ISSN 0025-5610
Volume 153
Number 2
Math. Program. (2015) 153:595-633
DOI 10.1007/s10107-014-0819-4
Data-driven estimation in equilibrium
using inverse optimization
Dimitris Bertsimas, Vishal Gupta &
Ioannis Ch.Paschalidis

1 23
Your article is protected by copyright and
all rights are held exclusively by Springer-
Verlag Berlin Heidelberg and Mathematical
Optimization Society. This e-offprint is for
personal use only and shall not be self-
archived in electronic repositories. If you wish
to self-archive your article, please use the
accepted manuscript version for posting on
your own website. You may further deposit
the accepted manuscript version in any
repository, provided it is only made publicly
available 12 months after official publication
or later and provided acknowledgement is
given to the original source of publication
and a link is inserted to the published article
on Springer's website. The link must be
accompanied by the following text: "The final
publication is available at link.springer.com”.

Math. Program., Ser. A (2015) 153:595–633
DOI 10.1007/s10107-014-0819-4
FULL LENGTH PAPER
Data-driven estimation in equilibrium using inverse
optimization
Dimitris Bertsimas · Vishal Gupta ·
Ioannis Ch. Paschalidis
Received: 23 November 2012 / Accepted: 12 September 2014 / Published online: 30 September 2014
© Springer-Verlag Berlin Heidelberg and Mathematical Optimization Society 2014
Abstract Equilibrium modeling is common in a variety of fields such as game theory
and transportation science. The inputs for these models, however, are often diffi-
cult to estimate, while their outputs, i.e., the equilibria they are meant to describe,
are often directly observable. By combining ideas from inverse optimization with
the theory of variational inequalities, we develop an efficient, data-driven technique
for estimating the parameters of these models from observed equilibria. We use this
technique to estimate the utility functions of players in a game from their observed
actions and to estimate the congestion function on a road network from traffic count
data. A distinguishing feature of our approach is that it supports both parametric and
nonparametric estimation by leveraging ideas from statistical learning (kernel meth-
ods and regularization operators). In computational experiments involving Nash and
Wardrop equilibria in a nonparametric setting, we find that a) we effectively estimate
the unknown demand or congestion function, respectively, and b) our proposed reg-
ularization technique substantially improves the out-of-sample performance of our
estimators.
D. Bertsimas (
B
)
MIT, Sloan School of Management, Massachusetts Institute of Technology,
Cambridge, MA 02139, USA
e-mail: dbertsim@mit.edu
V. Gupta
Operations Research Center, Massachusetts Institute of Technology,
Cambridge, MA 02139, USA
e-mail: vgupta1@mit.edu
I. Ch. Paschalidis
Department of Electrical and Computer Engineering, Boston University,
Boston, MA 02215, USA
e-mail: yannisp@bu.edu
123
Author's personal copy

596 D. Bertsimas et al.
Keywords Equilibrium · Nonparametric estimation · Utility estimation ·
Traffic assignment
Mathematics Subject Classification 74G75 Equilibrium: Inverse Problems ·
62G05 Nonparametric Inference: Estimation · 62P20 Applications to Economics ·
90B20 Operations Research and Management Science: Traffic Problems
1 Introduction
Modeling phenomena as equilibria is a common approach in a variety of fields. Exam-
ples include Nash equilibrium in game theory, traffic equilibrium in transportation sci-
ence and market equilibrium in economics. Often, however, the model primitives or
“inputs” needed to calculate equilibria are not directly observable and can be difficult
to estimate. Small errors in these estimates may have large impacts on the resulting
equilibrium. This problem is particularly serious in design applications, where one
seeks to (re)design a system so that the induced equilibrium satisfies some desirable
properties, such as maximizing social welfare. In this case, small errors in the estimates
may substantially affect the optimal design. Thus, developing accurate estimates of
the primitives is crucial.
In this work we propose a novel framework to estimate the unobservable model
primitives for systems in equilibrium. Our data-driven approach hinges on the fact
that although the model primitives may be unobservable, it is frequently possible to
observe equilibria experimentally. We use these observed equilibria to estimate the
original primitives.
We draw on an example from game theory to illustrate. Typically, one specifies
the utility functions for each player in a game and then calculates Nash equilibria. In
practice, however, it is essentially impossible to observe utilities directly. Worse, the
specific choice of utility function often makes a substantial difference in the resulting
equilibrium. Our approach amounts to estimating a player’s utility function from her
actions in previous games, assuming her actions were approximately equilibria with
respect to her opponents. In contrast to her utility function, her previous actions are
directly observable. This utility function can be used either to predict her actions in
future games, or as an input to subsequent mechanism design problems involving this
player in the future.
A second example comes from transportation science. Given a particular road net-
work, one typically specifies a cost function and then calculates the resulting flow
under user (Wardrop) equilibrium. However, measuring the cost function directly in a
large-scale network is challenging because of the interdependencies among arcs. Fur-
thermore, errors in estimates of cost functions can have severe and counterintuitive
effects; Braess paradox (see [13]) is one well-known example. Our approach amounts
to estimating cost functions using current traffic count data (flows) on the network,
assuming those flows are approximately in equilibrium. Again, in contrast to the cost
function, traffic count data are readily observable and frequently collected on many
real-life networks. Finally, our estimate can be used either to predict congestion on
the network in the future, or else to inform subsequent network design problems.
123
Author's personal copy

Data-driven estimation in equilibrium 597
In general, we focus on equilibria that can be modeled as the solution to a variational
inequality (VI). VIs are a natural tool for describing equilibria with examples spanning
economics, transportation science, physics, differential equations, and optimization.
(See Sect. 2.1 or [26] for detailed examples.) Our model centers on solving an inverse
variational inequality problem: given data that we believe are equilibria, i.e., solutions
to some VI, estimate the function which describes this VI, i.e., the model primitives.
Our formulation and analysis is motivated in many ways by the inverse optimization
literature. In inverse optimization, one is given a candidate solution to an optimiza-
tion problem and seeks to characterize the cost function or other problem data that
would make that solution (approximately) optimal. See [27] for a survey of inverse
combinatorial optimization problems, [3] for the case of linear optimization and [28]
for the case of conic optimization. The critical difference, however, is that we seek
a cost function that would make t he observed data equilibria, not optimal solutions
to an optimization problem. In general, optimization problems can be reformulated
as variational inequalities (see Sect. 2.1), so that our inverse VI problem generalizes
inverse optimization, but this generalization allows us to address a variety of new
applications.
To the best of our knowledge, we are the first to consider inverse variational inequal-
ity problems. Previous work, however, has examined the problem of estimating para-
meters for systems assumed to be in equilibrium, most notably the structural estimation
literature in econometrics and operations management ([4,5,32,35]). Although there
are a myriad of techniques collectively referred to as structural estimation, roughly
speaking, they entail (1) assuming a parametric model for t he system including proba-
bilistic assumptions on random quantities, (2) deducing a set of necessary (structural)
equations for unknown parameters, and, finally, (3) solving a constrained optimization
problem corresponding to a generalized method of moments (GMM) estimate for the
parameters. The constraints of this optimization problem include the structural equa-
tions and possibly other application-specific constraints, e.g., orthogonality conditions
of instrumental variables. Moreover, this optimization problem is typically difficult to
solve numerically, as it can be non-convex with large flat regions and multiple local
optima (see [4] for some discussion).
Our approach differs from structural estimation and other specialized approaches in
a number of respects. From a philosophical point of view, the most critical difference is
in the objective of the methodology. Specifically, in the structural estimation paradigm,
one posits a “ground-truth” model of a system with a known parametric form. The
objective of the method is to learn the parameters in order to provide insight into
the system. By contrast, in our paradigm, we make no assumptions (parametric or
nonparametric) about the true mechanics of the system; we treat is as a “black-box.”
Our objective is to fit a model—in fact, a VI—that can be used to predict the behavior
of the system. We make no claim that this fitted model accurately reflects “reality,
merely that it has good predictive power.
This distinction is subtle, mirroring the distinction between “data-modelling” in
classical statistics and “algorithmic modeling” in machine learning. (A famous, albeit
partisaned, account of this distinction is [15].) Our approach is kindred to the machine
learning point of view. For a more detailed discussion, please see Appendix 2.
This philosophical difference has a number of practical consequences:
123
Author's personal copy

Figures
Citations
More filters
Journal ArticleDOI

Smart "Predict, then Optimize"

TL;DR: In this article, the standard paradigm is predict-then-optimize (P2O) approach, which involves two significant challenges: prediction and optimization, and because of the typically complex nature of each challenge, the traditional P2O paradigm is predicted then-optimized.
Journal ArticleDOI

Data-driven inverse optimization with imperfect information

TL;DR: This paper formalizes this inverse optimization problem as a distributionally robust program minimizing the worst-case risk that the predicted decision differs from the agent’s actual response to a random signal and shows that the emerging inverse optimization problems can be exactly reformulated as tractable convex programs when a new suboptimality loss function is used.
Journal ArticleDOI

A robust learning approach for regression models based on distributionally robust optimization

TL;DR: This work presents a Distributionally Robust Optimization (DRO) approach to estimate a robustified regression plane in a linear regression setting, when the observed samples are potentially contaminated with adversarially corrupted outliers, and establishes two types of performance guarantees for the solution to the formulation under mild conditions.
Journal ArticleDOI

Cooperative Operation for Wind Turbines and Hydrogen Fueling Stations With On-Site Hydrogen Production

TL;DR: Numerical experiments indicate that the cooperation can greatly reduce the hydrogen production cost for HFSs and the proposed algorithm outperforms conventional alternating direction method of multipliers (ADMM) algorithm in the case studies.
Journal ArticleDOI

Smart “Predict, then Optimize”

- 01 Jan 2022 - 
TL;DR: Ye et al. as mentioned in this paper proposed a new and very general framework, called Smart "Predict, then Optimize" (SPO), which directly leverages the optimization problem structure and constraints for designing better prediction models.
References
More filters
Book

Convex Optimization

TL;DR: In this article, the focus is on recognizing convex optimization problems and then finding the most appropriate technique for solving them, and a comprehensive introduction to the subject is given. But the focus of this book is not on the optimization problem itself, but on the problem of finding the appropriate technique to solve it.
Book

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

TL;DR: In this paper, the authors describe the important ideas in these areas in a common conceptual framework, and the emphasis is on concepts rather than mathematics, with a liberal use of color graphics.
Book

Nonlinear Programming

Book

Spline models for observational data

Grace Wahba
TL;DR: In this paper, a theory and practice for the estimation of functions from noisy data on functionals is developed, where convergence properties, data based smoothing parameter selection, confidence intervals, and numerical methods are established which are appropriate to a number of problems within this framework.
Book

Optimization by Vector Space Methods

TL;DR: This book shows engineers how to use optimization theory to solve complex problems with a minimum of mathematics and unifies the large field of optimization with a few geometric principles.
Related Papers (5)
Frequently Asked Questions (9)
Q1. What contributions have the authors mentioned in the paper "Data-driven estimation in equilibrium using inverse optimization" ?

The authors use this technique to estimate the utility functions of players in a game from their observed actions and to estimate the congestion function on a road network from traffic count data. A distinguishing feature of their approach is that it supports both parametric and nonparametric estimation by leveraging ideas from statistical learning ( kernel methods and regularization operators ). In computational experiments involving Nash and Wardrop equilibria in a nonparametric setting, the authors find that a ) they effectively estimate the unknown demand or congestion function, respectively, and b ) their proposed regularization technique substantially improves the out-of-sample performance of their estimators. 

The authors see their ambiguity set technique and nonparametric analysis as important tools to protect against potentially faulty inference in these settings. 

Using software for linear optimization, it is possible to generate lower and upper bounds on the function f(x̂) for various choices of x̂ quickly and efficiently. 

errors in estimates of cost functions can have severe and counterintuitive effects; Braess paradox (see [13]) is one well-known example. 

measuring the cost function directly in a large-scale network is challenging because of the interdependencies among arcs. 

Note that because of interdependencies in the network, the cost of traveling arc a may depend not only on xa , but on the flows on other arcs as well. 

3. Nonparametric estimation: Like existing methods in inverse optimization and structural estimation, their approach can be applied in a parametric setting. 

their estimate can be used either to predict congestion on the network in the future, or else to inform subsequent network design problems. 

In this paper, the authors propose a computationally tractable technique for estimation in equilibrium based on an inverse variational inequality formulation.