scispace - formally typeset
Open AccessJournal ArticleDOI

Dynamic pricing and learning: historical origins, current research, and new directions

A.V. den Boer
- 01 Jun 2015 - 
- Vol. 20, Iss: 1, pp 1-18
TLDR
In this paper, the authors provide a brief introduction to the historical origins of quantitative research on pricing and demand estimation, point to different subfields in the area of dynamic pricing, and provide an in-depth overview of the available literature on dynamic pricing and learning.
About
This article is published in Surveys in Operations Research and Management Science.The article was published on 2015-06-01 and is currently open access. It has received 245 citations till now. The article focuses on the topics: Dynamic pricing.

read more

Citations
More filters
Journal ArticleDOI

Bandits with Knapsacks

TL;DR: This work presents two algorithms whose reward is close to the information-theoretic optimum: one is based on a novel "balanced exploration" paradigm, while the other is a primal-dual algorithm that uses multiplicative updates that is optimal up to polylogarithmic factors.
Posted Content

Introduction to Multi-Armed Bandits

TL;DR: In this article, a more introductory, textbook-like treatment of multi-armed bandits is provided, with a self-contained, teachable technical introduction and a brief review of further developments; many of the chapters conclude with exercises.
Journal ArticleDOI

Dynamic Pricing and Demand Learning with Limited Price Experimentation

TL;DR: A pricing policy is demonstrated that incurs a regret of O(log^(m) T), or m iterations of the logarithm, and it is shown that this regret is the smallest possible up to a constant factor.
Journal ArticleDOI

The impact of dynamic price variability on revenue maximization

TL;DR: In this article, the authors analyzed the effect of dynamic price variability on revenue maximization and found that higher price variability leads to higher hotel revenues, while the benefits from charging different prices for the same service (intertemporal price discrimination) and limiting the number of units available before the demand is known (inventory control) outweigh the potential negative effects of price unfairness and organizational culture.
Journal ArticleDOI

Personalized Dynamic Pricing with Machine Learning: High Dimensional Features and Heterogeneous Elasticity

TL;DR: This work considers a seller who can dynamically adjust the price of a product at the individual customer level, by utilizing information about customers’ characteristics encoded as a d-dimensional feature vector, and designs a near-optimal pricing policy for a “semi-clairvoyant” seller that achieves an expected regret of order s √Tlog T.
References
More filters
Journal ArticleDOI

Finite-time Analysis of the Multiarmed Bandit Problem

TL;DR: This work shows that the optimal logarithmic regret is also achievable uniformly over time, with simple and efficient policies, and for all reward distributions with bounded support.
Journal ArticleDOI

Automobile prices in market equilibrium

TL;DR: In this article, the authors developed techniques for empirically analyzing demand and supply in differentiated products markets and then applied these techniques to analyze equilibrium in the U.S. automobile industry.
Journal ArticleDOI

A New Product Growth for Model Consumer Durables

TL;DR: A growth model for the timing of initial purchase of new products is developed and tested empirically against data for eleven consumer durables, and a long-range forecast is developed for the sales of color television sets.
Book ChapterDOI

A new product growth model for consumer durables

Frank M. Bass
- 01 Jan 1976 - 
TL;DR: In this article, a growth model for the timing of initial purchase of new products is proposed, and a behavioral rationale for the model is offered in terms of innovative and imitative behavior.
Journal ArticleDOI

Mixed mnl models for discrete response

TL;DR: In this article, the adequacy of a mixing specification can be tested simply as an omitted variable test with appropriately definedartificial variables, and a practicalestimation of aarametricmixingfamily can be run by MaximumSimulated Likelihood EstimationorMethod ofSimulatedMoments, andeasilycomputedinstruments are provided that make the latter procedure fairly eAcient.
Related Papers (5)
Frequently Asked Questions (10)
Q1. What have the authors contributed in "Dynamic pricing and learning: historical origins, current research, and new directions" ?

The authors survey these literature streams: they provide a brief introduction to the historical origins of quantitative research on pricing and demand estimation, point to different subfields in the area of dynamic pricing, and provide an in-depth overview of the available literature on dynamic pricing and learning. The authors discuss relations with methodologically related research areas, and identify several important directions for future research. 

The focus of the paper is on properties and numerical performance of an online-learning algorithm suitable for the complicated process considered by the authors. 

The common goal of the literature on dynamic pricing and learning is to develop pricing policies that take the intrinsic uncertainty about the relation between price and expected demand into account. 

They show that if an infinite number of goods can be sold during a finite time interval, it is optimal to use a price-skimming strategy. 

Mason and Välimäki (2011) consider a seller of a single item in an infinite time horizon, with maximizing the expected discounted reward as objective criterion. 

In addition they prove a performance bound on decay balancing, showing that the resulting expected discounted revenue is always at least one third of the optimal value. 

In the static monopoly pricing problem considered by Cournot (1838), the demand function is deterministic and completely known to the firm. 

Lobel and Perakis (2011) attempt to bridge the gap between robust and data-driven approaches to dynamic pricing, by considering a setting where the uncertainty set is deduced from data samples. 

In theory an optimal price strategy can be calculated by dynamic programming, but in practice this is computationally intractable. 

They show that a certainty equivalent pricing policy is not strongly consistent, by showing in an example that the limit of the price sequence is with positive probability different from the optimal price.