Dynamic pricing and learning: historical origins, current research, and new directions

Question

Q1. What have the authors contributed in "Dynamic pricing and learning: historical origins, current research, and new directions" ?

Q2. What is the focus of the paper?

Q3. What is the goal of the literature on dynamic pricing and learning?

Q4. What is the way to use a price-skimming strategy?

Q5. What is the heuristic for a seller of a single item?

Q6. How does the authors prove a performance bound on decay balancing?

Q7. What is the definition of the demand function in the static monopoly pricing problem?

Q8. How do they bridge the gap between robust and data-driven approaches to dynamic pricing?

Q9. How can an optimal price strategy be calculated?

Q10. What is the way to show that a certainty equivalent pricing policy is not strongly consistent?

Accepted Answer

The authors survey these literature streams: they provide a brief introduction to the historical origins of quantitative research on pricing and demand estimation, point to different subfields in the area of dynamic pricing, and provide an in-depth overview of the available literature on dynamic pricing and learning. The authors discuss relations with methodologically related research areas, and identify several important directions for future research.

Accepted Answer

The focus of the paper is on properties and numerical performance of an online-learning algorithm suitable for the complicated process considered by the authors.

Accepted Answer

The common goal of the literature on dynamic pricing and learning is to develop pricing policies that take the intrinsic uncertainty about the relation between price and expected demand into account.

Accepted Answer

They show that if an infinite number of goods can be sold during a finite time interval, it is optimal to use a price-skimming strategy.

Accepted Answer

Mason and Välimäki (2011) consider a seller of a single item in an infinite time horizon, with maximizing the expected discounted reward as objective criterion.

Accepted Answer

In addition they prove a performance bound on decay balancing, showing that the resulting expected discounted revenue is always at least one third of the optimal value.

Accepted Answer

In the static monopoly pricing problem considered by Cournot (1838), the demand function is deterministic and completely known to the firm.

Accepted Answer

Lobel and Perakis (2011) attempt to bridge the gap between robust and data-driven approaches to dynamic pricing, by considering a setting where the uncertainty set is deduced from data samples.

Accepted Answer

In theory an optimal price strategy can be calculated by dynamic programming, but in practice this is computationally intractable.

Accepted Answer

They show that a certainty equivalent pricing policy is not strongly consistent, by showing in an example that the limit of the price sequence is with positive probability different from the optimal price.

Dynamic pricing and learning: historical origins, current research, and new directions

Citations

Bandits with Knapsacks

Introduction to Multi-Armed Bandits

Dynamic Pricing and Demand Learning with Limited Price Experimentation

The impact of dynamic price variability on revenue maximization

Personalized Dynamic Pricing with Machine Learning: High Dimensional Features and Heterogeneous Elasticity

References

Finite-time Analysis of the Multiarmed Bandit Problem

Automobile prices in market equilibrium

A New Product Growth for Model Consumer Durables

A new product growth model for consumer durables

Mixed mnl models for discrete response

Related Papers (5)

Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms

Simultaneously Learning and Optimizing Using Controlled Variance Pricing

Dynamic Pricing Under a General Parametric Choice Model

Optimal dynamic pricing of inventories with stochastic demand over finite horizons

The Theory and Practice of Revenue Management

Frequently Asked Questions (10)