Deep Reinforcement Learning for Dynamic Pricing of Perishable Products

Dynamic pricing is a strategy for setting flexible prices for products based on existing market demand. In this paper, we address the problem of dynamic pricing of perishable products using DQN value function approximator. A model-free reinforcement learning approach is used to maximize revenue for...

Full description

Saved in:
Bibliographic Details
Published inOptimization and Learning Vol. 1443; pp. 132 - 143
Main Authors Burman, Vibhati, Vashishtha, Rajesh Kumar, Kumar, Rajan, Ramanan, Sharadha
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2021
Springer International Publishing
SeriesCommunications in Computer and Information Science
Subjects
Online AccessGet full text
ISBN3030856712
9783030856717
ISSN1865-0929
1865-0937
DOI10.1007/978-3-030-85672-4_10

Cover

More Information
Summary:Dynamic pricing is a strategy for setting flexible prices for products based on existing market demand. In this paper, we address the problem of dynamic pricing of perishable products using DQN value function approximator. A model-free reinforcement learning approach is used to maximize revenue for a perishable item with fixed initial inventory and selling horizon. The demand is influenced by the price and freshness of the product. The conventional tabular Q-learning method involves storing the Q-values for each state-action pair in a lookup table. This approach is not suitable for control problems with large state spaces. Hence, we use function approximation approach to address the limitations of a tabular Q-learning method. Using DQN function approximator we generalize the unseen states from the seen states, which reduces the space requirements for storing value function for each state-action combination. We show that using DQN we can model the problem of pricing perishable products. Our results demonstrate that the DQN based dynamic pricing algorithm generates higher revenue when compared with conventional one-step price optimization and constant pricing strategy.
Bibliography:Rajan was an employee of TCS when this work was done.
ISBN:3030856712
9783030856717
ISSN:1865-0929
1865-0937
DOI:10.1007/978-3-030-85672-4_10