Optimal Transportation Network Company Vehicle Dispatching via Deep Deterministic Policy Gradient

With the popularity of smart phones and the maturity of civilian global positioning system (GPS) technology, transportation network company (TNC) services have become a prominent commute mode in many major cities, which can effectively pair the passengers with the TNC vehicles/drivers through mobile...

Full description

Saved in:

Bibliographic Details
Published in	Wireless Algorithms, Systems, and Applications Vol. 11604; pp. 297 - 309
Main Authors	Shi, Dian, Li, Xuanheng, Li, Ming, Wang, Jie, Li, Pan, Pan, Miao
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2019 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Actor-critic algorithm Deterministic policy gradient TNC services TNC vehicle dispatching
Online Access	Get full text
ISBN	9783030235963 3030235963
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-030-23597-0_24

Cover

More Information
Summary:	With the popularity of smart phones and the maturity of civilian global positioning system (GPS) technology, transportation network company (TNC) services have become a prominent commute mode in many major cities, which can effectively pair the passengers with the TNC vehicles/drivers through mobile applications. However, given the growing number of TNC vehicles, how to efficiently dispatch TNC vehicles poses crucial challenges. In this paper, we propose a novel method for TNC vehicle dispatching in different areas of the city based on deep reinforcement learning (DRL) method with joint consideration of the TNC company, individual TNC vehicle, and customer/passenger. The proposed model optimizes the distribution of vehicles geographically to meet the customers’ demands, while improving the drivers’ profit. In particular, we consider the high dimensional state and action space in the urban city traffic dynamic environment, and develop a deep deterministic policy gradient, an actor-critic based DRL algorithm for dispatching vacant TNC vehicles. We leverage Didi Chuxing’s open data set to evaluate the performance of the proposed approach, and the simulation results show that the proposed approach improves the average income of the driver while satisfying the supply and demand relationship between TNC vehicles and customers/passengers.
ISBN:	9783030235963 3030235963
ISSN:	0302-9743 1611-3349
DOI:	10.1007/978-3-030-23597-0_24