Optimal Transportation Network Company Vehicle Dispatching via Deep Deterministic Policy Gradient
With the popularity of smart phones and the maturity of civilian global positioning system (GPS) technology, transportation network company (TNC) services have become a prominent commute mode in many major cities, which can effectively pair the passengers with the TNC vehicles/drivers through mobile...
        Saved in:
      
    
          | Published in | Wireless Algorithms, Systems, and Applications Vol. 11604; pp. 297 - 309 | 
|---|---|
| Main Authors | , , , , , | 
| Format | Book Chapter | 
| Language | English | 
| Published | 
        Switzerland
          Springer International Publishing AG
    
        2019
     Springer International Publishing  | 
| Series | Lecture Notes in Computer Science | 
| Subjects | |
| Online Access | Get full text | 
| ISBN | 9783030235963 3030235963  | 
| ISSN | 0302-9743 1611-3349  | 
| DOI | 10.1007/978-3-030-23597-0_24 | 
Cover
| Summary: | With the popularity of smart phones and the maturity of civilian global positioning system (GPS) technology, transportation network company (TNC) services have become a prominent commute mode in many major cities, which can effectively pair the passengers with the TNC vehicles/drivers through mobile applications. However, given the growing number of TNC vehicles, how to efficiently dispatch TNC vehicles poses crucial challenges. In this paper, we propose a novel method for TNC vehicle dispatching in different areas of the city based on deep reinforcement learning (DRL) method with joint consideration of the TNC company, individual TNC vehicle, and customer/passenger. The proposed model optimizes the distribution of vehicles geographically to meet the customers’ demands, while improving the drivers’ profit. In particular, we consider the high dimensional state and action space in the urban city traffic dynamic environment, and develop a deep deterministic policy gradient, an actor-critic based DRL algorithm for dispatching vacant TNC vehicles. We leverage Didi Chuxing’s open data set to evaluate the performance of the proposed approach, and the simulation results show that the proposed approach improves the average income of the driver while satisfying the supply and demand relationship between TNC vehicles and customers/passengers. | 
|---|---|
| ISBN: | 9783030235963 3030235963  | 
| ISSN: | 0302-9743 1611-3349  | 
| DOI: | 10.1007/978-3-030-23597-0_24 |