Journal of Computer Applications ›› 2022, Vol. 42 ›› Issue (11): 3395-3403.DOI: 10.11772/j.issn.1001-9081.2021122109

• CCF Bigdata 2021 • Previous Articles    

User incentive based bike‑sharing dispatching strategy

Bing SHI1, Xizi HUANG1, Zhaoxiang SONG1, Jianqiao XU2()   

  1. 1.School of Computer Science and Artificial Intelligence,Wuhan University of Technology,Wuhan Hubei 430070,China
    2.Department of Information Security,Naval University of Engineering,Wuhan Hubei 430033,China
  • Received:2021-12-15 Revised:2022-01-18 Accepted:2022-01-24 Online:2022-11-14 Published:2022-11-10
  • Contact: Jianqiao XU
  • About author:SHI Bing, born in 1982, Ph. D., professor. His research interests include artificial intelligence, multi‑agent systems.
    HUANG Xizi, born in 1997, M. S. candidate. Her research interests include artificial intelligence, multi‑agent systems.
    SONG Zhaoxiang, born in 1997, M. S. candidate. His research interests include artificial intelligence, multi‑agent systems.
    XU Jianqiao, born in 1979, M. S., lecturer. His research interests include network and information security, artificial intelligence.
  • Supported by:
    Humanity and Social Science Research Foundation of Ministry of Education of China(19YJC790111);Philosophy and Social Science Post?Foundation of Ministry of Education(18JHQ060)

基于用户激励的共享单车调度策略

石兵1, 黄茜子1, 宋兆翔1, 徐建桥2()   

  1. 1.武汉理工大学 计算机与人工智能学院, 武汉 430070
    2.海军工程大学 信息安全系, 武汉 430033
  • 通讯作者: 徐建桥
  • 作者简介:石兵(1982—),男,江苏泰兴人,教授,博士,CCF会员,主要研究方向:人工智能、多智能体系统
    黄茜子(1997—),女,湖北咸宁人,硕士研究生,主要研究方向为:人工智能、多智能体系统
    宋兆翔(1997—),男,湖北孝感人,硕士研究生,主要研究方向:人工智能、多智能体系统
    徐建桥(1979—),男,湖北武汉人,讲师,硕士,主要研究方向:网络与信息安全、人工智能。xujianqiao321@163.com
  • 基金资助:
    教育部人文社会科学研究项目(19YJC790111);教育部哲学社会科学研究后期资助项目(18JHQ060)

Abstract:

To address the dispatching problem of bike?sharing, considering the budget constraints, user maximum walking distance restrictions, user temporal and spatial demands and dynamic changes in the distribution of shared bicycles, a bike?sharing dispatching strategy with user incentives was proposed to improve the long?term user service rate of the bike?sharing platform. The dispatching strategy consists of a task generation algorithm, a budget allocation algorithm and a task allocation algorithm. In the task generation algorithm, the Long Short?Term Memory (LSTM) network was used to predict the future bike demand of users; in the budget allocation algorithm, the Deep Deterministic Policy Gradient (DDPG) algorithm was used to design a budget allocation strategy; after the budget was allocated to the tasks, the tasks needed to be allocated to the user for execution, so a greedy matching strategy was used for task allocation. Experiments were carried out on the Mobike dataset to compare the proposed strategy with the dispatching strategy with unlimited budget (that is, the platform is not limited by budget and can use any money to encourage users to ride to the target area), the greedy dispatching strategy, the dispatching strategy with truck hauling, and the situation without dispatching. Experimental results show that the proposed dispatching strategy with user incentive can effectively improve the service rate in the bike?sharing system compared to the greedy dispatching strategy and dispatching strategy with truck hauling.

Key words: bike?sharing dispatching, demand prediction, user incentive, Markov decision, deep reinforcement learning

摘要:

针对共享单车的调度问题,在考虑预算限制、用户最大步行距离限制、用户时空需求以及共享单车分布动态变化的情况下,提出一种用户激励下的共享单车调度策略,以达到提高共享单车平台长期用户服务率的目的。该调度策略包含任务生成算法、预算分配算法和任务分配算法。在任务生成算法中,使用长短期记忆(LSTM)网络预测用户未来的单车需求量;在预算分配算法中,采用深度策略梯度(DDPG)算法来设计预算分配策略;任务分配完预算后,需要将任务分配给用户执行,因此在任务分配算法中使用贪心匹配策略来进行任务分配。基于摩拜单车的数据集进行实验,并把所提策略分别与无预算限制的调度策略(即平台不受预算限制,可以使用任意金钱激励用户将车骑行至目标区域)、贪心的调度策略、卡车拖运下的调度策略以及未进行调度的情况进行对比。实验结果表明,与贪心调度策略和卡车托运下的调度策略相比,用户激励下的共享单车调度策略能有效提高共享单车系统中的用户服务率。

关键词: 共享单车调度, 需求预测, 用户激励, 马尔可夫决策, 深度强化学习

CLC Number: