基于用户激励的共享单车调度策略研究（CCF Bigdata 2021+139）

《计算机应用》唯一官方网站

• • 下一篇

基于用户激励的共享单车调度策略研究（CCF Bigdata 2021+139）

石兵¹,黄茜子¹,宋兆翔²,徐建桥³

1. 武汉理工大学计算机与人工智能学院
2. 武汉理工大学
3. 中国人民解放军海军工程大学信息安全系

收稿日期:2021-12-15 修回日期:2022-01-18 接受日期:2022-01-24 发布日期:2022-06-08 出版日期:2022-06-08
通讯作者: 黄茜子
基金资助:
教育部人文社科研究项目;教育部哲学社会科学研究后期资助项目

Users incentive bike-sharing dispatching: A reinforcement learning method（CCF Bigdata 2021+139）

Received:2021-12-15 Revised:2022-01-18 Accepted:2022-01-24 Online:2022-06-08 Published:2022-06-08
Supported by:
Humanity and Social Science Youth Research Foundation of Ministry of Education;Philosophy and Social Science Post-Foundation of Ministry of Education

摘要/Abstract

摘要： 针对共享单车的调度问题，考虑预算限制、用户最大步行距离限制、用户时空需求以及共享单车分布动态变化的情况下，提出一种用户激励下的共享单车调度策略，达到提高共享单车平台长期用户服务率的目的。该调度策略包含任务生成算法、预算分配算法和任务分配算法。任务生成算法中，基于LSTM预测用户未来的单车需求量。预算分配算法中，平台要顺序地为各个时段分配预算，这是一个序贯决策问题，因此可建模为马尔科夫决策过程，并采用深度强化学习算法DDPG来设计预算分配策略。在任务分配算法中，由于预算的限制导致无法使用主流的二部图匹配算法，选择使用贪心匹配策略来进行任务分配。最后，基于摩拜单车的数据集进行实验，并分别与无预算限制的调度策略（即平台不受预算限制，可以使用任意金钱激励用户将车骑行至目标区域）、基于贪心的调度策略、卡车拖运下的调度策略以及未进行调度的情况进行对比实验。结果表明用户激励下的共享单车调度策略效果仅次于无预算限制的调度策略，能够为共享单车的调度策提供有意义的指导。

关键词: 共享单车调度, 需求预测, 用户激励, 马尔科夫决策, 深度强化学习

Abstract: Focused on the issue that bike-sharing dispatching, considered budget constraints, restrictions on users' maximum walking distance, users' temporal and spatial needs, and dynamic changes in the distribution of shared bicycles. Devised a bike-sharing dispatching strategy with user participation to improve the long-term user service rate of the platform. The dispatching strategy includes task generation algorithm, budget allocation algorithm and task allocation algorithm. In the task generation algorithm, predicted the user’s future bicycle demand based on LSTM. In the budget allocation algorithm, the sequential allocation of budgets for each time period is a sequential decision-making problem, and thus modeled it as a Markov decision process. At the same time, considering that the problem has a high-dimensional and continuous state space and a continuous action space, designed a budget allocation strategy based on the deep deterministic strategy gradient algorithm DDPG. In the task allocation algorithm, due to the budget constraint that makes it impossible to use the mainstream bipartite graph matching algorithm, used the greedy matching strategy for task allocation. Finally, we run experiments based on the Mobike dataset to evaluate our strategy against the dispatching strategy with unlimited budget, the dispatching strategy with greedy budget allocation, the dispatching strategy under truck hauling, and the situation without dispatching. The results show that our shared bicycle dispatching strategy with user participation can achieve the best results except for the dispatching strategy with unlimited budget. The experimental results can provide some useful insights for dispatching shared bikes.

Key words: Bike-Sharing Dispatching, Demand Prediction, User Incentive, Markov decision, Deep Reinforcement Learning

中图分类号:

TP181

石兵黄茜子宋兆翔徐建桥. 基于用户激励的共享单车调度策略研究（CCF Bigdata 2021+139）[J]. 计算机应用.

[1]	秦鑫彤, 宋政育, 侯天为, 王飞越, 孙昕, 黎伟. 基于自适应p持续的移动自组网信道接入和资源分配算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 863-868.
[2]	邓辅秦, 官桧锋, 谭朝恩, 付兰慧, 王宏民, 林天麟, 张建民. 基于请求与应答通信机制和局部注意力机制的多机器人强化学习路径规划方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 432-438.
[3]	李源潮, 陶重犇, 王琛. 基于最大熵深度强化学习的双足机器人步态控制方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 445-451.
[4]	龙杰, 谢良, 徐海蛟. 集成的深度强化学习投资组合模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 300-310.
[5]	郭茂祖, 张雅喆, 赵玲玲. 基于空间语义和个体活动的电动汽车充电站选址方法[J]. 《计算机应用》唯一官方网站, 2023, 43(9): 2819-2827.
[6]	王昱, 任田君, 范子琳. 基于引导Minimax-DDQN的无人机空战机动决策[J]. 《计算机应用》唯一官方网站, 2023, 43(8): 2636-2643.
[7]	王子腾, 于亚新, 夏子芳, 乔佳琪. 融合好奇心和策略蒸馏的稀疏奖励探索机制[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2082-2090.
[8]	魏远, 林彦, 郭晟楠, 林友芳, 万怀宇. 融合出发地与目的地时空相关性的城市区域间出租车需求预测[J]. 《计算机应用》唯一官方网站, 2023, 43(7): 2100-2106.
[9]	方和平, 刘曙光, 冉泳屹, 钟坤华. 基于深度强化学习的多数据中心一体化调度优化[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1884-1892.
[10]	李校林, 江雨桑. 无人机辅助移动边缘计算中的任务卸载算法[J]. 《计算机应用》唯一官方网站, 2023, 43(6): 1893-1899.
[11]	黄晓辉, 杨凯铭, 凌嘉壕. 基于共享注意力的多智能体强化学习订单派送[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1620-1624.
[12]	曹腾飞, 刘延亮, 王晓英. 基于改进深度强化学习的边缘计算服务卸载算法[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1543-1550.
[13]	丁正凯, 傅启明, 陈建平, 陆悠, 吴宏杰, 方能炜, 邢镔. 结合注意力机制与深度强化学习的超短期光伏功率预测[J]. 《计算机应用》唯一官方网站, 2023, 43(5): 1647-1654.
[14]	王哲, 王启名, 李陶深, 葛丽娜. 基于深度强化学习的SWIPT边缘网络联合优化方法[J]. 《计算机应用》唯一官方网站, 2023, 43(11): 3540-3550.
[15]	邓晖奕, 李勇振, 尹奇跃. 引入通信与探索的多智能体强化学习QMIX算法[J]. 《计算机应用》唯一官方网站, 2023, 43(1): 202-208.

基于用户激励的共享单车调度策略研究（CCF Bigdata 2021+139）

Users incentive bike-sharing dispatching: A reinforcement learning method（CCF Bigdata 2021+139）

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics