Collaborative offloading strategy in internet of vehicles based on asynchronous deep reinforcement learning

doi:10.11772/j.issn.1001-9081.2023050788

Journal of Computer Applications ›› 2024, Vol. 44 ›› Issue (5): 1501-1510.DOI: 10.11772/j.issn.1001-9081.2023050788

Special Issue: 第十九届中国机器学习会议(CCML 2023)

• The 19th China Conference on Machine Learning (CCML 2023) • Previous Articles Next Articles

Collaborative offloading strategy in internet of vehicles based on asynchronous deep reinforcement learning

Xiaoyan ZHAO¹^,², Wei HAN¹, Junna ZHANG¹^,²(), Peiyan YUAN¹

^1.College of Computer and Information Engineering，Henan Normal University，Xinxiang Henan 453007，China
^2.Henan Engineering Lab of Intelligence Business & Internet of Things （Henan Normal University），Xinxiang Henan 453007，China

Received:2023-06-20 Revised:2023-07-14 Accepted:2023-07-24 Online:2023-08-03 Published:2024-05-10
Contact: Junna ZHANG
About author:ZHAO Xiaoyan， born in 1981， Ph. D.， associate professor. Her research interests include edge computing， D2D communication.
HAN Wei， born in 1995， M. S. candidate. His research interests include edge computing.
YUAN Peiyan，born in 1978， Ph. D.， professor. His research interests include edge computing， crowd sensing.
Supported by:
National Natural Science Foundation of China(62072159);Science and Technology Research Project of Henan Province(222102210011)

基于异步深度强化学习的车联网协作卸载策略

赵晓焱¹^,², 韩威¹, 张俊娜¹^,²(), 袁培燕¹

^1.河南师范大学计算机与信息工程学院, 河南新乡 453007
^2.智慧商务与物联网技术河南省工程实验室(河南师范大学), 河南新乡 453007

通讯作者: 张俊娜
作者简介:赵晓焱（1981—），女，河南许昌人，副教授，博士，CCF会员，主要研究方向：边缘计算、D2D通信
韩威（1995—），男，河南商丘人，硕士研究生，CCF会员，主要研究方向：边缘计算
袁培燕（1978—），男，河南邓州人，教授，博士，CCF会员，主要研究方向：边缘计算、群智感知。
第一联系人：张俊娜（1979—），女，河南扶沟人，副教授，博士，CCF会员，主要研究方向：边缘计算、服务计算
基金资助:
国家自然科学基金资助项目(62072159);河南省科技攻关项目(222102210011)

Abstract

Abstract:

With the rapid development of Internet of Vehicles （IoV）， smart connected vehicles generate a large number of latency-sensitive and computation-intensive tasks， and limited vehicle computing resources and traditional cloud service modes cannot meet the needs of in-vehicle users. Mobile Edge Computing （MEC） provides an effective paradigm for solving task offloading of massive data. However， when considering multi-task and multi-user scenarios， the complexity of task offloading scenarios in IoV is high due to the real-time and dynamic changes in vehicle locations， task types and vehicle density， and the offloading process is prone to problems such as unbalanced edge resource allocation， excessive communication cost overhead and slow algorithm convergence. To solve the above problems， cooperative task offloading strategy of multiple edge servers in multi-task and multi-user mobile scenarios in IoV was focused on. First， a three-layer heterogeneous network model for multi-edge collaborative processing was proposed， and dynamic collaborative clusters were introduced for the changing environment in IoV to transform the offloading problem into a joint optimization problem of delay and energy consumption. Then， the problem was divided into two subproblems of offloading decision and resource allocation， where the resource allocation problem was further split into resource allocation for edge servers and transmission bandwidth， and the two subproblems were solved based on convex optimization theory. In order to find the optimal offloading decision set， a Multi-edge Collaborative Deep Deterministic Policy Gradient （MC-DDPG） algorithm that can handle continuous problems in collaborative clusters was proposed， based on which an Asynchronous MC-DDPG （AMC-DDPG） algorithm was designed. The training parameters in collaborative clusters were asynchronously uploaded to the cloud for global update， and then the updated results were returned to each collaborative cluster to improve the convergence speed. Simulation results show that the AMC-DDPG algorithm improves the convergence speed by at least 30% over the DDPG algorithm and achieves better results in terms of reward and total cost.

Key words: Internet of Vehicles (IoV), Mobile Edge Computing (MEC), task offloading, collaboration, Deep Reinforcement Learning (DRL)

摘要：

随着车联网（IoV）的快速发展，智能网联汽车产生了大量延迟敏感型和计算密集型任务，有限的车辆计算资源以及传统的云服务模式无法满足车载用户的需求，移动边缘计算（MEC）为解决海量数据的任务卸载提供了一种有效范式。但在考虑多任务、多用户场景时，由于车辆位置、任务种类以及车辆密度的实时性和动态变化，IoV中任务卸载场景复杂度较高，卸载过程中容易出现边缘资源分配不均衡、通信成本开销过大、算法收敛慢等问题。为解决以上问题，重点研究了IoV中多任务、多用户移动场景中的多边缘服务器协同任务卸载策略。首先，提出一种多边缘协同处理的三层异构网络模型，针对IoV中不断变化的环境，引入动态协作簇，将卸载问题转化为时延和能耗的联合优化问题；其次，将问题分为卸载决策和资源分配两个子问题，其中资源分配问题又拆分为面向边缘服务器和传输带宽的资源分配，并基于凸优化理论求解。为了寻求最优卸载决策集，提出一种能在协作簇中处理连续问题的多边缘协作深度确定性策略梯度（MC-DDPG）算法，并在此基础上设计了一种异步多边缘协作深度确定性策略梯度（AMC-DDPG）算法，通过将协作簇中的训练参数异步上传至云端进行全局更新，再将更新结果返回每个协作簇中提高收敛速度。仿真结果显示，AMC-DDPG算法较DDPG算法至少提高了30%的收敛速度，且在奖励和总成本等方面也取得了较好的效果。

关键词: 车联网, 移动边缘计算, 任务卸载, 协作, 深度强化学习

CLC Number:

TP393.0

Xiaoyan ZHAO, Wei HAN, Junna ZHANG, Peiyan YUAN. Collaborative offloading strategy in internet of vehicles based on asynchronous deep reinforcement learning[J]. Journal of Computer Applications, 2024, 44(5): 1501-1510.

赵晓焱, 韩威, 张俊娜, 袁培燕. 基于异步深度强化学习的车联网协作卸载策略[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1501-1510.

Figures/Tables 7

Fig. 1 Structure of edge computing system model

Fig. 2 Schematic diagram of MC-DDPG algorithm

Fig. 3 Actual network deployment of AMC-DDPG algorithm

Fig. 4 Rewards under different weight proportions

Fig. 5 Comparison of iterative training performance of algorithms with different number of users

Fig. 6 Comparison of iterative training performance of algorithms with different number of edge servers

Fig. 7 Comparison of total cost and total delay among different algorithms

References 25

1	SUN W， ZHANG H， WANG R， et al. Reducing offloading latency for digital twin edge networks in 6G［J］. IEEE Transactions on Vehicular Technology， 2020， 69（10）： 12240-12251. 10.1109/tvt.2020.3018817
2	CHEN C， WANG C， QIU T， et al. Caching in vehicular named data networking： architecture， schemes and future directions［J］. IEEE Communications Surveys & Tutorials， 2020， 22（4）： 2378-2407. 10.1109/comst.2020.3005361
3	LIU Y， LI Y， NIU Y， et al. Joint optimization of path planning and resource allocation in mobile edge computing［J］. IEEE Transactions on Mobile Computing， 2019， 19（9）： 2129-2144. 10.1109/tmc.2019.2922316
4	BUTE M S， FAN P， LIU G， et al. A cluster-based cooperative computation offloading scheme for C-V2X networks［J］. Ad Hoc Networks， 2022， 132： 102862. 10.1016/j.adhoc.2022.102862
5	LIU G， DAI F， HUANG B， et al. A collaborative computation and dependency-aware task offloading method for vehicular edge computing： a reinforcement learning approach［J］. Journal of Cloud Computing， 2022， 11（1）： No.68. 10.1186/s13677-022-00340-3
6	SUN J， GU Q， ZHENG T， et al. Joint communication and computing resource allocation in vehicular edge computing［J/OL］. International Journal of Distributed Sensor Networks， 2019， 15（3）［2023-05-01］. .
7	NING Z， DONG P， WANG X， et al. Deep reinforcement learning for vehicular edge computing： an intelligent offloading system［J］. ACM Transactions on Intelligent Systems and Technology， 2019， 10（6）： No. 60. 10.1145/3317572
8	SHIN A， LIM Y. Federated-learning-based energy-efficient load balancing for UAV-enabled MEC system in vehicular networks［J］. Energies， 2023， 16（5）： 2486. 10.3390/en16052486
9	YE X， LI M， SI P， et al. Collaborative and intelligent resource optimization for computing and caching in IoV with blockchain and MEC using A3C approach［J］. IEEE Transactions on Vehicular Technology， 2023， 72（2）： 1449-1463. 10.1109/tvt.2022.3210570
10	LU H， HE X， DU M， et al. Edge QoE： computation offloading with deep reinforcement learning for Internet of Things［J］. IEEE Internet of Things Journal， 2020， 7（10）： 9255-9265. 10.1109/jiot.2020.2981557
11	WU Y， XIA J， GAO C， et al. Task offloading for vehicular edge computing with imperfect CSI： a deep reinforcement approach［J］. Physical Communication， 2022， 55： 101867. 10.1016/j.phycom.2022.101867
12	RAZA S， LIU W， AHMED M， et al. An efficient task offloading scheme in vehicular edge computing［J］. Journal of Cloud Computing， 2020， 9（1）： No. 28. 10.1186/s13677-020-00175-w
13	ZHANG H， WANG Z， LIU K. V2X offloading and resource allocation in SDN-assisted MEC-based vehicular networks［J］. China Communications， 2020， 17（5）： 266-283. 10.23919/jcc.2020.05.020
14	GU X， ZHANG G， CAO Y. Cooperative mobile edge computing‐cloud computing in Internet of vehicle： architecture and energy‐efficient workload allocation［J］. Transactions on Emerging Telecommunications Technologies， 2021， 32（8）： e4095. 10.1002/ett.4095
15	CHO H， CUI Y， LEE J. Energy-efficient cooperative offloading for edge computing-enabled vehicular networks［J］. IEEE Transactions on Wireless Communications， 2022， 21（12）： 10709-10723. 10.1109/twc.2022.3186590
16	HUANG Y， CAO Y， ZHANG M， et al. CSO-DRL： a collaborative service offloading approach with deep reinforcement learning in vehicular edge computing［J］. Scientific Programming， 2022， 2022： 1163177. 10.1155/2022/1163177
17	ZHANG K， MAO Y， LENG S， et al. Contract-theoretic approach for delay constrained offloading in vehicular edge computing networks［J］. Mobile Networks and Applications， 2019， 24（3）： 1003-1014. 10.1007/s11036-018-1032-0
18	SUN Y， GUO X， SONG J， et al. Adaptive learning-based task offloading for vehicular edge computing systems［J］. IEEE Transactions on Vehicular Technology， 2019， 68（4）： 3061-3074. 10.1109/tvt.2019.2895593
19	ZHANG K， MAO Y， LENG S， et al. Optimal delay constrained offloading for vehicular edge computing networks［C］// Proceedings of the 2017 IEEE International Conference on Communications. Piscataway： IEEE， 2017： 1-6. 10.1109/icc.2017.7997360
20	CHEN C， LI H， LI H， et al. Efficiency and fairness oriented dynamic task offloading in internet of vehicles［J］. IEEE Transactions on Green Communications and Networking， 2022， 6（3）： 1481-1493. 10.1109/tgcn.2022.3167643
21	CHEN X， GE H， LIU L， et al. Computing offloading decision based on DDPG algorithm in mobile edge computing［C］// Proceedings of the 2021 IEEE 6th International Conference on Cloud Computing and Big Data Analytics. Piscataway： IEEE， 2021： 391-399. 10.1109/icccbda51879.2021.9442599
22	SILVER D， LEVER G， HEESS N， et al. Deterministic policy gradient algorithms［C］// Proceedings of the 31st International Conference on Machine Learning. New York： JMLR， 2014： 387-395.
23	WANG Y， FANG W， DING Y， et al. Computation offloading optimization for UAV-assisted mobile edge computing： a deep deterministic policy gradient approach［J］. Wireless Networks， 2021， 27： 2991-3006. 10.1007/s11276-021-02632-z
24	TRAN T X， POMPILI D. Joint task offloading and resource allocation for multi-server mobile-edge computing networks［J］. IEEE Transactions on Vehicular Technology， 2018， 68（1）： 856-868. 10.1109/tvt.2018.2881191
25	CHENG N， LYU F， QUAN W， et al. Space/aerial-assisted computing offloading for IoT applications： a learning-based approach［J］. IEEE Journal on Selected Areas in Communications， 2019， 37（5）： 1117-1129. 10.1109/jsac.2019.2906789

[1]	Jiepo FANG, Chongben TAO. Hybrid internet of vehicles intrusion detection system for zero-day attacks [J]. Journal of Computer Applications, 2024, 44(9): 2763-2769.
[2]	Tianyu HUANG, Yuanxing LI, Hao CHEN, Zijia GUO, Mingjun WEI. User cluster partitioning method based on weighted fuzzy clustering in ground-air collaboration scenarios [J]. Journal of Computer Applications, 2024, 44(5): 1555-1561.
[3]	Junna ZHANG, Xinxin WANG, Tianze LI, Xiaoyan ZHAO, Peiyan YUAN. Task offloading method based on dynamic service cache assistance [J]. Journal of Computer Applications, 2024, 44(5): 1493-1500.
[4]	Xin LI, Liyong BAO, Hongwei DING, Zheng GUAN. MAC layer scheduling strategy of roadside units based on MEC server priority service [J]. Journal of Computer Applications, 2024, 44(4): 1227-1235.
[5]	Tingting GAO, Zhongyuan YAO, Miao JIA, Xueming SI. Overview of on-chain and off-chain consistency protection technologies [J]. Journal of Computer Applications, 2024, 44(12): 3658-3668.
[6]	Jiachen YU, Ye YANG. Irregular object grasping by soft robotic arm based on clipped proximal policy optimization algorithm [J]. Journal of Computer Applications, 2024, 44(11): 3629-3638.
[7]	Huan ZHANG, Jingyu WANG, Lixin LIU, Xiaoyu JIANG. Multi-organization collaborative data sharing scheme with dual authorization [J]. Journal of Computer Applications, 2024, 44(10): 3307-3314.
[8]	Guoshuai MA, Yuhua QIAN, Yayu ZHANG, Junxia LI, Guoqing LIU. Scientific collaboration potential prediction based on dynamic heterogeneous information fusion [J]. Journal of Computer Applications, 2023, 43(9): 2775-2783.
[9]	Jinbo LI, Ping ZHANG, Ji ZHANG, Muhua LIU. Identity-based ring signature scheme on number theory research unit lattice [J]. Journal of Computer Applications, 2023, 43(9): 2798-2805.
[10]	Yiyu GUO, Luoyu ZHOU, Xinyu LIU, Yao LI. Dangerous goods detection method in elevator scene based on improved attention mechanism [J]. Journal of Computer Applications, 2023, 43(7): 2295-2302.
[11]	Ruiqi FENG, Leilei WANG, Xiang LIN, Jinbo XIONG. Software Guard Extensions-based secure data processing framework for traffic monitoring of internet of vehicles [J]. Journal of Computer Applications, 2023, 43(6): 1870-1877.
[12]	Xiaolin LI, Yusang JIANG. Task offloading algorithm for UAV-assisted mobile edge computing [J]. Journal of Computer Applications, 2023, 43(6): 1893-1899.
[13]	Sheng YE, Jing WANG, Jianfeng XIN, Guiling WANG, Chenhong GUO. Dynamic evolution method for microservice composition systems in cloud-edge environment [J]. Journal of Computer Applications, 2023, 43(6): 1696-1704.
[14]	Shangjing LIN, Ji MA, Bei ZHUANG, Yueying LI, Ziyi LI, Tie LI, Jin TIAN. Wireless traffic prediction based on federated learning [J]. Journal of Computer Applications, 2023, 43(6): 1900-1909.
[15]	Tengfei CAO, Yanliang LIU, Xiaoying WANG. Edge computing and service offloading algorithm based on improved deep reinforcement learning [J]. Journal of Computer Applications, 2023, 43(5): 1543-1550.

Collaborative offloading strategy in internet of vehicles based on asynchronous deep reinforcement learning

基于异步深度强化学习的车联网协作卸载策略

RichHTML

PDF

Knowledge

Abstract

Cite this article

share this article

Figures/Tables 7

References 25

Related Articles 15

Recommended Articles

Metrics