Reliable and secure service function chain deployment based on encoder-decoder structured reinforcement learning

doi:10.11772/j.issn.1001-9081.2024111677

Journal of Computer Applications

Received:2024-11-29 Revised:2025-03-29 Accepted:2025-03-31 Online:2025-04-08 Published:2025-04-08

基于编解码结构强化学习的安全可靠服务功能链部署

况翔¹,马震²,朱万春¹,张智¹,崔云飞¹

1. 贵阳信息科技学院信息工程学院
2. 贵阳信息科技学院智能工程学院

通讯作者: 马震
基金资助:
教育部产学合作协同育人项目;贵州省级“金课”

Abstract

Abstract: ToIn order to efficiently allocate limited network resources in cloud computing whileto ensuringe service quality, while and improving resource utilization and management efficiency, an encoder-decoder-based deep reinforcement learning algorithm (ED-DRL) was proposed for service function chain (SFC) deployment. this paper proposes a deep reinforcement learning algorithm based on an encoder-decoder structure (ED-DRL) for the deployment of Service Function Chains (SFCs). The SFC placement was formulated as a Markov decision process, and a reinforcement learning approach with a graph attention network (GAT) encoder and a gated recurrent unit (GRU) decoder was employed to effectively extract network topology features and inter-node dependencies.The algorithm first treats SFC placement as a Markov decision process, then employs a reinforcement learning method with a Graph Attention Network (GAT) encoder and a Gated Recurrent Unit (GRU) decoder structure to efficiently extract network topology features and inter-node dependencies. By integratingCombined with the Asynchronous Advantage Actor-Critic (A3C) method, the algorithm was capable of generates secure and reliable SFC placement strategies in dynamic environments.SFC placement strategies that are reliable and secure in dynamic environments. Simulation results demonstrateshow that the encoder-decoder structure reinforcement learning approachmethod, which considerings security and reliability, achieved an acceptance rate of 70.1% and an average reward of 0.0635, outperforming existing algorithmsoutperforms existing algorithms in terms of acceptance rate, average reward, and average running time.

Key words: Service Function Chain Deployment, Reinforcement Learning, Markov Decision Process, Asynchronous Advantage Actor-Critic, Graph Attention Network, Gated Recurrent Unit

摘要： 为了在云计算中高效分配有限网络资源以确保服务质量，同时提高资源利用率和管理效率，提出了一种基于编解码结构的深度强化学习算法(ED-DRL)用于服务功能链(SFC)部署。该算法首先将SFC放置看作一个马尔科夫决策过程，采用图注意力网络(GAT)编码器和门控循环单元(GRU)解码器结构的强化学习方法高效提取网络拓扑特征和节点间的依赖关系，结合异步优势Actor-Critic(A3C)方法，算法能在动态环境中生成安全可靠的SFC放置策略。仿真结果表明，考虑安全与可靠性的编解码结构强化学习方法能获得了70.7%的够在接受率与、0.0635的平均收益，与平均运行时间上优于现有算法。

关键词: 服务功能链部署, 强化学习, 马尔科夫决策过程, 异步优势Actor-Critic, 图注意力网络, 门控循环单元

CLC Number:

TP393.01

况翔马震朱万春张智崔云飞. 基于编解码结构强化学习的安全可靠服务功能链部署[J]. 《计算机应用》唯一官方网站, DOI: 10.11772/j.issn.1001-9081.2024111677.

[1]	Kaile YU, Jiajun LIAO, Jiali MAO, Xiaopeng HUANG. Multi-objective optimization of steel logistics vehicle-cargo matching under multiple constraints [J]. Journal of Computer Applications, 2025, 45(8): 2477-2483.
[2]	Jiaxin YAN, Yanping CHEN, Weizhe YANG, Ruizhang HUANG, Yongbin QIN. Heterogeneous graph attention network for relation extraction based on feature combination [J]. Journal of Computer Applications, 2025, 45(8): 2470-2476.
[3]	Shuo ZHANG, Guokai SUN, Yuan ZHUANG, Xiaoyu FENG, Jingzhi WANG. Dynamic detection method of eclipse attacks for blockchain node analysis [J]. Journal of Computer Applications, 2025, 45(8): 2428-2436.
[4]	Tianyu XUE, Aiping LI, Liguo DUAN. Vehicular edge computing scheme with task offloading and resource optimization [J]. Journal of Computer Applications, 2025, 45(6): 1766-1775.
[5]	Pengcheng XU, Lei HE, Chuan LI, Weiqi QIAN, Tun ZHAO. Deep symbolic regression method based on Transformer [J]. Journal of Computer Applications, 2025, 45(5): 1455-1463.
[6]	Jiaxin LI, Site MO. Power work order classification in substation area based on MiniRBT-LSTM-GAT and label smoothing [J]. Journal of Computer Applications, 2025, 45(4): 1356-1362.
[7]	Jing WANG, Xuming FANG. Intelligent joint power and channel allocation algorithm for Wi-Fi7 multi-link integrated communication and sensing [J]. Journal of Computer Applications, 2025, 45(2): 563-570.
[8]	Huahua WANG, Liang HUANG, Jiajie CHEN, Jiening FANG. Dynamic allocation algorithm for multi-beam subcarriers of low orbit satellites based on deep reinforcement learning [J]. Journal of Computer Applications, 2025, 45(2): 571-577.
[9]	Jianpeng HU, Lichen ZHANG. Deep spatio-temporal network model for multi-time step wind power prediction [J]. Journal of Computer Applications, 2025, 45(1): 98-105.
[10]	Liang ZHU, Jingzhe MU, Hongqiang ZUO, Jingzhong GU, Fubao ZHU. Location privacy-preserving recommendation scheme based on federated graph neural network [J]. Journal of Computer Applications, 2025, 45(1): 136-143.
[11]	Zijun MIAO, Fei LUO, Weichao DING, Wenbo DONG. Traffic signal control algorithm based on overall state prediction and fair experience replay [J]. Journal of Computer Applications, 2025, 45(1): 337-344.
[12]	Hang YANG, Wanggen LI, Gensheng ZHANG, Zhige WANG, Xin KAI. Multi-layer information interactive fusion algorithm based on graph neural network for session-based recommendation [J]. Journal of Computer Applications, 2024, 44(9): 2719-2725.
[13]	Guixiang XUE, Hui WANG, Weifeng ZHOU, Yu LIU, Yan LI. Port traffic flow prediction based on knowledge graph and spatio-temporal diffusion graph convolutional network [J]. Journal of Computer Applications, 2024, 44(9): 2952-2957.
[14]	Hailin XIAO, Tianyi HUANG, Qiuxiang DAI, Yuejun ZHANG, Zhongshan ZHANG. Safe reinforcement learning method for decision making of autonomous lane changing based on trajectory prediction [J]. Journal of Computer Applications, 2024, 44(9): 2958-2963.
[15]	Yi ZHOU, Hua GAO, Yongshen TIAN. Proximal policy optimization algorithm based on clipping optimization and policy guidance [J]. Journal of Computer Applications, 2024, 44(8): 2334-2341.

Reliable and secure service function chain deployment based on encoder-decoder structured reinforcement learning

基于编解码结构强化学习的安全可靠服务功能链部署

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics