Wi-Fi7多链路通感一体化的功率和信道联合智能分配算法

doi:10.11772/j.issn.1001-9081.2024020191

《计算机应用》唯一官方网站 ›› 2025, Vol. 45 ›› Issue (2): 563-570.DOI: 10.11772/j.issn.1001-9081.2024020191

• 网络与通信 • 上一篇

Wi-Fi7多链路通感一体化的功率和信道联合智能分配算法

王靖, 方旭明()

西南交通大学信息与通信工程学院，成都 611756

收稿日期:2024-02-27 修回日期:2024-03-13 接受日期:2024-03-14 发布日期:2024-06-04 出版日期:2025-02-10
通讯作者: 方旭明
作者简介:王靖（1998—），男，四川遂宁人，硕士研究生，主要研究方向：Wi⁃Fi MAC资源管理、强化学习算法；
基金资助:
国家自然科学基金资助项目(62071393)

Intelligent joint power and channel allocation algorithm for Wi-Fi7 multi-link integrated communication and sensing

Jing WANG, Xuming FANG()

School of Information and Communication Engineering，Southwest Jiaotong University，Chengdu Sichuan 611756，China

Received:2024-02-27 Revised:2024-03-13 Accepted:2024-03-14 Online:2024-06-04 Published:2025-02-10
Contact: Xuming FANG
About author:WANG Jing， born in 1998， M. S. candidate. His research interests include Wi-Fi MAC resource management， reinforcement learning algorithm.
Supported by:
National Natural Science Foundation of China(62071393)

摘要/Abstract

摘要：

针对下一代Wi-Fi7设备中多链路传输时通信与感知一体化的功率和信道联合资源分配的问题，根据多链路设备（MLD）特殊的上下两层媒体接入控制层（MAC）结构，提出一种基于QMIX的联合功率控制与信道分配的多链路多智能体强化学习算法（JPCQMIX）。该算法将MLD的每个下层MAC即每条链路作为一个智能体，并在上层MAC中设置混合网络用来处理所有下层MAC的局部值函数，以达到中心式训练的效果。训练完成后，每个下层MAC进入分布式执行模式，并独立地与它的局部环境进行交互，以进行功率控制和信道分配决策。仿真结果表明，相较于多智能体深度Q网络（MADQN）算法和传统启发式粒子群优化（PSO）算法，所提算法在通信吞吐量性能上分别提高了20.51%和29.10%；同时，所提算法在面对不同感知精度阈值和不同链路最低信干噪比（SINR）时，鲁棒性更好。可见，JPCQMIX能有效提升系统在满足感知精度条件下的通信吞吐量。

关键词: Wi-Fi7, 多链路, 通信感知一体化, 多智能体, 深度强化学习

Abstract:

To solve the problem of joint power and channel resource allocation for integrated communication and sensing in multi-link transmission of next-generation Wi-Fi7 devices， a multi-link multi-agent reinforcement learning algorithm based on QMIX（Q-learning Mixing Network） for Joint Power Control and channel allocation （JPCQMIX） was proposed on the basis of special upper and lower Media Access Control （MAC） layer structure of Multi-Link Device （MLD）. In the algorithm， each lower-layer MAC， i.e.， each link， was regarded as an agent， and mixing network was set up in the upper-layer MAC to process all the local value functions of lower-layer MACs， thereby achieving the effect of centralized training. After the training， each lower-layer MAC entered the distributed execution mode and interacted with its local environment independently to perform power control and channel allocation decision making. Simulation results show that the proposed algorithm improves the communication throughput performance by 20.51% and 29.10% respectively compared with Multi-Agent Deep Q Network （MADQN） algorithm and the traditional heuristic Particle Swarm Optimization （PSO） algorithm. Meanwhile， the proposed algorithm demonstrates better robustness when facing with different sensing accuracy thresholds and different link minimum Signal-to-Interference-plus-Noise Ratio （SINR）. It can be seen that JPCQMIX enhances the system’s communication throughput under the condition of satisfying the sensing accuracy effectively.

Key words: Wi-Fi7, multi-link, integrated communication and sensing, multi-agent, deep reinforcement learning

中图分类号:

TN929

王靖, 方旭明. Wi-Fi7多链路通感一体化的功率和信道联合智能分配算法[J]. 计算机应用, 2025, 45(2): 563-570.

Jing WANG, Xuming FANG. Intelligent joint power and channel allocation algorithm for Wi-Fi7 multi-link integrated communication and sensing[J]. Journal of Computer Applications, 2025, 45(2): 563-570.

图/表 9

图1 多链路通信与感知一体化数据传输

Fig. 1 Multi-link integrated communication and sensing data transmission

图2 附近AP的干扰示意图

Fig. 2 Interference diagram of nearby APs

图3 JPCQMIX的网络框架

Fig. 3 Network framework of JPCQMIX

图4 仿真场景

Fig. 4 Simulation scenario

表1 主要网络结构参数

Tab. 1 Main network structure parameters

仿真参数	数值
训练步数	150 000
经验回放数组大小	256
采样批大小	32
学习率	0.000 1
折扣因子	0.9
目标网络更新频率	150
$ϵ ⁃$ greedy策略初始值	1
$ϵ ⁃$ greedy策略最终值	0.005
$ϵ ⁃$ greedy策略递减总步数	50 000

表1 主要网络结构参数

Tab. 1 Main network structure parameters

仿真参数	数值
训练步数	150 000
经验回放数组大小	256
采样批大小	32
学习率	0.000 1
折扣因子	0.9
目标网络更新频率	150
$ϵ ⁃$ greedy策略初始值	1
$ϵ ⁃$ greedy策略最终值	0.005
$ϵ ⁃$ greedy策略递减总步数	50 000

图5 不同学习率下的收敛性能对比

Fig. 5 Comparison of convergence performance under different learning rates

图6 不同算法的通信吞吐量变化对比

Fig. 6 Comparison of communication throughput variation of different algorithms

图7 不同感知精度阈值下的通信吞吐量变化对比

Fig. 7 Comparison of communication throughput variation under different sensing accuracy thresholds

图8 不同最低SINR下的通信吞吐量变化对比

Fig. 8 Comparison of communication throughput variation under different minimum SINRs

参考文献 24

1	DENG C， FANG X， HAN X， et al. IEEE 802.11be Wi-Fi 7： new challenges and opportunities［J］. IEEE Communications Surveys and Tutorials， 2020， 22（4）： 2136-2166.
2	LÓPEZ-RAVENTÓS Á， BELLALTA B. IEEE 802.11be multi-link operation： when the best could be to use only a single interface［C］// Proceedings of the 19th Mediterranean Communication and Computer Networking Conference. Piscataway： IEEE， 2021： 1-7.
3	HAN T X， AU E. 802.11bf timeline discussion［EB/OL］. ［2023-12-09］..
4	鲁勇，吕绍和，王晓东，等. 基于WiFi信号的人体行为感知技术研究综述［J］. 计算机学报， 2019， 42（2）： 231-251.
	LU Y， LYU S H， WANG X D， et al. A survey on WiFi based human behavior analysis technology［J］. Chinese Journal of Computers， 2019， 42（2）： 231-251.
5	LIU F， CUI Y， MASOUROS C， et al. Integrated sensing and communications： toward dual-functional wireless networks for 6G and beyond［J］. IEEE Journal on Selected Areas in Communications， 2022， 40（6）： 1728-1767.
6	LÓPEZ-RAVENTÓS Á， BELLALTA B. Dynamic traffic allocation in IEEE 802.11be multi-link WLANs［J］. IEEE Wireless Communications Letters， 2022， 11（7）： 1404-1408.
7	ITURRIA-RIVERA P E， CHENIER M， HERSCOVICI B， et al. Channel selection for Wi-Fi 7 multi-link operation via optimistic-weighted VDN and parallel transfer reinforcement learning［C］// Proceedings of the IEEE 34th Annual International Symposium on Personal， Indoor and Mobile Radio Communications. Piscataway： IEEE， 2023： 1-6.
8	ITURRIA-RIVERA P E， CHENIER M， HERSCOVICI B， et al. RL meets multi-link operation in IEEE 802.11be： multi-headed recurrent soft-actor critic-based traffic allocation［C］// Proceedings of the 2023 IEEE International Conference on Communications. Piscataway： IEEE， 2023： 4001-4006.
9	MEDDA D， IOSSIFIDES A， CHATZIMISIOS P. Study of band allocation policies in IEEE 802.11be networks with devices of different capabilities［C］// Proceedings of the 2023 IEEE International Conference on Communications. Piscataway： IEEE， 2023： 3205-3209.
10	ALI R， BELLALTA B. A federated reinforcement learning framework for link activation in multi-link Wi-Fi networks［C］// Proceedings of the 2023 IEEE International Black Sea Conference on Communications and Networking. Piscataway： IEEE， 2023： 360-365.
11	CHEN L， LIU F， WANG W， et al. Joint radar-communication transmission： a generalized Pareto optimization framework［J］. IEEE Transactions on Signal Processing， 2021， 69： 2752-2765.
12	CHENG G， FANG Y， XU J， et al. Optimal coordinated transmit beamforming for networked integrated sensing and communications［J］. IEEE Transactions on Wireless Communications， 2024， 23（8）： 8200-8214.
13	HE Y， LIU J， LI M， et al. SenCom： integrated sensing and communication with practical WiFi［C］// Proceedings of the 29th Annual International Conference on Mobile Computing and Networking. New York： ACM， 2023： No.60.
14	MENEGHELLO F， CHEN C， CORDEIRO C， et al. Toward integrated sensing and communications in IEEE 802.11bf Wi-Fi networks［J］. IEEE Communications Magazine， 2023， 61（7）： 128-133.
15	MENEGHELLO F， GARLISI D， FABBRO N DAL， et al. SHARP： environment and person independent activity recognition with commodity IEEE 802.11 access points［J］. IEEE Transactions on Mobile Computing， 2023， 22（10）： 6160-6175.
16	LAN/MAN Standards Committee. IEEE Draft Standard for Information technology — telecommunications and information exchange between systems local and metropolitan area networks —specific requirements： part 11： wireless LAN Medium Access Control （MAC） and Physical Layer （PHY） specifications amendment： enhancements for Extremely High Throughput （EHT）［S］. Piscataway： IEEE， 2023.
17	RASHID T， SAMVELYAN M， DE WITT C S， et al. Monotonic value function factorisation for deep multi-agent reinforcement learning［J］. Journal of Machine Learning Research， 2020， 21： 1-51.
18	DU R， HUA H， XIE H， et al. An overview on IEEE 802.11 bf： WLAN sensing［EB/OL］. ［2023-11-17］..
19	LEVITSKY I， OKATEV Y， KHOROV E. Study on simultaneous transmission and reception on multiple links in IEEE 802.11be networks［C］// Proceedings of the 2020 International Conference Engineering and Telecommunication. Piscataway： IEEE， 2020： 1-4.
20	ZHAI H， FANG Y. Physical carrier sensing and spatial reuse in multirate and multihop wireless ad hoc networks［C］// Proceedings of the 25th IEEE International Conference on Computer Communications. Piscataway： IEEE， 2006： 1-12.
21	黄震华，林小龙，孙圣力，等. 会话场景下基于特征增强的图神经推荐方法［J］. 计算机学报， 2022， 45（4）： 766-780.
	HUANG Z H， LIN X L， SUN S L， et al. Feature augmentation based graph neural recommendation method in session scenarios［J］. Chinese Journal of Computers， 2022， 45（4）： 766-780.
22	亓法欣，童向荣，于雷. 基于强化学习DQN的智能体信任增强［J］. 计算机研究与发展， 2020， 57（6）： 1227-1238.
	QI F X， TONG X R， YU L. Agent trust boost via reinforcement learning DQN［J］. Journal of Computer Research and Development， 2020， 57（6）： 1227-1238.
23	LI J， DANG X， LI S. DQN-based decentralized multi-agent JSAP resource allocation for UAV swarm communication［J］. Journal of Systems Engineering and Electronics， 2023， 34（2）： 289-298.
24	GIRMAY G G， PHAM Q V， HWANG W J. Joint channel and power allocation for device-to-device communication on licensed and unlicensed band［J］. IEEE Access， 2019， 7： 22196-22205.

[1]	缪孜珺, 罗飞, 丁炜超, 董文波. 基于全局状态预测与公平经验重放的交通信号控制算法[J]. 《计算机应用》唯一官方网站, 2025, 45(1): 337-344.
[2]	周毅, 高华, 田永谌. 基于裁剪优化和策略指导的近端策略优化算法[J]. 《计算机应用》唯一官方网站, 2024, 44(8): 2334-2341.
[3]	马天, 席润韬, 吕佳豪, 曾奕杰, 杨嘉怡, 张杰慧. 基于深度强化学习的移动机器人三维路径规划方法[J]. 《计算机应用》唯一官方网站, 2024, 44(7): 2055-2064.
[4]	赵晓焱, 韩威, 张俊娜, 袁培燕. 基于异步深度强化学习的车联网协作卸载策略[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1501-1510.
[5]	唐睿, 庞川林, 张睿智, 刘川, 岳士博. D2D通信增强的蜂窝网络中基于DDPG的资源分配[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1562-1569.
[6]	刘晓芳, 张军. 概率驱动的动态多目标多智能体协同调度进化优化[J]. 《计算机应用》唯一官方网站, 2024, 44(5): 1372-1377.
[7]	秦鑫彤, 宋政育, 侯天为, 王飞越, 孙昕, 黎伟. 基于自适应p持续的移动自组网信道接入和资源分配算法[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 863-868.
[8]	唐朝君, 夏梅妍, 张华, 谢挺. 动态事件触发下多智能体系统固定时间一致性[J]. 《计算机应用》唯一官方网站, 2024, 44(3): 960-965.
[9]	邓辅秦, 官桧锋, 谭朝恩, 付兰慧, 王宏民, 林天麟, 张建民. 基于请求与应答通信机制和局部注意力机制的多机器人强化学习路径规划方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 432-438.
[10]	李源潮, 陶重犇, 王琛. 基于最大熵深度强化学习的双足机器人步态控制方法[J]. 《计算机应用》唯一官方网站, 2024, 44(2): 445-451.
[11]	邓辅秦, 谭朝恩, 黎俊炜, 钟家铭, 付兰慧, 张建民, 王宏民, 李楠楠, 姜炳春, 林天麟. 面向大型仓储环境的基于冲突搜索算法[J]. 《计算机应用》唯一官方网站, 2024, 44(12): 3854-3860.
[12]	余家宸, 杨晔. 基于裁剪近端策略优化算法的软机械臂不规则物体抓取[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3629-3638.
[13]	王昱, 关智慧, 李远鹏. 基于轨迹预测和分布式MADDPG的无人机集群追击决策[J]. 《计算机应用》唯一官方网站, 2024, 44(11): 3623-3628.
[14]	孙安泰, 刘烨, 徐冬梅. 多智能体系统的动态面渐近补偿算法[J]. 《计算机应用》唯一官方网站, 2024, 44(10): 3151-3157.
[15]	龙杰, 谢良, 徐海蛟. 集成的深度强化学习投资组合模型[J]. 《计算机应用》唯一官方网站, 2024, 44(1): 300-310.

Wi-Fi7多链路通感一体化的功率和信道联合智能分配算法

Intelligent joint power and channel allocation algorithm for Wi-Fi7 multi-link integrated communication and sensing

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 9

参考文献 24

相关文章 15

编辑推荐

Metrics