Dynamic spectrum access mechanism of multi-users based on restless multi-armed bandit model in cognitive networks

doi:10.11772/j.issn.1001-9081.2014.10.2782

Journal of Computer Applications ›› 2014, Vol. 34 ›› Issue (10): 2782-2786.DOI: 10.11772/j.issn.1001-9081.2014.10.2782

• Network and communications • Previous Articles Next Articles

Dynamic spectrum access mechanism of multi-users based on restless multi-armed bandit model in cognitive networks

ZHU Jiang,HAN Chao,YANG Jielei,PENG Zhuxun

Chongqing Key Lab of Mobile Communications Technology (Chongqing University of Posts and Telecommunications), Chongqing 400065, China

Received:2014-05-08 Revised:2014-06-16 Online:2014-10-01 Published:2014-10-30
Contact: HAN Chao
Supported by:
;the Key Project of Chinese Ministry of Education;the Doctoral Foundation of CQUPT

认知无线网络中基于无休止多臂赌博机模型的多用户频谱接入机制

朱江,韩超,杨浩磊,彭著勋

移动通信技术重庆市重点实验室（重庆邮电大学），重庆 400065

通讯作者: 韩超
作者简介:朱江(1977-)，男，湖北荆州人，副教授，博士，主要研究方向：移动通信、认知无线电；
韩超(1989-)，男，山东泰安人，硕士研究生，主要研究方向：认知无线电；
杨浩磊(1990-)，男，河南许昌人，硕士研究生，主要研究方向：认知无线电；
彭著勋(1989-)，男，重庆荣昌人，硕士研究生，主要研究方向：认知无线电。
基金资助:
教育部人文社会科学研究项目;教育部科学技术研究重点项目;重庆市科委自然科学基金项目;重庆市教委科学技术研究项目;重庆邮电大学博士启动基金项目

Abstract

Abstract:

Based on the theory of Restless Multi-Armed Bandit (RMAB) model, a novel mechanism of dynamic spectrum access was proposed for the problem that how to coordinate multiple user access multiple idle channels. Firstly, concerning the channel sensing error of the cognitive user being existed in the practical network, the Whittle index policy which can deal with sensing error effectively was derived. In this policy, the users achieved one belief value for every channel based on the historical experience accumulation and chose the channel, which was needed to sense and access, by considering the immediate and future rewards based on the belief values. Secondly, this paper used the multi-bid auction algorithm to deal with the collision among secondary users when they selected the channels to improve the spectrum utilization. The simulation results demonstrate that, in the same environment, the cognitive users with the proposed mechanism have higher throughtput than the mechanism without dealing with sensing error or without multi-bid.

摘要：

针对如何协调多个认知用户择机接入多段空闲频域信道的问题，提出了一种基于无休止多臂赌博机(RMAB)模型的动态频谱接入机制。首先，考虑到实际环境下认知用户的信道感知误差，推导出能有效处理感知误差的Whittle索引值算法，该算法通过历史经验积累给予每个信道一定的信任值，并综合考虑在当前信任值下选择每个信道的立即收益与未来收益的多少，选择出需要感知接入的信道；其次，对于多个认知用户接入相同信道时产生冲突的问题，提出了基于多标拍卖的协调机制，通过多标拍卖的方式处理认知用户之间的冲突。仿真结果表明，在相同的环境中，所提出的频谱接入机制与未处理误差的或者未采用多标拍卖的接入机制相比，认知用户获得的吞吐量更大。

CLC Number:

TN926

ZHU Jiang HAN Chao YANG Jielei PENG Zhuxun. Dynamic spectrum access mechanism of multi-users based on restless multi-armed bandit model in cognitive networks[J]. Journal of Computer Applications, 2014, 34(10): 2782-2786.

朱江韩超杨浩磊彭著勋. 认知无线网络中基于无休止多臂赌博机模型的多用户频谱接入机制[J]. 计算机应用, 2014, 34(10): 2782-2786.

References

［1］ZHAO Q, KRISHNAMACHARI B, LIU K. On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance ［J］. IEEE Transactions on Wireless Communications, 2008,7(12):5431-5440.
［2］ZHANG B, ZHU Y, HU K. Spectrum assignment algorithm based on particle swarm optimization for cognitive radio ［J］. Journal of Computer Applications, 2012,32(12):3184-3186.(张北伟,朱云龙,胡琨元.基于粒子群算法的认知无线电频谱分配算法［J］.计算机应用,2012,32(12):3184-3186.)［3］LONG Y, YIN H, ZHU J, et al.New adaptive dynamic channel allocation algorithm in cognitive radio ［J］. Journal of Computer Applications, 2011,31(11):2915-2917.(龙吟,殷亨静,朱江,等.认知无线电中的新型自适应动态信道分配算法［J］.计算机应用,2011,31(11):2915-2917.)
［4］LIU K, ZHAO Q, KRISHNAMACHARI B. Dynamic multichannel access with imperfect channel state detection ［J］. IEEE Transactions on Signal Processing, 2010,58(5):2795-2808.
［5］DONG S, LEE J. Greedy confidence bound techniques for restless multi-armed bandit based Cognitive Radio ［C］// Proceedings of the 2013 47th Annual Conference on Information Sciences and Systems. Piscataway: IEEE Press, 2013:1-4.
［6］LIU K, ZHAO Q. Indexability of restless bandit problems and optimality of Whittle index for dynamic multichannel access ［J］. IEEE Transactions on Information Theory, 2010,56(11):5547-5567.
［7］GILBERT E N. Capacity of a burst-noise channel ［J］. Bell System Technical Journal, 1960,39:1253-1265.
［8］SMALLWOOD R, SONDIK E. The optimal control of partially observable Markov processes over a finite horizon ［J］. Operations Research, 1971,26:1071-1088.
［9］ZHAO Q, TONG L, SWAMI A, et al.Decentralized cognitive MAC for opportunistic spectrum access in Ad Hoc networks: a POMDP framework ［J］. IEEE Journal on Selected Areas in Communications, 2007,25(3):589-600.
［10］PAPADIMITRIOU C H, TSITSIKLIS J N. The complexity of optimal queueing network control ［J］. Mathematics of Operations Research, 1999,24(2):293-305.
［11］ZHU J, LI S. Channel allocation mechanisms for cognitive radio networks via repeated multi-bid auction ［C］// Proceedings of the 2006 IET International Conference on Wireless, Mobile and Multimedia Networks. Beijing: IET Beijing Branch, 2006:1-4.
［12］WHITTLE P. Restless bandits: activity allocation in a changing world ［J］. Journal of Applied Probability, 1988,25:287-598.
［13］NY J L, DAHLEH M, FERON E. Multi-UAV dynamic routing with partial observations using restless bandit allocation indices ［C］// Proceedings of the 2008 IEEE American Control Conference. Piscataway: IEEE Press, 2008:4220-4225.
［14］SONDIK E. The optimal control of partially observable Markov decision processes over the infinite horizon: discounted costs ［J］. Operations Research, 1978,26(2):282-304.

［15］LI F, TANG Y, ZHU J. Dynamic channel selection in unknown environment based on graphical game and multi-Q learning ［J］. Journal on Communications, 2013,34(11):1-7.(李方伟, 唐永川, 朱江.未知环境中基于图型博弈和multi-Q学习的动态信道选择算法［J］.通信学报,2013,34(11):1-7.)
［16］GALLAGER R G. Discrete stochastic processes ［M］. Dordrecht: Kluwer Academic Publishers, 1995.

[1]	MO Wenjie, ZHENG Lin. Path planning algorithm for mobile sink with optimized network lifetime and shortest path in wireless sensor network [J]. Journal of Computer Applications, 2017, 37(8): 2150-2156.
[2]	LI Jiaxun, ZHANG Shaojie, ZHAO Haitao, MA Dongtang. Design and implementation of hardware-in-loop simulation system of wireless network MAC protocol in USRP2 [J]. Journal of Computer Applications, 2015, 35(8): 2124-2128.
[3]	ZHAO Nan, WU Minghu, ZHOU Xianjun, XIONG Wei, ZENG Chunyan. Cooperative spectrum sharing method based on spectrum contract [J]. Journal of Computer Applications, 2015, 35(7): 1805-1808.
[4]	QIU Changxiao, LENG Supeng, YE Yu. Joint switch scheduling and resource allocation algorithm based on energy efficiency in heterogeneous wireless networks [J]. Journal of Computer Applications, 2015, 35(6): 1505-1508.
[5]	WANG Jing, GAO Zehua, GAO Feng, PAN Xiang. Enhanced frequency-domain channel contention mechanism in wireless local area network [J]. Journal of Computer Applications, 2015, 35(2): 317-321.
[6]	HONG Yong LI Ping. Information security defense mechanism based on wireless sensor network correlation [J]. Journal of Computer Applications, 2013, 33(02): 423-467.
[7]	WANG Xiao-gang CAO Jian. Flexible link-state routing protocol for mobile Ad Hoc networks [J]. Journal of Computer Applications, 2012, 32(08): 2085-2094.
[8]	ZHANG Hong-jun MAO Yong-yi. Improved algorithm of wireless sensor network node localization [J]. Journal of Computer Applications, 2012, 32(08): 2103-2105.
[9]	JING Qin-qin WEN Hong XU Liang PENG Sheng-qi. Generation and transmission system of optical millimeter-wave carrying OFDM signal based on phase modulator [J]. Journal of Computer Applications, 2012, 32(05): 1217-1220.
[10]	WANG Chu-hang. Topology control algorithm for wireless sensor network based on mean RSSI [J]. Journal of Computer Applications, 2012, 32(02): 352-358.
[11]	DANG Xiao-chao LI Xiao-yan. Weighted correction model in wireless sensor network localization [J]. Journal of Computer Applications, 2012, 32(02): 355-358.
[12]	LIU Jing WANG Xin-hua WANG Zhen WANG Shuo. Routing scheme for vehicle Ad Hoc network [J]. Journal of Computer Applications, 2012, 32(02): 359-366.
[13]	REN Zhi ZHENG Ai-li YAO Yu-kun LI Qing-yang. Continuous wireless network coding based on sliding windows [J]. Journal of Computer Applications, 2011, 31(09): 2321-2324.
[14]	Yi WU Xiao LIN Jian-yong CAI. Wireless broadband video transmission system based on adaptive choice of multiple networks [J]. Journal of Computer Applications, 2011, 31(08): 2029-2032.
[15]	Yu-kun YAO Peng-xiang LI Zhi REN Yuan GU. Borrowed address assignment algorithm for ZigBee network [J]. Journal of Computer Applications, 2011, 31(08): 2044-2047.

Dynamic spectrum access mechanism of multi-users based on restless multi-armed bandit model in cognitive networks

认知无线网络中基于无休止多臂赌博机模型的多用户频谱接入机制

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics