GPU parallel particle swarm optimization algorithm based on adaptive warp

doi:10.11772/j.issn.1001-9081.2016.12.3274

Journal of Computer Applications ›› 2016, Vol. 36 ›› Issue (12): 3274-3279.DOI: 10.11772/j.issn.1001-9081.2016.12.3274

Previous Articles Next Articles

GPU parallel particle swarm optimization algorithm based on adaptive warp

ZHANG Shuo, HE Fazhi, ZHOU Yi, YAN Xiaohu

School of Computer, Wuhan University, Wuhan Hubei 430072, China

Received:2016-06-03 Revised:2016-07-06 Online:2016-12-08 Published:2016-12-10
Supported by:
This work is partially supported by the National Natural Science Foundation of China (61472289), the Natural Science Foundation of Hubei Province (2015CFB254).

基于自适应线程束的GPU并行粒子群优化算法

张硕, 何发智, 周毅, 鄢小虎

武汉大学计算机学院, 武汉 430072

通讯作者: 何发智
作者简介:张硕(1992-),男,湖北仙桃人,硕士研究生,主要研究方向:计算机图形学、GPU并行计算;何发智(1968-),男,湖北武汉人,教授,博士,CCF会员,主要研究方向:计算机支持的协同工作、计算机图形学、图像处理、并行计算;周毅(1983-),男,湖北汉川人,高级工程师,博士研究生,主要研究方向:GPU通用计算、智能优化算法;鄢小虎(1986-),男,湖北武汉人,高级工程师,博士研究生,CCF会员,主要研究方向为:软硬件协同设计、智能优化算法。
基金资助:
国家自然科学基金资助项目（61472289）；湖北省自然科学基金资助项目（2015CFB254）。

Abstract

Abstract: The parallel Particle Swarm Optimization (PSO) algorithm was improved through Graphics Processor Unit (GPU) based on Compute Unified Device Architecture (CUDA). According to the structural characteristics of the CUDA hardware system, it can be concluded that block is executed serially and the basic scheduled and executive unit of Streaming Multiprocessor (SM) is warp. GPU parallel PSO algorithm based on adaptive warp was carried out in order to make full use of thread parallelism in the block. The dimensions of particles were corresponded to the threads of particles. Each particle was corresponded to one or more warps in accordance with its self-dimension adaptively by using the warp level parallelism of GPU. One or more particles were corresponded to each block. Comparison with the existing coarse-grained parallel approach (corresponding each particle to the thread) and fine-grained parallel approach (corresponding each particle to the block) was made, and the experimental results show that the proposed parallel approach achieves CPU speed-up ratio of 40 more than two kinds of approaches mentioned above.

Key words: Particle Swarm Optimization (PSO) algorithm, parallel computing, Graphic Processing Unit (GPU), Compute Unified Device Architecture (CUDA), adaptive warp

摘要： 基于统一计算设备架构（CUDA）对图形处理器（GPU）下的并行粒子群优化（PSO）算法作改进研究。根据CUDA的硬件体系结构特点，可知Block是串行执行的，线程束（Warp）才是流多处理器（SM）调度和执行的基本单位。为了充分利用Block中线程的并行性，提出基于自适应线程束的GPU并行PSO算法：将粒子的维度和线程相对应；利用GPU的Warp级并行，根据维度的不同自适应地将每个粒子与一个或多个Warp相对应；自适应地将一个或多个粒子与每个Block相对应。与已有的粗粒度并行方法（将每个粒子和线程相对应）以及细粒度并行方法（将每个粒子和Block相对应）进行了对比分析，实验结果表明，所提出的并行方法相对前两种并行方法，CPU加速比最多提高了40。

关键词: 粒子群优化算法, 并行计算, 图形处理器, 统一计算设备架构, 自适应线程束

CLC Number:

TP301.6

ZHANG Shuo, HE Fazhi, ZHOU Yi, YAN Xiaohu. GPU parallel particle swarm optimization algorithm based on adaptive warp[J]. Journal of Computer Applications, 2016, 36(12): 3274-3279.

张硕, 何发智, 周毅, 鄢小虎. 基于自适应线程束的GPU并行粒子群优化算法[J]. 计算机应用, 2016, 36(12): 3274-3279.

References

[1] KENNEDY J, EBERHART R. Particle swarm optimization[C]//Proceedings of the 1995 IEEE International Conference on Neural Networks. Piscataway, NJ:IEEE, 1995, 4:1942-1948.
[2] POLI R, KENNEDY J, BLACKWELL T. Particle swarm optimization[J]. Swarm Intelligence, 2007, 1(1):33-57.
[3] 张庆科,杨波,王琳,等.基于GPU的现代并行优化算法[J].计算机科学,2012,39(4):304-310.(ZHANG Q K, YANG B, WANG L, et al. Research on parallel modern optimization algorithms using GPU[J]. Computer Science, 2012, 39(4):304-310.)
[4] 左颢睿,张启衡,徐勇,等.基于GPU的并行优化技术[J].计算机应用研究,2009,26(11):4115-4118.(ZUO H R, ZHANG Q H, XU Y, et al. Parallel optimize technology based on GPU[J]. Application Research of Computers, 2009, 26(11):4115-4118.)
[5] 吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612.(WU E H, LIU Y Q. General purpose computation on GPU[J]. Journal of Computer-Aided Design & Computer Graphics, 2004, 16(5):601-612.)
[6] LUEBKE D, HUMPHREYS G. How GPUs work[J]. Computer, 2007, 40(2):96-100.
[7] NVIDIA Corporation. CUDA Programming Guide 7. 0[EB/OL].[2016-06-01]. http://www.nvidia.com.
[8] VERONESE L D P, KROHLING R. Swarm's flight:accelerating the particles using C-CUDA[C]//CEC'09:Proceedings of the Eleventh Conference on IEEE Congress on Evolutionary Computation. Piscataway, NJ:IEEE, 2009:3264-3270.
[9] CALAZAN R M, NEDJAH N, DE MACEDO M L. Parallel GPU-based implementation of high dimension particle swarm optimizations[C]//Proceedings of the 2013 IEEE Fourth Latin American Symposium on Circuits and Systems. Piscataway, NJ:IEEE, 2013:1-4.
[10] ZHOU Y, TAN Y. GPU-based parallel particle swarm optimization[C]//CEC'09:Proceedings of the 2009 IEEE Congress on Evolutionary Computation. Piscataway, NJ:IEEE, 2009:1493-1500.
[11] MUSSI L, DAOLIO F, CAGNONI S. Evaluation of parallel particle swarm optimization algorithms within the CUDA architecture[J]. Information Sciences, 2011, 181(20):4642-4657.
[12] 邹岩,杨志义,张凯龙.CUDA并行程序的内存访问优化技术研究[J].计算机测量与控制,2009,17(12):2504-2506.(ZOU Y, YANG Z Y, ZHANG K L. Study on optimization techniques for accesses of CUDA[J]. Computer Measurement & Control, 2009, 17(12):2504-2506.)
[13] 陈风,田雨波,杨敏.基于CUDA的并行粒子群优化算法研究及实现[J].计算机科学,2014,41(9):263-268.(CHEN F, TIAN Y B, YANG M. Research and design of parallel particle swarm optimization algorithm based on CUDA[J]. Computer Science, 2014, 41(9):263-268.)
[14] 张德军,何发智,吴亦奇.一种基于定向变异粒子群算法的异构CAD模型奇异特征互操作方法[J].中国科学:信息科学,2015,45(5):634-649.(ZHANG D J, HE F Z, WU Y Q. Singular feature interoperability of heterogeneous CAD model based on directed mutation particle swarm optimization[J]. SCIENCE CHINA Information Sciences, 2015, 45(5):634-649.)
[15] Particle Swarm Central. Standard PSO version 2006[EB/OL].[2016-06-01]. http://www.particleswarm.info/Standard_PSO_2006.c.
[16] 蔡勇,李光耀,王琥.基于CUDA的并行粒子群优化算法的设计与实现[J].计算机应用研究,2013,30(8):2415-2418.(CAI Y, LI G Y, WANG H. Research and implementation of parallel particle swarm optimization based on CUDA[J]. Application Research of Computers, 2013, 30(8):2415-2418.)
[17] OWENS J D, HOUSTON M, LUEBKE D, et al. GPU computing[J]. Proceedings of the IEEE, 2008, 96(5):879-899.
[18] 赵明超,陈智斌,文有为.基于GPU图像去噪总变分对偶模型的并行计算[J].计算机应用,2016,36(5):1228-1231.(ZHAO M C, CHEN Z B, WEN Y W. Parallel computation for image denoising via total variation dual model on GPU[J]. Journal of Computer Applications, 2016, 36(5):1228-1231.)
[19] DOLPHIN Project Team. Metaheuristics on GPU[EB/OL].[2016-06-01]. http://www.sintef.no/globalassets/project/collab/presentations/the-van-metaheuristics-gpu-sintef.pdf.
[20] ROCKI K, SUDA R. Accelerating 2-opt and 3-opt local search using GPU in the travelling salesman problem[C]//Proceedings of the 201212th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. Washington, DC:IEEE Computer Society, 2012:705-706.
[21] BASTOS-FILHO C J A, OLIVEIRA M A C, NASCIMENTO D N O, et al. Impact of the random number generator quality on particle swarm optimization algorithm running on graphic processor units[C]//Proceedings of the 201010th International Conference on High Performance Computing and Simulation. Piscataway, NJ:IEEE, 2010:85-90.
[22] SUSSMAN M, CRUTCHFIELD W, PAPAKIPOS M. Pseudorandom number generation on the GPU[C]//Proceedings of the 21st ACM SIGGRAPH/EUROGRAPHICS Symposium on Graphics Hardware. New York:ACM, 2006:87-94.

GPU parallel particle swarm optimization algorithm based on adaptive warp

基于自适应线程束的GPU并行粒子群优化算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

[1]	Runlian ZHANG, Mi ZHANG, Xiaonian WU, Rui SHU. Differential property evaluation method based on GPU for large-state cryptographic S-boxes [J]. Journal of Computer Applications, 2024, 44(9): 2785-2790.
[2]	Peigen GAO, Bin SUO. Experimental design and staged PSO-Kriging modeling based on weighted hesitant fuzzy set [J]. Journal of Computer Applications, 2024, 44(7): 2144-2150.
[3]	Xiaoxin DU, Wei ZHOU, Hao WANG, Tianru HAO, Zhenfei WANG, Mei JIN, Jianfei ZHANG. Survey of subgroup optimization strategies for intelligent algorithms [J]. Journal of Computer Applications, 2024, 44(3): 819-830.
[4]	Zhihui GAO, Meng HAN, Shujuan LIU, Ang LI, Dongliang MU. Survey of high utility itemset mining methods based on intelligent optimization algorithm [J]. Journal of Computer Applications, 2023, 43(6): 1676-1686.
[5]	Jun LIANG, Zehong HONG, Songsen YU. Image segmentation model based on improved particle swarm optimization algorithm and genetic mutation [J]. Journal of Computer Applications, 2023, 43(6): 1743-1749.
[6]	Zhenhua YU, Zhengqi LIU, Ying LIU, Cheng GUO. Feature selection method based on self-adaptive hybrid particle swarm optimization for software defect prediction [J]. Journal of Computer Applications, 2023, 43(4): 1206-1213.
[7]	Feng XIANG, Zhongzhi LI, Xi XIONG, Binyong LI. Inverse distance weight interpolation algorithm based on particle swarm local optimization [J]. Journal of Computer Applications, 2023, 43(2): 385-390.
[8]	Xuesen MA, Xuemei XU, Gonghui JIANG, Yan QIAO, Tianbao ZHOU. Hybrid adaptive particle swarm optimization algorithm for workflow scheduling [J]. Journal of Computer Applications, 2023, 43(2): 474-483.
[9]	Chunfeng LIU, Zheng LI, Jufeng WANG. Multi-objective optimization of minicells in distributed factories [J]. Journal of Computer Applications, 2023, 43(12): 3824-3832.
[10]	Qian LIU, Yangming ZHANG, Dingsheng WAN. Parallel computing algorithm of grid-based distributed Xin’anjiang hydrological model [J]. Journal of Computer Applications, 2023, 43(11): 3327-3333.
[11]	JIANG Songyan, LIAO Xiaojuan, CHEN Guangzhu. Optimal task scheduling method based on satisfiability modulo theory for multiple processors with communication delay [J]. Journal of Computer Applications, 2023, 43(1): 185-191.
[12]	Jingwen CAI, Yongzhuang WEI, Zhenghong LIU. GPU-based method for evaluating algebraic properties of cryptographic S-boxes [J]. Journal of Computer Applications, 2022, 42(9): 2750-2756.
[13]	Bing GAO, Ya ZHENG, Jing QIN, Qijie ZOU, Zumin WANG. Network intrusion detection algorithm based on sparrow search algorithm and improved particle swarm optimization algorithm [J]. Journal of Computer Applications, 2022, 42(4): 1201-1206.
[14]	Fangxin NIE, Yujia WANG, Xin JIA. Teaching and learning information interactive particle swarm optimization algorithm [J]. Journal of Computer Applications, 2022, 42(3): 874-882.
[15]	Jing ZHANG, Aihong ZHU. Optimization method of automatic train operation speed curve based on genetic algorithm and particle swarm optimization [J]. Journal of Computer Applications, 2022, 42(2): 599-605.