Journal of Computer Applications ›› 2015, Vol. 35 ›› Issue (1): 58-61.DOI: 10.11772/j.issn.1001-9081.2015.01.0058

Previous Articles     Next Articles

Implementation and performance analysis of Knuth39 parallelization based on many integrated core platform

ZHANG Baodong1, ZHOU Jinyu2, LIU Xiao1,2, HUA Cheng2, ZHOU Xiaohui1,2   

  1. 1. School of Computer Science and Technology, Xi'an University of Posts and Telecommunications, Xi'an Shaanxi 710121, China;
    2. Parallel Computing Laboratory, Shaanxi Research Center for High Performance Computing, Xi'an Shaanxi 710121, China
  • Received:2014-08-06 Revised:2014-09-23 Online:2015-01-01 Published:2015-01-26

基于超多核心平台的Knuth39并行化实现及性能分析

张保东1, 周津羽2, 刘逍1,2, 华诚2, 周晓辉1,2   

  1. 1. 西安邮电大学 计算机学院, 西安710121;
    2. 陕西省高性能计算研究中心 并行计算实验室, 西安710121
  • 通讯作者: 周晓辉
  • 作者简介:张保东(1990-),男,山西晋城人,硕士研究生,主要研究方向:高性能计算;周津羽(1976-),男,北京人,研究员,主要研究方向:高性能计算、大数据;刘逍(1988-),男,山东文登人,硕士研究生,主要研究方向:高性能计算、自然语言处理;华诚(1987-),男,安徽桐城人,工程师,主要研究方向:金融计算、高性能计算;周晓辉(1978-),男,山东济南人,教授,博士,主要研究方向:高性能计算、大数据.
  • 基金资助:

    陕西省自然科学基础研究计划项目(2013JM8028).

Abstract:

To solve the low running speed problem of Knuth39 random number generator, a Knuth39 parallelization method based on Many Integrated Core (MIC) platform was proposed. Firstly, the random number sequence of Knuth39 generator was divided into subsequences by regular interval. Then, the random numbers were generated by every thread from the corresponding subsequence's starting point. Finally, the random number sequences generated by all threads were combined into the final sequence. The experimental results show that the parallelized Knuth39 generator successfully passed 452 tests of TestU01, the results are the same as those of Knuth39 generator without parallelization. Compared with single thread on Central Processing Unit (CPU), the optimal speed-up ratio on MIC platform is 15.69 times. The proposed method improves the running speed of Knuth39 generator effectively, ensures the randomness of the generated sequences, and it is more suitable for high performance computing.

Key words: random number generator, Knuth39, parallelization, Many Integrated Core (MIC), TestU01

摘要:

针对Knuth39随机数发生器运行速度慢的问题,提出了一种基于超多核心(MIC)平台的Knuth39并行化方法.首先,将Knuth39发生器的随机数序列以固定间隔划分成多个子序列;然后,每个线程从各子序列的起点开始生成随机数;最后,将各个线程生成的随机数序列组合成最终的序列.实验结果表明,并行化后Knuth39通过了TestU01的452项测试,与串行程序相同.同中央处理器(CPU)单线程相比,并行化后MIC平台下的最优加速比可达到15.69倍.所提方法有效地提高了Knuth39发生器的运行速度,并且保证了生成序列的随机性,更加适用于高性能计算领域.

关键词: 随机数发生器, Knuth39, 并行化, 超多核心, TestU01

CLC Number: