[1] Intel Corporation. Intel 64 and IA-32 architectures software developer's manual[EB/OL]. [2013-10-10].http://www.intel.com/Assets/PDF/manual/252046.pdf. [2] Intel Corportation. Legal disclaimer & optimization notice[EB/OL]. [2013-10-10].https://gcc.gnu.org/wiki/cauldron2014?action=AttachFile&do=get&target=Cauldron14_AVX-512_Vector_ISA_Kirill_Yukhin_20140711.pdf. [3] STEWART J. An investigation of SIMD instruction sets [D]. Melbourne: University of Ballarat, 2005. [4] ALLEN R, KENNEDY K. Optimizing compilers for modern architectures: a dependence-based approach[M]. San Francisco: Morgan Kaufmann, 2002. [5] LARSEN S, AMARASINGHE S. Exploiting superword level parallelism with multimedia instruction sets[C]// PLDI 2000: Proceedings of the ACM SIGPLAN 2000 Conference on Programming Language Design and Implementation. New York: ACM Press, 2000:145-156. [6] KIM T, HOSKOTE Y. Automatic generation of custom SIMD instructions for superword level parallelism[C]// Proceedings of the 2014 Design, Automation and Test in Europe Conference and Exhibition. Piscataway: IEEE Press, 2014:1-6. [7] LIU P, ZHAO R, GAO W, et al. A new algorithm to exploit superword level parallelism[C]// Proceedings of the 2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing. Piscataway: IEEE Press,2013: 521-527. [8] ZHAO B, ZHAO R, LI Y, et al. SLP exploitation method for type conversion statements[J]. Computer Science,2014, 41(11): 16-21.(赵博, 赵荣彩, 李雁冰, 等. 类型转换语句的 SLP 发掘方法[J]. 计算机科学, 2014, 41(11): 16-21.) [9] WEI S. Reaserch of SIMD vectorization algorithm and optimization[D]. Zhengzhou: Information Engineering University, 2012.(魏帅. 面向 SIMD 的向量化算法及重组技术研究[D]. 郑州:信息工程大学, 2012.) [10] PRIETO M, PINUEL L, CATTHOOR F, et al. Improving superword level parallelism support in modern compilers[C]// CODES+ISSS 2005: Proceedings of the Third IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis. Piscataway: IEEE Press, 2005: 303-308. [11] KUDRIAVTSEV A, KOGGE P. Generation of permutations for SIMD processors[C]// LCTES 2005: Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems. New York: ACM Press, 2005:147-156. [12] LIU J, ZHANG Y, JANG O, et al. A compiler framework for extracting superword level parallelism[C]// PLDI 2012: Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation. New York: ACM Press, 2012: 347-358. |