[1]HU C, ZHANG J, WANG J, et al.Overview of technologies for irregular and out-of-core parallel computing [J]. Journal of Chinese Computer Systems, 2008, 29(11): 1969-1978. (胡长军,张纪林,王钰,等.非规则、核外并行计算研究综述[J].小型微型计算机系统,2008,29(11):1969-1978.)
[2]FERNER C S. Revisiting communication code generation algorithms for message-passing systems [J]. International Journal of Parallel, Emergent and Distributed Systems, 2006, 21(5): 323-344.
[3]BONDHUGULA U, HARTONO A, RAMANUJAM J, et al.A practical automatic polyhedral parallelizer and locality optimizer [C]// PLDI'08: Proceedings of the 2008 ACM SIGPLAN Conference on Programming Language Design and Implementation. New York: ACM, 2008: 101-113.
[4]BONDHUGULA U. Automatic distributed-memory parallelization and code generation using the polyhedral framework, IISc-CSA-TR-2011-3 [R]. Bangalore: Indian Institute of Science, 2011.
[5]GUO M, PAN Y, LIU Z. Symbolic communication set generation for irregular parallel applications [J]. The Journal of Supercomputing, 2003, 25(3): 199-214.
[6]HU C, LI J, WANG J, et al.Communication set generation for a special case of irregular parallel applications [J]. Chinese Journal of Computers, 2008, 31(1): 120-126. (胡长军,李静,王珏,等.一类非规则并行应用问题的通信集生成算法[J].计算机学报, 2008, 31(1): 120-126.)
[7]RAVISHANKAR M, EISENLOHR J, POUCHET L-N, et al.Code generation for parallel execution of a class of irregular loops on distributed memory systems [C]// SC'12: Proceedings of the 2012 International Conference for High Performance Computing, Networking, Storage, and Analysis. Los Alamitos: IEEE Computer Society, 2012: 1-11.
[8]STROUT M M, GEORGE G, OLSCHANOWSKY C. Set and relation manipulation for the sparse polyhedral framework [C]// LCPC 2012: Proceedings of the 25th International Workshop on Languages and Compilers for Parallel Computing, LNCS 7760. Berlin: Springer-Verlag, 2012: 61-75.
[9]LAMIELLE A, STROUT M M. Enabling code generation within the sparse polyhedral framework, CS-10-102 [R]. Fort Collins, CO: Colorado State University, 2010.
[10]BASUMALLIK A, EIGENMANN R. Optimizing irregular shared-memory applications for distributed-memory systems [C]// PPOPP'06: Proceedings of the 11th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. New York: ACM, 2006: 119-128.
[11]CAMPANONI S, JONES T, HOLLOWAY G, et al.HELIX: automatic parallelization of irregular programs for chip multiprocessing [C]// CGO'12: Proceedings of the 10th International Symposium on Code Generation and Optimization. New York: ACM, 2012: 84-93.
[12]KIM H, JOHNSON NP, LEE J W, et al.Automatic speculative DOALL for clusters [C]// CGO'12: Proceedings of the 10th International Symposium on Code Generation and Optimization. New York: ACM, 2012: 94-103.
[13]ZHUANG X, EICHENBERGER AE, LUO Y, et al.Exploiting parallelism with dependence-aware scheduling [C]// PACT'09: Proceedings of the 2009 International Conference on Parallel Architectures and Compilation Techniques. Washington, DC: IEEE Computer Society, 2009: 193-202.
[14]DING R, ZHAO R, HAN L. Automatic computation and data decomposition algorithm based on dominate value [J]. Computer Science, 2012, 39(3): 290-294. (丁锐,赵荣彩,韩林.基于主导值的计算和数据划分算法[J].计算机科学,2012,39(3):290-294.)
[15]DING R, ZHAO R, LIU X, et al.Partition method for automatic parallelization of irregular problems [J]. Journal of Information Engineering University, 2013, 14(2): 235-242. (丁锐,赵荣彩,刘晓娴,等.自动并行化中不规则问题的划分方法[J].信息工程大学学报,2013,14(2):235-242.)
[16]AMARASINGHE S P, LAM M S. Communication optimization and code generation for distributed memory machines [C]// PLDI'93: Proceedings of The ACM SIGPLAN 1993 Conference on Programming Language Design and Implementation. New York: ACM, 1993: 126-138. |