[1] GRAY J, LIU D T, NIETO-SANTISTEBAN M, et al. Scientific data management in the coming decade[J]. ACM SIGMOD Record, 2005, 34(4): 34-41. [2] 崔杰,李陶深,兰红星.基于Hadoop的海量数据存储平台设计与开发[J].计算机研究与发展,2012,49(Suppl.):12-18. (CUI J, LI T S, LAN H X. Design and development of the mass data storage platform based on Hadoop[J]. Journal of Computer Research and Development, 2012, 49(Suppl.): 12-18.) [3] 贺瑶,王文庆,薛飞.基于云计算的海量数据挖掘研究[J].计算机技术与发展,2013,23(2):69-72. (HE Y,WANG W Q, XUE F. Study of massive data mining based on cloud computing[J].Computer Technology and Development, 2013, 23(2): 69-72.) [4] 余永红,向晓军,高阳,等.面向服务的云数据挖掘引擎的研究[J].计算机科学与探索,2012,6(1):46-57. (YU Y H, XIANG X J, GAO Y, et al. Research on service-oriented data mining engine based on cloud computing[J]. Journal of Frontiers of Computer Science and Technology, 2012, 6(1): 46-57.) [5] HAN J, KAMBER M, PEI J. Data mining: concepts and techniques[M]. 3rd edition. San Francisco, CA: Morgan Kaufmann, 2011: 89-98. [6] 陆戌辰,王梅,乐嘉锦.列存储中的OLAP多查询优化方法[J].计算机科学与探索,2012,6(9):852-864. (LU X C, WANG M, LE J J. Multi-query optimization strategy in column-based OLAP system[J]. Journal of Frontiers of Computer Science and Technology, 2012, 6(9): 852-864.) [7] 周国亮,王桂兰,朱永利.多核处理器上的并行联机分析处理算法研究[J].计算机科学与探索,2013, 7(2):180-190. (ZHOU G L, WANG G L, ZHU Y L. Parallel on-line analysis processing algorithms research on multi-core CPUs[J]. Journal of Frontiers of Computer Science and Technology, 2013, 7(2): 180-190.) [8] 奚建清,游进国,汤德佑,等.基于MapReduce的封闭立方体并行计算方法[J].华南理工大学学报(自然科学版),2009,37(1):91-95,112. (XI J Q, YOU J G, TANG D Y, et al. A parallel closed-cubing algorithm based on MapReduce[J]. Journal of South China University of Technology (Natural Science Edition), 2009, 37(1): 91-95, 112.) [9] 宋杰,郭朝鹏,王智,等.大数据分析的分布式MOLAP技术[J].软件学报,2014,25(4):731-752. (SONG J, GUO C P, WANG Z, et al. Distributed MOLAP technique for big data analysis[J]. Journal of Software, 2014, 25(4): 731-752.) [10] 张娟.基于Hadoop的商立方体研究与实现[D].上海:华东师范大学,2013:11-15. (ZHANG J. The research and implementation of quotient cube based on Hadoop [D]. Shanghai: East China Normal University, 2013: 11-15.) [11] 梁彦.基于分布式平台Spark和YARN的数据挖掘算法的并行化研究[D].广州:中山大学,2014:8-12. (LIANG Y. Research on parallelization of data mining algorithm based on distributed platforms Spark and YARN [D]. Guangzhou: Sun Yat-sen University, 2014: 8-12.) [12] 李成华,张新访,金海,等.MapReduce:新型的分布式并行计算编程模型[J].计算机工程与科学,2011,33(3):129-135. (LI C H, ZHANG X F, JIN H, et al. MapReduce: a new programming model for distributed parallel computing[J]. Computer Engineering & Science, 2011, 33(3): 129-135.) [13] KARAU H. Fast data processing with Spark: high-speed distributed computing made easy with Spark[M]. Bermingham, UK: Packt Publishing, 2013: 5-13. [14] DEAN J, GHEMAWAT S. MapReduce: simplified data processing on large clusters[J]. Communications of the ACM, 2008, 51(1): 107-113. [15] ZAHARIA M, CHOWDHURY M, FRANKLIN M J, et al. Spark: cluster computing with working sets[C]//HotCloud '10: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing. Berkeley, CA: USENIX Association, 2010: 10-10. |