[1] 王虹旭, 吴斌, 刘旸. 基于Spark的并行图数据分析系统[J]. 计算机科学与探索, 2015, 9(9):1066-1074.(WANG H X, WU B, LIU Y. Parallel graph data analysis system based on Spark[J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(9): 1066-1074.) [2] YAN W, BRAHMAKSHATRIYA U, XUE Y, et al. p-PIC: parallel power iteration clustering for big data[J]. Journal of Parallel & Distributed Computing, 2013, 73(3):352-359. [3] LIU L, CHEN X, LIU M, et al. An influence power-based clustering approach with PageRank-like model[J]. Applied Soft Computing, 2015, 40(11):17-32. [4] LIN N P, CHANG C I, CHUEH H E, et al. A deflected grid-based algorithm for clustering analysis[J]. Wseas Transactions on Computers, 2008, 7(4):125-132. [5] MENENDEZ H D, CAMACHO D. GANY: a genetic spectral-based clustering algorithm for large data analysis[C]//Proceedings of the 2015 IEEE Congress on Evolutionary Computation. Piscataway, NJ: IEEE, 2015:640-647. [6] LIU L, SUN L, CHEN S, et al. K-PRSCAN: a clustering method based on PageRank[J]. Neurocomputing, 2016, 175(11):65-80. [7] LIN F, COHEN W W. Power iteration clustering[C]//Proceedings of the 27th International Conference on Machine Learning. Haifa: [s.n.], 2010: 655-662. [8] DARJI A, WAGHELA D. Parallel power iteration clustering for big data using MapReduce in Hadoop[J]. International Journal of Advanced Research in Computer Science and Software Engineering, 2014, 4(6):1357-1363. [9] ZAHARIA M, CHOWDHURY M, FRANKLIN M J, et al. Spark: cluster computing with working sets[C]//HotCloud 2010: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing. Berkeley: USENIX, 2010:1765-1773. [10] GONZALEZ J E, XIN R S, DAVE A, et al. GraphX: graph processing in a distributed dataflow framework[C]//OSDI 2014: Proceedings of the 11th USENIX conference on Operating Systems Design and Implementation. Berkeley: USENIX, 2014:599-613. [11] ZAHARIA M, CHOWDHURY M, DAS T, et al. Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing[C]//NSDI 2012: Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. Berkeley: USENIX, 2012:2. [12] 陈侨安, 李峰, 曹越, 等. 基于运行数据分析的Spark任务参数优化[J]. 计算机工程与科学, 2016, 38(1):11-19.(CHEN Q A, LI F, CAO Y, et al. Parameter optimization for Spark jobs based on runtime data analysis[J]. Computer Engineering and Science, 2016, 38(1):11-19.) |