[1] PAGE L, BRIN S, MOTWANI R, et al. The PageRank citation ranking: bringing order to the Web [C]// Proceedings of the 7th International World Wide Web Conference. Brisbane: [s.n.], 1998: 161-172. [2] LANGVILLE A, MEYER C D. Google's PageRank and beyond: the science of search engine rankings [M]. Princeton: Princeton University Press, 2006: 1-2. [3] WHITE T. Hadoop: the definitive guide [M]. Sebastopol: O'Reilly Media, 2009: 103-108. [4] LIU P. Practice in Hadoop: open the gate of cloud computing [M]. Beijing: Publishing House of Electronics Industry, 2011: 89-91.(刘鹏.实战Hadoop:开启通向云计算的捷径[M].北京:电子工业出版社,2011:89-91.) [5] DEAN J, CHEMAWAT S. MapReduce: simplified data processing on large clusters [C]// OSDI 2004: Proceedings of the 6th Symposium on Operating System Design and Implementation. Berkeley: USENIX Association, 2004: 137-150. [6] CHEN G, NIU Z. Research on PageRank algorithm based on MapReduce [J]. Microelectronics and Computer, 2012, 29(5): 81-85.(陈宫,牛秦洲.基于MapReduce的PageRank算法的研究[J].微电子学与计算机,2012,29(5):81-85.) [7] LIN J, SCHATZ M. Design patterns for efficient graph algorithms in MapReduce [C]// MLG'10: Proceedings of the Eighth Workshop on Mining and Learning with Graphs. New York: ACM, 2010: 78-85. [8] ZHANG Y, YIN C, WU C. Research on PageRank algorithm optimization based on MapReduce [J]. Application Research of Computers, 2014, 31(2): 431-434.(张永,尹传晔,吴崇正.基于MapReduce的PageRank算法优化研究[J].计算机应用研究,2014,31(2):431-434.) [9] LIAO S, TAO Y, HE Z, et al. CGPR: an acceleration method for PageRank based on graph-clustering on MapReduce [J]. Journal of Chinese Computer Systems, 2012, 33(6): 1195-1201.(廖松博,陶岳,何震瀛,等.CGPR:一种在MapReduce平台上基于图划分的PageRank加速方法[J].小型微型计算机系统,2012,33(6):1195-1201.) [10] VISWANATHAN A. A guide to using LZO compression in Hadoop [J]. Linux Journal, 2012, 2012(220): Article No. 1. [11] ZHANG Y, SONG W, LIU T, et al. Query classification based on URL topic [J]. Application Research of Computers, 2012, 49(6): 1298-1305.(张宇,宋巍,刘挺,等.基于URL主题的查询分类方法[J].计算机研究与应用,2012,49(6):1298-1305.) [12] Wikipedia. Projects using Heritrix [EB/OL]. [2014-06-13]. http://crawler.archive.org/index.html. |