[1]蒋明原,孔令德,宁静静.一种海量数据下的Lucene全文检索解决方案[J].电脑开发与应用,2011,24(4):32-35.[2]MOFFAT A, WEBBER W, ZOBEL J. Load balancing for term-distributed parallel retrieval [C]// SIGIR'06: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM Press, 2006: 348-355.[3]曹宇,尹刚,李翔,等.聚类搜索引擎研究进展浅析[J].电脑知识与技术,2011,7(22):5398-5400.[4]徐文海,温有奎.一种基于TFIDF方法的中文关键词抽取算法[J].情报理论与实践,2008,31(2):298-302.[5]OWEN S, ANIL R, DUNNING T, et al. Mahout in action [M]. Greenwich: Manning Publications, 2010: 123-137.[6]ESTEVES R M, PAIS R, RONG C. K-means clustering in the cloud—a Mahout test [C]// Proceedings of the 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications. Washington, DC: IEEE Computer Society, 2011:514-519.[7]ESTEVES R M, RONG C. Using Mahout for clustering Wikipedia's latest articles: a comparison between K-means and fuzzy C-means in the cloud [C]// Proceedings of the 2011 IEEE Third International Conference on Cloud Computing Technology and Science. Washington, DC: IEEE Computer Society, 2011: 565-569.[8]李应安.基于Map/Reduce的聚类算法的并行化研究[D].广州:中山大学,2010.[9]BUTLER M H, RUTHERFORD J. Distributed Lucene: a distributed free text index for Hadoop [EB/OL]. [2012-03-25]. http://www.hpl.hp.com/techreports/2008/HPL-2008-64.pdf.[10]SAJJA K. Performance study of Lucene in parallel and distributed environments [D]. Boise: Boise State University, 2011.[11]HATCHER E, GOSPODNETIC O, McCANDLESS M. Lucene in action [M]. Greenwich: Manning Publications, 2009.[12]王浩,姚长利,郭琳,等.基于中文搜索引擎网络信息用户行为研究[J].计算机应用研究,2009,26(12):4665-4668. |