[1] WANG Y, JIN X, CHENG X. Network big data: present and future [J]. Chinese Journal of Computers, 2013, 36(6): 1125-1138.(王元卓,靳小龙,程学旗.网络大数据:现状与展望[J].计算机学报,2013,36(6):1125-1138.) [2] HUANG Y, SUN X, LIU Z, et al. The microblog retweeting prediction evaluation system and performance comparation [J]. Journal of Harbin University of Science and Technology, 2013, 18(4): 52-57.(黄英来,孙晓芳,刘镇波,等.微博转发预测算法评测系统的建立及性能比较[J].哈尔滨理工大学学报,2013,18(4):52-57.) [3] SUH B, HONG L C, PIROLLI P, et al. Want to be retweeted? Large scale analytics on factors impacting retweet in twitter network [C]// Proceedings of the 2010 IEEE International Conference on Social Computing. Piscataway: IEEE, 2010: 177-184. [4] XU Z, YANG Q. Analyzing user retweet behavior on twitter [C]// Proceedings of 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. Piscataway: IEEE, 2012: 46-50. [5] ROMERO D M, MEEDER B, KLEINBERG J. Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter [C]// Proceedings of the 20th International Conference on World Wide Web. New York: ACM, 2011: 695-740. [6] WENG J, LIM E P, JIANG J. TwitterRank: finding topic-sensitive influential twitterers [C]// Proceedings of the 3rd ACM International Conference on Web Search and Data Mining. New York: ACM, 2010: 261-270. [7] WELCH M J, SCHONFELD U, HE D. Topical semantics of twitter links [C]// Proceedings of the 4th ACM International Conference on Web Search and Data Mining. New York: ACM, 2011: 327-336. [8] MORCHID M, DUFOUR R, LINARES G, et al. Feature selection using principal component analysis for massive retweet detection [J]. Pattern Recognition Letters, 2014, 49(11): 33-39. [9] PENG H, ZHU J, PIAO D Z, et al. Retweet modeling using condi-tional random fields [C]// ICDMW'11: Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops. Washington, DC: IEEE Computer Society, 2011: 336-343. [10] ZHANG Y, LU R, YANG Q. Predicting retweeting in microblogs [J]. Journal of Chinese Information Processing, 2012, 26(4): 109-114.(张旸,路荣,杨青.微博客中转发行为的预测研究[J].中文信息学报,2012,26(4):109-114.) [11] LI Y, YU H, LIU L. Predict algorithm of micro-blog retweet scale based on SVM [J]. Application Research of Computers, 2013, 30(9): 2594-2597.(李英乐,于洪涛,刘力雄.基于SVM的微博转发规模预测方法[J].计算机应用研究,2013,30(9):2594-2597.) [12] XIE J, LIU G, SU B, et al. Prediction of user's retweet behavior in social network [J]. Journal of Shanghai Jiaotong University, 2013, 47(4): 584-588.(谢婧,刘功申,苏波,等.社交网络中的用户转发行为预测[J].上海交通大学学报,2013,47(4):584-588.) [13] LUO Z, WU X, CAI W, et al. Examining multi-factor interactions in microblogging based on log-linear modeling [C]// Proceedings of the 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. New York: ACM, 2012: 189-193. [14] LUO Z, CAI W, CHEN T. Microblogging retweet prediction algorithm based on random forest [J]. Computer Science, 2014, 41(4): 62-64.(罗知林,蔡皖东,陈挺.一种基于随机森林的微博转发预测算法[J].计算机科学,2014,41(4):62-64.) [15] FANG X, ZHANG H, GAO S. Web spam detection based on SMOTE and random forests [J]. Journal of Shandong University: Engineering Science, 2013, 43(1): 22-26.(房晓南,张化祥,高爽.基于SMOTE和随机森林的Web spam检测[J].山东大学学报:工学版,2013,43(1):22-26.) [16] YU H, GAO S, ZHAO J, et al. Classification for imbalanced microarray data based on oversampling technology and random forest [J]. Computer Science, 2012, 39(5): 190-194.(于化龙,高尚,赵靖,等.基于过采样技术和随机森林的不平衡微阵列数据分类方法研究[J].计算机科学,2012,39(5):190-194.) [17] LIAN J, ZHOU X, CAO W, et al. SINA microblog data retrieval [J]. Journal of Tsinghua University, 2011, 51(10): 1300-1305.(廉捷,周欣,曹伟,等.新浪微博数据挖掘方案[J].清华大学学报:自然科学版,2011,51(10):1300-1305.) [18] BREIMAN L. Random forests [J]. Machine Learning, 2001, 45(1): 5-32. [19] BREIMAN L. Bagging predictors [J]. Machine Learning, 1996, 24(2): 123-140. |