[1] 张剑峰,夏云庆,姚建民.微博文本处理研究综述[J].中文信息学报,2012,26(4):21-27.(ZHANG J F,XIA Y Q,YAO J M.A review towards microtext processing[J].Journal of Chinese Information Processing,2012,26(4):21-27.) [2] 徐小琳,阙喜戎,程时端.信息过滤技术和个性化信息服务[J].计算机工程与应用,2003,39(9):182-184.(XU X L,QUE X R,CHENG S D.Information filtering and user modeling[J].Computer Engineering and Applications,2003,39(9):182-184.) [3] 贺涛,曹先彬,谭辉.基于免疫的中文网络短文本聚类算法[J].自动化学报,2009,35(7):896-902.(HE T,CAO X B,TAN H.An immune based algorithm for Chinese network short text clustering[J].Acta Automatica Sinica,2009,35(7):896-902.) [4] BLEI D M,NG A Y,JORDAN M I.Latent Dirichlet allocation[J].Journal of Machine Learning Research,2003,3:993-1022. [5] 王琳,冯时,徐伟丽,等.一种面向微博客文本流的噪音判别与内容相似性双重检测的过滤方法[J].计算机应用与软件,2012,29(8):25-29.(WANG L,FENG S,XU W L,et al.A filtering approach for spam discrimination and content similarity double detection for microblog text stream[J].Computer Applications and Software,2012,29(8):25-29.) [6] 高俊波,梅波.基于文本内容分析的微博广告过滤模型研究[J].计算机工程,2014,40(5):17-20.(GAO J B,MEI B.Research on microblog advertisement filtering model based on text content analysis[J].Computer Engineering,2014,40(5):17-20.) [7] 方东昊.基于LDA的微博短文本分类技术的研究与实现[D].沈阳:东北大学,2011:23-28.(FANG D H.Study and implementation for microblog's short text classification based on LDA[D].Shenyang:Northeastern University,2011:23-28.) [8] 刁宇峰,杨亮,林鸿飞.基于LDA模型的博客垃圾评论发现[J].中文信息学报,2011,25(1):41-47.(DIAO Y F,YANG L,LIN H F.LDA-based opinion spam discovering[J].Journal of Chinese Information Processing,2011,25(1):41-47.) [9] XU T,OARD D W.Wikipedia-based topic clustering for microblogs[J].Proceedings of the American Society for Information Science and Technology,2011,48(1):1-10. [10] 吕超镇,姬东鸿,吴飞飞.基于LDA特征扩展的短文本分类[J].计算机工程与应用,2015,51(4):123-127.(LYU C Z,JI D H,WU F F.Short text classification based on expanding feature of LDA[J].Computer Engineering and Applications,2015,51(4):123-127.). [11] GRIFFITHS T L,STEYVERS M.Finding scientific topics[J].Proceedings of the National Academy of Sciences of the United States of America,2004,101(S1):5228-5235. [12] 李文波,孙乐,张大鲲.基于Labeled-LDA模型的文本分类新算法[J].计算机学报,2008,31(4):620-627.(LI W B,SUN L,ZHANG D K.Text classification based on labeled-LDA model[J].Chinese Journal of Computers,2008,31(4):620-627.) [13] 张华平.NLPIR汉语分词系统[CP/OL].[2015-07-17].http://ictclas.nlpir.org/. (ZHANG H P.Chinese lexical analysis system[CP/OL].[2015-07-17].http://ictclas.nlpir.org/.) [14] SALTON G,WONG A,YANG C S.A vector space model for automatic indexing[J].Communications of the ACM,1975,18(11):613-620. [15] SALTON G,YANG C S.On the specification of term values in automatic indexing[J].Journal of Documentation,1973,29(4):351-372. [16] CAO J,XIA T,et al.A density-based method for adaptive LDA model selection[J].Neurocomputing,2009,72(7/8/9):1775-1781. [17] CHANG C-C,LIN C-J.LIBSVM:a library for support vector machines[J].ACM Transactions on Intelligent Systems and Technology,2011,2(3):Article No.27. |