[1] SPIRIN N, HAN J. Survey on Web spam detection:principles and algorithms[J]. ACM SIGKDD Explorations Newsletter, 2012, 13(2):50-64. [2] CHANDRA A, SUAIB M. A survey on Web spam and spam 2.0[J]. International Journal of Advanced Computer Research, 2014, 4(2):634-644. [3] TAHIR M A, BOURIDANE A, KURUGOLLU F. Simultaneous feature selection and feature weighting using hybrid tabu search/K-nearest neighbor classifier[J]. Pattern Recognition Letters, 2007, 28(4):438-446. [4] BONEV B, ESCOLANO F, CAZORLA M. Feature selection, mutual information, and the classification of high-dimensional patterns[J]. Pattern Analysis and Applications, 2008, 11(3/4):309-319. [5] MOUSTAKIDIS S P, THEOCHARIS J B. A fast SVM-based wrapper feature selection method driven by a fuzzy complementary criterion[J]. Pattern Analysis and Applications, 2012, 15(4):379-397. [6] LIN S, LEE Z, CHEN S, et al. Parameter determination of support vector machine and feature selection using simulated annealing approach[J]. Applied Soft Computing, 2008, 8(4):1505-1512. [7] AHMED A. Feature subset selection using ant colony optimization[J]. International Journal of Computational Intelligence and Applications, 2005, 2(1):53-58. [8] AHMAD F, ISA N A M, HUSSAIN Z, et al. A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer[J]. Pattern Analysis and Applications, 2014, 18(4):861-870. [9] MARINAKI M, MARINAKIS Y. A hybridization of clonal selection algorithm with iterated local search and variable neighborhood search for the feature selection problem[J]. Memetic Computing, 2015, 7(3):181-201. [10] SAMADZADEGAN F, NAMIN S R, RAJABI M A. Evaluating the potential of clonal selection optimization algorithm to hyperspectral image feature selection[J]. Key Engineering Materials, 2012, 500(1):799-805. [11] YEN S, LEE Y. Cluster-based under-sampling approaches for imbalanced data distributions[J]. Expert Systems with Applications, 2009, 36(3):5718-5727. [12] SUN Y, KAMEL M S, WONG A K, et al. Cost-sensitive boosting for classification of imbalanced data[J]. Pattern Recognition, 2007, 40(12):3358-3378. [13] HONG X, CHEN S, HARRIS C J. A kernel-based two-class classifier for imbalanced data sets[J]. IEEE Transactions on Neural Networks, 2007, 18(1):28-41. [14] 卢晓勇,陈木生.基于随机森林和欠采样集成的垃圾网页检测[J].计算机应用,2016,36(3):731-734.(LU X Y, CHEN M S. Web spam detection based on random forest and under-sampling ensemble[J]. Journal of Computer Applications, 2016, 36(3):731-734.) [15] FAWCETT T. An introduction to ROC analysis[J]. Pattern Recognition Letters, 2006, 27(8):861-874. [16] DAVIS J, GOADRICH M. The relationship between precision-recall and ROC curves[C]//ICML 2006:Proceedings of the 23rd International Conference on Machine Learning. New York:ACM, 2006:233-240. [17] DE CASTRO L N, VON ZUBEN F J. Learning and optimization using the clonal selection principle[J]. IEEE Transactions on Evolutionary Computation, 2002, 6(3):239-251. [18] SCARSELLI F, TSOI A C, HAGENBUCHNER M, et al. Solving graph data issues using a layered architecture approach with applications to Web spam detection[J]. Neural Networks, 2013, 48:78-90. |