[1] 韩家炜,坎伯M,裴健.数据挖掘:概念与技术[M].3版.范明,孟小峰,译.北京:机械工业出版社,2012:288.(HAN J W, KAMER M, PEI J. Data Mining:Concepts and Techniques[M]. 3rd ed. FAN M, MENG X F, translated. Beijing:China Machine Press, 2012:288.) [2] BERKHIN P. A survey of clustering data mining techniques[M]//KOGAN J, NICHOLAS C, TEBOULLE M. Grouping Multidimensional Data. Berlin:Springer, 2002:25-71. [3] AGGARWAL C C, REDDY C K. Data Clustering:Algorithms and Applications[M]. Boca Raton:Chapman and Hall/CRC, 2013:3-15. [4] HARTIGAN J A, WONG M A. Algorithm AS 136:a K-means clustering algorithm[J]. Journal of the Royal Statistical Society, 1979, 28(1):100-108. [5] JAIN A K. Data clustering:50 years beyond K-means[J]. Pattern Recognition Letters, 2010, 31(8):651-666. [6] XIONG H, WU J, CHEN J. K-means clustering versus validation measures:a data-distribution perspective[J]. IEEE Transactions on Systems, Man, and Cybernetics, Part B:Cybernetics, 2009, 39(2):318-331. [7] HE H, GARCIA E A. Learning from imbalanced data[J]. IEEE Transactions on Knowledge and Data Engineering, 2009, 21(9):1263-1284. [8] KUMAR N S, RAO K N, GOVARDHAN A, et al. Undersampled K-means approach for handling imbalanced distributed data[J]. Progress in Artificial Intelligence, 2014, 3(1):29-38. [9] KUMAR C N S, RAO K N, GOVARDHAN A. An empirical comparative study of novel clustering algorithms for class imbalance learning[C]//Proceedings of the 2nd International Conference on Computer and Communication Technologies, AISC 380. Berlin:Springer, 2016:181-191. [10] 刘云.不平衡数据的模糊聚类算法研究及在宏基因组重叠群分类中的应用[D].长春:吉林大学,2016:15-48.(LIU Y. Research of fuzzy clustering method on imbalanced dataset and its application in metagenomic contigs binning[D]. Changchun:Jilin University, 2016:15-48.) [11] LIANG J, BAI L, DANG C, et al. The K-means-type algorithms versus imbalanced data distributions[J]. IEEE Transactions on Fuzzy Systems, 2012, 20(4):728-745. [12] 程铃钫,杨天鹏,陈黎飞.不平衡数据的软子空间聚类算法[J].计算机应用,2017,37(10):2952-2957.(CHENG L F, YANG T P, CHEN L F. Soft subspace clustering algorithm for imbalanced data[J]. Journal of Computer Applications, 2017, 37(10):2952-2957.) [13] CHEN L, JIANG Q, WANG S. A probability model for projective clustering on high dimensional data[C]//ICDM 2008:Proceedings of the 8th IEEE International Conference on Data Mining. Washington, DC:IEEE Computer Society, 2008:755-760. [14] VIDAL R. Subspace clustering[J]. IEEE Signal Processing Magazine, 2011, 28(2):52-68. [15] XU L, JORDAN M I. On convergence properties of the EM algorithm for Gaussian mixtures[J]. Neural Computation, 1996, 8(1):129-151. [16] 李航.统计学习方法[M].北京:清华大学出版社,2012:162-165.(LI H. Statistical Learning Method[M]. Beijing:Tsinghua University Press, 2012:162-165.) [17] TASKAR B, SEGAL E, KOLLER D. Probabilistic classification and clustering in relational data[C]//IJCAI 2001:Proceedings of the 17th International Joint Conference on Artificial Intelligence. San Francisco, CA:Morgan Kaufmann, 2001, 2:870-876 [18] 朱杰,陈黎飞.类属数据的贝叶斯聚类算法[J].计算机应用,2017,37(4):1026-1031.(ZHU J, CHEN L F. Bayesian clustering algorithm for categorical data[J]. Journal of Computer Applications, 2017, 37(4):1026-1031.) [19] LI X, CHEN Z, YANG F. Exploring of clustering algorithm on class-imbalanced data[C]//Proceedings of the 8th International Conference on Computer Science and Education. Piscataway, NJ:IEEE, 2013:89-93. [20] STREHL A, GHOSH J. Cluster ensembles-a knowledge reuse framework for combining multiple partitions[J]. Journal of Machine Learning Research, 2003, 3(3):583-617. |