[1] LI Y, LU Y, ZHANG F, et al. Protein classification using family profiles[C]//FSKD 2010:Proceeding of the 7th International Conference on Fuzzy Systems and Knowledge Discovery. New York:ACM, 2010:2212-2216. [2] 杨旸.基于机器学习方法的生物序列分类研究[D].上海:上海交通大学,2009:1-2. (YANG Y. Research on biological sequence classification based on machine learning methods[D]. Shanghai:Shanghai Jiao Tong University, 2009:1-2.) [3] LARRAÑAGA P, CALVO B, SANTANA R, et al. Machine learning in bioinformatics[J]. Briefings in Bioinformatics, 2006, 7(1):86-112. [4] PEARSON W R, LIPMAN D J. Improved tools for biological sequence comparison[J]. Proceedings of the National Academy of Sciences of the United States of America, 1988, 85(8):2444-2448. [5] ALTSCHUL S F, GISH W, MILLER W, et al. Basic local alignment search tool[J]. Journal of Molecular Biology, 1990, 215(3):403-410. [6] DAO F-Y, YANG H, SU Z-D, et al. Recent advances in conotoxin classification by using machine learning methods[J]. Molecules, 2017, 22(7):1057. [7] WEI L, XING P, SHI G, et al. Fast prediction of protein methylation sites using a sequence-based feature selection technique[J]. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2017, PP(99):1. [8] LIU B, XU J, FAN S, et al. PseDNA-Pro:DNA-binding protein identification by combining Chou's PseAAC and physicochemical distance transformation[J]. Molecular Informatics, 2015, 34(1):8-17. [9] 熊贇,陈越,朱扬勇.ProFaM:一个蛋白质序列家族挖掘算法[J].计算研究与发展,2007,44(7):1160-1168. (XIONG Y, CHEN Y, ZHU Y Y. ProFaM:an efficient algorithm for protein sequence family mining[J]. Journal of Computer Research and Development, 2007, 44(7):1160-1168.) [10] ZHOU C, CULE B, GOETHALS B. Pattern based sequence classification[J]. IEEE Transactions on Knowledge and Data Engineer, 2016, 28(5):1285-1298. [11] MULDER N J, KERSEY P, PRUESS M, et al. In silico characterization of proteins:UniProt, InterPro and Integr8[J]. Molecular Biotechnology, 2008, 38(2):165-177. [12] ZHANG M, KAO B, CHEUNG D W, et al. Mining periodic:patterns with gap requirement from sequences[C]//SIGMOD'05:Proceeding of the 2005 ACM SIGMOD International Conference on Management of Data. New York:ACM, 2005:623-633. [13] WANG X, DUANG L, DDONG G, et al. Efficient mining of density-aware distinguishing sequential patterns with gap constraints[C]//DASFAA 2014:Proceeding of the 2014 International Conference on Database Systems for Advanced Applications, LNCS 8421. Cham:Springer, 2014:372-387. [14] TSENG V S, LEE C-H. Effective temporal data classification by integrating sequential pattern mining and probabilistic induction[J]. Expert Systems with Applications, 2009, 36(5):9524-9532. [15] LIU B, HSU W, MA Y. Integrating classification and association rule mining[C]//KDD'98:Proceeding of the 4th International Conference on Knowledge Discovery and Data Mining. Menlo Park, CA:AAAI Press, 1998:80-86. [16] LIN H, CHEN W. Prediction of thermophilic proteins using feature selection technique[J]. Journal of Microbiological Methods, 2010, 84(1):67-70. [17] TANG H, ZOU P, ZHANG C M, et al. Identification of using feature selection technique[J]. Scientific Reports 6, 2016:30441. [18] HALL M, FRANK E, HOLMES G, et al. The WEKA data mining software:an update[J]. ACM SIGKDD Explorations, 2009, 11(1):10-18. |