[1] KINNUNEN T, LI H. An overview of text-independent speaker recognition: From features to supervectors [J]. Speech Communication, 2010, 52(1): 12-40.[2] REYNOLDS D, QUATIERI T, DUNN R. Speaker verification using adapted Gaussian mixture models [J]. Digital Signal Processing, 2000, 10(1/2/3): 19-41.[3] CAMPBELL W, STURIM D, REYNOLDS D. Support vector machines using GMM supervectors for speaker verification [J]. Signal Processing Letters, 2006, 13(5): 308-311.[4] CAMPBELL W, KARAM Z, STURIM D, et al. Speaker comparison with inner product discriminant functions [C]// Proceedings of the 23th Annual Conference on Neural Information Processing Systems, Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 2009: 207-215.[5] VAPNIK V. Statistical learning theory [M]. New York: Wiley, 1998.[6] National Institute of Standards and Technology. NIST speaker recognition evaluation [EB/OL]. [2011-01-02]. http://www.itl.nist.gov/iad/mig/tests/spk/2006/index.html.[7] 杨行峻,迟惠生.语音信号数字处理[M].北京:电子工业出版社,1994.[8] DEMPSTER A, LAIRD N, RUBIN D. Maximum likelihood from incomplete data via the EM algorithm [J]. Journal of the Royal Statistics Society, 1977, 39(1): 1-38.[9] GAUVAIN J L, LEE C-H. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains [J]. IEEE Transactions on Speech Audio Processing, 1994, 2(2): 291-298.[10] HOFMANN T, SCHOLKOPF B, SMOLA A. Kernel methods in machine learning [J]. The Annals of Statistics, 2008, 36(3): 1171-1220.[11] HERSHEY J, OLSEN P. Approximating the Kullback Leibler divergence between Gaussian mixture models [C] // ICASSP 2007: IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway, NJ: IEEE Press, 2008: 317-320.[12] CONWAY B. Functional analysis [M]. New York: Springer-Verlag, 1990.[13] 张贤达.矩阵分析与应用[M].北京:清华大学出版社,2005.[14] HE JIALONG, LI LIU, PALM G. A discriminative training algorithm for VQ-based speaker identification [J]. IEEE Transactions on Audio, Speech, and Language Processing, 1999, 7(3): 353 -256.[15] ARONOWITZ H, BURSHTEIN D. Efficient speaker identification and retrieval [C]// INTERSPEECH 2005: Proceedings of the 9th European Conference on Speech Communication and Technology. Lisbon: [s.n.], 2005: 2433-2436.[16] ARONOWITZ H, BURSHTEIN D, AMIR A. Speaker indexing in audio archives using Gaussian mixture scoring simulation [M]. MLMI 2004: Proceedings of the First Workshop on Machine Learning for Multimodal Interaction, LNCS 3361. Berlin: Springer-Verlag, 2004: 243-252.[17] National Institute of Standards and Technology. NIST speaker recognition evaluation [EB/OL]. [2011-01-05]. http://www.itl.nist.gov/iad/mig/tests/spk/2008/index.html.[18] SOLOMONOFF A, CAMPBELL W, BOARDMAN I. Advances in channel compensation for SVM speaker recognition [C]// ICASSP'05: Proceedings of 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing. Piscataway, NJ: IEEE Press, 2005: 629-632.[19] XIANG B, CHAUDHARI U, NAVRATIL J, et al. Short-time Gaussianization for robust speaker verification [J]. Acoustics, Speech, and Signal Processing, 2002, 1(1): 681-684.[20] BRMMER N. FoCal toolkit [EB/OL]. [2011-01-08]. http://niko.brummer.googlepages.com/focal/. |