计算机应用 ›› 2018, Vol. 38 ›› Issue (4): 1176-1180.DOI: 10.11772/j.issn.1001-9081.2017092316

• 虚拟现实与多媒体计算 • 上一篇    下一篇

稀疏正则非负矩阵分解的语音增强算法

蒋茂松, 王冬霞, 牛芳琳, 曹玉东   

  1. 辽宁工业大学 电子与信息工程学院, 辽宁 锦州 121001
  • 收稿日期:2017-09-26 修回日期:2017-10-27 出版日期:2018-04-10 发布日期:2018-04-09
  • 通讯作者: 王冬霞
  • 作者简介:蒋茂松(1989-),男,安徽六安人,硕士研究生,主要研究方向:现代信号处理、多媒体;王冬霞(1975-),女,辽宁锦州人,教授,博士,主要研究方向:阵列、语音处理与通信;牛芳琳(1971-),女,辽宁锦州人,副教授,博士,主要研究方向:信息论、信道编码、数字喷泉码;曹玉东(1975-),男,辽宁锦州人,副教授,博士,主要研究方向:图像识别、图像理解。
  • 基金资助:
    辽宁省科学事业公益研究基金资助项目(20170056)。

Speech enhancement method based on sparsity-regularized non-negative matrix factorization

JIANG Maosong, WANG Dongxia, NIU Fanglin, CAO Yudong   

  1. College of Electronic and Information Engineering, Liaoning University of Technology, Jinzhou Liaoning 121001, China
  • Received:2017-09-26 Revised:2017-10-27 Online:2018-04-10 Published:2018-04-09
  • Supported by:
    This work is partially supported by the Scientific Public Welfare Research Foundation of Liaoning Province (20170056).

摘要: 对于非负矩阵分解的语音增强算法在不同环境噪声的鲁棒性问题,提出一种稀疏正则非负矩阵分解(SRNMF)的语音增强算法。该算法不仅考虑到数据处理时的噪声影响,而且对系数矩阵进行了稀疏约束,使其分解出的数据具有较好的语音特征。该算法首先在对语音和噪声的幅度谱先验字典矩阵学习的基础上,构建联合字典矩阵,然后更新带噪语音幅度谱在联合字典矩阵下的系数矩阵,最后重构原始纯净语音,实现语音增强。实验结果表明,在非平稳噪声和低信噪比(小于0 dB)条件下,该算法较好地削弱了噪声的变化对算法性能的影响,不仅有较高的信源失真率(SDR),提高了1~1.5个数量级,而且运算速度也有一定程度的提高,使得基于非负矩阵分解的语音增强算法更实用。

关键词: 非负矩阵分解, 语音增强, 稀疏正则, 鲁棒性, 联合字典

Abstract: In order to improve the robustness of Non-negative Matrix Factorization (NMF) algorithm for speech enhancement in different background noises, a speech enhancement algorithm based on Sparsity-regularized Robust NMF (SRNMF) was proposed, which takes into account the noise effect of data processing, and makes sparse constraints on the coefficient matrix to get better speech characteristics of the decomposed data. First, the prior dictionary of the amplitude spectrum of speech and noise were learned and the joint dictionary matrix of speech and noise were constructed. Then, the SRNMF algorithm was used to update the coefficient matrix of the amplitude spectrum with noise in the joint dictionary matrix. Finally, the original pure speech was reconstructed, and enhanced. The speech enhancement performance of the SRNMF algorithm in different environmental noise was analyzed through simulation experiments. Experimental results show that the proposed algorithm can effectively weaken the influence of noise changes on performance under non-stationary environments and low Signal-to-Noise Ratio (SNR) (<0 dB), it not only has about 1-1.5 magnitudes improvement in Source-to-Distortion Ratio (SDR) scores, but also is faster than other algorithms, which makes the NMF-based speech enhancement algorithm more practical.

Key words: Non-negative Matrix Factorization (NMF), speech enhancement, sparsity-regularization, robustness, joint dictionary

中图分类号: