Single-channel speech enhancement based on gender-related deep neural networks and non-negative matrix factorization models

LI Xu; WANG Ziteng; WANG Xiaofei; FU Qiang; YAN Yonghong

doi:10.15949/j.cnki.0371-0025.2019.02.009

LI Xu, WANG Ziteng, WANG Xiaofei, FU Qiang, YAN Yonghong. Single-channel speech enhancement based on gender-related deep neural networks and non-negative matrix factorization models[J]. ACTA ACUSTICA, 2019, 44(2): 221-230. DOI: 10.15949/j.cnki.0371-0025.2019.02.009

Citation:

Single-channel speech enhancement based on gender-related deep neural networks and non-negative matrix factorization models

Graphical Abstract

Graphical Abstract

Abstract

Abstract

In order to obtain the clean speech from the noisy signal, a single-channel speech enhancement algorithm based on gender-related models is proposed. Specifically, in the training stage, Deep Neural Networks(DNN) and Nonnegative Matrix Factorization(NMF) are employed to train two gender-related DNN-NMF models using the genderspecific training data. In the test stage, an algorithm based on NMF and group sparsity penalty is proposed to identify the gender information of the speaker in the test signal. Then the corresponding DNN-NMF model is used to estimate the activations for speech enhancement. Experimental results show that the proposed algorithm performs better in suppressing the noises without decreasing the speech quality compared with other NMF-based and DNN-based methods.

FullText(HTML)

References (0)

Cited By

Single-channel speech enhancement based on gender-related deep neural networks and non-negative matrix factorization models

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content