EI / SCOPUS / CSCD 收录

中文核心期刊

GE Wanying, ZHANG Tianqi, FAN Congcong, ZHANG Tian. Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network[J]. ACTA ACUSTICA, 2021, 46(1): 55-66. DOI: 10.15949/j.cnki.0371-0025.2021.01.006
Citation: GE Wanying, ZHANG Tianqi, FAN Congcong, ZHANG Tian. Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network[J]. ACTA ACUSTICA, 2021, 46(1): 55-66. DOI: 10.15949/j.cnki.0371-0025.2021.01.006

Monaural noisy speech separation combining sparse non-negative matrix factorization and deep attractor network

  • The performance of monaural speech separation method is limited when the speech mixture is corrupted by background noise.To obtain the enhanced separated speeches from the noisy mixture,a monaural noisy speech separation method combining Sparse Non-negative Matrix Factorization(SNMF) and Deep Attractor Network(DANet)is proposed.This method firstly decomposes the noisy mixture into coefficients of speech and noise signal.Then the speech coefficient is projected to a high-dimensional embedding space and a DANet is trained to force the embeddings to move to different clusters.The attractor points are used to separate the speech coefficients by masking method,and finally the enhanced separated speeches are reconstructed by the speech basis and their corresponding coefficients.Experimental results in various background noise environments show that the proposed algorithm effectively suppress the noises without decreasing the speech quality of reconstructed speeches by comparison with different baseline methods.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return