Noise power estimation based on constrained sequential Gaussian mixture model for speech enhancement

XU Chundong; ZHANG Zhen; ZHAN Ge; YING Dongwen; LI Junfeng; YAN Yonghong

doi:10.15949/j.cnki.0371-0025.2017.05.015

XU Chundong, ZHANG Zhen, ZHAN Ge, YING Dongwen, LI Junfeng, YAN Yonghong. Noise power estimation based on constrained sequential Gaussian mixture model for speech enhancement[J]. ACTA ACUSTICA, 2017, 42(5): 633-640. DOI: 10.15949/j.cnki.0371-0025.2017.05.015

Citation:

Noise power estimation based on constrained sequential Gaussian mixture model for speech enhancement

Graphical Abstract

Graphical Abstract

Abstract

Abstract

An approach to estimate the noise logarithmic power was presented based on maximal likelihood. The two-component Gaussian mixture model (GMM) is utilized to describe the distribution of logarithmic power of noisy speech, where one component denotes the speech ("speech+noise") power distribution and the other component denotes the non-speech power distribution. The mean of non-speech component is optimal estimate of noise power. An on-line method is presented to update the parameter set of GMM frame by frame. Due to long-term speech absence, the on- line updation may fail. An on-line minimum description length (MDL) is presented to determine the long-term speechabsence/presence, which enables the model work well under long-term speech absence. The performance of the proposedmethod is evaluated by speech enhancement. The experimental results confirm GMM algorithm outperforms the typicalmethod such as classic MS and IMCRA algorithm.

FullText(HTML)

References (0)

Cited By

Noise power estimation based on constrained sequential Gaussian mixture model for speech enhancement

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content