融合盲源分离和轻量化模型的双通道语音增强

Dual-channel speech enhancement combining blind source separation and lightweight model

摘要: 基于深度学习的语音增强方法在实际部署中常受限于计算资源的约束, 且在低信噪比条件下性能会显著弱化。为此, 将盲源分离算法同轻量化语音增强模型相结合, 提出了一种融合盲源分离和轻量化模型的双通道语音增强方法。其中, 盲源分离算法的输出用作语音增强模型的辅助信息, 轻量化模型则进一步优化语音质量。实验结果表明, 所提方法能够以极低的模型参数量和计算复杂度实现高质量语音增强。

Abstract: The practical deployment of deep learning-based speech enhancement methods is usually constrained by limited computational resources, and the performance significantly deteriorates under low signal-to-noise ratio conditions. This paper proposes lightweight hybrid dual-channel speech enhancement approaches that integrate blind source separation (BSS) algorithms with lightweight speech enhancement models. The outputs of BSS algorithms are used as auxiliary information for the speech enhancement model, while the model further refines the speech quality. Experimental results demonstrate that the proposed approaches can achieve high-quality speech enhancement with minimal model parameters and low computational complexity.