EI / SCOPUS / CSCD 收录

中文核心期刊

语音识别中多种特征信息综合利用的方法

Methods for combining the information of various features in speech recognition

  • 摘要: 在基于特征的语音识别研究中,往往会发现其中有些特征的识别性能对一些音比另一些音更好,而另一些特征却与此相反。它们在一些音的识别特性上存在着一定程度的互补。本文基于目前话音识别研究主要方法之一的HMMM识别方法,提出了三种有效综合利用这种互补关系提高HMM识别性能的方法。作者*分别称它们为顶尖参数法,全部参数法和最可靠参数法。这三种方法在多发音人汉语数字的DHMM/VQ语音识别中,分别将识别率由89%提高到了92.3%、95.7%、94.3%。本文将详细介绍这三种方法,及其在多发育人汉语数字的DHMM/VQ语音识别中试验结果极及其分析。

     

    Abstract: In studies of speech recognition based on features, it is often discovered that the recognition performance of some features for some utterances is better than for the others while the other features have opposite effects. They have, to some extent, the complementary relation in the recognition of some utterances. Based on HMM which is now widely used in speech recognition studies, three efficient methods for combining multiple complementary features to improve the recognition performance of HMM are presented in this paper. They are defined as the maximum parameter method, the all parameter method and the most reliable parameter method. The three methods respectively improve the performance in multi-speaker Chinese digit DHMM/VQ recognition from 89% to 92.3%, 95.7%, 94.3%. This paper describes the three methods and gives experimental results and analysis in multi-speaker Chinese digit DHMM/VQ recognition in detail.

     

/

返回文章
返回