EI / SCOPUS / CSCD 收录

中文核心期刊

基于随机轨迹模型的汉语连续语音识别方法研究

A study on recognition of continuous Chinese speech based on stochastic trajectory models

  • 摘要: 本文在指出隐马尔可夫模型(HMM)不合理假设的基础上,介绍了随机轨迹模型(STM)的理论机制及优越性。随机轨迹模型将语音基元的声学观察表示为参数空间中轨迹的聚类,并将轨迹建模为状态随机序列概率密度函数的混合,该模型可以克服HMM的不合理假设,在理论上更合理。根据STM的特点及汉语语音特色,本文对汉语连续语音识别基元的选取进行了讨论,提出了音素类单元作为识别系统的识别基元。基于STM的汉语连续语音识别的实验结果证明了STM的有效性和音素类单元的一致性。

     

    Abstract: After pointed the unreality of the three basic HMM assumptions, this paper introduces the theory and the advantage of Stochastic Trajectory Models (STMs) which would possibly overcome these problems caused by HMM assumptions. STM represents the acoustic observations of a speech unit as clusters of trajectories in a parameter space. The trajectories are modeled by mixture of probability density functions of random sequence of states. After analyzing the characteristics of Chinese speech, the acoustic units for continuous Chinese speech recognition based on STM are discussed and phone-like units are suggested. The performance of continuous Chinese speech recognition based on STM is studied using VINICS system. The experimental results demonstrate the efficiency of STM and the consistency of phone-like units.

     

/

返回文章
返回