EI / SCOPUS / CSCD 收录

中文核心期刊

CHEN Bin, ZHANG Lianhai, WANG Bo, QU Dan. Boundary detection of Chinese initials and finals based on seneff's auditory spectrum features[J]. ACTA ACUSTICA, 2012, 37(1): 104-112. DOI: 10.15949/j.cnki.0371-0025.2012.01.012
Citation: CHEN Bin, ZHANG Lianhai, WANG Bo, QU Dan. Boundary detection of Chinese initials and finals based on seneff's auditory spectrum features[J]. ACTA ACUSTICA, 2012, 37(1): 104-112. DOI: 10.15949/j.cnki.0371-0025.2012.01.012

Boundary detection of Chinese initials and finals based on seneff's auditory spectrum features

  • A boundary detection method of Chinese initials and finals is proposed based on the energy distribute and formant structure characteristics.According to this method,the auditory spectrum is first of all got after speech signal passes the Seneff's auditory model,and then based on the spectrum the parameters of all-band energy,low-band energy, spectrum center of gravity,ratio of high and low frequency energy,middle and high energy,etc are chose to describe the energy distribute and formant structure characteristic of different kinds of Chinese initials and finals.Finally,the boundary is determined according to the parameter mutation,and modified using the first envelope difference and simplebased Kullback-Leibler distance.The experimental results show that under 8 kHz sampling frequency,the accuracy is 93.7%for clean speech,above 85.3%for noisy speech with the SNR of 10 dB and above 86.7%for codec speech.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return