EI / SCOPUS / CSCD 收录

中文核心期刊

CHEN Bin, ZHANG Lianhai, WANG Bo, QU Dan. Boundary detection of Chinese initials and finals based on seneff's auditory spectrum features[J]. ACTA ACUSTICA, 2012, 37(1): 104-112. DOI: 10.15949/j.cnki.0371-0025.2012.01.012
Citation: CHEN Bin, ZHANG Lianhai, WANG Bo, QU Dan. Boundary detection of Chinese initials and finals based on seneff's auditory spectrum features[J]. ACTA ACUSTICA, 2012, 37(1): 104-112. DOI: 10.15949/j.cnki.0371-0025.2012.01.012

Boundary detection of Chinese initials and finals based on seneff's auditory spectrum features

More Information
  • PACS: 
  • Received Date: July 27, 2010
  • Revised Date: December 09, 2010
  • Available Online: June 22, 2022
  • A boundary detection method of Chinese initials and finals is proposed based on the energy distribute and formant structure characteristics.According to this method,the auditory spectrum is first of all got after speech signal passes the Seneff's auditory model,and then based on the spectrum the parameters of all-band energy,low-band energy, spectrum center of gravity,ratio of high and low frequency energy,middle and high energy,etc are chose to describe the energy distribute and formant structure characteristic of different kinds of Chinese initials and finals.Finally,the boundary is determined according to the parameter mutation,and modified using the first envelope difference and simplebased Kullback-Leibler distance.The experimental results show that under 8 kHz sampling frequency,the accuracy is 93.7%for clean speech,above 85.3%for noisy speech with the SNR of 10 dB and above 86.7%for codec speech.
  • Related Articles

    [1]ZHANG Yuxiang, LI Zhuo, LU Jingze, SHANG Zengqiang, CHEN Shuli, WANG Wenchao, ZHANG Pengyuan. Spoof speech detection based on speaker features[J]. ACTA ACUSTICA, 2025, 50(1): 201-210. DOI: 10.12395/0371-0025.2023278
    [2]FAN Xiaohe, ZHAO Heming, CHEN Xueqin, ZHOU Yan. Deceptive Chinese speech detection based on sparse decomposition of cepstral feature[J]. ACTA ACUSTICA, 2018, 43(1): 121-128. DOI: 10.15949/j.cnki.0371-0025.2018.01.014
    [3]WU Di, ZHAO Heming, TAO Zhi, ZHANG Xiaojun, XIAO Zhongzhe, XU Yishen. Speech endpoint detection in low-SNRs environment based on perception spectrogram structure boundary parameter[J]. ACTA ACUSTICA, 2014, 39(3): 392-399. DOI: 10.15949/j.cnki.0371-0025.2014.03.015
    [4]LI Hao, TANG Chaojing. Initial/final segmentation using loss function and acoustic features[J]. ACTA ACUSTICA, 2012, 37(3): 339-345. DOI: 10.15949/j.cnki.0371-0025.2012.03.010
    [5]YIN Hui, XIE Xiang, KUANG Jingming. Acoustic features based on auditory model and adaptive fractional Fourier transform for speech recognition[J]. ACTA ACUSTICA, 2012, 37(1): 97-103. DOI: 10.15949/j.cnki.0371-0025.2012.01.011
    [6]ZHANG Baoqi, ZHANG Lianhai, QU Dan. Segmentation of Chinese initials and finals based on auditory event detection[J]. ACTA ACUSTICA, 2010, 35(6): 701-707. DOI: 10.15949/j.cnki.0371-0025.2010.06.013
    [7]SHAO Jian, ZHAO Qingwei, YAN Yonghong. Initial/final acoustic model based on separating nasal coda in Chinese Putonghua speech recognition[J]. ACTA ACUSTICA, 2010, 35(5): 587-592. DOI: 10.15949/j.cnki.0371-0025.2010.05.021
    [8]ZHANG Qingqing, PAN Jielin, YAN Yonghong. Tonal articulatory feature-based acoustic modeling for Chinese Putonghua speech recognition[J]. ACTA ACUSTICA, 2010, 35(2): 254-260. DOI: 10.15949/j.cnki.0371-0025.2010.02.026
    [9]MA Yuanfeng, CHEN Ke'an, WANG Na, ZHENG Wen. Application of auditory spectrum-based features into acoustic target recognition[J]. ACTA ACUSTICA, 2009, 34(2): 142-150. DOI: 10.15949/j.cnki.0371-0025.2009.02.006
    [10]LI Xueli, DING Hui, XU Boling. Entropy-based initial/final segmentation for Chinese whiskered speech[J]. ACTA ACUSTICA, 2005, 30(1): 69-75. DOI: 10.15949/j.cnki.0371-0025.2005.01.011

Catalog

    Article Metrics

    Article views (30) PDF downloads (8) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return