EI / SCOPUS / CSCD 收录

中文核心期刊

TAO Jianhua, CAI Lianhong. Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features[J]. ACTA ACUSTICA, 2003, 28(5): 395-402. DOI: 10.15949/j.cnki.0371-0025.2003.05.003
Citation: TAO Jianhua, CAI Lianhong. Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features[J]. ACTA ACUSTICA, 2003, 28(5): 395-402. DOI: 10.15949/j.cnki.0371-0025.2003.05.003

Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features

More Information
  • PACS: 
    • 43.70  (Speech production)
  • Received Date: December 09, 2001
  • Revised Date: August 12, 2002
  • Available Online: August 03, 2022
  • A prosody modeling method based on statistic model is described. Based on this, a Chinese prosody model based on the classification of syllabic prosody features is presented, which makes automatic prosody prediction with prosody templates and prosody cost function. And the automatic training algorithm of the model in detail is described. Further more, according to statistic prosody modeling method, the influence to prosody template selection with the help of the analysis of the prosody interaction among prosody elements is analyzed. Finally, the error distribution of the statistic method based prosody prediction is given. The results show good naturalness and much flexible in application.
  • Related Articles

    [1]HAO Xiaoyang, ZHANG Pengyuan. Autoregressive multi-speaker model in Chinese speech synthesis based on variational autoencoder[J]. ACTA ACUSTICA, 2022, 47(3): 405-416. DOI: 10.15949/j.cnki.0371-0025.2022.03.004
    [2]LIANG Chunyan, YANG Lin, ZHOU Ruohua, YAN Yonghong. Modeling prosodic features with probabilistic linear discriminant analysis for speaker verification[J]. ACTA ACUSTICA, 2015, 40(1): 28-33. DOI: 10.15949/j.cnki.0371-0025.2015.01.004
    [3]ZHANG Qingqing, PAN Jielin, YAN Yonghong. Tonal articulatory feature-based acoustic modeling for Chinese Putonghua speech recognition[J]. ACTA ACUSTICA, 2010, 35(2): 254-260. DOI: 10.15949/j.cnki.0371-0025.2010.02.026
    [4]JIANG Xiaoqing, TIAN Lan, CUI Guohui. Statistical analysis of prosodie parameters and emotion recognition of multilingual speech[J]. ACTA ACUSTICA, 2006, 31(3): 217-221. DOI: 10.15949/j.cnki.0371-0025.2006.03.005
    [5]WANG Wei, CAI Lianhong. Research on predicting prosodic parameters for Chinese synthesis by data mining approach[J]. ACTA ACUSTICA, 2003, 28(1): 1-6. DOI: 10.15949/j.cnki.0371-0025.2003.01.001
    [6]TAO Jianhua, CAI Lianhong, ZHAO Shixia, WU Zhiyong. The study of the trainable prosodic model for Chinese text to speech system[J]. ACTA ACUSTICA, 2001, 26(1): 67-72. DOI: 10.15949/j.cnki.0371-0025.2001.01.012
    [7]YU Zhenli, CHENG Bozhong. Study of a new synthesis method based on speech production model and RTLA model[J]. ACTA ACUSTICA, 2000, 25(5): 455-462. DOI: 10.15949/j.cnki.0371-0025.2000.05.013
    [8]ZHANG Jialu, QI Shiqian, YU Ge. Assessment methods of speech synthesis systems for Chinese[J]. ACTA ACUSTICA, 1998, 23(1): 19-30. DOI: 10.15949/j.cnki.0371-0025.1998.01.003
    [9]MA Xiaohui, FU Yuqing, LU Jiren, GONG Yifan. A study on recognition of continuous Chinese speech based on stochastic trajectory models[J]. ACTA ACUSTICA, 1997, 22(2): 176-181. DOI: 10.15949/j.cnki.0371-0025.1997.02.012
    [10]GUAN Cun-tai, CHEN Yong-bin, WU Bo-xiu. A study on acoustic models of Chinese speech recognition system with whole Chinese syllables[J]. ACTA ACUSTICA, 1994, 19(5): 321-330. DOI: 10.15949/j.cnki.0371-0025.1994.05.001

Catalog

    Article Metrics

    Article views (51) PDF downloads (5) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return