Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features

TAO Jianhua; CAI Lianhong

doi:10.15949/j.cnki.0371-0025.2003.05.003

Volume 28 Issue 5

Aug. 2022

Turn off MathJax

Article Contents

Abstract

ACTA ACUSTICA > 2003 > 28(5): 395-402. > DOI: 10.15949/j.cnki.0371-0025.2003.05.003 CSTR: 32049.14.11-2065.2003.05.003

TAO Jianhua, CAI Lianhong. Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features[J]. ACTA ACUSTICA, 2003, 28(5): 395-402. DOI: 10.15949/j.cnki.0371-0025.2003.05.003

Citation:

PDF (2511 KB)

Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features

Department of Computer Science and Technology Tsinghua University Beijing 100084

More Information

PACS:
- 43.70 (Speech production)
Received Date: December 09, 2001
Revised Date: August 12, 2002
Available Online: August 03, 2022

Graphical Abstract

Abstract

Abstract

A prosody modeling method based on statistic model is described. Based on this, a Chinese prosody model based on the classification of syllabic prosody features is presented, which makes automatic prosody prediction with prosody templates and prosody cost function. And the automatic training algorithm of the model in detail is described. Further more, according to statistic prosody modeling method, the influence to prosody template selection with the help of the analysis of the prosody interaction among prosody elements is analyzed. Finally, the error distribution of the statistic method based prosody prediction is given. The results show good naturalness and much flexible in application.

FullText(HTML)

References (0)

[1]	HAO Xiaoyang, ZHANG Pengyuan. Autoregressive multi-speaker model in Chinese speech synthesis based on variational autoencoder[J]. ACTA ACUSTICA, 2022, 47(3): 405-416. DOI: 10.15949/j.cnki.0371-0025.2022.03.004
[2]	LIANG Chunyan, YANG Lin, ZHOU Ruohua, YAN Yonghong. Modeling prosodic features with probabilistic linear discriminant analysis for speaker verification[J]. ACTA ACUSTICA, 2015, 40(1): 28-33. DOI: 10.15949/j.cnki.0371-0025.2015.01.004
[3]	ZHANG Qingqing, PAN Jielin, YAN Yonghong. Tonal articulatory feature-based acoustic modeling for Chinese Putonghua speech recognition[J]. ACTA ACUSTICA, 2010, 35(2): 254-260. DOI: 10.15949/j.cnki.0371-0025.2010.02.026
[4]	JIANG Xiaoqing, TIAN Lan, CUI Guohui. Statistical analysis of prosodie parameters and emotion recognition of multilingual speech[J]. ACTA ACUSTICA, 2006, 31(3): 217-221. DOI: 10.15949/j.cnki.0371-0025.2006.03.005
[5]	WANG Wei, CAI Lianhong. Research on predicting prosodic parameters for Chinese synthesis by data mining approach[J]. ACTA ACUSTICA, 2003, 28(1): 1-6. DOI: 10.15949/j.cnki.0371-0025.2003.01.001
[6]	TAO Jianhua, CAI Lianhong, ZHAO Shixia, WU Zhiyong. The study of the trainable prosodic model for Chinese text to speech system[J]. ACTA ACUSTICA, 2001, 26(1): 67-72. DOI: 10.15949/j.cnki.0371-0025.2001.01.012
[7]	YU Zhenli, CHENG Bozhong. Study of a new synthesis method based on speech production model and RTLA model[J]. ACTA ACUSTICA, 2000, 25(5): 455-462. DOI: 10.15949/j.cnki.0371-0025.2000.05.013
[8]	ZHANG Jialu, QI Shiqian, YU Ge. Assessment methods of speech synthesis systems for Chinese[J]. ACTA ACUSTICA, 1998, 23(1): 19-30. DOI: 10.15949/j.cnki.0371-0025.1998.01.003
[9]	MA Xiaohui, FU Yuqing, LU Jiren, GONG Yifan. A study on recognition of continuous Chinese speech based on stochastic trajectory models[J]. ACTA ACUSTICA, 1997, 22(2): 176-181. DOI: 10.15949/j.cnki.0371-0025.1997.02.012
[10]	GUAN Cun-tai, CHEN Yong-bin, WU Bo-xiu. A study on acoustic models of Chinese speech recognition system with whole Chinese syllables[J]. ACTA ACUSTICA, 1994, 19(5): 321-330. DOI: 10.15949/j.cnki.0371-0025.1994.05.001

Cited By

Get Citation

PDF

XML

Article Metrics

Article views (51) PDF downloads (5)

Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features

Abstract

Related Articles

Catalog

Article Metrics

Related

Study of prosody model on Chinese speech synthesis based on the classification of syllabic prosody features

Abstract

Related Articles

Catalog

Article Metrics

Related

Export File

Citation

Format

Content