Prosody conversion for mandarin emotional voice conversion
-
Graphical Abstract
-
Abstract
A prosody conversion method was proposed for transforming neutral speech to some required target emotion, in which F0 was modeled by DCT and converted by GMM-based method at both phrase level and syllable level, while duration was converted by CART-based method at phoneme level. A corpus consisted of three basis emotions was used for training and testing. Objective evaluation and The listening test results showed that our method can convert emotional prosody effectively, the sad emotion conversion achieved accuracy of nearly 100% in listening test
-
-