EI / SCOPUS / CSCD 收录

中文核心期刊

GU Dong, JIAN Zhihua. An algorithm for voice conversion with limited corpus[J]. ACTA ACUSTICA, 2018, 43(5): 864-872. DOI: 10.15949/j.cnki.0371-0025.2018.05.018
Citation: GU Dong, JIAN Zhihua. An algorithm for voice conversion with limited corpus[J]. ACTA ACUSTICA, 2018, 43(5): 864-872. DOI: 10.15949/j.cnki.0371-0025.2018.05.018

An algorithm for voice conversion with limited corpus

  • Under the condition of limited target speaker's corpus, this paper proposed a new voice conversion algorithm using unified tensor dictionary with limited corpus. Firstly, parallel speech of N speakers was selected randomly from the speech corpus to build the base of tensor dictionary. And then, after the operation of multi-series dynamic time warping for those chosen speech, N two-dimension basic dictionaries can be generated which constituted the unified tensor dictionary. During the conversion stage, the two dictionaries of source and target speaker were established by linear combination of the N basic dictionaries using the two speakers' speech. The experimental results showed that when the number of the basic speaker was 14, our algorithm can obtain the compared perfornmnce of the traditional NMF-based method with few target speaker corpus, which greatly facilitate the application of voice conversion system.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return