기본주파수와 성도길이의 상관관계를 이용한HTS 음성합성기에서의 목소리 변환Voice transformation for HTS using correlation between fundamental frequency and vocal tract length

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 818
  • Download : 0
The main advantage of the statistical parametric speech synthesis is its flexibility in changing voice characteristics. A personalized text-to-speech(TTS) system can be implemented by combining a speech synthesis system and a voice transformation system, and it is widely used in many application areas. It is known that the fundamental frequency and the spectral envelope of speech signal can be independently modified to convert the voice characteristics. Also it is important to maintain naturalness of the transformed speech. In this paper, a speech synthesis system based on Hidden Markov Model(HMM-based speech synthesis, HTS) using the STRAIGHT vocoder is constructed and voice transformation is conducted by modifying the fundamental frequency and spectral envelope. The fundamental frequency is transformed in a scaling method, and the spectral envelope is transformed through frequency warping method to control the speaker's vocal tract length. In particular, this study proposes a voice transformation method using the correlation between fundamental frequency and vocal tract length. Subjective evaluations were conducted to assess preference and mean opinion scores(MOS) for naturalness of synthetic speech. Experimental results showed that the proposed voice transformation method achieved higher preference than baseline systems while maintaining the naturalness of the speech quality.
Publisher
한국음성학회
Issue Date
2017-03
Language
Korean
Keywords

voice transformation; HMM-based speech synthesis; STRAIGHT vocoder; fundamental frequency; vocal tract length

Citation

말소리와 음성과학, v.9, no.1, pp.41 - 47

ISSN
2005-8063
DOI
10.13064/KSSS.2017.9.1.041
URI
http://hdl.handle.net/10203/225499
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0