DSpace at KOASAS: Prediction of prosodic phrase boundaries for Korean text-to-speech conversion

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Ph.D.(박사논문)

Prediction of prosodic phrase boundaries for Korean text-to-speech conversion한국어 문서-음성 변환을 위한 운율경계 예측에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 556
Download : 0

Export

Kim, Yeon-Jun / 김연준

This thesis describes a methodology in the prediction of prosodic phrase boundaries for Korean text-to-speech (TTS) conversion systems. The proposed method in this thesis is modeled using the temporal constraints of the human articulatory system and the syntactic influence emanating from dependency relation which is effective in freer word-order languages. TTS conversion, a well-known technique of communication between humans and computers, is the process of generating speech from text and the ultimate goal of speech synthesis; for any string of words a TTS system can approximate the way a human would read these same words. Although the need for communication between humans and computers is increasing as computers become more prevalent, currently, TTS systems are used only for several restricted applications because of their poor synthetic quality. Prosody plays an important role in speech production as well as speech understanding. In continuous speech, speakers tend to group words into phrases whose boundaries are marked by duration and intonational cues, and many phonological rules constrain operation only within such phrases, usually termed prosodic phrases. Therefore, a computational model for prosodic structure is necessary for high quality TTS conversion since the correct assignment of phrase breaks can increase the intelligibility of a sentence as well as improve its naturalness. In this work, several statistical models for predicting the prosodic phrase boundaries of speech are proposed. The computational prosody model in this work is automatically trainable only with syntactic information and can be incorporated into existing TTS conversion systems. This work makes use of dependency grammar, which is known to be more effective for parsing word-order free languages including Korean. For prosodic boundary prediction, various relevant features extracted from text analysis are incorporated instead of an input word sequence itself, whose motivation and ...

Advisors: Oh, Yung-Hwan researcher; 오영환 researcher

Description: 한국과학기술원 : 전산학전공,

Publisher: 한국과학기술원

Issue Date: 2000

Identifier: 157670/325007 / 000945077

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전산학전공, 2000.2, [ vi, 90 p. ]

Keywords: Prosody control; TTS; Speech synthesis; Prosodic phrasing; 운율경계 예측; 운율 조절; 문서-음성 변환; 음성합성

URI: http://hdl.handle.net/10203/33155

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=157670&flag=dissertation

Appears in Collection: CS-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Prediction of prosodic phrase boundaries for Korean text-to-speech conversion한국어 문서-음성 변환을 위한 운율경계 예측에 관한 연구

KOASAS

Communities & Collections