DSpace at KOASAS: Singing voice generation and phrase emphasis using glottal-waveform

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Ph.D.(박사논문)

Singing voice generation and phrase emphasis using glottal-waveform성대파를 이용한 가창음성 생성 및 어구 강조

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 552
Download : 0

Export

Bae, Jae-Hyun / 배재현

Research on the speech synthesis area are performed mainly about the plain read speech sentence generation and the quality of the synthesized speech is improved exceedingly. Recent days, dialogic speech style, emotional, expressive speech synthesis area is widely being studied. And researches about the voice quality which means the color of the voice are performed also. Among these, studies on the expressive TTS are mainly focused on corpus based method. this method records the pronunciations of various circumstances and use proper units among them for the proper context. In the corpus based synthesis, natural speech segments are used with almost no modification. The advantage of this way is that synthetic speech is very natural. But there is some disadvantages also. One of which is that we have to record huge amount of speech sentences to cope with various circumstances. Therefore in the unprepared context, naturalness of the synthetic speech may be degraded. Another disadvantage is that the synthetic speech may have different prosody compared to the target prosody that prosody module produce. Among the research area on the voice color, area on the glottal waveform is widely performed. In this area, modeling and modifying the the glottal waveform are studied and produces the high quality synthetic speech. In this paper, we want to generate the speech in which key phrase is emphasized from the plain read speech sentence by transforming the glottal waveform. Plain synthetic speech sentence of conventional TTS system cannot express the speaker`s intention. On the contrary, in the real environment, people may emphasize the keyword or phrase which they want to deliver clearly. The emphasized keyword or phrase has strong voice color than other phrases. By utilizing this phenomenon, we want to emphasize the keyword or phrase which is the contextual core in the sentence. we use glottal waveforms to make the re-synthesized speech be more natural. To estimate the glo...

Advisors: Oh, Yung-Hwan researcher; 오영환 researcher

Description: 한국과학기술원 : 전산학과,

Publisher: 한국과학기술원

Issue Date: 2011

Identifier: 466474/325007 / 020005820

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전산학과, 2011.2, [ viii, 59 p. ]

Keywords: Glottal waveform Transformation; Phrase Emphasis; Speech Synthesis; Singing Voice Generation; 가창음성 생성; 성대파 변환; 어구 강조; 음성 합성

URI: http://hdl.handle.net/10203/33334

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=466474&flag=dissertation

Appears in Collection: CS-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Singing voice generation and phrase emphasis using glottal-waveform성대파를 이용한 가창음성 생성 및 어구 강조

KOASAS

Communities & Collections