DSpace at KOASAS: Very low bit rate speech coding based on temporal decomposition of line spectral frequencies

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Ph.D.(박사논문)

Very low bit rate speech coding based on temporal decomposition of line spectral frequencies선스펙트럼 주파수의 시간적 분해법에 기반한 극저전송률 음성부호화

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 586
Download : 0

Export

Kim, Sung-Joo / 김승주

Very low bit rate (VLBR) speech coding technology digitizes speech signal at bit rate about 1 kbps and below, so that it can transfer or store speech signal effectively. To develop a VLBR speech coder, it is essential to remove the temporal redundancy of spectral information of speech. Most of VLBR speech coders analyze the input speech as a sequence of phonetically meaningful segments like phonemes and then quantize them to remove the spectral redundancy. In this case, it is expected that the coded speech can be utilized by several interesting applications such as client-server model speech recognition, spoken document retrieval, speaker transformation, speaking rate change, and so on. It is because the VLBR speech coder abstracts the essential information of the input speech more efficiently compared with a fixed-frame speech coding system. In this paper, two important aspects of a VLBR speech coding are studied: 1) development of a novel method for quantizing spectral information of speech and 2) application of a VLBR speech coder output. Thus a VLBR speech coder is implemented and its applications are discussed. The implemented vocoder adopts temporal decomposition method, which does not requires training or matching patterns. For representing spectral information of input speech, line spectral frequency (LSF) parameters are used since several merits of LSF parameter are very applicable to a low bit rate speech coder, such as their robustness in quantization and transmission error. However, they also have an inherent property called LSF````s ordering property and this prohibits the temporal decomposition of LSF parameters. In order to solve this problem, a restricted temporal decomposition is proposed. Finally, a VLBR speech coder at the average bit rate of 996 bps is developed, and performance tests prove that the proposed vocoder reproduces a similar quality of the 2400 bps LPC-10E vocoder. As an application of the implemented VLBR speech coder, an automa...

Advisors: Oh, Yung-Hwan researcher; 오영환 researcher

Description: 한국과학기술원 : 전산학전공,

Publisher: 한국과학기술원

Issue Date: 2000

Identifier: 157669/325007 / 000945074

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전산학전공, 2000.2, [ iv, 101 p. ]

Keywords: temporal decomposition; speech coding; very low bit rate; LSF; 선스펙트럼 주파수; 시간적 분해법; 음성압축; 음성부호기; 극저전송률

URI: http://hdl.handle.net/10203/33154

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=157669&flag=dissertation

Appears in Collection: CS-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Very low bit rate speech coding based on temporal decomposition of line spectral frequencies선스펙트럼 주파수의 시간적 분해법에 기반한 극저전송률 음성부호화

KOASAS

Communities & Collections