A new VBR WI coder with CW modification for embedded applications특성파형 성분분리 방법 개선 및 새로운 가변 비트 기법을 적용한 임베디드 어플리케이션용 파형보간 코더

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 544
  • Download : 0
This dissertation presents a modified characteristic waveform (CW) decomposition method to enhance the speech quality of the wideband waveform interpolation (WI) coder and a new variable bit rate (VBR) coding technique to reduce the bit rate of the WI coder. In the original WI scheme, a CW is decomposed into a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW) in Cartesian coordinates. This may deteriorate the spectral shape of the reconstructed CWs. Especially, speech quality degradation is inevitable when the REW contains SEW components. To solve this problem, the decomposition of a CW is performed on polar coordinates so that the spectral envelop information of the reconstructed CWs holds consistency even for the REW with voiced components. The proposed CW modification results in the PESQ (Perceptual Evaluation of Speech Quality) score improvement by 0.3 from the baseline speech quality, i.e., the PESQ score of 2.8. The variable bit rate scheme in the WI coder was already proposed by Plante. It takes the benefit of time varying property of speech signals. On the other hand, the target of our variable bit rate scheme is to utilize the slowly varying property of the signal. After the WI coder extracted all parameters, the distortions between the current and the predicted parameters are measured. The predicted parameters are acquired by the prediction based on the past parameters to be transmitted. A parameter would not be transmitted unless the distortion exceeds the preset threshold. At the decoder, the non-transmitted parameter is reconstructed by the same prediction method used for the encoder. In this way, we can reduce 41 percent of the total bit rate while retaining the speech quality degradation below 0.1 PESQ score. Recently, demands for the speech coders which can provide good speech quality even at very low bit rates are increasing. Especially, adequate speech coders for the embedded applications are highly demanded. The final ve...
Advisors
Hahn, Min-Sooresearcher한민수researcher
Description
한국정보통신대학교 : 공학부,
Publisher
한국정보통신대학교
Issue Date
2008
Identifier
392974/225023 / 020025345
Language
eng
Description

학위논문(박사) - 한국정보통신대학교 : 공학부, 2008.2, [ iv, 91 p. ]

Keywords

VBR; Waveform interpolation; Multi-mode; 멀티 모드; 가변 비트율; 파형 보간

URI
http://hdl.handle.net/10203/54598
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392974&flag=dissertation
Appears in Collection
School of Engineering-Theses_Ph.D(공학부 박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0