DSpace at KOASAS: Speech enhancement and coding at medium-low rates

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

Speech enhancement and coding at medium-low rates음질 향상 및 중대역에서의 음성 부호화에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 681
Download : 0

Export

Cho, Dong-Ho / 조동호

In this dissertation work, speech enhancement and coding at medium-low rates (i.e., 4.8 to 16 kbits/s) have been studied. This dissertation work may be divided into five parts. First, adaptive linear prediction based on the frequency-domain block least-mean-square (FBLMS) adaptation algorithm has been studied. A new frequency-weighted block least-mean-square (FWBLMS) algorithm that minimizes frequency-weighted block mean-squared error is proposed and applied to linear prediction of speech. Also, the optimum convergence factors of various adaptive digital filter (ADF) algorithms are derived analytically. In adaptive linear prediction of speech, the use of the FWBLMS algorithm gives several advantages. These include direct residual extraction, the existence of time- and frequency-domain information of input and residual signals and prediction coefficients, inherent noise spectral shaping effect and simultaneous enhancement and coding by the spectral subtraction method in the frequency domain without block delay. Application of the FWBLMS algorithm to multi-rate vocoding is also discussed in detail. Second, enhancement of noisy speech corrupted by white or colored noise is studied. The unconstrained FBLMS (UFBLMS) algorithm with fast convergence speed for correlated input is newly applied to speech processing. For enhancement of speech degraded by white noise, the spectral subtraction method, Wiener filtering and the UFBLMS algorithm are investigated, and their performances are compared by various objective measures. The UFBLMS algorithm is superior to the spectral subtraction method or Wiener filtering technique by more than 3 dB in segmental frequency-weighted signal-to-quantization noise ($\mbox{FWSQNR}_\mbox{SEG}$) when SNR of speech is in the range of 0 to 10 dB. Furthermore, when the UFBLMS algorithm is used, high-pass filtering may be combined with the enhancement algorithm to improve speech quality and intelligibility. For enhancement of noisy speech corru...

Advisors: Un, Chong-Kwan researcher; 은종관 researcher

Description: 한국과학기술원 : 전기 및 전자공학과,

Publisher: 한국과학기술원

Issue Date: 1985

Identifier: 60905/325007 / 000795252

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1985.2, [ xxv, 374 p. ]

URI: http://hdl.handle.net/10203/35750

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=60905&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Speech enhancement and coding at medium-low rates음질 향상 및 중대역에서의 음성 부호화에 관한 연구

KOASAS

Communities & Collections