Speech enhancement and coding at medium-low rates음질 향상 및 중대역에서의 음성 부호화에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 647
  • Download : 0
In this dissertation work, speech enhancement and coding at medium-low rates (i.e., 4.8 to 16 kbits/s) have been studied. This dissertation work may be divided into five parts. First, adaptive linear prediction based on the frequency-domain block least-mean-square (FBLMS) adaptation algorithm has been studied. A new frequency-weighted block least-mean-square (FWBLMS) algorithm that minimizes frequency-weighted block mean-squared error is proposed and applied to linear prediction of speech. Also, the optimum convergence factors of various adaptive digital filter (ADF) algorithms are derived analytically. In adaptive linear prediction of speech, the use of the FWBLMS algorithm gives several advantages. These include direct residual extraction, the existence of time- and frequency-domain information of input and residual signals and prediction coefficients, inherent noise spectral shaping effect and simultaneous enhancement and coding by the spectral subtraction method in the frequency domain without block delay. Application of the FWBLMS algorithm to multi-rate vocoding is also discussed in detail. Second, enhancement of noisy speech corrupted by white or colored noise is studied. The unconstrained FBLMS (UFBLMS) algorithm with fast convergence speed for correlated input is newly applied to speech processing. For enhancement of speech degraded by white noise, the spectral subtraction method, Wiener filtering and the UFBLMS algorithm are investigated, and their performances are compared by various objective measures. The UFBLMS algorithm is superior to the spectral subtraction method or Wiener filtering technique by more than 3 dB in segmental frequency-weighted signal-to-quantization noise ($\mbox{FWSQNR}_\mbox{SEG}$) when SNR of speech is in the range of 0 to 10 dB. Furthermore, when the UFBLMS algorithm is used, high-pass filtering may be combined with the enhancement algorithm to improve speech quality and intelligibility. For enhancement of noisy speech corru...
Advisors
Un, Chong-Kwanresearcher은종관researcher
Description
한국과학기술원 : 전기 및 전자공학과,
Publisher
한국과학기술원
Issue Date
1985
Identifier
60905/325007 / 000795252
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1985.2, [ xxv, 374 p. ]

URI
http://hdl.handle.net/10203/35750
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=60905&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0