Speech quality enhancement and pitch-dependent bit reduction scheme for WI coders

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 324
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorHahn, Min-Soo-
dc.contributor.advisor한민수-
dc.contributor.authorCho, Keun-Seok-
dc.contributor.author조근석-
dc.date.accessioned2011-12-30-
dc.date.available2011-12-30-
dc.date.issued2009-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=393111&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/55094-
dc.description학위논문(석사) - 한국정보통신대학교 : 공학부, 2009.2, [ vii, 45 p. ]-
dc.description.abstractIn this thesis, an improved SEW/REW decomposition method with pitchdependent phase generation and a noble variable bit rate (VBR) scheme are proposed to enhance the speech quality of the waveform interpolation (WI) coder and reduce the bit rate of the WI coder. In the original WI scheme, a characteristic waveform (CW) is decomposed into a slowly evolving waveform (SEW) and a rapidly evolving waveform (REW) in Cartesian coordinates. This may deteriorate the spectral shape of the reconstructed CWs. Especially, speech quality degradation is inevitable when the REW contains SEW components. To solve this problem, the proposed decomposition is performed in the magnitude domain to reduce spectral distortions. The phase of the characteristic waveforms is generated after classifying the signal into silence, unvoiced and voiced speech using the pitch value. The proposed VBR scheme is achieved by substituting white Gaussian noises with the excitation signal of silence and unvoiced speech and allocating bit rates variably. The performance of our proposed method was evaluated by the perceptual evaluation of speech quality (PESQ) score. The proposed CW modification results in the PESQ score improvement by 0.32 from the baseline speech quality, i.e., the PESQ score of 3.368. In addition, we confirmed that the required bit rate is decreased by 6.7% using the proposed novel VBR scheme. Experimental results show that our proposed algorithm achieves the improved speech quality while reducing the required bit rate compared to the conventional methods.eng
dc.languageeng-
dc.publisher한국정보통신대학교-
dc.subject음원 파형 보간-
dc.subject음원 이용 가변 비트율-
dc.subject특징 파형-
dc.subjectSource-Controlled Variable Bit Rate-
dc.subjectCharacteristic Waveform-
dc.subjectWaveform Interpolation-
dc.titleSpeech quality enhancement and pitch-dependent bit reduction scheme for WI coders-
dc.typeThesis(Master)-
dc.identifier.CNRN393111/225023-
dc.description.department한국정보통신대학교 : 공학부, -
dc.identifier.uid020064606-
dc.contributor.localauthorHahn, Min-Soo-
dc.contributor.localauthor한민수-
Appears in Collection
School of Engineering-Theses_Master(공학부 석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0