Multi-channel audio processing techniques : angle information based spatial audio coding and frequency domain based audio source separation음상정보를 이용한 공간 오디오 코딩 기술과 주파수 영역 오디오 음원분리 기술 기반의 다채널 오디오 처리 기술

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 512
  • Download : 0
The first issue of this thesis, new spatial audio coding schemes are proposed as a multichannel audio coding scheme. The SAC is a process to represent multichannel audio signals as down-mixed signal with spatial cues. Recently, binaural cue coding (BCC) has been introduced and becomes an important scheme for spatial audio coding. The inter-channel level difference (ICLD) as one of spatial cues of the BCC plays a pivotal role to remove a lot of redundant information. The accuracy of the ICLD, however, can be easily distorted by a quantization process. Instead of the ICLD, a new representation method of ICLD is proposed and it dramatically overcomes the quantization distortion. Another proposed scheme, global vector split based virtual source location information is newly presented as a SAC scheme. The GS-VSLI is analyzed on the semicircle plane and represented as angles. Spectral distortion measurement is conducted to confirm the usefulness of the GS-VSLI. As the second issue of the thesis, audio source separation techniques are dealt with. The object-based audio rendering is a method in order to make an auditory scene automatically. The core technique to realize object based audio processing is a blind source separation which makes multitude audio separated into object audio. For the robustness of our algorithm, the frequency-domain block-based multichannel blind deconvolution (MBD) with a normalization matrix is proposed. The normalization is designed to overcome the intrinsic problems of the time-domain MBD such as the whitening effect and the slow convergence. The experimental results confirm that the proposed MBD algorithm is superior to the previous works.
Advisors
Hahn, Min-Sooresearcher한민수researcher
Description
한국정보통신대학교 : 공학부,
Publisher
한국정보통신대학교
Issue Date
2005
Identifier
392580/225023 / 020015320
Language
eng
Description

학위논문(박사) - 한국정보통신대학교 : 공학부, 2005, [ xi, 144 p. ]

Keywords

Virutual Source Location Information; Binaural Cue Coding; MPEG-4 Spatial Audio Coding; Blind Source Separation; 블라인드 소스 분리; 가상 음원 위치 정보; 바이노럴 큐 코딩; MPEG-4 공간 오디오 코딩

URI
http://hdl.handle.net/10203/54551
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392580&flag=dissertation
Appears in Collection
School of Engineering-Theses_Ph.D(공학부 박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0