DSpace at KOASAS: A highly robust audio fingerprinting scheme in real environments

DSpace at KOASAS

College of Engineering(공과대학)KAIST-ICC School of Engineering-Theses_Ph.D(공학부 박사논문)

A highly robust audio fingerprinting scheme in real environments실제 환경에 강인한 오디오 핑거프린팅 기법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 579
Download : 0

Export

Park, Man-Soo / 박만수

Recently, content-based audio identification techniques by an audio fingerprinting scheme that can retrieve audio information without any text-based query. They have been recognized as one of the state-of-the-art and attractive application services on the music portal market in wire/wireless communications. This dissertation introduces a methodology to this challenging task using an audio signal query to retrieve polyphonic music items by matching it to pre-indexed audio references. In real environments, however, sound recordings are commonly distorted by channel and background noise. As well, music signals can be easily distorted by time stretch (tempo change). The performance of audio identification is greatly degraded by those distortion factors. Thus, the robustness of an audio fingerprinting system is still one of the most important issues in music information retrieval by content-based audio identification techniques. This dissertation introduces the conventional audio fingerprinting schemes such as stochastic modeling and audio hashing. In the stochastic modeling scheme, spectral parameters are conventionally used to build a stochastic model. Foote proposed the stochastic modeling method for content-based music information retrieval (MIR) [16]. The stochastic model is based on the spectral envelope histogram, the histogram of spectral audio feature counts at the code vectors of vector quantization (VQ). In this dissertation, we propose a new distance metric to measure the similarity of two probability distributions and apply the dynamic matching method instead of the static matching method. As well, we proposed the stochastic modeling method which uses pitch histogram instead of spectral envelope histogram. Music can be identified by distinctive melody lines. Melody line consists of the harmony of musical notes. After all, pitch becomes very useful information because it is a basis of melody note. In addition, the number of histogram bins can be limit...

Advisors: Kim, Hoi-Rin researcher; 김회린 researcher

Description: 한국정보통신대학교 : 공학부,

Publisher: 한국정보통신대학교

Issue Date: 2006

Identifier: 392694/225023 / 020025342

Language: eng

Description: 학위논문(박사) - 한국정보통신대학교 : 공학부, 2006.8, [ xii, 88 p. ]

Keywords: Frequency Filtering; Pitch Histogram; Audio Fingerprint; Temporal Filtering; 시간축 필터링; 주파수 필터링; 피치 히스토그램; 오디오 핑거프린트

URI: http://hdl.handle.net/10203/54569

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392694&flag=dissertation

Appears in Collection: School of Engineering-Theses_Ph.D(공학부 박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A highly robust audio fingerprinting scheme in real environments실제 환경에 강인한 오디오 핑거프린팅 기법

KOASAS

Communities & Collections