A highly robust audio fingerprinting scheme in real environments = 실제 환경에 강인한 오디오 핑거프린팅 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 236
  • Download : 0
Recently, content-based audio identification techniques by an audio fingerprinting scheme that can retrieve audio information without any text-based query. They have been recognized as one of the state-of-the-art and attractive application services on the music portal market in wire/wireless communications. This dissertation introduces a methodology to this challenging task using an audio signal query to retrieve polyphonic music items by matching it to pre-indexed audio references. In real environments, however, sound recordings are commonly distorted by channel and background noise. As well, music signals can be easily distorted by time stretch (tempo change). The performance of audio identification is greatly degraded by those distortion factors. Thus, the robustness of an audio fingerprinting system is still one of the most important issues in music information retrieval by content-based audio identification techniques. This dissertation introduces the conventional audio fingerprinting schemes such as stochastic modeling and audio hashing. In the stochastic modeling scheme, spectral parameters are conventionally used to build a stochastic model. Foote proposed the stochastic modeling method for content-based music information retrieval (MIR) [16]. The stochastic model is based on the spectral envelope histogram, the histogram of spectral audio feature counts at the code vectors of vector quantization (VQ). In this dissertation, we propose a new distance metric to measure the similarity of two probability distributions and apply the dynamic matching method instead of the static matching method. As well, we proposed the stochastic modeling method which uses pitch histogram instead of spectral envelope histogram. Music can be identified by distinctive melody lines. Melody line consists of the harmony of musical notes. After all, pitch becomes very useful information because it is a basis of melody note. In addition, the number of histogram bins can be limit...
Advisors
Kim, Hoi-Rinresearcher김회린researcher
Description
한국정보통신대학교 : 공학부,
Publisher
한국정보통신대학교
Issue Date
2006
Identifier
392694/225023 / 020025342
Language
eng
Description

학위논문(박사) - 한국정보통신대학교 : 공학부, 2006.8, [ xii, 88 p. ]

Keywords

Frequency Filtering; Pitch Histogram; Audio Fingerprint; Temporal Filtering; 시간축 필터링; 주파수 필터링; 피치 히스토그램; 오디오 핑거프린트

URI
http://hdl.handle.net/10203/54569
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392694&flag=dissertation
Appears in Collection
School of Engineering-Theses_Ph.D(공학부 박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0