A highly robust audio fingerprinting scheme in real environments실제 환경에 강인한 오디오 핑거프린팅 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 524
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Hoi-Rin-
dc.contributor.advisor김회린-
dc.contributor.authorPark, Man-Soo-
dc.contributor.author박만수-
dc.date.accessioned2011-12-28T02:44:10Z-
dc.date.available2011-12-28T02:44:10Z-
dc.date.issued2006-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392694&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/54569-
dc.description학위논문(박사) - 한국정보통신대학교 : 공학부, 2006.8, [ xii, 88 p. ]-
dc.description.abstractRecently, content-based audio identification techniques by an audio fingerprinting scheme that can retrieve audio information without any text-based query. They have been recognized as one of the state-of-the-art and attractive application services on the music portal market in wire/wireless communications. This dissertation introduces a methodology to this challenging task using an audio signal query to retrieve polyphonic music items by matching it to pre-indexed audio references. In real environments, however, sound recordings are commonly distorted by channel and background noise. As well, music signals can be easily distorted by time stretch (tempo change). The performance of audio identification is greatly degraded by those distortion factors. Thus, the robustness of an audio fingerprinting system is still one of the most important issues in music information retrieval by content-based audio identification techniques. This dissertation introduces the conventional audio fingerprinting schemes such as stochastic modeling and audio hashing. In the stochastic modeling scheme, spectral parameters are conventionally used to build a stochastic model. Foote proposed the stochastic modeling method for content-based music information retrieval (MIR) [16]. The stochastic model is based on the spectral envelope histogram, the histogram of spectral audio feature counts at the code vectors of vector quantization (VQ). In this dissertation, we propose a new distance metric to measure the similarity of two probability distributions and apply the dynamic matching method instead of the static matching method. As well, we proposed the stochastic modeling method which uses pitch histogram instead of spectral envelope histogram. Music can be identified by distinctive melody lines. Melody line consists of the harmony of musical notes. After all, pitch becomes very useful information because it is a basis of melody note. In addition, the number of histogram bins can be limit...eng
dc.languageeng-
dc.publisher한국정보통신대학교-
dc.subjectFrequency Filtering-
dc.subjectPitch Histogram-
dc.subjectAudio Fingerprint-
dc.subjectTemporal Filtering-
dc.subject시간축 필터링-
dc.subject주파수 필터링-
dc.subject피치 히스토그램-
dc.subject오디오 핑거프린트-
dc.titleA highly robust audio fingerprinting scheme in real environments-
dc.title.alternative실제 환경에 강인한 오디오 핑거프린팅 기법-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN392694/225023-
dc.description.department한국정보통신대학교 : 공학부, -
dc.identifier.uid020025342-
dc.contributor.localauthorKim, Hoi-Rin-
dc.contributor.localauthor김회린-
Appears in Collection
School of Engineering-Theses_Ph.D(공학부 박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0