Music identification using pitch histogram and MFCC-VQ dynamic pattern피치히스토그램과 MFCC-VQ 동적패턴을 이용한 음악 검색

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 555
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Hoi-Rin-
dc.contributor.advisor김회린-
dc.contributor.authorPark, Chul-Eui-
dc.contributor.author박철의-
dc.date.accessioned2011-12-30-
dc.date.available2011-12-30-
dc.date.issued2005-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392508&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/55358-
dc.description학위논문(석사) - 한국정보통신대학교 : 공학부, 2005, [ x, 43 p. ]-
dc.description.abstractWhen we listen to unknown music contents on TV or computer, we often want to know some information about the music. However, it is usually difficult to get the desired information from service providers directly. Content-based MIR provides a solution for this problem. Therefore, various content-based audio retrieval techniques based on QBE have been required to efficiently identify an unknown music signal. In this thesis, we suggest two methods for music retrieval. One method is a MFCC-temporal method using the temporal characteristics of melody. The other method is a hybrid method based on pitch histogram and MFCC-VQ dynamic patterns: uses both static patterns and temporal patterns of melody for MIR. Our features include pitch and MFCC for representing the characteristics of notes and we describe melody patterns by using pitch histogram and temporal sequence of codeword index. Then, we compute the similarity between test pattern and reference patterns. When compare with the patterns, the proper pattern matching method is especially important to get good performance. Therefore we also present appropriate pattern matching methods for our retrieval methods. In MFCC-VQ temporal method, a time alignment method is used to compensate for the temporal difference between two patterns by shifting the reference sequence. In addition, A modified ED technique is employed which divides the distance of two patterns by the weighted value which is the number of frames with the same MFCC-VQ index. In the hybrid method, we used a TSO method using the minimum sum of order index in the pitch histogram and MFCC-VQ temporal method as the retrieved result. We have tested the proposed methods in small and broader search areas, which are two different TV drama OSTs and 1,005 popular songs, respectively. When we compare the proposed methods with baseline methods, the experimental results showed that the performance of our methods is better than that of the baseline methods in both s...eng
dc.languageeng-
dc.publisher한국정보통신대학교-
dc.subjectQBE-
dc.subjectMusic Information Retrieval-
dc.subjectMIR-
dc.subjectMFCC-
dc.subject벡터 양자화-
dc.subject쿼리를 이용한 음악 검색-
dc.subject음악정보검색-
dc.subjectVQ-
dc.titleMusic identification using pitch histogram and MFCC-VQ dynamic pattern-
dc.title.alternative피치히스토그램과 MFCC-VQ 동적패턴을 이용한 음악 검색-
dc.typeThesis(Master)-
dc.identifier.CNRN392508/225023-
dc.description.department한국정보통신대학교 : 공학부, -
dc.identifier.uid020034541-
dc.contributor.localauthorKim, Hoi-Rin-
dc.contributor.localauthor김회린-
Appears in Collection
School of Engineering-Theses_Master(공학부 석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0