Melody extraction based on dynamic bayesian network동적 베이지안 네트워크에 기반한 멜로디 추출

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 403
  • Download : 0
This thesis considers a melody extraction algorithm based on the state-space equation of the parameters that define melody. In this thesis, melody is defined to be the singing voice pitch sequence in the vocal part and the pitch sequence of leading instrument in non-vocal part of music. The main idea of the considered algorithm is that the parameters that consist of melody pitch and their harmonic amplitudes are assumed to follow two uncoupled first-order Markov processes, and the polyphonic audio is related to the parameters such that the current framed segment of the polyphonic audio is conditionally independent of other framed segments given the parameters. From this perspective, a dynamic Bayesian network (DBN) for melody extraction can be constructed. And the posterior probability is estimated from this DBN, and it is used to estimate the parameters for melody extraction. To obtain the posterior probability, the likelihood and transition probabilities need to be defined. In defining the likelihood, the accompaniment which is considered the difference between polyphonic audio and melody is assumed to follow a multivariate Gaussian distribution. The transition probability of the melody pitch is obtained based on the statistical characteristics of music that account for small and large variation in melody, and the transition probability of the harmonic amplitudes is assumed to be a Gaussian for reasons of mathematical tractability. To estimate the parameters, the sequential Monte Carlo (SMC) method is utilized. The SMC method relies on a so-called sequential importance density, and this density is designed using multiple-pitches which are estimated by a simple multiple-pitch extraction algorithm. Experimental results show that the performance of the considered algorithm is better than or comparable to those of other well known melody extraction algorithms in terms of the raw pitch accuracy and the raw chroma accuracy.
Advisors
Yoo, Chang-Dongresearcher유창동researcher
Description
한국과학기술원 : 전기 및 전자공학과,
Publisher
한국과학기술원
Issue Date
2011
Identifier
466463/325007  / 020047568
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 2011.2, [ viii, 71 p. ]

Keywords

sequential Monte Carlo method; 멜로디 추출; 동적 베이지안 네트워크; 상태공간 방정식; dynamic Bayesian network; state-space equation; 순차적 몬테 카를로 기법; melody extraction

URI
http://hdl.handle.net/10203/35942
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=466463&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0