Real-time pitch tracking using weakly supervised convolutional recurrent neural network합성곱 순환 신경망을 사용한 실시간 음 높이 추적

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 346
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorNam, Juhan-
dc.contributor.advisor남주한-
dc.contributor.authorChoi, Soonbeom-
dc.date.accessioned2019-08-28T02:46:02Z-
dc.date.available2019-08-28T02:46:02Z-
dc.date.issued2018-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=733791&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/266015-
dc.description학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2018.2,[iv, 28 p. :]-
dc.description.abstractExisting audio analysis algorithms are focused on using by composers. Performers also use similar techniques, but algorithms should work in real-time without post processing. Pitch tracking is one of the famous technology that is applied to various audio signal processing technologies. Especially for synthesizing new sound real-time high performance pitch tracking is necessary for performers. Digital signal processing(DSP) based pitch tracking algorithms like YIN or probabilistic YIN algorithm shows high accuracy and they are generally used for pitch tracking tasks. Still those algorithms are difficult to cope with various recording environments and have a long analysis time. In this paper, we propose a pitch analysis algorithm using neural network which can learn various recording environments based on data and reduce the number of operations. Especially we adopt convolutional neural network and convolutional recurrent neural network which show high pitch tracking accuracy. Also we applied post processing based on average mean difference function which is used in DSP pitch tracking and help finding fine pitch. The problem of neural network is that it needs large enough data to be trained. Here we propose weakly supervised learning idea which obtain annotation from DSP algorithm especially using PYIN algorithm. We found that the prediction from DSP annotation shows close accuracy compared to the prediction from human annotation. Though these process user can obtain continuous pitches complete automatically. We made our own dataset to train pitch tracking. The dataset is consist of jazz and blues style guitar solo. We compared result among several different setup networks and also compared our method with DSP algorithms in accuracy and computation speed. We mainly focused on voiced samples. Experiment is done using same test set from dataset and computing environment.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectaudio analysis▼apitch tracking▼aartificial intelligence▼aartificial neural network▼aperformance-
dc.subject오디오 분석▼a음 높이 분석▼a인공지능▼a인공 신경망▼a공연-
dc.titleReal-time pitch tracking using weakly supervised convolutional recurrent neural network-
dc.title.alternative합성곱 순환 신경망을 사용한 실시간 음 높이 추적-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :문화기술대학원,-
dc.contributor.alternativeauthor최순범-
Appears in Collection
GCT-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0