DSpace at KOASAS: Effective and compact neural autoregressive models for piano music transcription

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Theses_Ph.D.(박사논문)

Effective and compact neural autoregressive models for piano music transcription피아노 음악 채보를 위한 효과적이고 간결한 자기회귀 신경망 모델

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 2
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	남주한	-
dc.contributor.author	Kwon, Taegyun	-
dc.contributor.author	권태균	-
dc.date.accessioned	2024-08-08T19:31:01Z	-
dc.date.available	2024-08-08T19:31:01Z	-
dc.date.issued	2024	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1098151&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/321996	-
dc.description	학위논문(박사) - 한국과학기술원 : 문화기술대학원, 2024.2,[x, 114 p. :]	-
dc.description.abstract	In this dissertation, I focus on autoregressive model among neural network-based automatic transcription models. The piano has a characteristic that all sounds are generated only by the note onset and the continuation of the note that occurred in advance, so it is expected that the autoregressive model will have an advantage in inducing a causal relationship in frame-by-frame prediction. I designed the autoregressive prediction model based on a model combining acoustic module and music language module. In order to take advantage of the characteristics of the autoregressive model, a model capable of real-time operation was designed using a unidirectional RNN, and methods to overcome the disadvantages of the autoregressive model, which receives less information and is vulnerable to exposure bias compared to models using a bidirectional RNN, were suggested. For stable learning, I propose a network and learning method that expresses the states of notes in more detail and effectively utilizes recursive information. In addition to this, I induce the model to learn the invariance of the pitch shifting of the piano and the independence of each pitch. To this end, in the acoustic module, neurons are separated for each pitch, and each pitch is processed through a shared network. The music language model is also simplified to model the state progression of each pitch note. As a result, it was shown that the autoregressive model can also produce high performance when appropriately adjusted, and the hypothetically presented factors also showed an effect on performance improvement. In order to confirm the practical performance of the proposed model, the model was verified with multiple datasets with varied recording environments. The effectiveness of the proposed elements were examined through a note-level detailed analysis. The proposed model operated in real time with low complexity and showed equivalent performance to non-real-time models.2018	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	피아노 채보▼a딥러닝▼a자기회귀 모델	-
dc.subject	Piano transcription▼aDeep learning▼aAutoregressive model	-
dc.title	Effective and compact neural autoregressive models for piano music transcription	-
dc.title.alternative	피아노 음악 채보를 위한 효과적이고 간결한 자기회귀 신경망 모델	-
dc.type	Thesis(Ph.D)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :문화기술대학원,	-
dc.contributor.alternativeauthor	Nam, Juhan	-

Appears in Collection: GCT-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Effective and compact neural autoregressive models for piano music transcription피아노 음악 채보를 위한 효과적이고 간결한 자기회귀 신경망 모델

KOASAS

Communities & Collections