DSpace at KOASAS: Deep learning for vocal melody extraction

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Theses_Ph.D.(박사논문)

Deep learning for vocal melody extraction보컬 멜로디 추출을 위한 딥러닝

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 170
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Nam, Juhan	-
dc.contributor.advisor	남주한	-
dc.contributor.author	Kum, Sangeun	-
dc.date.accessioned	2022-04-15T01:53:38Z	-
dc.date.available	2022-04-15T01:53:38Z	-
dc.date.issued	2021	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=956569&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/294535	-
dc.description.abstract	In this thesis, we propose various deep learning (DL) based methods for vocal melody extraction. Vocal melody extraction is the task that identifies the melody pitch contour of the singing voice from multiple sources. Previous studies have been proposed as methods of calculating the pitch saliency from a spectrogram or isolating the melody source from the mixture. However, these methods have limitations in obtaining optimal outputs for various music. Although the performance of melody extraction has improved with the recent advances in DL, there are still limitations in terms of overall performance, the model using music-related knowledge and the lack of labeled data. Here we report the effective methods to estimate the pitch of melody and detect singing voice by introducing novel DL models and loss function. We also propose a multi-task network that allows pitch estimation and voice detection are tightly coupled. To address the lack of labeled data, we applied the semi-supervised learning that utilizes large amounts of unlabeled data. We explored the effects of three teacher-student model setups, data augmentation, unlabeled data, and proposed the most effective learning method for vocal melody extraction. In addition, we apply semi-supervised learning to the singing vocal detection and show that it can be extended to other MIR tasks that suffer from lack of labeled data.	-
dc.language	eng	-
dc.title	Deep learning for vocal melody extraction	-
dc.title.alternative	보컬 멜로디 추출을 위한 딥러닝	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :문화기술대학원,	-
dc.description.isOpenAccess	학위논문(박사) - 한국과학기술원 : 문화기술대학원, 2021.2,[iv, 75 p. :]	-
dc.publisher.country	한국과학기술원	-
dc.type.journalArticle	Thesis(Ph.D)	-
dc.contributor.alternativeauthor	금상은	-
dc.subject.keywordAuthor	Deep Learning▼aVocal Melody Extraction▼aSinging Voice Detection▼aSemi-Supervised Learning▼aTeacher-Student Framework	-
dc.subject.keywordAuthor	딥러닝▼a보컬 멜로디 추출▼a음성 구간 탐지▼a반지도 학습▼a교사-학생 프레임워크	-

Appears in Collection: GCT-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Deep learning for vocal melody extraction보컬 멜로디 추출을 위한 딥러닝

KOASAS

Communities & Collections