DSpace at KOASAS: Semantic Tagging of Singing Voices in Popular Music Recordings

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Journal Papers(저널논문)

Semantic Tagging of Singing Voices in Popular Music Recordings

Cited 8 time in

Cited 6 time in

Hit : 506
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kim, Keunhyoung Luke	ko
dc.contributor.author	Lee, Jongpil	ko
dc.contributor.author	Kum, Sangeun	ko
dc.contributor.author	Park, Chae Lin	ko
dc.contributor.author	Nam, Juhan	ko
dc.date.accessioned	2020-07-18T00:57:26Z	-
dc.date.available	2020-07-18T00:57:26Z	-
dc.date.created	2020-05-30	-
dc.date.created	2020-05-30	-
dc.date.created	2020-05-30	-
dc.date.issued	2020-05	-
dc.identifier.citation	IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.28, pp.1656 - 1668	-
dc.identifier.issn	2329-9290	-
dc.identifier.uri	http://hdl.handle.net/10203/275511	-
dc.description.abstract	Singing voice is a key sound source in popular music. As recent music streaming and entertainment services call for more intelligent solutions to retrieve songs or evaluate musical characteristics, automatic analysis of popular music targeted to singing voice has been a significant research subject. The majority of studies have focused on quantitative or objective information of singing voice such as pitch, lyrics or singer identity. However, singing voice has a wide variety of dimensions that are somewhat difficult to quantify and therefore we often describe by words. In this article, we address the qualitative analysis of singing voice as a music auto-tagging task that annotates songs with a set of tag words. To this end, we build a music tag dataset dedicated to singing voice. Specifically, we define a vocabulary that describes timbre and singing styles of K-pop vocalists and collect human annotations for individual tracks. We then conduct statistical analysis to understand the global and temporal characteristics of the tag words. Using the dataset, we train a deep neural network model to automatically predict the voice-specific tags from popular music recordings and evaluate the model in different conditions. We discuss the results by comparing them to the statistical analysis of tag words. Finally, we show potential applications of the vocal tagging system in music retrieval, music thumbnailing and singing evaluation.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Semantic Tagging of Singing Voices in Popular Music Recordings	-
dc.type	Article	-
dc.identifier.wosid	000542977800004	-
dc.identifier.scopusid	2-s2.0-85086431815	-
dc.type.rims	ART	-
dc.citation.volume	28	-
dc.citation.beginningpage	1656	-
dc.citation.endingpage	1668	-
dc.citation.publicationname	IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING	-
dc.identifier.doi	10.1109/taslp.2020.2993893	-
dc.contributor.localauthor	Nam, Juhan	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Timbre	-
dc.subject.keywordAuthor	Instruments	-
dc.subject.keywordAuthor	Vocabulary	-
dc.subject.keywordAuthor	Tagging	-
dc.subject.keywordAuthor	Statistical analysis	-
dc.subject.keywordAuthor	Singing voice	-
dc.subject.keywordAuthor	vocal	-
dc.subject.keywordAuthor	semantic analysis	-
dc.subject.keywordAuthor	music tagging	-
dc.subject.keywordAuthor	convolutional neural networks	-
dc.subject.keywordAuthor	timbre	-
dc.subject.keywordAuthor	K-Pop	-
dc.subject.keywordPlus	MELODY EXTRACTION	-
dc.subject.keywordPlus	ACCOMPANIMENT	-
dc.subject.keywordPlus	EMOTION	-
dc.subject.keywordPlus	TIMBRE	-

Appears in Collection: GCT-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 8 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Semantic Tagging of Singing Voices in Popular Music Recordings

This item is cited by other documents in WoS

KOASAS

Communities & Collections