Semantic Tagging of Singing Voices in Popular Music Recordings

Cited 8 time in webofscience Cited 6 time in scopus
  • Hit : 506
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Keunhyoung Lukeko
dc.contributor.authorLee, Jongpilko
dc.contributor.authorKum, Sangeunko
dc.contributor.authorPark, Chae Linko
dc.contributor.authorNam, Juhanko
dc.date.accessioned2020-07-18T00:57:26Z-
dc.date.available2020-07-18T00:57:26Z-
dc.date.created2020-05-30-
dc.date.created2020-05-30-
dc.date.created2020-05-30-
dc.date.issued2020-05-
dc.identifier.citationIEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.28, pp.1656 - 1668-
dc.identifier.issn2329-9290-
dc.identifier.urihttp://hdl.handle.net/10203/275511-
dc.description.abstractSinging voice is a key sound source in popular music. As recent music streaming and entertainment services call for more intelligent solutions to retrieve songs or evaluate musical characteristics, automatic analysis of popular music targeted to singing voice has been a significant research subject. The majority of studies have focused on quantitative or objective information of singing voice such as pitch, lyrics or singer identity. However, singing voice has a wide variety of dimensions that are somewhat difficult to quantify and therefore we often describe by words. In this article, we address the qualitative analysis of singing voice as a music auto-tagging task that annotates songs with a set of tag words. To this end, we build a music tag dataset dedicated to singing voice. Specifically, we define a vocabulary that describes timbre and singing styles of K-pop vocalists and collect human annotations for individual tracks. We then conduct statistical analysis to understand the global and temporal characteristics of the tag words. Using the dataset, we train a deep neural network model to automatically predict the voice-specific tags from popular music recordings and evaluate the model in different conditions. We discuss the results by comparing them to the statistical analysis of tag words. Finally, we show potential applications of the vocal tagging system in music retrieval, music thumbnailing and singing evaluation.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleSemantic Tagging of Singing Voices in Popular Music Recordings-
dc.typeArticle-
dc.identifier.wosid000542977800004-
dc.identifier.scopusid2-s2.0-85086431815-
dc.type.rimsART-
dc.citation.volume28-
dc.citation.beginningpage1656-
dc.citation.endingpage1668-
dc.citation.publicationnameIEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING-
dc.identifier.doi10.1109/taslp.2020.2993893-
dc.contributor.localauthorNam, Juhan-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorTimbre-
dc.subject.keywordAuthorInstruments-
dc.subject.keywordAuthorVocabulary-
dc.subject.keywordAuthorTagging-
dc.subject.keywordAuthorStatistical analysis-
dc.subject.keywordAuthorSinging voice-
dc.subject.keywordAuthorvocal-
dc.subject.keywordAuthorsemantic analysis-
dc.subject.keywordAuthormusic tagging-
dc.subject.keywordAuthorconvolutional neural networks-
dc.subject.keywordAuthortimbre-
dc.subject.keywordAuthorK-Pop-
dc.subject.keywordPlusMELODY EXTRACTION-
dc.subject.keywordPlusACCOMPANIMENT-
dc.subject.keywordPlusEMOTION-
dc.subject.keywordPlusTIMBRE-
Appears in Collection
GCT-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 8 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0