PrimaDNN':A Characteristics-aware DNN Customization for Singing Technique Detection

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 67
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorYamamoto, Yuyako
dc.contributor.authorNam, Juhanko
dc.contributor.authorTerasawa, Hirokoko
dc.date.accessioned2024-01-02T08:01:50Z-
dc.date.available2024-01-02T08:01:50Z-
dc.date.created2023-12-29-
dc.date.issued2023-09-07-
dc.identifier.citation31st European Signal Processing Conference, EUSIPCO 2023, pp.406 - 410-
dc.identifier.urihttp://hdl.handle.net/10203/317212-
dc.description.abstractProfessional vocalists modulate their voice timbre or pitch to make their vocal performance more expressive. Such fluctuations are called singing techniques. Automatic detection of singing techniques from audio tracks can be beneficial to understand how each singer expresses the performance, yet it can also be difficult due to the wide variety of the singing techniques. A deep neural network (DNN) model can handle such variety; however, there might be a possibility that considering the characteristics of the data improves the performance of singing technique detection. In this paper, we propose PrimaDNN, a CRNN model with a characteristics-oriented improvement. The features of the model are: 1) input feature representation based on auxiliary pitch information and multi-resolution mel spectrograms, 2) Convolution module based on the Squeeze-and-excitation (SENet) and the Instance normalization. In the results of J-POP singing technique detection, PrimaDNN achieved the best results of 44.9% at the overall macro-F measure, compared to conventional works. We also found that the contribution of each component varies depending on the type of singing technique.-
dc.languageEnglish-
dc.publisherEuropean Signal Processing Conference, EUSIPCO-
dc.titlePrimaDNN':A Characteristics-aware DNN Customization for Singing Technique Detection-
dc.typeConference-
dc.identifier.scopusid2-s2.0-85178324668-
dc.type.rimsCONF-
dc.citation.beginningpage406-
dc.citation.endingpage410-
dc.citation.publicationname31st European Signal Processing Conference, EUSIPCO 2023-
dc.identifier.conferencecountryFI-
dc.identifier.conferencelocationHelsinki-
dc.identifier.doi10.23919/EUSIPCO58844.2023.10290004-
dc.contributor.localauthorNam, Juhan-
dc.contributor.nonIdAuthorYamamoto, Yuya-
dc.contributor.nonIdAuthorTerasawa, Hiroko-
Appears in Collection
GCT-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0