Sparse feature learning for instrument identification: Effects of sampling and pooling methods

Cited 10 time in webofscience Cited 0 time in scopus
  • Hit : 697
  • Download : 0
Feature learning for music applications has recently received considerable attention from many researchers. This paper reports on the sparse feature learning algorithm for musical instrument identification, and in particular, focuses on the effects of the frame sampling techniques for dictionary learning and the pooling methods for feature aggregation. To this end, two frame sampling techniques are examined that are fixed and proportional random sampling. Furthermore, the effect of using onset frame was analyzed for both of proposed sampling methods. Regarding summarization of the feature activation, a standard deviation pooling method is used and compared with the commonly used max- and average-pooling techniques. Using more than 47 000 recordings of 24 instruments from various performers, playing styles, and dynamics, a number of tuning parameters are experimented including the analysis frame size, the dictionary size, and the type of frequency scaling as well as the different sampling and pooling methods. The results show that the combination of proportional sampling and standard deviation pooling achieve the best overall performance of 95.62% while the optimal parameter set varies among the instrument classes. (c) 2016 Acoustical Society of America.
Publisher
ACOUSTICAL SOC AMER AMER INST PHYSICS
Issue Date
2016-05
Language
English
Article Type
Article
Citation

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, v.139, no.5, pp.2290 - 2298

ISSN
0001-4966
DOI
10.1121/1.4946988
URI
http://hdl.handle.net/10203/212157
Appears in Collection
GCT-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 10 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0