Representation learning of music using artist labels아티스트 레이블을 이용한 음악의 표현 학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 536
  • Download : 0
While music is becoming easily accessible due to the increasing number of online music services, it is getting more difficult to find the songs that users like. Therefore, it is important to grasp information about a large number of released songs and build retrieval and recommendation systems based on the information. The most popular method is to use text-based meta data or user data. However, this approach has a problem of rarely being searched for new or unknown songs, and it is difficult and time consuming to construct the data. On the other hand, content-based methods that extract features from music data directly and use the features to train the system become important because it can somewhat solve these problems. Recently, representation learning or feature learning has drawn great attention in various types of machine learning tasks. In music domain, feature learning is either unsupervised or supervised by semantic labels such as music genre. However, finding discriminative features in an unsupervised way is challenging, and supervised feature learning using semantic labels may involve noisy or expensive annotation. In this thesis, we present a feature learning approach that utilizes artist labels attached in every single music track as objective meta data. We train a deep convolutional neural network to classify audio tracks into a large number of artists. We regard the trained model as a general feature extractor and apply it to other tasks such as artist recognition, genre classification and music auto-tagging in transfer learning settings. The results show that our approach outperforms or is comparable to previous state-of-the-art methods, indicating that the proposed approach effectively captures general music audio features. Finally, we utilize the proposed approach for a music retrieval system.
Advisors
Nam, Juhanresearcher남주한researcher
Description
한국과학기술원 :문화기술대학원,
Publisher
한국과학기술원
Issue Date
2018
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2018.2,[iv, 34 p. :]

Keywords

Representation learning▼aartist recognition▼atransfer learning▼agenre classification▼amusic auto-tagging▼amusic information retrieval; 표현 학습▼a아티스트 인식▼a전이 학습▼a장르 분류▼a음악 오토태깅▼a음악 정보 검색

URI
http://hdl.handle.net/10203/266014
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=733783&flag=dissertation
Appears in Collection
GCT-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0