Drum sample retrieval from mixed audio via a joint embedding space of mixed and single audio samples혼합 및 단일 오디오 샘플의 조인트 임베딩을 통한 혼합 오디오의 드럼 샘플 검색

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 324
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorNam, Juhan-
dc.contributor.advisor남주한-
dc.contributor.authorKim, Wonil-
dc.date.accessioned2021-05-12T19:36:42Z-
dc.date.available2021-05-12T19:36:42Z-
dc.date.issued2020-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=910800&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/284008-
dc.description학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2020.2,[iv, 29 p. :]-
dc.description.abstractAs the development of digital audio processing has popularized the technology of making music easily, sample-based music creation has become a mainstream practice. One of the key tasks in the sample-based approach is to search desired instrument samples in the large collections. However, most commercial sample packages described the samples using metadata, making it difficult to intuitively imagine the sound without listening to it. Inspired by music producers who often find instrument samples with a reference song, we set up a query-by-example scheme that takes mixed audio as a query and retrieves single audio samples. Our method is based on deep metric learning where a triplet neural network is trained to have single audio samples and their mixtures with other instruments closely located in the embedding space. We also suggest a method to generate mixed audio to build the dataset. As a result, we observe the performance difference according to the learning method, the model configuration, and the learning input types to find the best model for retrieving single audio in mixed audio. The results show that our model achieves promising retrieval performance in the query-by-example task. We also ensure the operation of the neural network by visualizing both single and mixed audio samples in the embedding space.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectRepresentation learning▼aMetric learning▼aMusic information retrieval▼aData generation▼aConvolutional neural networks▼aquery-by-example-
dc.subject표현 학습▼a메트릭 학습▼a음악 정보 검색▼a데이터 생성▼a회선 신경망▼a예시 질의-
dc.titleDrum sample retrieval from mixed audio via a joint embedding space of mixed and single audio samples-
dc.title.alternative혼합 및 단일 오디오 샘플의 조인트 임베딩을 통한 혼합 오디오의 드럼 샘플 검색-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :문화기술대학원,-
dc.contributor.alternativeauthor김원일-
Appears in Collection
GCT-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0