DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Nam, Juhan | - |
dc.contributor.advisor | 남주한 | - |
dc.contributor.author | Kim, Wonil | - |
dc.date.accessioned | 2021-05-12T19:36:42Z | - |
dc.date.available | 2021-05-12T19:36:42Z | - |
dc.date.issued | 2020 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=910800&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/284008 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2020.2,[iv, 29 p. :] | - |
dc.description.abstract | As the development of digital audio processing has popularized the technology of making music easily, sample-based music creation has become a mainstream practice. One of the key tasks in the sample-based approach is to search desired instrument samples in the large collections. However, most commercial sample packages described the samples using metadata, making it difficult to intuitively imagine the sound without listening to it. Inspired by music producers who often find instrument samples with a reference song, we set up a query-by-example scheme that takes mixed audio as a query and retrieves single audio samples. Our method is based on deep metric learning where a triplet neural network is trained to have single audio samples and their mixtures with other instruments closely located in the embedding space. We also suggest a method to generate mixed audio to build the dataset. As a result, we observe the performance difference according to the learning method, the model configuration, and the learning input types to find the best model for retrieving single audio in mixed audio. The results show that our model achieves promising retrieval performance in the query-by-example task. We also ensure the operation of the neural network by visualizing both single and mixed audio samples in the embedding space. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Representation learning▼aMetric learning▼aMusic information retrieval▼aData generation▼aConvolutional neural networks▼aquery-by-example | - |
dc.subject | 표현 학습▼a메트릭 학습▼a음악 정보 검색▼a데이터 생성▼a회선 신경망▼a예시 질의 | - |
dc.title | Drum sample retrieval from mixed audio via a joint embedding space of mixed and single audio samples | - |
dc.title.alternative | 혼합 및 단일 오디오 샘플의 조인트 임베딩을 통한 혼합 오디오의 드럼 샘플 검색 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :문화기술대학원, | - |
dc.contributor.alternativeauthor | 김원일 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.