DSpace at KOASAS: Neural audio fingerprinting for broadcast monitoring with source separation

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Theses_Master(석사논문)

Neural audio fingerprinting for broadcast monitoring with source separation음원 분리를 적용한 방송 모니터링용 신경망 기반 오디오 핑거프린팅 기법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 10
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	남주한	-
dc.contributor.author	Kim, Jongsoo	-
dc.contributor.author	김종수	-
dc.date.accessioned	2024-07-30T19:30:47Z	-
dc.date.available	2024-07-30T19:30:47Z	-
dc.date.issued	2024	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1096186&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/321401	-
dc.description	학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2024.2,[iv, 35 p. :]	-
dc.description.abstract	Audio fingerprinting systems have evolved over time through frequency-analysis techniques, and have recently shown significantly improved performance in noisy environments through deep neural networks. However, these systems work well for identifying music played in specific spaces, but show lower performance in broadcast monitoring tasks. A major problem is that both deep neural network-based and frequency analysis-based systems often fail to detect music segments, mistaking them for non-musical content, primarily due to speech noise overpowering the music in broadcast audio. To address this, our study employed a pre-trained source separation model to remove vocals before feeding the query audio into the fingerprint extraction model, enhancing the performance of the broadcast monitoring system. Furthermore, We fine-tuned the source separation model to optimize it for speech removal. To do this, we customized the training dataset by replacing the vocal source with speech source. As a result, we improved the speech removal performance, boosting the performance of the broadcast monitoring system.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	오디오 지문▼a심층신경망▼a목소리 제거▼a음원 분리▼a미세 조정	-
dc.subject	Audio fingerprinting▼aDeep neural network▼aSpeech removal▼aSource separation▼aFine-tuning	-
dc.title	Neural audio fingerprinting for broadcast monitoring with source separation	-
dc.title.alternative	음원 분리를 적용한 방송 모니터링용 신경망 기반 오디오 핑거프린팅 기법	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :문화기술대학원,	-
dc.contributor.alternativeauthor	Nam, Juhan	-

Appears in Collection: GCT-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Neural audio fingerprinting for broadcast monitoring with source separation음원 분리를 적용한 방송 모니터링용 신경망 기반 오디오 핑거프린팅 기법

KOASAS

Communities & Collections