DSpace at KOASAS: POLYPHONIC SOUND EVENT DETECTION USING CONVOLUTIONAL BIDIRECTIONAL LSTM AND SYNTHETIC DATA-BASED TRANSFER LEARNING

DSpace at KOASAS

College of Engineering(공과대학)Dept. of Bio and Brain Engineering(바이오및뇌공학과)BiS-Conference Papers(학술회의논문)

POLYPHONIC SOUND EVENT DETECTION USING CONVOLUTIONAL BIDIRECTIONAL LSTM AND SYNTHETIC DATA-BASED TRANSFER LEARNING

Cited 25 time in

Cited 15 time in

Hit : 264
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Jung, Seokwon	ko
dc.contributor.author	Park, Jungbae	ko
dc.contributor.author	Lee, Sangwan	ko
dc.date.accessioned	2020-06-26T03:21:04Z	-
dc.date.available	2020-06-26T03:21:04Z	-
dc.date.created	2020-06-17	-
dc.date.created	2020-06-17	-
dc.date.issued	2019-05	-
dc.identifier.citation	44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.885 - 889	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	http://hdl.handle.net/10203/274942	-
dc.description.abstract	This paper presents a novel approach to improve the performance of polyphonic sound event detection that combines a convolutional bidirectional recurrent neural network (CBRNN) with transfer learning. The ordinary convolutional recurrent neural network (CRNN) is known to suffer from a vanishing gradient problem, which significantly reduces the efficiency of information transfer to past events. To resolve this issue, we combine forward and backward long short-term memory (LSTM) modules and demonstrate that they complement each other. To effectively deal with the issue of overfitting that arises from increased model complexity, we apply transfer learning with a dataset that contains synthesized artifacts. We show that the model achieves faster and better performance with less data. Simulations with the 2016 TUT dataset show that the performance of the CBRNN with transfer learning is dramatically improved compared to the ordinary CRNN; the F1 score was 28.4% higher, and the error rate was 0.42 lower.	-
dc.language	English	-
dc.publisher	IEEE	-
dc.title	POLYPHONIC SOUND EVENT DETECTION USING CONVOLUTIONAL BIDIRECTIONAL LSTM AND SYNTHETIC DATA-BASED TRANSFER LEARNING	-
dc.type	Conference	-
dc.identifier.wosid	000482554001023	-
dc.identifier.scopusid	2-s2.0-85068970982	-
dc.type.rims	CONF	-
dc.citation.beginningpage	885	-
dc.citation.endingpage	889	-
dc.citation.publicationname	44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	Brighton, ENGLAND	-
dc.identifier.doi	10.1109/ICASSP.2019.8682909	-
dc.contributor.localauthor	Lee, Sangwan	-
dc.contributor.nonIdAuthor	Jung, Seokwon	-
dc.contributor.nonIdAuthor	Park, Jungbae	-

Appears in Collection: BiS-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 25 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

POLYPHONIC SOUND EVENT DETECTION USING CONVOLUTIONAL BIDIRECTIONAL LSTM AND SYNTHETIC DATA-BASED TRANSFER LEARNING

This item is cited by other documents in WoS

KOASAS

Communities & Collections