DSpace at KOASAS: Deep Video Inpainting Guided by Audio-Visual Self-Supervision

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Conference Papers(학술회의논문)

Deep Video Inpainting Guided by Audio-Visual Self-Supervision

Cited 0 time in webofscience

Cited 0 time in

Hit : 97
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kim, Kyuyeon	ko
dc.contributor.author	Jung, Junsik	ko
dc.contributor.author	Kim, Woojae	ko
dc.contributor.author	Yoon, Sung-Eui	ko
dc.date.accessioned	2022-08-24T08:00:18Z	-
dc.date.available	2022-08-24T08:00:18Z	-
dc.date.created	2022-06-09	-
dc.date.created	2022-06-09	-
dc.date.created	2022-06-09	-
dc.date.issued	2022-05-09	-
dc.identifier.citation	47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, pp.1970 - 1974	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	http://hdl.handle.net/10203/298078	-
dc.description.abstract	Humans can easily imagine a scene from auditory information based on their prior knowledge of audio-visual events. In this paper, we mimic this innate human ability in deep learning models to improve the quality of video inpainting. To implement the prior knowledge, we first train the audio-visual network, which learns the correspondence between auditory and visual information. Then, the audiovisual network is employed as a guider that conveys the prior knowledge of audio-visual correspondence to the video inpainting network. This prior knowledge is transferred through our proposed two novel losses: audio-visual attention loss and audio-visual pseudo-class consistency loss. These two losses further improve the performance of the video inpainting by encouraging the inpainting result to have a high correspondence to its synchronized audio. Experimental results demonstrate that our proposed method can restore a wider domain of video scenes and is particularly effective when the sounding object in the scene is partially blinded.	-
dc.language	English	-
dc.publisher	IEEE Signal Processing Society	-
dc.title	Deep Video Inpainting Guided by Audio-Visual Self-Supervision	-
dc.type	Conference	-
dc.identifier.wosid	000864187902049	-
dc.identifier.scopusid	2-s2.0-85131260855	-
dc.type.rims	CONF	-
dc.citation.beginningpage	1970	-
dc.citation.endingpage	1974	-
dc.citation.publicationname	47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022	-
dc.identifier.conferencecountry	SI	-
dc.identifier.conferencelocation	Virtual	-
dc.identifier.doi	10.1109/ICASSP43922.2022.9747073	-
dc.contributor.localauthor	Yoon, Sung-Eui	-

Appears in Collection: CS-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Deep Video Inpainting Guided by Audio-Visual Self-Supervision

KOASAS

Communities & Collections