DSpace at KOASAS: Preventing attention leakage in slot attention for object-centric learning

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Master(석사논문)

Preventing attention leakage in slot attention for object-centric learning객체중심학습을 위한 슬롯 어텐션에서의 어텐션 누출 방지 기법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 7
Download : 0

Export

Choi, Janghyuk / 최장혁

Object-centric learning (OCL) aims for a compositional understanding of scenes, like humans recognize the visual world at the object level, by representing a scene as a set of object-centric representations. Although OCL has been successfully applied to the multi-view image and video datasets by leveraging geometric or temporal information to adopt various data-driven inductive biases, it faces challenges when applied to single-view images. This is due to the reduced availability of information regard- ing scene decomposition, resulting in the attention leakage problem where object-centric representation gives attention to not only individual objects but also the background around the objects. The attention leakage problem incurs deficient background separation, thereby object-centric representation can be constructed with the background noise. To address this challenge, we introduce SLot Attention via SHepherding (SLASH), a novel OCL framework for single-view images. SLASH integrates two simple- yet-effective modules into Slot Attention: the Attention Refining Kernel (ARK) and the Intermediate Point Predictor and Encoder (IPPE). These modules, respectively, prevent slots from being distracted by background noise and provide focus points to guide the learning of object-centric representations. We further propose a weak semi-supervision approach that leverages point-level annotation for OCL. Even though it is trained with annotation, our framework can be used without any assistant annotation during the inference phase. Experimental results demonstrate that our proposed method enhances the consistency in learning object-centric representations and delivers robust performance across four different datasets.

Advisors: 최호진 researcher

Description: 한국과학기술원 :전산학부,

Publisher: 한국과학기술원

Issue Date: 2023

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 전산학부, 2023.8,[v, 42 p. :]

Keywords: 약한 반지도학습; 객체 중심 학습▼a객체 중심 표현▼a슬롯 어텐션▼a어텐션 누출 문제; Object-centric learning▼aobject-centric representation▼aslot attention▼aattention leakage problem▼aweak semi-supervision

URI: http://hdl.handle.net/10203/320732

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045964&flag=dissertation

Appears in Collection: CS-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Preventing attention leakage in slot attention for object-centric learning객체중심학습을 위한 슬롯 어텐션에서의 어텐션 누출 방지 기법

KOASAS

Communities & Collections