DSpace at KOASAS: Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Conference Papers(학술회의논문)

Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 166
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kim, Yeongbin	ko
dc.contributor.author	Singh, Gautam	ko
dc.contributor.author	Park, Junyeong	ko
dc.contributor.author	Gulcehre, Caglar	ko
dc.contributor.author	Ahn, Sungjin	ko
dc.date.accessioned	2023-11-30T01:02:29Z	-
dc.date.available	2023-11-30T01:02:29Z	-
dc.date.created	2023-11-09	-
dc.date.issued	2023-12-14	-
dc.identifier.citation	The Thirty-seventh Conference on Neural Information Processing Systems, NeurIPS 2023	-
dc.identifier.uri	http://hdl.handle.net/10203/315450	-
dc.description.abstract	Systematic compositionality, or the ability to adapt to novel situations by creating a mental model of the world using reusable pieces of knowledge, remains a significant challenge in machine learning. While there has been considerable progress in the language domain, efforts towards systematic visual imagination, or envisioning the dynamical implications of a visual observation, are in their infancy. We introduce the Systematic Visual Imagination Benchmark (SVIB), the first benchmark designed to address this problem head-on. SVIB offers a novel framework for a minimal world modeling problem, where models are evaluated based on their ability to generate one-step image-to-image transformations under a latent world dynamics. The framework provides benefits such as the possibility to jointly optimize for systematic perception and imagination, a range of difficulty levels, and the ability to control the fraction of possible factor combinations used during training. We provide a comprehensive evaluation of various baseline models on SVIB, offering insight into the current state-of-the-art in systematic visual imagination. We hope that this benchmark will help advance visual systematic compositionality.	-
dc.language	English	-
dc.publisher	The Conference on Neural Information Processing Systems	-
dc.title	Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models	-
dc.type	Conference	-
dc.type.rims	CONF	-
dc.citation.publicationname	The Thirty-seventh Conference on Neural Information Processing Systems, NeurIPS 2023	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	New Orleans Ernest N. Morial Convention Center	-
dc.contributor.localauthor	Ahn, Sungjin	-
dc.contributor.nonIdAuthor	Kim, Yeongbin	-
dc.contributor.nonIdAuthor	Singh, Gautam	-
dc.contributor.nonIdAuthor	Park, Junyeong	-
dc.contributor.nonIdAuthor	Gulcehre, Caglar	-

Appears in Collection: CS-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models

KOASAS

Communities & Collections