DSpace at KOASAS: StyleDrop: Text-to-Image Generation in Any Style

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Conference Papers(학술대회논문)

StyleDrop: Text-to-Image Generation in Any Style

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 132
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Sohn, Kihyuk	ko
dc.contributor.author	Ruiz, Nataniel	ko
dc.contributor.author	Lee, Kimin	ko
dc.contributor.author	Chin, Daniel Castro	ko
dc.contributor.author	Blok, Irina	ko
dc.contributor.author	Chang, Huiwen	ko
dc.contributor.author	Barber, Jarred	ko
dc.contributor.author	Jiang, Lu	ko
dc.contributor.author	Entis, Glenn	ko
dc.contributor.author	Li, Yuanzhen	ko
dc.contributor.author	Hao, Yuan	ko
dc.contributor.author	Essa, Irfan	ko
dc.contributor.author	Rubinstein, Michael	ko
dc.contributor.author	Krishnan, Dilip	ko
dc.date.accessioned	2023-12-08T01:03:47Z	-
dc.date.available	2023-12-08T01:03:47Z	-
dc.date.created	2023-12-07	-
dc.date.issued	2023-12-13	-
dc.identifier.citation	37th Conference on Neural Information Processing Systems (NeurIPS)	-
dc.identifier.uri	http://hdl.handle.net/10203/316030	-
dc.description.abstract	Pre-trained large text-to-image models synthesize impressive images with an appropriate use of text prompts. However, ambiguities inherent in natural language and out-of-distribution effects make it hard to synthesize image styles, that leverage a specific design pattern, texture or material. In this paper, we introduce StyleDrop, a method that enables the synthesis of images that faithfully follow a specific style using a text-to-image model. The proposed method is extremely versatile and captures nuances and details of a user-provided style, such as color schemes, shading, design patterns, and local and global effects. It efficiently learns a new style by fine-tuning very few trainable parameters (less than 1% of total model parameters) and improving the quality via iterative training with either human or automated feedback. Better yet, StyleDrop is able to deliver impressive results even when the user supplies only a single image that specifies the desired style. An extensive study shows that, for the task of style tuning text-to-image models, StyleDrop implemented on Muse convincingly outperforms other methods, including DreamBooth and textual inversion on Imagen or Stable Diffusion.	-
dc.language	English	-
dc.publisher	Neural Information Processing Systems Foundation	-
dc.title	StyleDrop: Text-to-Image Generation in Any Style	-
dc.type	Conference	-
dc.type.rims	CONF	-
dc.citation.publicationname	37th Conference on Neural Information Processing Systems (NeurIPS)	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	New Orleans Ernest N. Morial Convention Center	-
dc.contributor.localauthor	Lee, Kimin	-
dc.contributor.nonIdAuthor	Sohn, Kihyuk	-
dc.contributor.nonIdAuthor	Ruiz, Nataniel	-
dc.contributor.nonIdAuthor	Chin, Daniel Castro	-
dc.contributor.nonIdAuthor	Blok, Irina	-
dc.contributor.nonIdAuthor	Chang, Huiwen	-
dc.contributor.nonIdAuthor	Barber, Jarred	-
dc.contributor.nonIdAuthor	Jiang, Lu	-
dc.contributor.nonIdAuthor	Entis, Glenn	-
dc.contributor.nonIdAuthor	Li, Yuanzhen	-
dc.contributor.nonIdAuthor	Hao, Yuan	-
dc.contributor.nonIdAuthor	Essa, Irfan	-
dc.contributor.nonIdAuthor	Rubinstein, Michael	-
dc.contributor.nonIdAuthor	Krishnan, Dilip	-

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

StyleDrop: Text-to-Image Generation in Any Style

KOASAS

Communities & Collections