DSpace at KOASAS: ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 74
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Yoon, Hee Suk	ko
dc.contributor.author	Tee, Joshua Tian Jin	ko
dc.contributor.author	Yoon, Eunseop	ko
dc.contributor.author	Yoon, Sunjae	ko
dc.contributor.author	Kim, Gwangsu	ko
dc.contributor.author	Li, Yingzhen	ko
dc.contributor.author	Yoo, Chang-Dong	ko
dc.date.accessioned	2023-06-13T01:00:52Z	-
dc.date.available	2023-06-13T01:00:52Z	-
dc.date.created	2023-06-13	-
dc.date.created	2023-06-13	-
dc.date.issued	2023-05-02	-
dc.identifier.citation	International Conference on Learning Representations (ICLR) 2023	-
dc.identifier.uri	http://hdl.handle.net/10203/307224	-
dc.description.abstract	Studies have shown that modern neural networks tend to be poorly calibrated due to over-confident predictions. Traditionally, post-processing methods have been used to calibrate the model after training. In recent years, various trainable calibration measures have been proposed to incorporate them directly into the training process. However, these methods all incorporate internal hyperparameters, and the performance of these calibration objectives relies on tuning these hyperparameters, incurring more computational costs as the size of neural networks and datasets become larger. As such, we present Expected Squared Difference (ESD), a tuning-free (i.e., hyperparameter-free) trainable calibration objective loss, where we view the calibration error from the perspective of the squared difference between the two expectations. With extensive experiments on several architectures (CNNs, Transformers) and datasets, we demonstrate that (1) incorporating ESD into the training improves model calibration in various batch size settings without the need for internal hyperparameter tuning, (2) ESD yields the best-calibrated results compared with previous approaches, and (3) ESD drastically improves the computational costs required for calibration during training due to the absence of internal hyperparameter. The code is publicly accessible at https://github.com/hee-suk-yoon/ESD.	-
dc.language	English	-
dc.publisher	International Conference on Learning Representations	-
dc.title	ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure	-
dc.type	Conference	-
dc.type.rims	CONF	-
dc.citation.publicationname	International Conference on Learning Representations (ICLR) 2023	-
dc.identifier.conferencecountry	RW	-
dc.identifier.conferencelocation	Kigali	-
dc.contributor.localauthor	Yoo, Chang-Dong	-
dc.contributor.nonIdAuthor	Tee, Joshua Tian Jin	-
dc.contributor.nonIdAuthor	Yoon, Eunseop	-
dc.contributor.nonIdAuthor	Li, Yingzhen	-

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure

KOASAS

Communities & Collections