DSpace at KOASAS: Provable Benefit of Mixup for Finding Optimal Decision Boundaries

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Conference Papers(학술대회논문)

Provable Benefit of Mixup for Finding Optimal Decision Boundaries

Cited 0 time in webofscience

Cited 0 time in

Hit : 39
Download : 0

Export

Oh, Junsoo / Yun, Chulhee researcher

We investigate how pair-wise data augmentation techniques like Mixup affect the sample complexity of finding optimal decision boundaries in a binary linear classification problem. For a family of data distributions with a separability constant 𝜅, we analyze how well the optimal classifier in terms of training loss aligns with the optimal one in test accuracy (i.e., Bayes optimal classifier). For vanilla training without augmentation, we uncover an interesting phenomenon named the curse of separability. As we increase 𝜅 to make the data distribution more separable, the sample complexity of vanilla training increases exponentially in 𝜅; perhaps surprisingly, the task of finding optimal decision boundaries becomes harder for more separable distributions. For Mixup training, we show that Mixup mitigates this problem by significantly reducing the sample complexity. To this end, we develop new concentration results applicable to 𝑛^2 pair-wise augmented data points constructed from 𝑛 independent data, by carefully dealing with dependencies between overlapping pairs. Lastly, we study other masking-based Mixup-style techniques and show that they can distort the training loss and make its minimizer converge to a suboptimal classifier in terms of test accuracy.

Publisher: International Conference on Machine Learning

Issue Date: 2023-07-26

Language: English

Citation: 40th International Conference on Machine Learning, ICML 2023, pp.26403 - 26450

ISSN: 2640-3498

URI: http://hdl.handle.net/10203/316022

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Provable Benefit of Mixup for Finding Optimal Decision Boundaries

KOASAS

Communities & Collections