DSpace at KOASAS: Mitigating dataset bias for robust deep learning: from clean dataset to the practical noisy dataset

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Theses_Ph.D.(박사논문)

Mitigating dataset bias for robust deep learning: from clean dataset to the practical noisy dataset강건한 심층 신경망 학습을 위한 데이터세트 편향 완화 방법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 8
Download : 0

Export

Ahn, Sumyeong / 안수명

The demand for machine learning models in various industries is growing rapidly due to recent advancements in artificial intelligence research. However, the effectiveness of deep neural network models heavily depends on the quality of the training data they receive. If the target labels are strongly correlated to target features, it is resulted in the model with prejudice. These datasets are known as biased datasets, and this issue is referred to as the dataset bias problem. Therefore, having a well-structured training dataset is crucial for training robust deep neural networks. However, acquiring such datasets in real-world scenarios is challenging and expensive because it often requires significant human effort, such as additional labeling. To address this issue and minimize human involvement, researchers have been studying training methods that can prevent bias in models trained on biased datasets. This thesis involves an extensive investigation that examines biased training datasets and even explores scenarios involving different types of noise. Firstly, we propose an algorithm called PGD, which is a per-sample gradient-based resampling method. It constructs balanced-mini-batches to mitigate bias in biased datasets with accurate labels. Secondly, we describe DENEB, a training method based on entropy that aims to reduce dataset bias with noisy labels. DENEB leverages the entropy of the softmax function to alleviate the impact of noisy labels. Lastly, we introduce ORBIS, a method that boosts the debiasing of existing debiasing techniques by utilizing unlabeled or potentially corrupted labeled open datasets.

Advisors: 윤세영 researcher

Description: 한국과학기술원 :김재철AI대학원,

Publisher: 한국과학기술원

Issue Date: 2023

Identifier: 325007

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 김재철AI대학원, 2023.8,[vi, 63 p. :]

Keywords: 기계학습▼a인공지능▼a강건학습▼a신뢰가능한 인공지능▼a데이터세트 편향▼a편향완화; Machine learning▼aArtificial intelligence▼aRobust training▼aTrustworthy/reliable AI▼aDataset bias▼aDebias

URI: http://hdl.handle.net/10203/320816

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1046586&flag=dissertation

Appears in Collection: AI-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Mitigating dataset bias for robust deep learning: from clean dataset to the practical noisy dataset강건한 심층 신경망 학습을 위한 데이터세트 편향 완화 방법

KOASAS

Communities & Collections