DynamicLabels: Supporting informed construction of machine learning label sets with crowd feedbackDynamicLabels: 크라우드 피드백을 활용한 머신러닝 레이블 셋 구축 지원 시스템

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
Label set construction --- deciding on a group of distinct labels --- is an essential stage in building a machine learning (ML) application, as a badly designed label set negatively affects other stages such as training dataset construction, model training, and model deployment. Despite its significance, it is challenging for ML practitioners to come up with a well-defined label set, especially when no external references are available. To mitigate this difficulty, ML practitioners often go through multiple iterations to gradually improve their label set. Through our formative study (n=4), we observed that there still remain challenges in collecting helpful feedback and utilizing them to make optimal refinement decisions. To support the iterative refinement, we present DynamicLabels, a system that aims to support a more informed label set-building process with crowd feedback. Crowd workers provide feedback as annotations and label suggestions to the ML practitioner's label set, and the ML practitioner can review the feedback through multi-aspect analysis and see the potential consequences of label refinements. Through a within-subjects study (n=16) using two datasets, we found that DynamicLabels enables better understanding and exploration of the collected feedback and supports a more structured, confident refinement process. The ML practitioners were also able to see surfacing conflicts and edge cases that could have been ignored. In addition, the crowd feedback helped ML practitioners to gain diverse perspectives, spot current weaknesses, and shop from crowd-generated labels. With DynamicLabels, ML practitioners can successfully gain concrete understanding and evidence from the crowd and make informed refinements to iteratively improve the label set.
Advisors
김주호researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2023.8,[v, 33 p. :]

Keywords

인간-컴퓨터 상호작용▼a크라우드소싱▼a크라우드 피드백▼a머신러닝▼a머신러닝 실무자 지원▼a레이블 셋 구축; Human-Computer Interaction▼aCrowdsourcing▼aCrowd Feedback▼aMachine Learning▼aMachine Learning Practitioner Support▼aLabel Set Construction

URI
http://hdl.handle.net/10203/320717
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045949&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0