DSpace at KOASAS: Code-based question answering model and training dataset for programming education

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Master(석사논문)

Code-based question answering model and training dataset for programming education프로그래밍 교육을 위한 코드 기반 질의응답 모델 및 학습 데이터셋

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 342
Download : 0

Export

Lee, Changyoon

Students and teaching assistants (TAs) in programming courses spend a large amount of time asking and answering questions that require the understanding of code. Models that understand code can help them reduce the time and effort required to answer the questions by identifying relevant snippets in code. I introduce CodeQA, a dataset for machine-in-the-loop question answering (QA) for the programming education domain. CodeQA's tasks, question type classification and code line selection, are designed to aid code-based question answering by providing outputs useful for answering the question, and challenge models to understand both the question and code, a fundamentally different type of text from documents in other QA datasets. CodeQA contains 9,237 question-answer-code triples gathered from chat logs in an introductory programming course in the original language, mostly Korean and the rest in English, and in English translation of the Korean texts. I provide detailed analysis of CodeQA dataset and illustrate the baseline models' behavior through qualitative studies. The relatively low scores of baseline models for CodeQA tasks suggest that the tasks are challenging even for well-performing models for natural language QA.

Advisors: Oh, Alice researcher; 오혜연 researcher

Description: 한국과학기술원 :전산학부,

Publisher: 한국과학기술원

Issue Date: 2021

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 전산학부, 2021.8,[iv, 24 p. :]

Keywords: Question answering▼aCode understanding▼aProgramming education▼aDataset; 질의응답▼a코드 이해▼a프로그래밍 교육▼a데이터셋

URI: http://hdl.handle.net/10203/296136

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=964797&flag=dissertation

Appears in Collection: CS-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Code-based question answering model and training dataset for programming education프로그래밍 교육을 위한 코드 기반 질의응답 모델 및 학습 데이터셋

KOASAS

Communities & Collections