Enhancing lexical representation of test coverage for failure clustering다중 실행 오류 클러스터링을 위한 테스트 커버리지의 어휘적 표현 향상

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 52
  • Download : 0
Failure clustering aims to group multiple test failures based on shared root causes, helping developers to comprehend and debug each root cause (i.e., the underlying fault) in isolation. Clustering of failing test executions requires distances between those executions, for which distance measures between coverage vectors are widely used. Lexical representation of coverage has been suggested as an alternative, representing each structural element covered by an execution with the lexical tokens in the element. This paper investigates whether the granularity of the lexical representation affects the effectiveness of the failure clustering. We evaluate varying levels of tokenisation granularity by using them for clustering coexisting real-world test failures in Defects4J benchmark. Our results show that the traditionally adopted subtokenisation can actually deconstruct larger meaningful semantic token units, resulting in suboptimal clustering. We further suggest a novel tokenisation strategy based on the semantically similar line groups.
Advisors
Yoo, Shinresearcher유신researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2022.2,[iii, 23 p. :]

URI
http://hdl.handle.net/10203/309553
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=997588&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0