Augmentation of sign language poses by including the understanding of the sign language domain by body part수어 도메인의 이해를 포함한 부위별 수어 포즈 데이터 증강

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 93
  • Download : 0
Sign language recognition refers to the process of recognizing sign language expressions, into spoken language expressions. Various studies are being conducted to solve this with deep learning models, but these models face the issue of not performing satisfactorily because of an insufficient amount of data to train the models stably, so typically, augmenting the data is used to solve such problems. In the case of existing sign language data, a method similar to augmenting images has been used. However, these general augmentation methods do not effectively capture the characteristics of sign language, such as the difference in meaning and structure included in each body part and the varying degrees to which they contribute to recognition. This study analyzes the impact of each augmentation method on different body parts, its effect on the structure of sign language, and the relationship with learning performance in perception. Additionally, two hypotheses are proposed for the augmentation methods for each body part. For hands, which are generally more important in sign language representation and contain critical information such as handshapes, methods that can preserve the structural relationship between key-points will be very useful in the recognition process. For the body, which has less representation ability of sign language, augmentation may not have a significant impact. However, since the body contains spatial information including the position of the wrists, which convey the location component, methods that preserve spatial information could be beneficial. Through recognition experiments, we confirmed the similarity between the learning of the sign language recognition model and the hypotheses proposed in this study, and we distinguished appropriate augmentation methods and their characteristics for each body part. Our research, through these findings, seeks to understand the relationship between the characteristics of different body parts and augmentation methods, and offers insights into augmentation techniques.
Advisors
박종철researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2024
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2024.2,[iv, 29 p. :]

Keywords

수어 인식▼a데이터 증강▼a포즈 데이터▼a수어 데이터 증강▼a수어 도메인; Sign language recognition▼aData augmentation▼aPose data▼aSign language data augmentation▼aSign language domain

URI
http://hdl.handle.net/10203/321793
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097318&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0