DC Field | Value | Language |
---|---|---|
dc.contributor.author | Shin, Jisu | ko |
dc.contributor.author | Song, Hoyun | ko |
dc.contributor.author | Lee, Huije | ko |
dc.contributor.author | Gaim, Fitsum | ko |
dc.contributor.author | Park, Jong-Cheol | ko |
dc.date.accessioned | 2023-11-14T03:01:32Z | - |
dc.date.available | 2023-11-14T03:01:32Z | - |
dc.date.created | 2023-11-13 | - |
dc.date.issued | 2023-11-02 | - |
dc.identifier.citation | The 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics | - |
dc.identifier.uri | http://hdl.handle.net/10203/314595 | - |
dc.description.abstract | The research for detecting offensive language on online platforms has much advanced. However, the majority of these studies have primarily focused on English. Given the unique characteristics of offensive language, where social and cultural contexts significantly influence content understanding, language-specific datasets are essential. Acquiring comprehensive datasets in Korean, a less-resourced language, has mostly relied on human annotations, suffering from inherent limitations in terms of labor intensity and potential annotator bias. Automatic generation of datasets using generative methods offers an alternative approach to address these limitations, yet faces challenges in capturing linguistic and cultural diversities while maintaining native-level fluency. To address these challenges, we introduce a prompt design methodology, Korean Offensive language Machine Generation (K-OMG), using large language models. By manipulating three prompt factors, we find an effective prompt design to generate culturally aligned offensive language with fluent expressions. Experimental results demonstrate the high quality and utility of our automatically generated dataset. Our detailed analysis shows that the proposed approach achieves exceptional fluency in generating texts while effectively incorporating social and cultural diversities. | - |
dc.language | English | - |
dc.publisher | Association for Computational Linguistics (ACL) | - |
dc.title | Generation of Korean Offensive Language by Leveraging Large Language Models via Prompt Design | - |
dc.type | Conference | - |
dc.type.rims | CONF | - |
dc.citation.publicationname | The 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics | - |
dc.identifier.conferencecountry | IO | - |
dc.identifier.conferencelocation | Bali | - |
dc.contributor.localauthor | Park, Jong-Cheol | - |
dc.contributor.nonIdAuthor | Gaim, Fitsum | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.