Generation of Korean Offensive Language by Leveraging Large Language Models via Prompt Design

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 60
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorShin, Jisuko
dc.contributor.authorSong, Hoyunko
dc.contributor.authorLee, Huijeko
dc.contributor.authorGaim, Fitsumko
dc.contributor.authorPark, Jong-Cheolko
dc.date.accessioned2023-11-14T03:01:32Z-
dc.date.available2023-11-14T03:01:32Z-
dc.date.created2023-11-13-
dc.date.issued2023-11-02-
dc.identifier.citationThe 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics-
dc.identifier.urihttp://hdl.handle.net/10203/314595-
dc.description.abstractThe research for detecting offensive language on online platforms has much advanced. However, the majority of these studies have primarily focused on English. Given the unique characteristics of offensive language, where social and cultural contexts significantly influence content understanding, language-specific datasets are essential. Acquiring comprehensive datasets in Korean, a less-resourced language, has mostly relied on human annotations, suffering from inherent limitations in terms of labor intensity and potential annotator bias. Automatic generation of datasets using generative methods offers an alternative approach to address these limitations, yet faces challenges in capturing linguistic and cultural diversities while maintaining native-level fluency. To address these challenges, we introduce a prompt design methodology, Korean Offensive language Machine Generation (K-OMG), using large language models. By manipulating three prompt factors, we find an effective prompt design to generate culturally aligned offensive language with fluent expressions. Experimental results demonstrate the high quality and utility of our automatically generated dataset. Our detailed analysis shows that the proposed approach achieves exceptional fluency in generating texts while effectively incorporating social and cultural diversities.-
dc.languageEnglish-
dc.publisherAssociation for Computational Linguistics (ACL)-
dc.titleGeneration of Korean Offensive Language by Leveraging Large Language Models via Prompt Design-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.publicationnameThe 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics-
dc.identifier.conferencecountryIO-
dc.identifier.conferencelocationBali-
dc.contributor.localauthorPark, Jong-Cheol-
dc.contributor.nonIdAuthorGaim, Fitsum-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0