Toward chatbot safety: Context-aware offensive language detection in chatbot-human conversation data안전한 챗봇을 위하여: 챗봇-인간 간 대화 데이터에서의 문맥을 고려한 불쾌 언어 탐지

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 8
  • Download : 0
Offensive language exchanged between chatbots and human users can have detrimental consequences on users, chatbot services, and society. Despite notable advancements in autonomous offensive language detection, extending these efforts to chat data remains challenging due to the inherent briefness and contextual complexity of chat messages. This paper introduces a novel offensive language dataset based on real-world chatbot-human conversations, annotated with a focus on context-awareness. Our dataset distinguishes context-dependent offensive language, which is offensive expression under specific context. To address this nuanced task, a neural network model named CALIOPER (Context-Aware modeL for Identifying Offensive language using Pre-trained Encoder and Retrieval) is proposed. CALIOPER leverages a context-aware encoder and attention mechanism to incorporate previous messages and retrieve relevant information from them for detecting offensiveness. Experimental results show that CALIOPER outperforms existing offensive language detection models on multi-turn datasets, particularly for context-dependent offensive language. This work provides a direction in offensive language detection in chat data, contributing to the mitigation of the harmful effects of chatbots and safer chatbot ecosystems.
Advisors
차미영researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2023.8,[iii, 18 p. :]

Keywords

불쾌 언어 탐지▼a다중 턴 대화▼a인간-챗봇 상호작용▼a대화형 에이전트; Offensive language detection▼amulti-turn dialogue▼ahuman-chatbot interaction▼aconversational agent

URI
http://hdl.handle.net/10203/320733
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045965&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0