Linguistic Methods for the Detection of Implicit Abuse
Linguistic Methods for the Detection of Implicit Abuse
Disciplines
Computer Sciences (15%); Linguistics and Literature (85%)
Keywords
-
Hate Speech,
Linguistic Analysis,
Implicitly Abusive Language,
Offensive Language,
Natural Language Processing
Recent years have seen a massive rise in abusive content on the web. Automatic classification methods are sought to assist operators of online platforms in finding such content. Since much abusive content is expressed in the form of written comments, natural language processing is a key technology in tackling this issue. The effectiveness of state-of-the-art methods for abusive language detection is limited. While explicit abuse, that is, abuse conveyed by unambiguously abusive words, such as swearwords, can now be fairly reliably detected, we currently have no indication that classifiers can also detect implicit forms of abuse. In this project, we want to address the classification of a set of subtypes of implicit abuse to fill this important gap in current research. In order to do so, we will create datasets that suitably represent these forms of abuse and develop classification methods that can also be evaluated on those datasets. Linguistic features will play a key role for classification. They are more important for detecting implicitly abusive language than for detecting explicitly abusive language.
- Universität Wien - 100%
- Benjamin Roth, Universität Wien , national collaboration partner
- Josef Ruppenhofer, FernUniversität Hagen - Germany
Research Output
- 3 Publications
-
2024
Title Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm DOI 10.18653/v1/2024.emnlp-main.132 Type Conference Proceeding Abstract Author Ruppenhofer J Pages 2200-2218 -
2025
Title Revisiting Implicitly Abusive Language Detection: Evaluating LLMs in Zero-Shot and Few-Shot Settings Type Conference Proceeding Abstract Author Dagmar Gromann Conference the 31st International Conference on Computational Linguistics (COLING) Pages 3879-3898 Link Publication -
2023
Title Euphemistic Abuse - A New Dataset and Classification Experiments for Implicitly Abusive Language DOI 10.18653/v1/2023.emnlp-main.1012 Type Conference Proceeding Abstract Author Kampfmeier J Pages 16280-16297