Conference Articles

Overview of the HASOC Subtracks at FIRE 2023: Hate Speech and Offensive Content Identification in Assamese, Bengali, Bodo, Gujarati and Sinhala

Tharindu Ranasinghe, Aston University
Koyel Ghosh, Central Institute of Technology
Aditya Shankar Pal, Indian Statistical Institute, Kolkata
Apurbalal Senapati, Central Institute of Technology
Alphaeus Eric Dmonte, Central Institute of Technology
Marcos Zampieri, George Mason University
Sandip Modha, LDRP-ITR
Shrey Satapara, Indian Institute of Technology Hyderabad

Document Type

Conference Article

Publication Title

ACM International Conference Proceeding Series

Abstract

The evaluation of content moderation systems requires reliable benchmark data. This task becomes particularly formidable for low-resource languages, where obtaining or curating such data poses significant challenges. Addressing this issue, HASOC 2023 organised various shared tasks focused on identifying offensive content in low-resource languages. This paper reports on tasks for hate speech detection in several Indo-Aryan languages - Assamese, Bengali, Gujarati, and Sinhala as well as a Sino-Tibetan language, Bodo, for which limited linguistic resources currently exist. The shared task involved the compilation of multiple datasets. In total, nearly 200 runs were submitted by more than 30 teams, which are presented and analysed in this report.

First Page

Last Page

DOI

10.1145/3632754.3633278

Publication Date

12-15-2023

Recommended Citation

Ranasinghe, Tharindu; Ghosh, Koyel; Pal, Aditya Shankar; Senapati, Apurbalal; Dmonte, Alphaeus Eric; Zampieri, Marcos; Modha, Sandip; and Satapara, Shrey, "Overview of the HASOC Subtracks at FIRE 2023: Hate Speech and Offensive Content Identification in Assamese, Bengali, Bodo, Gujarati and Sinhala" (2023). Conference Articles. 496.
https://digitalcommons.isical.ac.in/conf-articles/496

This document is currently not available here.

COinS

Conference Articles

Overview of the HASOC Subtracks at FIRE 2023: Hate Speech and Offensive Content Identification in Assamese, Bengali, Bodo, Gujarati and Sinhala

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Browse

Search

Author Corner

Links

Conference Articles

Overview of the HASOC Subtracks at FIRE 2023: Hate Speech and Offensive Content Identification in Assamese, Bengali, Bodo, Gujarati and Sinhala

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Recommended Citation

Share

Browse

Search

Author Corner

Links