Hate Speech Detection Research in South Asian Languages: a survey of tasks, datasets and methods

Sharma, Deepawali, Nath, Tanusree, Gupta, Vedika and Singh, Vivek Kumar (2025) Hate Speech Detection Research in South Asian Languages: a survey of tasks, datasets and methods. ACM Transactions on Asian and Low-Resource Language Information Processing, 24 (3). pp. 1-44. ISSN 2375-4699

[thumbnail of hate speech survey.pdf] Text
hate speech survey.pdf
Restricted to Repository staff only

Download (1MB) | Request a copy

Abstract

Social media has over the years emerged as a powerful platform for communicating and sharing views, thoughts, and opinions. However, at the same time it is being abused by certain individuals to spread hate against individuals, communities, religions, and so on. Such content can lead to serious issues of mental health, online well-being, and social order. Therefore, it is very important to have automated methods and approaches for detecting such content from the large volume of posts in social media. Recently there has been several efforts to develop computational approaches toward this end, however, most of these efforts are directed toward content in English language. Only recently studies have started focusing on low resource languages, including those from South Asia. This article attempts to present a detailed and comprehensive survey of hate speech related research in South Asian languages. The various definitions and terms related to Hate speech in different social media platforms are discussed first. The different tasks in the hate speech research, available datasets, and the popular computational approaches used in the South-Asian languages are surveyed in detail. Major patterns identified and the practical implications are presented and discussed, along with a discussion of challenges and opportunities of further research in the area

Item Type: Article
Keywords: Cyberbullying | Hate speech | Offensive speech | Low resource languages | South Asian languages
Subjects: Physical, Life and Health Sciences > Computer Science
Social Sciences and humanities > Social Sciences > Linguistics and Language
Social Sciences and humanities > Social Sciences > Communication and Transportation
JGU School/Centre: Jindal Global Business School
Depositing User: Mr. Gautam Kumar
Date Deposited: 13 May 2025 12:27
Last Modified: 13 May 2025 12:27
Official URL: https://doi.org/10.1145/3711710
URI: https://pure.jgu.edu.in/id/eprint/9496

Downloads

Downloads per month over past year

Actions (login required)

View Item
View Item