SHIELD: System for Harmful Explicit-Content Identification and Evaluation Through LLM-Driven Approach

Dimensions

Kapoor, Dishant, Ahuja, Karan, Kumar, Deepika, Puri, Paanav, Jangirala, Srinivas, Gupta, Vedika and Mandal, Anandadeep (2026) SHIELD: System for Harmful Explicit-Content Identification and Evaluation Through LLM-Driven Approach. IEEE Access, 14. pp. 29493-29522. ISSN 2169-3536

[thumbnail of SHIELD_System_for_Harmful_Explicit-Content_Identification_and_Evaluation_Through_LLM-Driven_Approach.pdf]

Preview

Text
SHIELD_System_for_Harmful_Explicit-Content_Identification_and_Evaluation_Through_LLM-Driven_Approach.pdf - Published Version
Download (4MB) | Preview

Abstract

The surge in access to explicit content across various platforms has sparked major concerns, yet existing content filtering systems find it difficult to analyze different media formats leading to the spread of unchecked dissemination of harmful content. To tackle these shortcomings, the authors proposed SHIELD, which is an optimized end-to-end pipeline to detect & analyze explicit content, using a large-language-model (LLM) driven approach. SHIELD processes multi media inputs by segregating and preprocessing them, followed by converting all formats into text through advanced models, extracting meaningful textual context and subjecting the resulting data to two parallel evaluation mechanisms: an LLM-based classifier for contextual analysis, and a semantic vector-based scoring system for quantitative measurement. Explicitness classifications are output in a JSON format, which allows easy integration into real-world systems. When benchmarked against a manually curated ground truth dataset, the LLM-based system surpasses vector-based approach, with an accuracy of 93.32%, as against 67.81%. The pipeline shows robustness across all media types and file sizes, confirming its viability as a scalable, context-aware solution.

Item Type:	Article
Uncontrolled Keywords:	Accuracy \| content moderation \| Data models \| Deep learning \| Explicit content detection \| Hate speech \| large language model (LLM) \| Large language models \| Media \| multimodal content analysis \| Pipelines \| Random forests \| Semantics \| vector embeddings \| Videos
Subjects:	Social Sciences and humanities > Business, Management and Accounting > Management of Technology and Innovation Physical, Life and Health Sciences > Computer Science
Divisions:	Jindal Global Business School
Depositing User:	Mr. Arjun Dinesh
Date Deposited:	17 Mar 2026 09:03
Last Modified:	17 Mar 2026 09:03
Official URL:	https://ieeexplore.ieee.org/document/11406083
URI:	https://pure.jgu.edu.in/id/eprint/11038

Downloads

Downloads per month over past year

Actions (login required)

: View Item