Identifying the relationship between hypothesis and premise

Jhunthra, Srishti, Garg, Harshit and Gupta, Vedika (2023) Identifying the relationship between hypothesis and premise. In: 5th International Conference, RTIP2R 2022, December 1-2, 2022, Kingsville,Texas, United States of America.

[thumbnail of Identifying the relationship between hypothesis and premise.pdf] Text
Identifying the relationship between hypothesis and premise.pdf - Published Version
Restricted to Repository staff only

Download (1MB) | Request a copy


Natural language processing is one the most interesting study leading to huge research solutions in the modern era. Multilingual toxic comment classification can be served as a huge benefit to the existing social media life where comments, tweets, etc. can be analyzed when a topic is known to the system. This would help in the prevention of false commenting and better interpretation and analysis of the miscommunication that occurred on different social media platforms over a certain issue. Multilingual toxic comment classification refers to the analysis of a hypothetical sentence proposed given a premise. This classification is divided into three categories are the hypothetical sentence proposed can be either an entailment to the premise, neutral, or contradictory to the known premise statement. Natural language inference is considered as one of the most trending problems under the field of natural language processing which helps to determine how two statements given the premise and hypothesis are related to each other. Thus, the paper proposes different models such as CBOW, ESIM, BiLSTM and fine-tuned XML-RoBERTa model to predict the relationship between two statements. The prediction helps in the determination of whether the given hypothesis is in an entailment, neutral, contradictory relation with the given premise. The paper shows a study over various algorithms that can be used to solve the natural language inference problem. After analysis, the paper also proposes a model that obtained an accuracy of 95.35% with a ROC score of 0.9629 for the entailment relationship, 0.97076 for the neutral relationships, and 0.9797 for the contradictory relationships between the sentence pairs.

Item Type: Conference or Workshop Item (Paper)
Keywords: Natural Language Processing | Natural Language Inference | Multilingual Toxic Comment | XLM-RoBERTa | Comments | Prediction
Subjects: Physical, Life and Health Sciences > Computer Science
JGU School/Centre: Jindal Global Business School
Depositing User: Amees Mohammad
Date Deposited: 16 Jan 2023 04:34
Last Modified: 31 Jul 2023 09:53
Official URL:


Downloads per month over past year

Actions (login required)

View Item
View Item