A comparison of performance of rough set theory with machine learning techniques in detecting phishing attack

Singh, Arpit and Misra, Subhas C. (2022) A comparison of performance of rough set theory with machine learning techniques in detecting phishing attack. In: Advances in computing, informatics, networking and cybersecurity: A book honoring Professor Mohammad S. Obaidat’s significant scientific contributions. Lecture Notes in Networks and Systems (289). Springer, Cham, pp. 631-650. ISBN 978-3-030-87049-2

[thumbnail of A comparison of performance of rough set theory with machine learning techniques in detecting phishing attack.pdf] Text
A comparison of performance of rough set theory with machine learning techniques in detecting phishing attack.pdf - Published Version
Restricted to Repository staff only

Download (18MB) | Request a copy

Abstract

Phishing is a deceptive social engineering trick that lures online users to disclose personal and confidential information to the fake websites disguising as legitimate ones in electronic communication. Several machine learning methods have been proposed to detect phishing websites. The major limitation of assumption of a peculiar distribution of the data plagues analysis with machine learning (ML) especially when the number of data points are limited in number. This paper applies classical rough sets analysis (CRSA) for detecting phishing websites from a collection of legitimate and fake websites sourced from an open UCI Repository. The work focuses primarily on comparing the classification performance of CRSA with the traditional ML tools. CRSA does not require any distributional assumption for analysis and does not need any user inputs for the parameters in the analysis unlike the traditional ML tools. The comparison revealed no significant difference in the classification performances of the two sets of algorithms. The precision of CRSA was 95.6% which is comparable and in some cases better than the other ML tools. Similarly, high ROC and f-measure places CRSA at par with the statistical ML tools. The relaxation in considering the statistical properties of data clearly gives CRSA edge over other data analysis tools. The CRSA based techniques provide a robust method of detecting phishing websites. The generalization of the analysis makes CRSA a quick-to-use framework for detection of phishing websites without consideration of statistical properties of the data.

Item Type: Book Section
Keywords: Phishing | Cyber-security | Rough sets | Machine learning | Classification
Subjects: Social Sciences and humanities > Business, Management and Accounting > Management of Technology and Innovation
Physical, Life and Health Sciences > Computer Science
JGU School/Centre: Jindal Global Business School
Depositing User: Mr. Syed Anas
Date Deposited: 27 Mar 2022 04:15
Last Modified: 19 Nov 2022 21:34
Official URL: https://doi.org/10.1007/978-3-030-87049-2_22
URI: https://pure.jgu.edu.in/id/eprint/1860

Downloads

Downloads per month over past year

Actions (login required)

View Item
View Item