Comparison of the efficiency of machine learning algorithms for phishing detection from uniform resource locator

Ahana Nandi Tultul,Romana Afroz,Md Alomgir Hossain
DOI: https://doi.org/10.11591/ijeecs.v28.i3.pp1640-1648
2022-12-01
Indonesian Journal of Electrical Engineering and Computer Science
Abstract:We are using cyberspace for completing our daily life activities because of the growth of Internet. Attackers use some approachs, such as phishing, with the use of false websites to collect personal information of users. Although, software companies launch products to prevent phishing attacks, identifying a webpage as legitimate or phishing, is a very defficult and these products cannot protect from attacks. In this paper, an anti-phishing system has been introduced that can extract feature from website’s URL as instant basis and use four classification algorithms named as K-Nearest neighbor, decision tree, support vector machine, random forest on these features. According to the comparison of the experimental results from these algorithms, random forest algorithm with the selected features gives the highest performance with the 95.67% accuracy rate. Then we have used one deep learning algorithm as enhanced of our experiment named as deep neural decision forests which have given performance with the 92.67% accuracy rate. Then we have created a system which can extract the features from raw URL and pass the features to our deep neural decision forest trained model and can classify the URL as Phishing or legitimate.
What problem does this paper attempt to address?