Phishing Website Detection Using Machine Learning: A Review

Authors

DOI:

https://doi.org/10.31185/wjps.145

Keywords:

Phishing Detection, Machine learning, Phish Tank

Abstract

Phishing, a form of cyber attack in which perpetrators employ fraudulent websites or emails to Deceive individuals into divulging sensitive information such as passwords or financial data, can be mitigated through various machine-learning algorithms for website detection.

These algorithms, including decision trees, support vector machines, and Random Forest, analyze multiple website features, such as URL structure, website content, and the presence of specific keywords or patterns, to ascertain the likelihood of a website being a phishing site.

This comprehensive review elucidates the concept of phishing website detection and the diverse techniques employed while summarizing previous studies, their outcomes, and their contributions. Overall, machine learning algorithms serve as a potent tool in the identification of phishing websites, thereby safeguarding users against falling prey to such malicious attacks.

References

Ashutosh Kumar Singh, A.J., and Keshav Singh, A Survey on Cyber Security Awareness and Perception among University Students in India. Journal of Advances in Mathematics and Computer Science, November 2021.

Kareem, N., Afaster Training Algorithm and Genetic Algorithm to Recognize Some of Arabic Phonemes.

Hashim, A.S., W.A. Awadh, and A.K. Hamoud. Student performance prediction model based on supervised machine learning algorithms. in IOP Conference Series: Materials Science and Engineering. 2020. IOP Publishing.

Chinaza Uchechukwu, H.H., and Jianguo Ding, A Survey of Machine Learning Techniques for Phishing Detection. IEEE Access, August 2020.

Kalaharsha, P. and B.M. Mehtre, Detecting Phishing Sites--An Overview. arXiv preprint arXiv:2103.12739, 2021.

Sabir, B., et al., Reliability and Robustness analysis of Machine Learning based Phishing URL Detectors. IEEE Transactions on Dependable and Secure Computing, 2022.

Almousa, M., et al., Phishing website detection: How effective are deep learning‐based models and hyperparameter optimization? Security and Privacy, 2022. 5(6): p. e256.

Nakano, H., et al., Canary in Twitter Mine: Collecting Phishing Reports from Experts and Non-experts. arXiv preprint arXiv:2303.15847, 2023.

Zhang, Q. Practical Thinking on Neural Network Phishing Website Detection Research Based on Decision Tree and Optimal Feature Selection. in Journal of Physics: Conference Series. 2021. IOP Publishing.

AlEroud, A. and G. Karabatis. Bypassing detection of URL-based phishing attacks using generative adversarial deep neural networks. in Proceedings of the sixth international workshop on security and privacy analytics. 2020.

Mijwil, M., et al., Exploring the Top Five Evolving Threats in Cybersecurity: An In-Depth Overview. Mesopotamian journal of cybersecurity, 2023. 2023: p. 57-63.

Yassine El Hajjaji, A.A.E.K., and Abdellah Ezzati, Phishing Attacks and Countermeasures: A Survey. IEEE Access, 2020.

Brandão, P.R. and G.P. Matos, Machine Learning and APTs.

Do, N.Q., et al., Deep learning for phishing detection: Taxonomy, current challenges and future directions. IEEE Access, 2022.

Alsmadi, M.H.A.a.A.A., Anti-Phishing Techniques: A Review. Journal of Emerging Trends in Computing and Information Sciences, December 2015.

Xu Chen, S.L., Wei Wang, and Xiaodan Zhang, A Real-Time Anti-Phishing Method Based on Online Learning and Semi-Supervised Learning. Journal of Computational Science, October 2021.

Anwekar, S.A. and V. Agrawal, PHISHING WEBSITE DETECTION USING MACHINE LEARNING ALGORITHMS.

Jain, S., Phishing Websites Detection Using Machine Learning. Available at SSRN 4121102.

Lakshmanarao, A., P.S.P. Rao, and M.B. Krishna. Phishing website detection using novel machine learning fusion approach. in 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS). 2021. IEEE.

Tang, L. and Q.H. Mahmoud, A Deep Learning-Based Framework for Phishing Website Detection. IEEE Access, 2021. 10: p. 1509-1521.

Kulkarni, A.D. and L.L. Brown III, Phishing websites detection using machine learning. 2019.

Tyagi, I., et al. A novel machine learning approach to detect phishing websites. in 2018 5th International conference on signal processing and integrated networks (SPIN). 2018. IEEE.

Karabatak, M. and T. Mustafa. Performance comparison of classifiers on reduced phishing website dataset. in 2018 6th International Symposium on Digital Forensic and Security (ISDFS). 2018. IEEE.

Zhang, X., et al. Boosting the phishing detection performance by semantic analysis. in 2017 ieee international conference on big data (big data). 2017. IEEE.

Fadheel, W., M. Abusharkh, and I. Abdel-Qader. On Feature selection for the prediction of phishing websites. in 2017 IEEE 15th Intl Conf on Dependable, Autonomic and Secure Computing, 15th Intl Conf on Pervasive Intelligence and Computing, 3rd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC/PiCom/DataCom/CyberSciTech). 2017. IEEE.

Ahmed, A.A. and N.A. Abdullah. Real time detection of phishing websites. in 2016 IEEE 7th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON). 2016. IEEE.

Jain, A.K. and B.B. Gupta, A novel approach to protect against phishing attacks at client side using auto-updated white-list. EURASIP Journal on Information Security, 2016. 2016(1): p. 1-11.

Aydin, M. and N. Baykal. Feature extraction and classification phishing websites based on URL. in 2015 IEEE Conference on Communications and Network Security (CNS). 2015. IEEE.

Smadi, S., et al. Detection of phishing emails using data mining algorithms. in 2015 9th International Conference on Software, Knowledge, Information Management and Applications (SKIMA). 2015. IEEE.

Nguyen, L.A.T., et al. A novel approach for phishing detection using URL-based heuristic. in 2014 international conference on computing, management and telecommunications (ComManTel). 2014. IEEE.

Chu, W., et al. Protect sensitive sites from phishing attacks using features extractable from inaccessible phishing URLs. in 2013 IEEE international conference on communications (ICC). 2013. IEEE.

Downloads

Published

2023-06-29

Issue

Section

Computer

How to Cite

al saedi, marwa, & Abbas Flayh , N. (2023). Phishing Website Detection Using Machine Learning: A Review. Wasit Journal for Pure Sciences , 2(2), 270-281. https://doi.org/10.31185/wjps.145