Guardians of the Web: Harnessing Machine Learning to Combat Phishing Attacks

Authors

  • Mowafaq Salem Alzboon Jadara University, Faculty of Information Technology. Irbid, Jordan Author https://orcid.org/0000-0002-3522-6689
  • Mohammad Subhi Al-Batah Jadara University, Faculty of Information Technology. Irbid, Jordan Author https://orcid.org/0000-0002-9341-1727
  • Muhyeeddin Alqaraleh Zarqa University, Faculty of Information Technology. Zarqa, Jordan Author https://orcid.org/0009-0001-9103-2002
  • Faisal Alzboon Caucasus International University (CIU), Dental Medicine, Tbilisi, Georgia Author
  • Lujin Alzboon Caucasus International University (CIU), Dental Medicine, Tbilisi, Georgia Author

DOI:

https://doi.org/10.56294/gr202591

Keywords:

Phishing, Website Detection, Machine Learning, Feature Extraction, Cybersecurity

Abstract

Phishing remains one of the most dangerous threats to internet users and organizations today since it utilizes spoofed websites to coax users into revealing their data. This paper focuses on the effectiveness of algorithms in detecting such abusive websites. It goes on to analyze the dataset of phishing and non- phishing URLs providing explanatory attributes such as domain registration date, URL length or the existence of HTTPS. The models studied include Decision Tree, Random Forest, and Support Vector Machines. The results found that the Random Forest algorithm had the best performance of 97% in terms of classification accuracy, and Support Vector Machines performed the best in terms of generalization accuracy with precision and recall values of 0.92 and 0.95, respectively. The study investigates feature selection and determinants of URL structural features which are crucial in determining the efficiency of detection. Also, to enhance model assessment the stratified 10-fold cross-validation technique was performed to reduce bias and variance. These Results show the prospect of One Layer Neural Networks as a tool to improve Phishing Detection Systems and help to provide low-cost and fast solutions for current or future cyberspace struggles. This work aims to increase confidence in online security applications against modern phishing methods.The proposed modifications will help strengthen counter measures against phishing attacks in a shifting technological context while also working towards sustaining the organizations and thus require further inquiry into the facets such as the applicability of sophisticated artificial intelligence techniques the use of useful yet diverse sets of data and the incorporation of explainable intelligent systems

References

1. Al-batah M, Al-Batah M, Salem Alzboon M, Alzaghoul E. Automated Quantification of Vesicoureteral Reflux using Machine Learning with Advancing Diagnostic Precision. Data Metadata [Internet]. 2025 Jan 1;4:460. Available from: https://dm.ageditor.ar/index.php/dm/article/view/460

2. Alqaraleh M, Salem Alzboon M, Mohammad SA-B. Optimizing Resource Discovery in Grid Computing: A Hierarchical and Weighted Approach with Behavioral Modeling. LatIA [Internet]. 2025 Jan 1;3:97. Available from: http://dx.doi.org/10.62486/latia202597

3. Wahed MA, Alqaraleh M, Salem Alzboon M, Subhi Al-Batah M. Evaluating AI and Machine Learning Models in Breast Cancer Detection: A Review of Convolutional Neural Networks (CNN) and Global Research Trends. LatIA [Internet]. 2025 Jan 1;3:117. Available from: http://dx.doi.org/10.62486/latia2025117

4. Alqaraleh M, Salem Alzboon M, Subhi Al-Batah M, Solayman Migdadi H. From Complexity to Clarity: Improving Microarray Classification with Correlation-Based Feature Selection. LatIA [Internet]. 2025 Jan 1;3:84. Available from: http://dx.doi.org/10.62486/latia202584

5. Alqaraleh M, Salem Alzboon M, Subhi Al-Batah M. Real-Time UAV Recognition Through Advanced Machine Learning for Enhanced Military Surveillance. Gamification Augment Real [Internet]. 2025 Jan 1;3:63. Available from: http://dx.doi.org/10.56294/gr202563

6. Wahed MA, Alqaraleh M, Alzboon MS, Al-Batah MS. Application of Artificial Intelligence for Diagnosing Tumors in the Female Reproductive System: A Systematic Review. Multidiscip. 2025;3:54.

7. Wahed MA, Alqaraleh M, Alzboon MS, Subhi Al-Batah M, de la Salud R el C, la de la Inteligencia T. AI Rx: Revolutionizing Healthcare Through Intelligence, Innovation, and Ethics. Semin Med Writ Educ [Internet]. 2025 Jan 1;4(35):35. Available from: http://dx.doi.org/10.56294/mw202535

8. Mowafaq SA, Muhyeeddin A, Al-Batah MS. AI in the Sky: Developing Real-Time UAV Recognition Systems to Enhance Military Security. Data Metadata [Internet]. 2024 Sep 29;3:417. Available from: https://dm.ageditor.ar/index.php/dm/article/view/417

9. Al-Batah MS, Salem Alzboon M, Solayman Migdadi H, Alkhasawneh M, Alqaraleh M. Advanced Landslide Detection Using Machine Learning and Remote Sensing Data. Data Metadata [Internet]. 2024 Oct 7;3. Available from: http://dx.doi.org/10.56294/dm2024.419

10. Islam MS, Jyoti MNJ, Mia MS, Hussain MG. Fake Website Detection Using Machine Learning Algorithms. In: 2023 International Conference on Digital Applications, Transformation and Economy, ICDATE 2023. 2023.

11. Al-Batah MS, Alzboon MS, Alzyoud M, Al-Shanableh N. Enhancing Image Cryptography Performance with Block Left Rotation Operations. Ejbali R, editor. Appl Comput Intell Soft Comput [Internet]. 2024 Jan 23;2024(1):3641927. Available from: https://onlinelibrary.wiley.com/doi/10.1155/2024/3641927

12. Muhyeeddin A, Mowafaq SA, Al-Batah MS, Mutaz AW. Advancing Medical Image Analysis: The Role of Adaptive Optimization Techniques in Enhancing COVID-19 Detection, Lung Infection, and Tumor Segmentation. LatIA [Internet]. 2024 Sep 29;2:74. Available from: http://dx.doi.org/10.62486/latia202474

13. Al-Batah M, Salem Alzboon M, Alqaraleh M, Ahmad Alzaghoul F. Comparative Analysis of Advanced Data Mining Methods for Enhancing Medical Diagnosis and Prognosis. Data Metadata [Internet]. 2024 Oct 29;3(3):83–92. Available from: http://dx.doi.org/10.56294/dm2024.465

14. Al-shanableh N, Alzyoud M, Al-husban RY, Alshanableh NM, Al-Oun A, Al-Batah MS, et al. Advanced Ensemble Machine Learning Techniques for Optimizing Diabetes Mellitus Prognostication: A Detailed Examination of Hospital Data. Data Metadata [Internet]. 2024 Sep 2;3. Available from: http://dx.doi.org/10.56294/dm2024.363

15. Alqaraleh M, Alzboon MS, Al-Batah MS. Skywatch: Advanced Machine Learning Techniques for Distinguishing UAVs from Birds in Airspace Security. Int J Adv Comput Sci Appl [Internet]. 2024;15(11):1065–78. Available from: http://dx.doi.org/10.14569/IJACSA.2024.01511104

16. Mat Rani L, Mohd Foozy CF, Mustafa SNB. Feature Selection to Enhance Phishing Website Detection Based On URL Using Machine Learning Techniques. J Soft Comput Data Min. 2023;4(1):30–41.

17. Alqaraleh M, Alzboon MS, Al-Batah MS, Abdel Wahed M, Abuashour A, Alsmadi FH. Harnessing Machine Learning for Quantifying Vesicoureteral Reflux: A Promising Approach for Objective Assessment. Int J Online Biomed Eng [Internet]. 2024 Aug 8;20(11):123–45. Available from: https://online-journals.org/index.php/i-joe/article/view/49673

18. Abuashour A, Salem Alzboon M, Kamel Alqaraleh M, Abuashour A. Comparative Study of Classification Mechanisms of Machine Learning on Multiple Data Mining Tool Kits. Am J Biomed Sci Res 2024 [Internet]. 2024;22(1):1. Available from: www.biomedgrid.com

19. Alzboon MS, Bader AF, Abuashour A, Alqaraleh MK, Zaqaibeh B, Al-Batah M. The Two Sides of AI in Cybersecurity: Opportunities and Challenges. In: 2023 International Conference on Intelligent Computing and Next Generation Networks(ICNGN) [Internet]. IEEE; 2023. p. 1–9. Available from: https://ieeexplore.ieee.org/document/10396670/

20. Kalla D, Kuraku S. Phishing Website URL’s Detection Using NLP and Machine Learning Techniques. J Artif Intell. 2023;

21. Vyvaswini T, Rao MPPN, Kousalya B, Pallavi G, Abdullal S, Siddartha P. Phishing Website Detection using Machine Learning. Int J Adv Res Sci Commun Technol. 2023;

22. Mathankar S, Sharma S, Wankhede T, Sahu M, Thakur S. Phishing Website Detection using Machine Learning Techniques. 2023 11th Int Conf Emerg Trends Eng Technol - Signal Inf Process (ICETET - SIP). 2023;

23. T LN, R SR, Ida S. Enhancing Cybersecurity: A Multilayered Approach to Phishing Website Detection Using Machine Learning. 2023 Int Conf Res Methodol Knowl Manag Artif Intell Telecommun Eng. 2023;

24. Alzboon MS, Qawasmeh S, Alqaraleh M, Abuashour A, Bader AF, Al-Batah M. Pushing the Envelope: Investigating the Potential and Limitations of ChatGPT and Artificial Intelligence in Advancing Computer Science Research. In: 2023 3rd International Conference on Emerging Smart Technologies and Applications (eSmarTA) [Internet]. IEEE; 2023. p. 1–6. Available from: https://ieeexplore.ieee.org/document/10293294/

25. Alzboon MS, Al-Batah MS. Prostate Cancer Detection and Analysis using Advanced Machine Learning. Int J Adv Comput Sci Appl [Internet]. 2023;14(8):388–96. Available from: http://thesai.org/Publications/ViewPaper?Volume=14&Issue=8&Code=IJACSA&SerialNo=43

26. Alzboon MS, Al-Batah MS, Alqaraleh M, Abuashour A, Bader AFH. Early Diagnosis of Diabetes: A Comparison of Machine Learning Methods. Int J online Biomed Eng. 2023;19(15):144–65.

27. Adake MM, Belekar AM, Ambekar CU, Bhaiyya PDD. Real-Time Phishing Website Detection using Machine Learning and Updating Phishing Probability with User Feedback. Int J Recent Technol Eng. 2023;

28. Desai P, Shah M. Phishing Website Detection using Machine Learning: A Comprehensive Study. Int J Multidiscip Res. 2023;

29. Kumar HVK, K S P. Phishing Website Detection Using Machine Learning. Int J Res Appl Sci Eng Technol. 2023;11(7):1824–6.

30. Shrivastava A, Raturi A, Sharma A, Rao ALN, Singh S, Sankhyan A. Phishing Website Detection Using Machine Learning. 2023 1st Int Conf Circuits, Power, Intell Syst CCPIS 2023. 2023;

31. Anakal S, Maka K, Tadkal A, Humanabad S, Anakal S, Laxmikant E. Phishing Website Detection Using Machine Learning Methods. Int Conf Integr Intell Commun Syst ICIICS 2023. 2023;

32. Alzboon MS, Qawasmeh S, Alqaraleh M, Abuashour A, Bader AF, Al-Batah M. Machine Learning Classification Algorithms for Accurate Breast Cancer Diagnosis. In: 2023 3rd International Conference on Emerging Smart Technologies and Applications (eSmarTA) [Internet]. IEEE; 2023. p. 1–8. Available from: https://ieeexplore.ieee.org/document/10293415/

33. Putri AK, Alzboon MS. Doctor Adam Talib’s Public Relations Strategy in Improving the Quality of Patient Service. Sinergi Int J Commun Sci [Internet]. 2023 May 25;1(1):42–54. Available from: https://journal.sinergi.or.id/index.php/ijcs/article/view/19

34. Al-Batah MS, Alzboon MS, Alazaidah R. Intelligent Heart Disease Prediction System with Applications in Jordanian Hospitals. Int J Adv Comput Sci Appl [Internet]. 2023;14(9):508–17. Available from: http://thesai.org/Publications/ViewPaper?Volume=14&Issue=9&Code=IJACSA&SerialNo=54

35. Alzboon MS, Al-Batah M, Alqaraleh M, Abuashour A, Bader AF. A Comparative Study of Machine Learning Techniques for Early Prediction of Diabetes. In: 2023 IEEE Tenth International Conference on Communications and Networking (ComNet) [Internet]. IEEE; 2023. p. 1–12. Available from: https://ieeexplore.ieee.org/document/10366688/

36. Nikita Pawar, Dr. P. A. Tijare. A Review on Phishing Website Detection Using Machine Learning Approach. Int J Sci Res Comput Sci Eng Inf Technol [Internet]. 2023 Apr 9;267–72. Available from: https://ijsrcseit.com/CSEIT2390227

37. Alzboon MS. Survey on Patient Health Monitoring System Based on Internet of Things. Inf Sci Lett [Internet]. 2022 Jul 1;11(4):1183–90. Available from: https://www.naturalspublishing.com/Article.asp?ArtcID=25233

38. Alzboon M. Semantic Text Analysis on Social Networks and Data Processing: Review and Future Directions. Inf Sci Lett [Internet]. 2022 Sep 1;11(5):1371–84. Available from: https://www.naturalspublishing.com/Article.asp?ArtcID=25306

39. Alzboon MS, Aljarrah E, Alqaraleh M, Alomari SA. Nodexl Tool for Social Network Analysis. Turkish J Comput Math Educ. 2021;12(14):202–16.

40. Al-Batah MS, Zaqaibeh BM, Alomari SA, Alzboon MS. Gene Microarray Cancer Classification using Correlation Based Feature Selection Algorithm and Rules Classifiers. Int J Online Biomed Eng [Internet]. 2019 May 14;15(08):62–73. Available from: https://online-journals.org/index.php/i-joe/article/view/10617

41. Alzboon MS, Alomari S, Al-Batah MS, Alomari SA, Banikhalaf M. The characteristics of the green internet of things and big data in building safer, smarter, and sustainable cities Vehicle Detection and Tracking for Aerial Surveillance Videos View project Evaluation of Knowledge Quality in the E-Learning System View pr [Internet]. Vol. 6, Article in International Journal of Engineering and Technology. Science Publishing Corporation; 2017. p. 83–92. Available from: https://www.researchgate.net/publication/333808921

42. Alzboon MS, Sintok UUM, Sintok UUM, Arif S. Towards Self-Organizing Infrastructure : A New Architecture for Autonomic Green Cloud Data Centers. ARPN J Eng Appl Sci. 2015;1–7.

Downloads

Published

2025-01-16

How to Cite

1.
Salem Alzboon M, Subhi Al-Batah M, Alqaraleh M, Alzboon F, Alzboon L. Guardians of the Web: Harnessing Machine Learning to Combat Phishing Attacks. Gamification and Augmented Reality [Internet]. 2025 Jan. 16 [cited 2025 Feb. 5];3:91. Available from: https://gr.ageditor.ar/index.php/gr/article/view/91