Improving Oral Cancer Outcomes Through Machine Learning and Dimensionality Reduction

Authors

DOI:

https://doi.org/10.56294/dm2024.570

Keywords:

Oral Cancer Diagnosis, Machine Learning in Oncology, Data Mining Techniques, Neural Networks for Cancer Prediction, Prognosis Models, Benign vs. Malignant Classification

Abstract

Oral cancer presents a formidable challenge in oncology, necessitating early diagnosis and accurate prognosis to enhance patient survival rates. Recent advancements in machine learning and data mining have revolutionized traditional diagnostic methodologies, providing sophisticated and automated tools for differentiating between benign and malignant oral lesions. This study presents a comprehensive review of cutting-edge data mining methodologies, including Neural Networks, K-Nearest Neighbors (KNN), Support Vector Machines (SVM), and ensemble learning techniques, specifically applied to the diagnosis and prognosis of oral cancer. Through a rigorous comparative analysis, our findings reveal that Neural Networks surpass other models, achieving an impressive classification accuracy of 93.6% in predicting oral cancer. Furthermore, we underscore the potential benefits of integrating feature selection and dimensionality reduction techniques to enhance model performance. These insights underscore the significant promise of advanced data mining techniques in bolstering early detection, optimizing treatment strategies, and ultimately improving patient outcomes in the realm of oral oncology

References

1. Alzboon MS, Al-Batah M, Alqaraleh M, Abuashour A, Bader AF. A Comparative Study of Machine Learning Techniques for Early Prediction of Diabetes. In: 2023 IEEE 10th International Conference on Communications and Networking, ComNet 2023 - Proceedings. 2023. p. 1–12.

2. Alzboon MS, Al-Batah M, Alqaraleh M, Abuashour A, Bader AF. A Comparative Study of Machine Learning Techniques for Early Prediction of Prostate Cancer. In: 2023 IEEE 10th International Conference on Communications and Networking, ComNet 2023 - Proceedings. 2023. p. 1–12.

3. Al-Shanableh N, Alzyoud M, Al-Husban RY, Alshanableh NM, Al-Oun A, Al-Batah MS, et al. Advanced ensemble machine learning techniques for optimizing diabetes mellitus prognostication: A detailed examination of hospital data. Data Metadata. 2024;3:363.

4. Al-Batah MS, Salem Alzboon M, Solayman Migdadi H, Alkhasawneh M, Alqaraleh M. Advanced Landslide Detection Using Machine Learning and Remote Sensing Data. Data Metadata [Internet]. 2024 Oct 7;3. Available from: https://dm.ageditor.ar/index.php/dm/article/view/419/782

5. Muhyeeddin A, Mowafaq SA, Al-Batah MS, Mutaz AW. Advancing Medical Image Analysis: The Role of Adaptive Optimization Techniques in Enhancing COVID-19 Detection, Lung Infection, and Tumor Segmentation. LatIA [Internet]. 2024 Sep 29;2(74):74. Available from: https://latia.ageditor.uy/index.php/latia/article/view/74

6. Mowafaq SA, Alqaraleh M, Al-Batah MS. AI in the Sky: Developing Real-Time UAV Recognition Systems to Enhance Military Security. Data Metadata. 2024;3(417).

7. Wahed MA, Alqaraleh M, Alzboon MS, Subhi Al-Batah M. AI Rx: Revolutionizing Healthcare Through Intelligence, Innovation, and Ethics. Semin Med Writ Educ [Internet]. 2025 Jan 1;4:35. Available from: https://mw.ageditor.ar/index.php/mw/article/view/35

8. Abdel Wahed M, Alqaraleh M, Salem Alzboon M, Subhi Al-Batah M. Application of Artificial Intelligence for Diagnosing Tumors in the Female Reproductive System: A Systematic Review. Multidiscip [Internet]. 2025 Jan 1;3:54. Available from: https://multidisciplinar.ageditor.uy/index.php/multidisciplinar/article/view/54

9. Rahman F, Chauhan R, Singh A, others. Texture-based abnormalities in oral squamous cell carcinoma detection using SVM. J Oral Cancer Res. 2018;10(3):85–92.

10. Kumar D, Gupta R, Mehta P. Automated segmentation of oral tissue layers using CNN and texture-based features. Int J Biomed Imaging. 2018;14(4):312–22.

11. Das S, Mitra S, Ghosh R. Quantitative assessment of keratinization in oral squamous cell carcinoma. J Cancer Diagnosis. 2015;7(2):45–52.

12. Patil R, Pawar S, Shinde V. Hybrid deep learning model for oral cancer detection using genetic algorithms. Int J Med Imaging. 2021;16(1):203–10.

13. Sarkar A, Chaudhary R, Mahajan S. Transfer learning-based framework for oral cancer histopathology classification. IEEE Trans Biomed Eng. 2019;66(5):1245–54.

14. Jain A, Verma R, Gupta S. Machine learning-based system for early detection of oral cancer using salivary biomarkers. J Med Diagnostics. 2020;12(1):52–60.

15. Alqaraleh M, Al-Batah M, Salem Alzboon M, Alzaghoul E. Automated Quantification of Vesicoureteral Reflux using Machine Learning with Advancing Diagnostic Precision. Data Metadata [Internet]. 2025 Jan 1;4:460. Available from: https://dm.ageditor.ar/index.php/dm/article/view/460

16. Al-Batah M, Salem Alzboon M, Alqaraleh M, Ahmad Alzaghoul F. Comparative Analysis of Advanced Data Mining Methods for Enhancing Medical Diagnosis and Prognosis. Data Metadata. 2024;3:465.

17. Abuashour A, Salem Alzboon M, Kamel Alqaraleh M, Abuashour A. Comparative Study of Classification Mechanisms of Machine Learning on Multiple Data Mining Tool Kits. Am J Biomed Sci Res 2024 [Internet]. 2024;22(1):577–9. Available from: www.biomedgrid.com

18. Alzboon MS, Al-Batah MS, Alqaraleh M, Abuashour A, Bader AFH. Early Diagnosis of Diabetes: A Comparison of Machine Learning Methods. Int J online Biomed Eng. 2023;19(15):144–65.

19. Al-Batah MS, Alzboon MS, Alzyoud M, Al-Shanableh N. Enhancing Image Cryptography Performance with Block Left Rotation Operations. Appl Comput Intell Soft Comput. 2024;2024(1):3641927.

20. Wahed MA, Alqaraleh M, Salem Alzboon M, Subhi Al-Batah M. Evaluating AI and Machine Learning Models in Breast Cancer Detection: A Review of Convolutional Neural Networks (CNN) and Global Research Trends. LatIA [Internet]. 2025 Jan 1;3:117. Available from: https://latia.ageditor.uy/index.php/latia/article/view/117

21. Alqaraleh M, Salem Alzboon M, Subhi Al-Batah M, Solayman Migdadi H. From Complexity to Clarity: Improving Microarray Classification with Correlation-Based Feature Selection. LatIA [Internet]. 2025 Jan 1;3:84. Available from: https://latia.ageditor.uy/index.php/latia/article/view/84

22. Al-Batah M, Zaqaibeh B, Alomari SA, Alzboon MS. Gene Microarray Cancer classification using correlation based feature selection algorithm and rules classifiers. Int J online Biomed Eng. 2019;15(8):62–73.

23. Alqaraleh M, Alzboon MS, Al-Batah MS, Wahed MA, Abuashour A, Alsmadi FH. Harnessing Machine Learning for Quantifying Vesicoureteral Reflux: A Promising Approach for Objective Assessment. Int J online Biomed Eng. 2024;20(11):123–45.

24. Al-Batah MS, Alzboon MS, Alazaidah R. Intelligent Heart Disease Prediction System with Applications in Jordanian Hospitals. Int J Adv Comput Sci Appl. 2023;14(9):508–17.

25. Alzboon MS. Internet of things between reality or a wishing - list : a survey. Int J Eng & Technol. 2019;7(June):956–61.

26. Alzboon MS, Qawasmeh S, Alqaraleh M, Abuashour A, Bader AF, Al-Batah M. Machine Learning Classification Algorithms for Accurate Breast Cancer Diagnosis. In: 2023 3rd International Conference on Emerging Smart Technologies and Applications, eSmarTA 2023. 2023.

27. Alzboon MS, Aljarrah E, Alqaraleh M, Alomari SA. Nodexl Tool for Social Network Analysis. Turkish J Comput Math Educ. 2021;12(14):202–16.

28. Alqaraleh M, Salem Alzboon M, Mohammad SA-B. Optimizing Resource Discovery in Grid Computing: A Hierarchical and Weighted Approach with Behavioral Modeling. LatIA [Internet]. 2025 Jan 1;3:97. Available from: https://latia.ageditor.uy/index.php/latia/article/view/97

29. Alzboon MS, Al-Batah MS. Prostate Cancer Detection and Analysis using Advanced Machine Learning. Int J Adv Comput Sci Appl. 2023;14(8):388–96.

30. Alzboon MS, Qawasmeh S, Alqaraleh M, Abuashour A, Bader AF, Al-Batah M. Pushing the Envelope: Investigating the Potential and Limitations of ChatGPT and Artificial Intelligence in Advancing Computer Science Research. In: 2023 3rd International Conference on Emerging Smart Technologies and Applications, eSmarTA 2023. 2023.

31. Alqaraleh M, Salem Alzboon M, Subhi Al-Batah M. Real-Time UAV Recognition Through Advanced Machine Learning for Enhanced Military Surveillance. Gamification Augment Real [Internet]. 2025 Jan 1;3:63. Available from: https://gr.ageditor.ar/index.php/gr/article/view/63

32. Alzboon M. Semantic Text Analysis on Social Networks and Data Processing: Review and Future Directions. Inf Sci Lett. 2022;11(5):1371–84.

33. Alzboon MS. Survey on Patient Health Monitoring System Based on Internet of Things. Inf Sci Lett. 2022;11(4):1183–90.

34. Alzboon MS, Alomari S, Al-Batah MS, Alomari SA, Banikhalaf M. The characteristics of the green internet of things and big data in building safer, smarter, and sustainable cities Vehicle Detection and Tracking for Aerial Surveillance Videos View project Evaluation of Knowledge Quality in the E-Learning System View pr [Internet]. Vol. 6, Article in International Journal of Engineering and Technology. 2017. p. 83–92. Available from: https://www.researchgate.net/publication/333808921

35. Al Tal S, Al Salaimeh S, Ali Alomari S, Alqaraleh M. The modern hosting computing systems for small and medium businesses. Acad Entrep J. 2019;25(4):1–7.

36. Alzboon MS, Bader AF, Abuashour A, Alqaraleh MK, Zaqaibeh B, Al-Batah M. The Two Sides of AI in Cybersecurity: Opportunities and Challenges. In: Proceedings of 2023 2nd International Conference on Intelligent Computing and Next Generation Networks, ICNGN 2023. 2023.

37. Alomari SA, Alqaraleh M, Aljarrah E, Alzboon MS. Toward achieving self-resource discovery in distributed systems based on distributed quadtree. J Theor Appl Inf Technol. 2020;98(20):3088–99.

38. Alazaidah R. A Comparative Analysis of Discretization Techniques in Machine Learning. In: 2023 24th International Arab Conference on Information Technology, ACIT 2023. 2023. p. 1–6.

39. Alazaidah R, Owida HA, Alshdaifat N, Issa A, Abuowaida S, Yousef N. A comprehensive analysis of eye diseases and medical data classification. TELKOMNIKA (Telecommunication Comput Electron Control. 2024;22(6):1422–30.

40. Moubayed A, Injadat MN, Alhindawi N, Samara G, Abuasal S, Alazaidah R. A Deep Learning Approach Towards Student Performance Prediction in Online Courses: Challenges Based on a Global Perspective. In: 2023 24th International Arab Conference on Information Technology, ACIT 2023. 2023. p. 1–6.

41. Alazaidah R, Hassan M, Al-Rbabah L, Samara G, Yusof M, Al-Sherideh AS. Utilizing Machine Learning in Medical Diagnosis: Systematic Review and Empirical Analysis. In: 2023 24th International Arab Conference on Information Technology, ACIT 2023. 2023. p. 1–9.

42. Aziz DIABA, Yusoff M, Ibrahim N, Alazaidah R. Paddy Diseases Multi-Class Classification using CNN Variants. In: 2023 24th International Arab Conference on Information Technology, ACIT 2023. 2023. p. 1–8.

43. Alzyoud M, Alazaidah R, Aljaidi M, Samara G, Qasem MH, Khalid M, et al. Diagnosing diabetes mellitus using machine learning techniques. Int J Data Netw Sci. 2024;8(1):179–88.

44. Al-Batah MS, Al-Eiadeh MR. An improved binary crow-JAYA optimisation system with various evolution operators, such as mutation for finding the max clique in the dense graph. Int J Comput Sci Math. 2024;19(4):327-38.

45. Alzyoud M, Alazaidah R, Alzoubi H, Al-Shanableh N, Aljaidi M, Almatarneh S. Toward Identifying The Best Base Classifier in Multi Label Classification-an Investigative Study. In: 2023 24th International Arab Conference on Information Technology, ACIT 2023. 2023. p. 1–9.

46. Al-Batah MS. Modified recursive least squares algorithm to train the hybrid multilayered perceptron (HMLP) network. Appl Soft Comput. 2010;10(1):236-44.

47. Alazaidah R, Al-Qerem A, Qasem MH, Al-Shaikh A, Almilli N, Injadat MN. Feature Selection in Associative Classification-A Review and Comparative Analysis. In: 2023 24th International Arab Conference on Information Technology, ACIT 2023. 2023. p. 1–5.

38. Al-Batah MS. Testing the probability of heart disease using classification and regression tree model. Annu Res Rev Biol. 2014;4(11):1713-25.

49. Qasem MH, Aljaidi M, Samara G, Alsarhan A, Alazaidah R, Ali Al-Gumaei YO, et al. Towards Advancing Distributed Data Mining: Intelligent Agent Systems. In: 2nd International Engineering Conference on Electrical, Energy, and Artificial Intelligence, EICEEAI 2023. 2023. p. 1–5.

50. Al-Batah MS. Integrating the principal component analysis with partial decision tree in microarray gene data. IJCSNS Int J Comput Sci Netw Secur. 2019;19(3):24-29.

51. Al-Batah MS, Al-Eiadeh MR. An improved discreet Jaya optimisation algorithm with mutation operator and opposition-based learning to solve the 0-1 knapsack problem. Int J Math Oper Res. 2023;26(2):143-69.

52. Alazaidah R, Samara G, Aljaidi M, Haj Qasem M, Alsarhan A, Alshammari M. Potential of Machine Learning for Predicting Sleep Disorders: A Comprehensive Analysis of Regression and Classification Models. Diagnostics. 2024;14(1):27.

53. Al-Batah MS. Ranked features selection with MSBRG algorithm and rules classifiers for cervical cancer. Int J Online Biomed Eng. 2019;15(12):4.

Downloads

Published

2025-01-02

Issue

Section

Original

How to Cite

1.
Subhi Al-Batah M, Alqaraleh M, Salem Alzboon M. Improving Oral Cancer Outcomes Through Machine Learning and Dimensionality Reduction. Data and Metadata [Internet]. 2025 Jan. 2 [cited 2025 Mar. 14];3:.570. Available from: https://dm.ageditor.ar/index.php/dm/article/view/570