Transformer guided and GAN augmented deep learning for medical image diagnostics
DOI:
https://doi.org/10.56294/dm20261301Keywords:
Attention Mechanisms, Capsule Networks, Convolutional Neural Networks, Disease Detection, Generative Adversarial Networks, Medical Imaging, Predictive Modeling, Recurrent Neural Networks, Resource Optimization, Transfer LearningAbstract
Introduction: Medical imaging serves as a crucial tool for disease diagnosis but current image analysis techniques fail to handle noisy data and insufficient annotations and different imaging modalities. Deep learning techniques have transformed medical imaging but achieving high diagnostic accuracy alongside computational efficiency remains a key challenge in clinical deployment.
Objective: The research proposes a single deep learning system which combines CNNs with RNNs and GANs to enhance automated disease detection from medical images through improved accuracy, better interpretability and faster processing times.
Method: The proposed Transformer-guided hybrid model uses CNNs to extract spatial features and RNNs to detect temporal patterns while GANs perform data augmentation and anomaly detection. Use consistent passive or active voice. The model was trained, validated on multimodal datasets and subsequently evaluated against ten baseline models, including SVM, transfer learning, and attention-based architectures. The evaluation metrics consisted of accuracy and precision and sensitivity and ROC-AUC.
Results: The integrated framework achieved superior diagnostic performance with 90% accuracy, 88% precision, 86% sensitivity and 0.95 ROC-AUC which outperformed all baseline models. The system delivered achieved faster processing without sacrificing diagnostic accuracy across imaging modalities without compromising its diagnostic accuracy for different imaging techniques.
Conclusions: The research developed an AI diagnostic system which uses CNN, RNN and GAN technologies to achieve efficient and ethical medical image analysis. The system enhances precision and speed while ensuring patient data security and transparent clinical reporting, enabling scalable AI-driven diagnostics.
References
[1] R. F. Buckley, A. P. Schultz, T. Hedden et al., "Functional network integrity presages cognitive decline in preclinical Alzheimer disease," Neurology, vol. 89, no. 1, pp. 29-37, 2017. DOI: https://doi.org/10.1212/WNL.0000000000004059
[2] K. N. H. Dillen, H. I. L. Jacobs, J. Kukolja et al., "Functional disintegration of the default mode network in prodromal Alzheimer's disease," Journal of Alzheimer's Disease, vol. 59, no. 1, pp. 169-187, 2017. DOI: https://doi.org/10.3233/JAD-161120
[3] M. Shabaz and U. Garg, "Predicting future diseases based on existing health status using link prediction," World Journal of Engineering, vol. 19, no. 1, pp. 29-32, 2021. DOI: https://doi.org/10.1108/WJE-10-2020-0533
[4] R. Poonguzhali, S. Ahmad, P. T. Sivasankar et al., “Automated brain tumor diagnosis using deep residual u-net segmentation model,” Computers, Materials & Continua, vol. 74, no. 1, pp. 2179–2194, 2023. DOI: https://doi.org/10.32604/cmc.2023.032816
[5] B. L. Y. Agbley, J. P. Li, A. U. Haq et al., “Federated Fusion of Magnified Histopathological Images for Breast Tumor Classification in the Internet of Medical Things,” in IEEE Journal of Biomedical and Health Informatics, 2023. DOI: https://doi.org/10.1109/JBHI.2023.3256974
[6] S. Chaudhury, A. N. Krishna, S. Gupta et al., "Effective image processing and segmentation-based machine learning techniques for diagnosis of breast cancer," Computational and Mathematical Methods in Medicine, vol. 1, Article ID 6841334, 2022. DOI: https://doi.org/10.1155/2022/6841334
[7] J. Godara, R. Aron, and M. Shabaz, "Sentiment analysis and sarcasm detection from social network to train health-care professionals," World Journal of Engineering, vol. 19, no. 1, pp. 124-133, 2021. DOI: https://doi.org/10.1108/WJE-02-2021-0108
[8] Haq AU, Li JP, Ahmad S, Khan S, Alshara MA, Alotaibi RM. Diagnostic approach for accurate diagnosis of COVID-19 employing deep learning and transfer learning techniques through chest X-ray images clinical data in E-healthcare. Sensors. 2021 Dec 9;21(24):8219. DOI: https://doi.org/10.3390/s21248219
[9] J. G. Kotwal, P. M. Shafi, "Artificial Driving based EfficientNet for Automatic Plant Leaf Disease Classification," Multimed Tools Appl, 2023. [Online]. Available: https://doi.org/10.1007/s11042-023-16882-w DOI: https://doi.org/10.1007/s11042-023-16882-w
[10] Alharbi, M., Ahmad, S. Enhancing COVID-19 detection using CT-scan image analysis and disease classification: the DI-QL approach. Health Technol. 15, 477–488 (2025). https://doi.org/10.1007/s12553-025-00952-0 DOI: https://doi.org/10.1007/s12553-025-00952-0
[11] J. Jaya, K. Thanushkodi, and M. Karnan, "Tracking algorithm for denoising of MR brain images," International Journal of Computer Science and Network Security, vol. 9, pp. 262-267, 2009.
[12] C. Ramalakshmi and A. J. Chandran, "Automatic brain tumor detection in MR images using neural network based classification," Biometrics and Bioinformatics, vol. 5, no. 6, pp. 221-225, 2013.
[13] A. M. Wink and J. B. Roerdink, "Denoising functional MR images: a comparison of wavelet denoising and Gaussian smoothing," IEEE Transactions on Medical Imaging, vol. 23, no. 3, pp. 374-387, 2004. DOI: https://doi.org/10.1109/TMI.2004.824234
[14] Ahmad S, Neal Joshua ES, Rao NT, Ghoniem RM, Taye BM, Bharany S. A multi stage deep learning model for accurate segmentation and classification of breast lesions in mammography. Scientific Reports. 2025 Oct 23;15(1):37103. DOI: https://doi.org/10.1038/s41598-025-21146-8
[15] S. Basu, T. Fletcher, and R. Whitaker, "Rician noise removal in diffusion tensor MRI," MICCAI Rician noise removal in diffusion tensor MRI, vol. 9, no. 1, pp. 117-125, 2006. DOI: https://doi.org/10.1007/11866565_15
[16] H. P. Sahu, "FINE_DENSEIGANET: Automatic medical image classification in chest CT scan using Hybrid Deep Learning Framework," International Journal of Image and Graphics [Preprint], 2023. [Online]. Available: https://doi.org/10.1142/s0219467825500044 DOI: https://doi.org/10.1142/S0219467825500044
[17] H. Byeon, R. Nair, V. Mahalakshmi, M. I. Khalaf, B. Kaushik, and M. Shabaz, "Enhancing medical image-based diagnostics through the application of convolutional neural networks techniques," in 2024 Third International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE), Ballari, India, 2024, pp. 1–6. doi: 10.1109/ICDCECE60827.2024.10548500. DOI: https://doi.org/10.1109/ICDCECE60827.2024.10548500
[18] Almadhor A, Ojo S, Nathaniel TI, Ahmad S, Hejaili AA. “Deep feature-driven SVM model with XAI for reliable colorectal cancer imaging analysis”, Signal, Image and Video Processing. 2025 Dec;19(15):1-8. DOI: https://doi.org/10.1007/s11760-025-04885-z
[19] Rajawat AS, Ahmad S, Muqeem M, Abdeljaber HA, Alanazi S, Nazeer J. “Advanced Deep Learning Integration for Early Pneumonia Detection for Smart Healthcare”, International Journal of Online & Biomedical Engineering. 2025 Mar 1;21(3). DOI: https://doi.org/10.3991/ijoe.v21i03.53107
[20] Querbes, F. Aubry, J. Pariente et al., "Early diagnosis of Alzheimer's disease using cortical thickness: impact of cognitive reserve," Brain, vol. 132, no. 8, pp. 2036-2047, 2009. DOI: https://doi.org/10.1093/brain/awp105
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Ahmed A.F Osman, Rajit Nair, Theyazn H.H Aldhyani, Sultan Ahmad, Mosleh Hmoud Al-Adhaileh, Hikmat A. M. Abdeljaber, Mohammed Ataelfadiel (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.
The article is distributed under the Creative Commons Attribution 4.0 License. Unless otherwise stated, associated published material is distributed under the same licence.
