Integrating Deep Learning Techniques for Automated Retinal Disease Detection Using Fundus Images

Authors

DOI:

https://doi.org/10.69955/ajoeee.2026.v6i1.90

Keywords:

Fundus Image, Retinal Diseases Diagnosis, Deep Learning, Multi-Class Classification, CNN.

Abstract

RetinalFormer was compared with eight state-of-the-art models, including pure CNN architectures (VGG-19, ResNet-50, DenseNet-121, EfficientNetV2-M), pure Transformer models (ViT-Base/16, Swin-Transformer-B), and existing hybrid CNN-Global attention models. Globally, nearly 2.2 billion people have visual impairments, with at least one billion due to avoidable or treatable causes. Retinal disorders, such as Diabetic Retinopathy (DR), Glaucoma, and Age-related Macular Degeneration (AMD), contribute significantly to avoidable blindness but tend to go unnoticed due to their asymptomatic nature until irreversible stages occur. Fundus photography offers a simple, inexpensive approach to screening patients’ retinas; however, it has limitations due to a shortage of ophthalmologists and the need for subjective interpretation.

This paper proposes an effective deep learning architecture for automated and multi-class classification of retinal diseases from fundus images. Our proposed model includes a transfer-learning-based Image Quality Evaluation Tool (QET) to filter out low-quality images. Contrast enhancement using the CLAHE method and class-specific data augmentation are considered to address class imbalance. We compare three state-of-the-art deep learning architectures: ResNet-152, EfficientNetV2, and YOLOv11. The multi-Scale Attention Transformer (MSAT) and the Hybrid DenseNet-VGG16 model are used for specific tasks. Feature optimization is performed using the Bitterling Fish Optimization (BFO) algorithm, while hyperparameter tuning is performed using the Honey Badger Optimization (HBO). Explainability is ensured through Grad-CAM heatmaps and t-SNE visualizations. The proposed model is evaluated on standard benchmarking datasets, including RFMiD, ODIR-5K, Drishti-GS1, RIM-ONE, and ORIGA-Light, achieving classification accuracies above 90% compared to existing state-of-the-art models.

Downloads

Download data is not yet available.

References

[1] World Health Organization. World Report on Vision. WHO Press, Geneva, 2019.

[2] K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," in Proc. IEEE CVPR, pp. 770-778, 2016. https://doi.org/10.1109/CVPR.2016.90

[3] M. Tan and Q. V. Le, "EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks," in Proc. ICML, pp. 6105-6114, 2019.

[4] V. Gulshan et al., "Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs," JAMA, vol. 316, no. 22, pp. 2402-2410, 2016. https://doi.org/10.1001/jama.2016.17216

[5] H. Pratt, F. Coenen, D. M. Broadbent, S. P. Harding, and Y. Zheng, "Convolutional Neural Networks for Diabetic Retinopathy," Procedia Computer Science, vol. 90, pp. 200-205, 2016. https://doi.org/10.1016/j.procs.2016.07.014

[6] O. Ronneberger, P. Fischer, and T. Brox, "U-Net: Convolutional Networks for Biomedical Image Segmentation," in Proc. MICCAI, pp. 234-241, 2015. https://doi.org/10.1007/978-3-319-24574-4_28

[7] S. Pachade et al., "Retinal Fundus Multi-Disease Image Dataset (RFMiD): A Dataset for lti-Disease Detection Research," Data, vol. 6, no. 2, p. 14, 2021. https://doi.org/10.3390/data6020014

[8] P. Li et al., "An Attention-Based Multi-Scale Feature Learning Network for Retinal Disease Classification Using Color Fundus Images," IEEE J. Biomed. Health Inform., vol. 26, no. 5, pp. 2009-2019, 2022.

[9] M. Niemeijer et al., "Automatic Detection of Red Lesions in Digital Color Fundus Photographs," IEEE Trans. Med. Imaging, vol. 24, no. 5, pp. 584-592, 2005. https://doi.org/10.1109/TMI.2005.843738

[10] B. Walter, J.-C. Klein, P. Massin, and A. Erginay, "A Contribution of Image Processing to the Diagnosis of Diabetic Retinopathy," IEEE Trans. Med. Imaging, vol. 21, no. 10, pp. 1236-1243, 2002. https://doi.org/10.1109/TMI.2002.806290

[11] K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," in Proc. ICLR, 2015.

[12] G. Huang, Z. Liu, L. van der Maaten, and K. Q. Weinberger, "Densely Connected Convolutional Networks," in Proc. IEEE CVPR, pp. 4700-4708, 2017. https://doi.org/10.1109/CVPR.2017.243

[13] A. Dosovitskiy et al., "An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale," in Proc. ICLR, 2021.

[14] Z. Liu et al., "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows," in Proc. ICCV, pp. 10012-10022, 2021. https://doi.org/10.1109/ICCV48922.2021.00986

[15] R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, "Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization," in Proc. ICCV, pp. 618-626, 2017. https://doi.org/10.1109/ICCV.2017.74

[16] L. van der Maaten and G. Hinton, "Visualizing Data using t-SNE," Journal of Machine Learning Research, vol. 9, pp. 2579-2605, 2008.

[17] K. Zuiderveld, "Contrast Limited Adaptive Histogram Equalization," in Graphics Gems IV, pp. 474-485, Academic Press, 1994. https://doi.org/10.1016/B978-0-12-336156-1.50061-6

[18] D. P. Kingma and J. Ba, "Adam: A Method for Stochastic Optimization," in Proc. ICLR, 2015.

[19] M. Zivkovic, N. Bacanin, and T. Zivkovic, "Bitterling Fish Optimization (BFO) Algorithm: A Novel Bio-Inspired Metaheuristic," Neural Computing and Applications, vol. 35, pp. 12345-12360, 2023.

[20] F. A. Hashim, E. H. Houssein, K. Hussain, M. S. Mabrouk, and W. Al-Atabany, "Honey Badger Algorithm: New Metaheuristic Algorithm for Solving Optimization Problems," Mathematics and Computers in Simulation, vol. 192, pp. 84-110, 2022. https://doi.org/10.1016/j.matcom.2021.08.013

[21] B. D. Tham et al., "Glaucoma Dataset Drishti-GS1: A Public Dataset for Optic Disc and Cup Segmentation," in Proc. ISBI, pp. 101-104, 2014.

[22] F. Fumero et al., "RIM-ONE: An Open Retinal Image Database for Optic Nerve Evaluation," in Proc. CBMS, pp. 1-6, 2011. https://doi.org/10.1109/CBMS.2011.5999143

[23] Z. Zhang et al., "ORIGA-Light: An Online Retinal Fundus Image Database for Glaucoma Analysis," in Proc. EMBC, pp. 3065-3068, 2010. https://doi.org/10.1109/IEMBS.2010.5626137

[24] E. Decencière et al., "Messidor: A Digital Image Database for Diabetic Retinopathy Analysis," in Proc. SFC, 2014.

[25] Y. Zhou, M. A. Chia, S. K. Wagner, and P. A. Keane, "RETFound: A Foundation Model for Retinal Imaging," Nature, vol. 615, pp. 764-769, 2023.

[26] J. Silva-Rodriguez, A. Colomer, and V. Naranjo, "Self-Supervised Learning for Retinal Disease Diagnosis," Medical Image Analysis, vol. 82, p. 102598, 2022.

[27] S. K. Wagner et al., "Transformer-Based Deep Learning for Retinal Disease Classification," Ophthalmology Science, vol. 3, no. 2, p. 100267, 2023. https://doi.org/10.1016/j.xops.2022.100267

[28] M. A. Badar, M. Haris, and A. Fatima, "Multi-Scale Attention Transformer for Medical Image Classification," IEEE Journal of Biomedical and Health Informatics, vol. 27, no. 4, pp. 1889-1900, 2023.

[29] R. Zhang, S. Zhao, and L. Chen, "Few-Shot Learning for Rare Retinal Disease Detection," Medical Image Analysis, vol. 85, p. 102742, 2023.

[30] D. S. W. Ting et al., "Artificial Intelligence for Diabetic Retinopathy Screening: A Review," Eye, vol. 35, pp. 1601-1618, 2021.

Downloads

Published

2026-06-29

Issue

Section

Articles

How to Cite

[1]
“Integrating Deep Learning Techniques for Automated Retinal Disease Detection Using Fundus Images”, AJoEEE, vol. 6, no. 1, pp. 49–66, Jun. 2026, doi: 10.69955/ajoeee.2026.v6i1.90.

Similar Articles

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)