RESEARCH ON HYBRID ALGORITHMS FOR DIAGNOSING EYE DISEASES

Mualliflar

  • Sayyora Iskandarova Muallif
  • Feruza Iskandarova Muallif
  • Seitjan Eraliyev Muallif

{$ Etel}:

Ocular Disease Diagnosis, Deep Learning, Hybrid Neural Networks, EfficientNet, DenseNet, Fundus Imaging, Multi-class Classification, Small Dataset Learning

Abstrak

To develop and rigorously evaluate a novel hybrid deep learning framework for simultaneous diagnosis of four critical ocular conditions, more precisely: cataract, diabetic retinopathy, glaucoma, and normal fundus - using a relatively small but balanced dataset of fundus images. The study addresses the challenge of achieving high diagnostic accuracy with limited data through architectural innovation and optimized training protocols. We propose a parallel hybrid convolutional neural network that integrates EfficientNetB3 (for global contextual feature extraction) and DenseNet121 (for local detailed feature extraction). The model processes dual-resolution inputs (300×300 and 224×224 pixels) simultaneously. A novel two-phase training strategy was implemented: Phase 1 (10 epochs) with frozen ImageNet-pre-trained backbones to train only the newly added classification heads, followed by Phase 2 (15 epochs) with selective fine-tuning of upper layers. The model incorporated label smoothing (ε=0.05), L2 regularization, and dropout to combat overfitting. The dataset comprised 3,200 curated fundus images (800 per class), split into training (2,560), validation (320), and test (320) sets. The hybrid model achieved a peak validation accuracy of 92.19% and a test accuracy of 91.87%, significantly outperforming standalone EfficientNetB3 and DenseNet121 models (p<0.001, McNemar's test). Diabetic retinopathy was detected with nearperfect precision (98.75%), while cataract, glaucoma, and normal classes showed robust and balanced performance. The proposed parallel hybrid architecture, combined with a disciplined twophase training regimen, successfully overcomes the limitations of small medical datasets. It effectively leverages complementary feature hierarchies from two state-of-the-art networks, establishing a new benchmark for multi-class ocular disease diagnosis. This work demonstrates that architectural synergy and meticulous training design can yield clinically relevant accuracy without requiring prohibitively large datasets.

##plugins.themes.default.displayStats.downloads##

##plugins.themes.default.displayStats.noStats##

Bibliografik havolalar

[1] Flaxman, S. R., Bourne, R. R. A., Resnikoff, S., et al. (2017). Global causes of blindness and distance vision impairment 1990–2020: a systematic review and metaanalysis. *The Lancet Global Health, 5*(12), e1221-e1234.

[2] Burton, M. J., Ramke, J., Marques, A. P., et al. (2021). The Lancet Global Health Commission on Global Eye Health: vision beyond 2020. *The Lancet Global Health, 9*(4), e489-e551.

[3] Esteva, A., Robicquet, A., Ramsundar, B., et al. (2019). A guide to deep learning in healthcare. *Nature Medicine, 25*(1), 24-29.

[4] Abramoff, M. D., Lou, Y., Erginay, A., et al. (2016). Improved automated detection of diabetic retinopathy on a publicly available dataset through integration of deep learning. *Investigative Ophthalmology & Visual Science, 57*(13), 5200-5206.

[5] Tan, M., & Le, Q. V. (2019). EfficientNet: Rethinking model scaling forconvolutional neural networks. In *International Conference on Machine Learning* (pp. 6105-6114). PMLR.

[6] Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. In *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition* (pp. 4700-4708).

[7] Gulshan, V., Peng, L., Coram, M., et al. (2016). Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. *JAMA, 316*(22), 2402-2410.

[8] Li, Z., He, Y., Keel, S., et al. (2018). Efficacy of a deep learning system for detecting glaucomatous optic neuropathy based on color fundus photographs. *Ophthalmology, 125*(8), 1199-1206.

[9] Zhang, L., Li, J., Han, H., et al. (2017). Automatic cataract diagnosis by imagebased interpretability. In *2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC)* (pp. 1230-1235). IEEE.

[10] Li, X., Hu, X., Yu, L., et al. (2020). CANet: Cross-disease attention network for joint diabetic retinopathy and diabetic macular edema grading. *IEEE Transactions on Medical Imaging, 39*(5), 1483-1493.

[11] Wang, X., Peng, Y., Lu, L., et al. (2017). ChestX-ray8: Hospital-scale chest X-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. In *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition* (pp. 2097-2106).

[12] Chen, Y., Li, J., Xiao, H., et al. (2017). Dual path networks. *Advances in Neural Information Processing Systems, 30*.

[13] Choi, J., Chun, D., Kim, H., & Lee, H. J. (2019). Gaussian YOLOv3: An accurate and fast object detector using localization uncertainty for autonomous driving. In *Proceedings of the IEEE/CVF International Conference on Computer Vision* (pp.502-511).

[14] Tschandl, P., Rosendahl, C., & Kittler, H. (2018). The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. *Scientific Data, 5*(1), 1-9.

[15] Wang, L., Lin, Z. Q., & Wong, A. (2020). Covid-net: A tailored deep convolutional neural network design for detection of covid-19 cases from chest x-ray images. *Scientific Reports, 10*(1), 19549.

[16] Guo, S., Wang, K., Kang, H., et al. (2019). BTS-DSN: Deeply supervised neural network with short connections for retinal vessel segmentation. *International Journal of Medical Informatics, 126*, 105-113.

[17] Szegedy, C., Vanhoucke, V., Ioffe, S., et al. (2016). Rethinking the inception architecture for computer vision. In *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition* (pp. 2818-2826).

[18] Yosinski, J., Clune, J., Bengio, Y., & Lipson, H. (2014). How transferable are features in deep neural networks. *Advances in Neural Information Processing Systems, 27*.

[19] EyePACS. (2015). Diabetic Retinopathy Detection. Kaggle. https://www.kaggle.com/c/diabetic-retinopathy-detection

[20] Pachade, S., Porwal, P., Thulkar, D., et al. (2021). Retinal Fundus Multi-Disease Image Dataset (RFMiD): A dataset for multi-disease detection research. *Data, 6*(2), 14.

[21] Selvaraju, R. R., Cogswell, M., Das, A., et al. (2017). Grad-cam: Visual explanations from deep networks via gradient-based localization. In *Proceedings of the IEEE International Conference on Computer Vision* (pp. 618-626).

Nashr qilingan

2026-02-16

Nashr

Bo'lim

Technical Sciences

##plugins.generic.recommendBySimilarity.heading##

{$ start} - {$ tugatish} {$ to'liq} dan

##plugins.generic.recommendBySimilarity.advancedSearchIntro##