TY - JOUR
T1 - A Low Complexity Efficient Deep Learning Model for Automated Retinal Disease Diagnosis
AU - Chowa, Sadia Sultana
AU - Bhuiyan, Md Rahad Islam
AU - Payel, Israt Jahan
AU - Karim, Asif
AU - Khan, Inam Ullah
AU - Montaha, Sidratul
AU - Hasan, Md Zahid
AU - Jonkman, Mirjam
AU - Azam, Sami
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/3/31
Y1 - 2025/3/31
N2 - The identification and early treatment of retinal disease can help to prevent loss of vision. Early diagnosis allows a greater range of treatment options and results in better outcomes. Optical coherence tomography (OCT) is a technology used by ophthalmologists to detect and diagnose certain eye conditions. In this paper, human retinal OCT images are classified into four classes using deep learning. Several image preprocessing techniques are employed to enhance the image quality. An augmentation technique, called generative adversarial network (GAN), is utilized in the Drusen and DME classes to address data imbalance issues, resulting in a total of 130,649 images. A lightweight optimized compact convolutional transformers (OCCT) model is developed by conducting an ablation study on the initial CCT model for categorizing retinal conditions. The proposed OCCT model is compared with two transformer-based models: vision Transformer (ViT) and Swin Transformer. The models are trained and evaluated with 32 × 32 sized images of the GAN-generated enhanced dataset. Additionally, eight transfer learning models are presented with the same input images to compare their performance with the OCCT model. The proposed model’s stability is assessed by decreasing the number of training images and evaluating the performance. The OCCT model’s accuracy is 97.09%, and it outperforms the two transformer models. The result further indicates that the OCCT model sustains its performance, even if the number of images is reduced.
AB - The identification and early treatment of retinal disease can help to prevent loss of vision. Early diagnosis allows a greater range of treatment options and results in better outcomes. Optical coherence tomography (OCT) is a technology used by ophthalmologists to detect and diagnose certain eye conditions. In this paper, human retinal OCT images are classified into four classes using deep learning. Several image preprocessing techniques are employed to enhance the image quality. An augmentation technique, called generative adversarial network (GAN), is utilized in the Drusen and DME classes to address data imbalance issues, resulting in a total of 130,649 images. A lightweight optimized compact convolutional transformers (OCCT) model is developed by conducting an ablation study on the initial CCT model for categorizing retinal conditions. The proposed OCCT model is compared with two transformer-based models: vision Transformer (ViT) and Swin Transformer. The models are trained and evaluated with 32 × 32 sized images of the GAN-generated enhanced dataset. Additionally, eight transfer learning models are presented with the same input images to compare their performance with the OCCT model. The proposed model’s stability is assessed by decreasing the number of training images and evaluating the performance. The OCCT model’s accuracy is 97.09%, and it outperforms the two transformer models. The result further indicates that the OCCT model sustains its performance, even if the number of images is reduced.
KW - Ablation studies
KW - Compact convolutional transformer (CCT)
KW - Generative adversarial network (GAN)
KW - Optical coherence tomography (OCT)
KW - Retinal disease
KW - Transformer model
UR - http://www.scopus.com/inward/record.url?scp=85214094849&partnerID=8YFLogxK
U2 - 10.1007/s41666-024-00182-5
DO - 10.1007/s41666-024-00182-5
M3 - Article
AN - SCOPUS:85214094849
SN - 2509-498X
SP - 1
EP - 40
JO - Journal of Healthcare Informatics Research
JF - Journal of Healthcare Informatics Research
ER -