From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification

Science Communicator Platform

Share By

From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification Publisher

Summary: A study found a hybrid AI model detects eye diseases with high accuracy using fewer resources. #EyeHealth #ArtificialIntelligence

A Arbab AMIRALI ; A Habibi AREF ; H Rabbani HOSSEIN ; M Tajmirriahi MAHNOOSH

Source: Journal of Medical Signals and Sensors Published:2025

Abstract

Background: Optical coherence tomography (OCT) is a pivotal imaging technique for the early detection and management of critical retinal diseases, notably diabetic macular edema and age-related macular degeneration. These conditions are significant global health concerns, affecting millions and leading to vision loss if not diagnosed promptly. Current methods for OCT image classification encounter specific challenges, such as the inherent complexity of retinal structures and considerable variability across different OCT datasets. Methods: This paper introduces a novel hybrid model that integrates the strengths of convolutional neural networks (CNNs) and vision transformer (ViT) to overcome these obstacles. The synergy between CNNs, which excel at extracting detailed localized features, and ViT, adept at recognizing long-range patterns, enables a more effective and comprehensive analysis of OCT images. Results: While our model achieves an accuracy of 99.80% on the OCT2017 dataset, its standout feature is its parameter efficiency-requiring only 6.9 million parameters, significantly fewer than larger, more complex models such as Xception and OpticNet-71. Conclusion: This efficiency underscores the model's suitability for clinical settings, where computational resources may be limited but high accuracy and rapid diagnosis are imperative. © 2025 Elsevier B.V., All rights reserved.

Related Docs

View other Related Docs

1. Automatic Classification of Macular Diseases From Oct Images Using Cnn Guided With Edge Convolutional Layer, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS (2022)

2. Attention to Lesion: Lesion-Aware Convolutional Neural Network for Retinal Optical Coherence Tomography Image Classification, IEEE Transactions on Medical Imaging (2019)

3. Optimal Deep Learning Architecture for Automated Segmentation of Cysts in Oct Images Using X-Let Transforms, Diagnostics (2023)

Experts (# of related papers)

View all Related Experts

Hossein Rabbani (18)

Raheleh Kafieh (6)

From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification

Other Related Docs

4. A New Computer-Aided Diagnosis Tool Based on Deep Learning Methods for Automatic Detection of Retinal Disorders From Oct Images, International Ophthalmology (2024)

5. Wavelet-Based Convolutional Mixture of Experts Model: An Application to Automatic Diagnosis of Abnormal Macula in Retinal Optical Coherence Tomography Images, Iranian Conference on Machine Vision and Image Processing, MVIP (2017)

6. Wavelet Scattering Transform Application in Classification of Retinal Abnormalities Using Oct Images, Scientific Reports (2023)

7. Automatic Classification of Retinal Optical Coherence Tomography Images With Layer Guided Convolutional Neural Network, IEEE Signal Processing Letters (2019)

8. A New Convolutional Neural Network Based on Combination of Circlets and Wavelets for Macular Oct Classification, Scientific Reports (2023)

9. Application of Deep Dictionary Learning and Predefined Filters for Classification of Retinal Optical Coherence Tomography Images, IEEE Access (2025)

10. Loss-Modified Transformer-Based U-Net for Accurate Segmentation of Fluids in Optical Coherence Tomography Images of Retinal Diseases, Journal of Medical Signals and Sensors (2023)

11. Automatic Classification of Retinal Three-Dimensional Optical Coherence Tomography Images Using Principal Component Analysis Network With Composite Kernels, Journal of Biomedical Optics (2017)

12. Isfahan Artificial Intelligence Event 2023: Macular Pathology Detection Competition, Journal of Medical Signals and Sensors (2025)

13. A Lightweight Mimic Convolutional Auto-Encoder for Denoising Retinal Optical Coherence Tomography Images, IEEE Transactions on Instrumentation and Measurement (2021)

14. Synthetic Oct Data in Challenging Conditions: Three-Dimensional Oct and Presence of Abnormalities, Medical and Biological Engineering and Computing (2022)

15. Convolutional Mixture of Experts Model: A Comparative Study on Automatic Macular Diagnosis in Retinal Optical Coherence Tomography Imaging, Journal of Medical Signals and Sensors (2019)

16. Slo-Net: Enhancing Multiple Sclerosis Diagnosis Beyond Optical Coherence Tomography Using Infrared Reflectance Scanning Laser Ophthalmoscopy Images, Translational Vision Science and Technology (2024)

17. Retinal Optical Coherence Tomography Image Classification With Label Smoothing Generative Adversarial Network, Neurocomputing (2020)

18. Macular Oct Classification Using a Multi-Scale Convolutional Neural Network Ensemble, IEEE Transactions on Medical Imaging (2018)

19. Detection of Retinal Abnormalities in Oct Images Using Wavelet Scattering Network, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS (2022)

20. Retinal Optical Coherence Tomography Image Analysis by a Restricted Boltzmann Machine, Biomedical Optics Express (2022)

Style	Citing Format
MLA	A Arbab AMIRALI, et al.. "From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification." Journal of Medical Signals and Sensors, vol. 15, no. 6, 2025, pp. -.
APA	A Arbab AMIRALI, A Habibi AREF, H Rabbani HOSSEIN, M Tajmirriahi MAHNOOSH (2025). From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification. Journal of Medical Signals and Sensors, 15(6), -.
Chicago	A Arbab AMIRALI, A Habibi AREF, H Rabbani HOSSEIN, M Tajmirriahi MAHNOOSH. "From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification." Journal of Medical Signals and Sensors 15, no. 6 (2025): -.
Harvard	A Arbab AMIRALI et al. (2025) 'From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification', Journal of Medical Signals and Sensors, 15(6), pp. -.
Vancouver	A Arbab AMIRALI, A Habibi AREF, H Rabbani HOSSEIN, M Tajmirriahi MAHNOOSH. From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification. Journal of Medical Signals and Sensors. 2025;15(6):-.
BibTex	@article{ author = {A Arbab AMIRALI and A Habibi AREF and H Rabbani HOSSEIN and M Tajmirriahi MAHNOOSH}, title = {From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification}, journal = {Journal of Medical Signals and Sensors}, volume = {15}, number = {6}, pages = {-}, year = {2025} }
RIS	TY - JOUR AU - A Arbab AMIRALI AU - A Habibi AREF AU - H Rabbani HOSSEIN AU - M Tajmirriahi MAHNOOSH TI - From Image to Sequence: Exploring Vision Transformers for Optical Coherence Tomography Classification JO - Journal of Medical Signals and Sensors VL - 15 IS - 6 SP - EP - PY - 2025 ER -

Science Communicator Platform

Authors

Abstract