Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation

Science Communicator Platform

Share By

Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation Publisher

Azad R¹ ; Heidari M² ; Shariatnia M³ ; Aghdam EK⁴ ; Karimijafarbigloo S¹ ; Adeli E⁵ ; Merhof D^{1, 6}

Source: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Published:2022

Abstract

Convolutional neural networks (CNNs) have been the de facto standard in a diverse set of computer vision tasks for many years. Especially, deep neural networks based on seminal architectures such as U-shaped model with skip-connections or atrous convolution with pyramid pooling have been tailored to a wide range of medical image analysis tasks. The main advantage of such architectures is that they are prone to detaining versatile local features. However, as a general consensus, CNNs fail to capture long-range dependencies and spatial correlations due to the intrinsic property of confined receptive field size of convolution operations. Alternatively, Transformer, profiting from global information modeling that stems from the self-attention mechanism, has recently attained remarkable performance in natural language processing and computer vision. Nevertheless, previous studies prove that both local and global features are critical for a deep model in dense prediction, such as segmenting complicated structures with disparate shapes and configurations. This paper proposes TransDeepLab, a novel DeepLab-like pure Transformer for medical image segmentation. Specifically, we exploit hierarchical Swin-Transformer with shifted windows to extend the DeepLabv3 and model the Atrous Spatial Pyramid Pooling (ASPP) module. A thorough search of the relevant literature yielded that we are the first to model the seminal DeepLab model with a pure Transformer-based model. Extensive experiments on various medical image segmentation tasks verify that our approach performs superior or on par with most contemporary works on an amalgamation of Vision Transformer and CNN-based methods, along with a significant reduction of model complexity. The codes and trained models are publicly available at github. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

Related Docs

View other Related Docs

1. Deep Attention Network for Identifying Ligand-Protein Binding Sites, Journal of Computational Science (2024)

2. Deep Learning Framework for Prediction of Infection Severity of Covid-19, Frontiers in Medicine (2022)

3. An Efficient Capsule-Based Network for 2D Left Ventricle Segmentation in Echocardiography Images, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society# EMBS (2023)

Experts (# of related papers)

View all Related Experts

Alireza Ahmadian (5)

Parastoo Farnia (3)

Other Related Docs

4. Accurate Automatic Glioma Segmentation in Brain Mri Images Based on Capsnet, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society# EMBS (2021)

5. Efficient Segmentation of Active and Inactive Plaques in Flair-Images Using Deeplabv3plus Se With Efficientnetb0 Backbone in Multiple Sclerosis, Scientific Reports (2024)

6. Deep Learning-Based Techniques in Glioma Brain Tumor Segmentation Using Multi-Parametric Mri: A Review on Clinical Applications and Future Outlooks, Journal of Magnetic Resonance Imaging (2025)

7. Current Trends in Glioma Tumor Segmentation: A Survey of Deep Learning Modules, Physica Medica (2025)

8. Segmentation of Pancreatic Ductal Adenocarcinoma (Pdac) and Surrounding Vessels in Ct Images Using Deep Convolutional Neural Networks and Texture Descriptors, Scientific Reports (2022)

9. Deep Vision Transformers for Prognostic Modeling in Covid-19 Patients Using Large Multi-Institutional Chest Ct Dataset, 2022 IEEE NSS/MIC RTSD - IEEE Nuclear Science Symposium# Medical Imaging Conference and Room Temperature Semiconductor Detector Conference (2022)

10. A Memory-Efficient Deep Framework for Multi-Modal Mri-Based Brain Tumor Segmentation, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society# EMBS (2022)

11. A Hybrid Capsule Network for Automatic 3D Mandible Segmentation Applied in Virtual Surgical Planning, Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society# EMBS (2022)

12. Atb-Net: A Novel Attention-Based Convolutional Neural Network for Predicting Full-Dose From Low-Dose Pet Images, 2021 IEEE Nuclear Science Symposium and Medical Imaging Conference Record# NSS/MIC 2021 and 28th International Symposium on Room-Temperature Semiconductor Detectors# RTSD 2022 (2021)

13. Deep Convolutional Neural Networks for Filtering Out Normal Frames in Reviewing Wireless Capsule Endoscopy Videos, Informatics in Medicine Unlocked (2024)

14. Fusion Strategies for Deep Convolutional Neural Network Representations in Histopathological Image Classification, Journal of Supercomputing (2025)

15. Brain Tumor Segmentation Using Multimodal Mri and Convolutional Neural Network, 2022 30th International Conference on Electrical Engineering# ICEE 2022 (2022)

16. Transfer Learning-Based Automatic Detection of Coronavirus Disease 2019 (Covid-19) From Chest X-Ray Images, Journal of Biomedical Physics and Engineering (2020)

17. A Mask-Guided Attention Deep Learning Model for Covid-19 Diagnosis Based on an Integrated Ct Scan Images Database, IISE Transactions on Healthcare Systems Engineering (2023)

18. Deep Learning-Based Automated Delineation of Head and Neck Malignant Lesions From Pet Images, 2020 IEEE Nuclear Science Symposium and Medical Imaging Conference# NSS/MIC 2020 (2020)

19. An Improved Capsule Network for Glioma Segmentation on Mri Images: A Curriculum Learning Approach, Computers in Biology and Medicine (2022)

20. Concurrent Learning Approach for Estimation of Pelvic Tilt From Anterior–Posterior Radiograph, Bioengineering (2024)

Style	Citing Format
MLA	Azad R, et al.. "Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation." Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13564 LNCS, no. , 2022, pp. 91-102.
APA	Azad R, Heidari M, Shariatnia M, Aghdam EK, Karimijafarbigloo S, Adeli E, Merhof D (2022). Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13564 LNCS(), 91-102.
Chicago	Azad R, Heidari M, Shariatnia M, Aghdam EK, Karimijafarbigloo S, Adeli E, Merhof D. "Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation." Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 13564 LNCS, no. (2022): 91-102.
Harvard	Azad R et al. (2022) 'Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation', Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 13564 LNCS(), pp. 91-102.
Vancouver	Azad R, Heidari M, Shariatnia M, Aghdam EK, Karimijafarbigloo S, Adeli E, et al.. Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2022;13564 LNCS():91-102.
BibTex	@article{ author = {Azad R and Heidari M and Shariatnia M and Aghdam EK and Karimijafarbigloo S and Adeli E and Merhof D}, title = {Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation}, journal = {Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)}, volume = {13564 LNCS}, number = {}, pages = {91-102}, year = {2022} }
RIS	TY - JOUR AU - Azad R AU - Heidari M AU - Shariatnia M AU - Aghdam EK AU - Karimijafarbigloo S AU - Adeli E AU - Merhof D TI - Transdeeplab: Convolution-Free Transformer-Based Deeplab V3+ For Medical Image Segmentation JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) VL - 13564 LNCS IS - SP - 91 EP - 102 PY - 2022 ER -

Science Communicator Platform

Authors

Abstract