Medical Image Segmentation Based on Hybrid Deep Learning Techniques
DOI:
https://doi.org/10.31185/wjps.736Keywords:
Image segmentation, U-Net, Transformer encoderAbstract
ABSTRACT: Medical image segmentation is an integral component of computer-aided treatment planning and diagnosis, enabling accurate analysis of abnormalities and anatomical structures. In this paper, a novel U-Net and transformer encoder-based hybrid deep learning architecture is developed to advance precision and speed in medical image segmentation. The U-Net architecture, for its remarkable capability to capture local details, is integrated in synergy with a transformer encoder, whose power is in capturing data's long-range dependencies. The synergy between such capabilities makes the resultant hybrid architecture robust for segmentation in diverse medical imaging modalities. The approach is rigorously compared on publicly available datasets, including brain MRI, Chest X-Rays, and ISIC, and is shown to have better Dice coefficients than current state-of-the-art. The synergy between U-Net and transformer components, in addition to better segmentation, is shown to result in computational savings, thus becoming efficient for clinical application. The result demonstrates the capability of such a hybrid architecture to advance diagnostic precision and contribute to enhanced treatment planning in clinical settings. The work is in alignment with growing literature on using advanced deep learning to overcome challenging medical image analysis problems, ultimately leading to enhanced clinical outcomes through accurate and reliable tools for segmentation.
References
X. Liu, L. Song, S. Liu, and Y. Zhang, “A Review of Deep-Learning-Based Medical Image Segmentation Methods,” Sustainability, vol. 13, no. 3, p. 1224, Jan. 2021, doi: https://doi.org/10.3390/su13031224.
H. Shang, Y. Tong, M. Li, S. Xu, L. Xu, and Z. Cao, “Deep Learning-based U-Mamba Model to Predict Differentiated Gastric Cancer using Radiomics Features from Spleen Segmentation,” Current Medical Imaging Formerly Current Medical Imaging Reviews, vol. 20, Dec. 2024, doi: https://doi.org/10.2174/0115734056349216241118115005.
R. Wang, T. Lei, R. Cui, B. Zhang, H. Meng, and A. K. Nandi, “Medical image segmentation using deep learning: A survey,” IET Image Processing, Jan. 2022, doi: https://doi.org/10.1049/ipr2.12419.
W. Weng and X. Zhu, “INet: Convolutional Networks for Biomedical Image Segmentation,” IEEE Access, vol. 9, pp. 16591–16603, 2021, doi: https://doi.org/10.1109/access.2021.3053408.
A. Dosovitskiy et al., “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,” International Conference on Learning Representations, May 2021, [Online]. Available: https://openreview.net/pdf?id=YicbFdNTTy
J. Chen et al., “TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers,” Medical Image Analysis, pp. 103280–103280, Jul. 2024, doi: https://doi.org/10.1016/j.media.2024.103280.
S. Tong, Z. Zuo, Z. Liu, D. Sun, and T. Zhou, “Hybrid attention mechanism of feature fusion for medical image segmentation,” IET Image Processing, vol. 18, no. 1, pp. 77–87, Sep. 2023, doi: https://doi.org/10.1049/ipr2.12934.
Z. Liu et al., “Swin Transformer: Hierarchical Vision Transformer using Shifted Windows,” 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 9992–10002, Oct. 2021, doi: 10.1109/iccv48922.2021.00986.
X. Lin, L. Yu, K.-T. Cheng, and Z. Yan, “BATFormer: Towards Boundary-Aware Lightweight Transformer for Efficient Medical Image Segmentation,” IEEE journal of biomedical and health informatics, vol. 27, no. 7, pp. 3501–3512, Jul. 2023, doi: https://doi.org/10.1109/jbhi.2023.3266977.
B. H. Menze et al., “The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS),” PMC, Oct. 2015, [Online]. Available: https://dspace.mit.edu/handle/1721.1/110992
“ISIC 2024 - Skin Cancer Detection with 3D-TBP,” @kaggle, 2024. https://www.kaggle.com/competitions/isic-2024-challenge/overview/ (accessed Feb. 27, 2025).
D. S. Kermany et al., “Identifying medical diagnoses and treatable diseases by Image-Based Deep Learning,” Cell, vol. 172, no. 5, pp. 1122-1131.e9, Feb. 2018, doi: 10.1016/j.cell.2018.02.010.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Mohammed Abdulameer Aljanabi , Noor Abd Alrazak Shnain

This work is licensed under a Creative Commons Attribution 4.0 International License.