EfficientUNetViT: Efficient Breast Tumor Segmentation Utilizing UNet Architecture and Pretrained Vision Transformer.

UNet breast cancer depthwise separable convolutional vision transformer

Journal

Bioengineering (Basel, Switzerland)
ISSN: 2306-5354
Titre abrégé: Bioengineering (Basel)
Pays: Switzerland
ID NLM: 101676056

Informations de publication

Date de publication:
21 Sep 2024
Historique:
received: 05 09 2024
accepted: 18 09 2024
medline: 27 9 2024
pubmed: 27 9 2024
entrez: 27 9 2024
Statut: epublish

Résumé

This study introduces a sophisticated neural network structure for segmenting breast tumors. It achieves this by combining a pretrained Vision Transformer (ViT) model with a UNet framework. The UNet architecture, commonly employed for biomedical image segmentation, is further enhanced with depthwise separable convolutional blocks to decrease computational complexity and parameter count, resulting in better efficiency and less overfitting. The ViT, renowned for its robust feature extraction capabilities utilizing self-attention processes, efficiently captures the overall context within images, surpassing the performance of conventional convolutional networks. By using a pretrained ViT as the encoder in our UNet model, we take advantage of its extensive feature representations acquired from extensive datasets, resulting in a major enhancement in the model's ability to generalize and train efficiently. The suggested model has exceptional performance in segmenting breast cancers from medical images, highlighting the advantages of integrating transformer-based encoders with efficient UNet topologies. This hybrid methodology emphasizes the capabilities of transformers in the field of medical image processing and establishes a new standard for accuracy and efficiency in activities related to tumor segmentation.

Identifiants

pubmed: 39329687
pii: bioengineering11090945
doi: 10.3390/bioengineering11090945
pii:
doi:

Types de publication

Journal Article

Langues

eng

Auteurs

Shokofeh Anari (S)

Department of Accounting, Economic and Financial Sciences, Islamic Azad University, South Tehran Branch, Tehran 1584743311, Iran.

Gabriel Gomes de Oliveira (GG)

Poli.TIC-CTI-Renato Archer, Campinas 13069-901, Brazil.

Ramin Ranjbarzadeh (R)

School of Computing, Faculty of Engineering and Computing, Dublin City University, D09 V209 Dublin, Ireland.

Angela Maria Alves (AM)

Poli.TIC-CTI-Renato Archer, Campinas 13069-901, Brazil.

Gabriel Caumo Vaz (GC)

School of Electrical and Computer Engineering, State University of Campinas, Campinas 13083-852, Brazil.

Malika Bendechache (M)

ADAPT Research Centre, School of Computer Science, University of Galway, H91 TK33 Galway, Ireland.

Classifications MeSH