EfficientUNetViT: Efficient Breast Tumor Segmentation Utilizing UNet Architecture and Pretrained Vision Transformer.
UNet
breast cancer
depthwise separable convolutional
vision transformer
Journal
Bioengineering (Basel, Switzerland)
ISSN: 2306-5354
Titre abrégé: Bioengineering (Basel)
Pays: Switzerland
ID NLM: 101676056
Informations de publication
Date de publication:
21 Sep 2024
21 Sep 2024
Historique:
received:
05
09
2024
accepted:
18
09
2024
medline:
27
9
2024
pubmed:
27
9
2024
entrez:
27
9
2024
Statut:
epublish
Résumé
This study introduces a sophisticated neural network structure for segmenting breast tumors. It achieves this by combining a pretrained Vision Transformer (ViT) model with a UNet framework. The UNet architecture, commonly employed for biomedical image segmentation, is further enhanced with depthwise separable convolutional blocks to decrease computational complexity and parameter count, resulting in better efficiency and less overfitting. The ViT, renowned for its robust feature extraction capabilities utilizing self-attention processes, efficiently captures the overall context within images, surpassing the performance of conventional convolutional networks. By using a pretrained ViT as the encoder in our UNet model, we take advantage of its extensive feature representations acquired from extensive datasets, resulting in a major enhancement in the model's ability to generalize and train efficiently. The suggested model has exceptional performance in segmenting breast cancers from medical images, highlighting the advantages of integrating transformer-based encoders with efficient UNet topologies. This hybrid methodology emphasizes the capabilities of transformers in the field of medical image processing and establishes a new standard for accuracy and efficiency in activities related to tumor segmentation.
Identifiants
pubmed: 39329687
pii: bioengineering11090945
doi: 10.3390/bioengineering11090945
pii:
doi:
Types de publication
Journal Article
Langues
eng