Deep Learning for Automatic Segmentation of Oral and Oropharyngeal Cancer Using Narrow Band Imaging: Preliminary Experience in a Clinical Perspective.

deep learning machine learning narrow band imaging neural network oral cancer oropharyngeal cancer segmentation

Journal

Frontiers in oncology
ISSN: 2234-943X
Titre abrégé: Front Oncol
Pays: Switzerland
ID NLM: 101568867

Informations de publication

Date de publication:
2021
Historique:
received: 06 11 2020
accepted: 08 03 2021
entrez: 12 4 2021
pubmed: 13 4 2021
medline: 13 4 2021
Statut: epublish

Résumé

Fully convoluted neural networks (FCNN) applied to video-analysis are of particular interest in the field of head and neck oncology, given that endoscopic examination is a crucial step in diagnosis, staging, and follow-up of patients affected by upper aero-digestive tract cancers. The aim of this study was to test FCNN-based methods for semantic segmentation of squamous cell carcinoma (SCC) of the oral cavity (OC) and oropharynx (OP). Two datasets were retrieved from the institutional registry of a tertiary academic hospital analyzing 34 and 45 NBI endoscopic videos of OC and OP lesions, respectively. The dataset referring to the OC was composed of 110 frames, while 116 frames composed the OP dataset. Three FCNNs (U-Net, U-Net 3, and ResNet) were investigated to segment the neoplastic images. FCNNs performance was evaluated for each tested network and compared to the gold standard, represented by the manual annotation performed by expert clinicians. For FCNN-based segmentation of the OC dataset, the best results in terms of Dice Similarity Coefficient (Dsc) were achieved by ResNet with 5(×2) blocks and 16 filters, with a median value of 0.6559. In FCNN-based segmentation for the OP dataset, the best results in terms of Dsc were achieved by ResNet with 4(×2) blocks and 16 filters, with a median value of 0.7603. All tested FCNNs presented very high values of variance, leading to very low values of minima for all metrics evaluated. FCNNs have promising potential in the analysis and segmentation of OC and OP video-endoscopic images. All tested FCNN architectures demonstrated satisfying outcomes in terms of diagnostic accuracy. The inference time of the processing networks were particularly short, ranging between 14 and 115 ms, thus showing the possibility for real-time application.

Identifiants

pubmed: 33842330
doi: 10.3389/fonc.2021.626602
pmc: PMC8024583
doi:

Types de publication

Journal Article

Langues

eng

Pagination

626602

Informations de copyright

Copyright © 2021 Paderno, Piazza, Del Bon, Lancini, Tanagli, Deganello, Peretti, De Momi, Patrini, Ruperti, Mattos and Moccia.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Références

Dig Endosc. 2020 Oct 8;:
pubmed: 33029824
Head Neck. 2020 Sep;42(9):2581-2592
pubmed: 32542892
Acta Otorhinolaryngol Ital. 2008 Apr;28(2):49-54
pubmed: 18669067
Int J Oral Maxillofac Surg. 2010 Mar;39(3):208-13
pubmed: 20185271
Curr Opin Otolaryngol Head Neck Surg. 2021 Apr 1;29(2):143-148
pubmed: 33595977
Dig Endosc. 2020 Jul 26;:
pubmed: 32715508
Otolaryngol Head Neck Surg. 2008 Apr;138(4):446-51
pubmed: 18359352
Laryngoscope. 2021 Apr;131(4):E1156-E1161
pubmed: 32797677
Biomed Opt Express. 2018 Oct 10;9(11):5318-5329
pubmed: 30460130
Laryngoscope. 2019 Feb;129(2):429-434
pubmed: 30229933
J Laryngol Otol. 2011 Mar;125(3):288-96
pubmed: 21054921
Int J Comput Assist Radiol Surg. 2019 Mar;14(3):483-492
pubmed: 30649670
Eur Arch Otorhinolaryngol. 2016 May;273(5):1207-14
pubmed: 26677852
Nat Biomed Eng. 2017 Sep;1(9):691-696
pubmed: 31015666
Laryngoscope. 2020 Nov;130(11):E686-E693
pubmed: 32068890
Curr Opin Otolaryngol Head Neck Surg. 2011 Apr;19(2):67-76
pubmed: 21330924
Clin Otolaryngol. 2019 Sep;44(5):729-735
pubmed: 31074935
Head Neck. 2015 Feb;37(2):215-22
pubmed: 24375619
Laryngoscope. 2018 Nov;128(11):2514-2520
pubmed: 29577322
IEEE J Biomed Health Inform. 2016 Jan;20(1):322-32
pubmed: 25438330
Eur Arch Otorhinolaryngol. 2016 Oct;273(10):3347-53
pubmed: 26879990
Dig Endosc. 2020 Nov;32(7):1057-1065
pubmed: 32064684

Auteurs

Alberto Paderno (A)

Department of Otorhinolaryngology-Head and Neck Surgery, ASST-Spedali Civili of Brescia, University of Brescia, Brescia, Italy.

Cesare Piazza (C)

Department of Otorhinolaryngology-Head and Neck Surgery, ASST-Spedali Civili of Brescia, University of Brescia, Brescia, Italy.

Francesca Del Bon (F)

Department of Otorhinolaryngology-Head and Neck Surgery, ASST-Spedali Civili of Brescia, University of Brescia, Brescia, Italy.

Davide Lancini (D)

Department of Otorhinolaryngology-Head and Neck Surgery, ASST-Spedali Civili of Brescia, University of Brescia, Brescia, Italy.

Stefano Tanagli (S)

Department of Otorhinolaryngology-Head and Neck Surgery, ASST-Spedali Civili of Brescia, University of Brescia, Brescia, Italy.

Alberto Deganello (A)

Department of Otorhinolaryngology-Head and Neck Surgery, ASST-Spedali Civili of Brescia, University of Brescia, Brescia, Italy.

Giorgio Peretti (G)

Department of Otorhinolaryngology-Head and Neck Surgery, IRCCS San Martino Hospital, University of Genoa, Genoa, Italy.

Elena De Momi (E)

Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy.

Ilaria Patrini (I)

Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy.

Michela Ruperti (M)

Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy.

Leonardo S Mattos (LS)

Department of Advanced Robotics, Istituto Italiano di Tecnologia, Genoa, Italy.

Sara Moccia (S)

Department of Advanced Robotics, Istituto Italiano di Tecnologia, Genoa, Italy.
The BioRobotics Institute, Scuola Superiore Sant'Anna, Pisa, Italy.
Department of Excellence in Robotics and AI, Scuola Superiore Sant'Anna, Pisa, Italy.

Classifications MeSH