Automatic Recognition of Laryngoscopic Images Using a Deep-Learning Technique.

Adult Deep Learning / statistics & numerical data Female Humans Image Interpretation, Computer-Assisted / statistics & numerical data Laryngeal Neoplasms / diagnostic imaging Laryngoscopy / methods Male Otolaryngologists / statistics & numerical data Reproducibility of Results Retrospective Studies Sensitivity and Specificity

Deep learning artificial intelligence clinical visual assessment. convolutional neural networks laryngoscopic image

Journal

The Laryngoscope

ISSN: 1531-4995

Titre abrégé: Laryngoscope

Pays: United States

ID NLM: 8607378

Informations de publication

Date de publication:
11 2020

Historique:

received: 30 07 2019

revised: 17 12 2019

accepted: 30 12 2019

pubmed: 19 2 2020

medline: 1 1 2021

entrez: 19 2 2020

Statut: ppublish

Résumé

To develop a deep-learning-based computer-aided diagnosis system for distinguishing laryngeal neoplasms (benign, precancerous lesions, and cancer) and improve the clinician-based accuracy of diagnostic assessments of laryngoscopy findings. Retrospective study. A total of 24,667 laryngoscopy images (normal, vocal nodule, polyps, leukoplakia and malignancy) were collected to develop and test a convolutional neural network (CNN)-based classifier. A comparison between the proposed CNN-based classifier and the clinical visual assessments (CVAs) by 12 otolaryngologists was conducted. In the independent testing dataset, an overall accuracy of 96.24% was achieved; for leukoplakia, benign, malignancy, normal, and vocal nodule, the sensitivity and specificity were 92.8% vs. 98.9%, 97% vs. 99.7%, 89% vs. 99.3%, 99.0% vs. 99.4%, and 97.2% vs. 99.1%, respectively. Furthermore, when compared with CVAs on the randomly selected test dataset, the CNN-based classifier outperformed physicians for most laryngeal conditions, with striking improvements in the ability to distinguish nodules (98% vs. 45%, P < .001), polyps (91% vs. 86%, P < .001), leukoplakia (91% vs. 65%, P < .001), and malignancy (90% vs. 54%, P < .001). The CNN-based classifier can provide a valuable reference for the diagnosis of laryngeal neoplasms during laryngoscopy, especially for distinguishing benign, precancerous, and cancer lesions. NA Laryngoscope, 130:E686-E693, 2020.

Identifiants

DOI: 10.1002/lary.28539 PMID: 32068890

pubmed: 32068890

doi: 10.1002/lary.28539

doi:

Types de publication

Evaluation Study Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

Pagination

E686-E693

Informations de copyright

Références

Alberti P. The history of laryngology: a centennial celebration. Otolaryngol Head Neck Surg 1996;114:345-354.

Stachler R, Francis DO, Schwartz SR, et al. Clinical practice guideline: hoarseness (dysphonia) (update). Otolaryngol Head Neck Surg 2018;158:S1-S42.

Russakovsky O, Deng J, Krause J, et al. ImageNet large scale visual recognition challenge. Int J Comput Vis 2015;115:211-252.

Deng J, Dong W, Socher R, Li L. ImageNet: a large-scale hierarchical image database. Paper presented at: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009); June 20-25, 2009; Miami, Florid, USA. IEEE, 2009.

Pan S, Yang Q. A survey on transfer learning. IEEE Trans Knowl Data Eng 2009;22:1345-1359.

Shen D, Wu G, Suk H. Deep learning in medical image analysis. Ann Rev Biomed Eng 2017;19:221-248.

Litjens G, Kooi T, Bejnordi B, et al. A survey on deep learning in medical image analysis. Med Image Anal 2017;42:60-88.

Zhang JP, Xia Y, Xie Y, Fulham M, Feng DD. Classification of medical images in the biomedical literature by jointly using deep and handcrafted visual features. IEEE J Biomed Health Inform 2017;22:1521-1530.

Esteva A, Kuprel B, Novoa R, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 2017;542:115-118.

Gulshan V, Peng L, Coram M, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 2016;316:2402-2410.

Yu L, Chen H, Dou Q, Qin J, Heng PA. Integrating online and offline three-dimensional deep learning for automated polyp detection in colonoscopy videos. IEEE J Biomed Health Inform 2017;21:65-75.

Bejnordi BE, Veta M, Johannes van Diest P, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast cancer. JAMA 2017;318:2199-2210.

Hirasawa T, Aoyama K, Tanimoto T, et al. Application of artificial intelligence using a convolutional neural network for detecting gastric cancer in endoscopic images. Gastric Cancer 2018;21:653-660.

He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. Paper presented at: IEEE Conference on Computer Vision and Pattern Recognition (CVPR); June 26-July 1, 2016; Las Vegas, NV.

Shin HC, Roth HR, Gao M, et al. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging 2016;35:1285-1298.

Xie S, Tu Z. Holistically-nested edge detection. The IEEE International Conference on Computer Vision (ICCV); December 7-13, 2015; Santiago, Chile.

van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res 2008;9:2579-2605.

Ramprasaath RS, Michael C, Abhishek D, et al. Grad-CAM: visual explanations from deep networks via gradient-based localization. Int J Comput Vis 2020;128:336-359.

Cheng CT, Ho TY, Lee TY, et al. Application of a deep learning algorithm for detection and visualization of hip fractures on plain pelvic radiographs. Eur Radiol 2019;29:5469-5477.

Witt DR, Chen H, Mielens JD, et al. Detection of chronic laryngitis due to laryngopharyngeal reflux using color and texture analysis of laryngoscopic images. J Voice 2014;28:98-105.

Verikas A, Gelzinis A, Bacauskiene M, Uloza V. Integrating global and local analysis of color, texture and geometrical information for categorizing laryngeal images. Int J Pattern Recognit Artifici Intel 2006;20:1187-1205.

Ilgner JF, Palm C, Schutz AG, Spitzer K, Westhofen M, Lehmann TM. Colour texture analysis for quantitative laryngoscopy. Acta Otolaryngol 2003;123:730-734.