Low-light image enhancement of high-speed endoscopic videos using a convolutional neural network.


Journal

Medical & biological engineering & computing
ISSN: 1741-0444
Titre abrégé: Med Biol Eng Comput
Pays: United States
ID NLM: 7704869

Informations de publication

Date de publication:
Jul 2019
Historique:
received: 30 10 2018
accepted: 20 02 2019
pubmed: 23 3 2019
medline: 29 1 2020
entrez: 23 3 2019
Statut: ppublish

Résumé

Laryngeal endoscopy is one of the primary diagnostic tools for laryngeal disorders. The main techniques are videostroboscopy and lately high-speed video endoscopy. Unfortunately, due to the restricting anatomy of the larynx and technical limitations of the recording equipment, many videos suffer from insufficient illumination, which complicates clinical examination and analysis. This work presents an approach to enhance low-light images from high-speed video endoscopy using a convolutional neural network. We introduce a new technique to generate realistically darkened training samples using Perlin noise. Extensive data augmentation is employed to cope with the limited training data allowing training with just 55 videos. The approach is compared against four state-of-the-art low-light enhancement methods and statistically significantly outperforms each on a no-reference (NIQE) and two full-reference (PSNR, SSIM) image quality metrics. The presented approach can be run on consumer-grade hardware and is thereby directly applicable in a clinical context. It is likely transferable to similar techniques such as videostroboscopy. Graphical Abstract The basic setup for training and employing an improved fully convolutional U-Net neural network to predict a brightness map used to enhance the lighting of ill-lit endoscopic high-speed videos - Artificially darkened training data are created using Perlin noise to allow region-specific darkening.

Identifiants

pubmed: 30900057
doi: 10.1007/s11517-019-01965-4
pii: 10.1007/s11517-019-01965-4
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1451-1463

Subventions

Organisme : Deutsche Forschungsgemeinschaft
ID : 323308998 under grant no. DO1247/8-1 and BO4399/2-1

Références

Med Biol Eng Comput. 2001 May;39(3):273-8
pubmed: 11465879
IEEE Trans Image Process. 2004 Apr;13(4):600-12
pubmed: 15376593
Folia Phoniatr Logop. 2006;58(3):175-85
pubmed: 16636565
Med Image Anal. 2007 Aug;11(4):400-13
pubmed: 17544839
Folia Phoniatr Logop. 2008;60(1):33-44
pubmed: 18057909
Ann Otol Rhinol Laryngol. 2008 Jun;117(6):413-24
pubmed: 18646437
IEEE Trans Image Process. 2009 Sep;18(9):1921-35
pubmed: 19403363
J Acoust Soc Am. 2011 Jan;129(1):326-39
pubmed: 21303014
IEEE Trans Image Process. 2011 Dec;20(12):3431-41
pubmed: 21609884
J Acoust Soc Am. 2011 Dec;130(6):3999-4009
pubmed: 22225054
Laryngoscope. 2012 Jul;122(7):1582-8
pubmed: 22544473
Laryngoscope. 2012 Nov;122(11):2511-8
pubmed: 22965771
Curr Opin Otolaryngol Head Neck Surg. 2012 Dec;20(6):466-71
pubmed: 23000735
Am J Speech Lang Pathol. 2013 May;22(2):212-26
pubmed: 23184134
IEEE Trans Image Process. 2013 Sep;22(9):3538-48
pubmed: 23661319
IEEE Trans Image Process. 2013 Dec;22(12):5372-84
pubmed: 24108715
Laryngoscope. 2014 Oct;124(10):2359-62
pubmed: 24782443
IEEE Trans Biomed Eng. 2015 Mar;62(3):795-806
pubmed: 25350912
J Acoust Soc Am. 2014 Dec;136(6):3290
pubmed: 25480074
IEEE Trans Med Imaging. 2016 Jul;35(7):1615-24
pubmed: 26829782
IEEE Trans Med Imaging. 2016 May;35(5):1285-98
pubmed: 26886976
IEEE Trans Med Imaging. 2016 May;35(5):1299-1312
pubmed: 26978662
IEEE Trans Image Process. 2017 Feb;26(2):982-993
pubmed: 28113318
J Voice. 2017 Sep;31(5):594-600
pubmed: 28416083
Med Biol Eng Comput. 2017 Dec;55(12):2123-2141
pubmed: 28550413
IEEE Trans Image Process. 2017 Sep;26(9):4509-4522
pubmed: 28641250
Med Image Anal. 2017 Dec;42:60-88
pubmed: 28778026
Nat Hum Behav. 2018 Jan;2(1):6-10
pubmed: 30980045
J Opt Soc Am. 1971 Jan;61(1):1-11
pubmed: 5541571
J Voice. 1996 Jun;10(2):201-5
pubmed: 8734395

Auteurs

Pablo Gómez (P)

Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany. pablo.gomez@uk-erlangen.de.

Marion Semmler (M)

Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.

Anne Schützenberger (A)

Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.

Christopher Bohr (C)

ENT Department, University Hospital Regensburg, University Regensburg, Franz-Josef-Strauß-Allee 11, 93053, Regensburg, Germany.

Michael Döllinger (M)

Division of Phoniatrics and Pediatric Audiology, Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg, Waldstraße 1, 91054, Erlangen, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH