Automated cooling tower detection through deep learning for Legionnaires' disease outbreak investigations: a model development and validation study.


Journal

The Lancet. Digital health
ISSN: 2589-7500
Titre abrégé: Lancet Digit Health
Pays: England
ID NLM: 101751302

Informations de publication

Date de publication:
Jul 2024
Historique:
received: 23 02 2023
revised: 10 04 2024
accepted: 01 05 2024
medline: 22 6 2024
pubmed: 22 6 2024
entrez: 21 6 2024
Statut: ppublish

Résumé

Cooling towers containing Legionella spp are a high-risk source of Legionnaires' disease outbreaks. Manually locating cooling towers from aerial imagery during outbreak investigations requires expertise, is labour intensive, and can be prone to errors. We aimed to train a deep learning computer vision model to automatically detect cooling towers that are aerially visible. Between Jan 1 and 31, 2021, we extracted satellite view images of Philadelphia (PN, USA) and New York state (NY, USA) from Google Maps and annotated cooling towers to create training datasets. We augmented training data with synthetic data and model-assisted labelling of additional cities. Using 2051 images containing 7292 cooling towers, we trained a two-stage model using YOLOv5, a model that detects objects in images, and EfficientNet-b5, a model that classifies images. We assessed the primary outcomes of sensitivity and positive predictive value (PPV) of the model against manual labelling on test datasets of 548 images, including from two cities not seen in training (Boston [MA, USA] and Athens [GA, USA]). We compared the search speed of the model with that of manual searching by four epidemiologists. The model identified visible cooling towers with 95·1% sensitivity (95% CI 94·0-96·1) and a PPV of 90·1% (95% CI 90·0-90·2) in New York City and Philadelphia. In Boston, sensitivity was 91·6% (89·2-93·7) and PPV was 80·8% (80·5-81·2). In Athens, sensitivity was 86·9% (75·8-94·2) and PPV was 85·5% (84·2-86·7). For an area of New York City encompassing 45 blocks (0·26 square miles), the model searched more than 600 times faster (7·6 s; 351 potential cooling towers identified) than did human investigators (mean 83·75 min [SD 29·5]; mean 310·8 cooling towers [42·2]). The model could be used to accelerate investigation and source control during outbreaks of Legionnaires' disease through the identification of cooling towers from aerial imagery, potentially preventing additional disease spread. The model has already been used by public health teams for outbreak investigations and to initialise cooling tower registries, which are considered best practice for preventing and responding to outbreaks of Legionnaires' disease. None.

Sections du résumé

BACKGROUND BACKGROUND
Cooling towers containing Legionella spp are a high-risk source of Legionnaires' disease outbreaks. Manually locating cooling towers from aerial imagery during outbreak investigations requires expertise, is labour intensive, and can be prone to errors. We aimed to train a deep learning computer vision model to automatically detect cooling towers that are aerially visible.
METHODS METHODS
Between Jan 1 and 31, 2021, we extracted satellite view images of Philadelphia (PN, USA) and New York state (NY, USA) from Google Maps and annotated cooling towers to create training datasets. We augmented training data with synthetic data and model-assisted labelling of additional cities. Using 2051 images containing 7292 cooling towers, we trained a two-stage model using YOLOv5, a model that detects objects in images, and EfficientNet-b5, a model that classifies images. We assessed the primary outcomes of sensitivity and positive predictive value (PPV) of the model against manual labelling on test datasets of 548 images, including from two cities not seen in training (Boston [MA, USA] and Athens [GA, USA]). We compared the search speed of the model with that of manual searching by four epidemiologists.
FINDINGS RESULTS
The model identified visible cooling towers with 95·1% sensitivity (95% CI 94·0-96·1) and a PPV of 90·1% (95% CI 90·0-90·2) in New York City and Philadelphia. In Boston, sensitivity was 91·6% (89·2-93·7) and PPV was 80·8% (80·5-81·2). In Athens, sensitivity was 86·9% (75·8-94·2) and PPV was 85·5% (84·2-86·7). For an area of New York City encompassing 45 blocks (0·26 square miles), the model searched more than 600 times faster (7·6 s; 351 potential cooling towers identified) than did human investigators (mean 83·75 min [SD 29·5]; mean 310·8 cooling towers [42·2]).
INTERPRETATION CONCLUSIONS
The model could be used to accelerate investigation and source control during outbreaks of Legionnaires' disease through the identification of cooling towers from aerial imagery, potentially preventing additional disease spread. The model has already been used by public health teams for outbreak investigations and to initialise cooling tower registries, which are considered best practice for preventing and responding to outbreaks of Legionnaires' disease.
FUNDING BACKGROUND
None.

Identifiants

pubmed: 38906615
pii: S2589-7500(24)00094-3
doi: 10.1016/S2589-7500(24)00094-3
pii:
doi:

Types de publication

Journal Article Validation Study

Langues

eng

Sous-ensembles de citation

IM

Pagination

e500-e506

Informations de copyright

Copyright © 2024 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC-ND 4.0 license. Published by Elsevier Ltd.. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of interests FN owns stock in Google and Amazon. All other authors declare no competing interests. The findings and conclusions of this study are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.

Auteurs

Karen K Wong (KK)

Centers for Disease Control and Prevention, Atlanta, GA, USA; University of California, Berkeley, CA, USA. Electronic address: kwong.cdc@gmail.com.

Thaddeus Segura (T)

University of California, Berkeley, CA, USA.

Gunnar Mein (G)

University of California, Berkeley, CA, USA.

Jia Lu (J)

University of California, Berkeley, CA, USA.

Elizabeth J Hannapel (EJ)

Centers for Disease Control and Prevention, Atlanta, GA, USA.

Jasen M Kunz (JM)

Centers for Disease Control and Prevention, Atlanta, GA, USA.

Troy Ritter (T)

Centers for Disease Control and Prevention, Atlanta, GA, USA.

Jessica C Smith (JC)

Centers for Disease Control and Prevention, Atlanta, GA, USA.

Alberto Todeschini (A)

University of California, Berkeley, CA, USA.

Fred Nugen (F)

University of California, Berkeley, CA, USA.

Chris Edens (C)

Centers for Disease Control and Prevention, Atlanta, GA, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH