Validation of a Machine Learning Model to Predict Childhood Lead Poisoning.

Child, Preschool Female Health Care Rationing Humans Lead Poisoning / diagnosis Logistic Models Machine Learning Male Preventive Health Services / methods Resource Allocation Risk Assessment / methods Sensitivity and Specificity United States

Journal

JAMA network open

ISSN: 2574-3805

Titre abrégé: JAMA Netw Open

Pays: United States

ID NLM: 101729235

Informations de publication

Date de publication:
01 09 2020

Historique:

entrez: 16 9 2020

pubmed: 17 9 2020

medline: 7 1 2021

Statut: epublish

Résumé

Childhood lead poisoning causes irreversible neurobehavioral deficits, but current practice is secondary prevention. To validate a machine learning (random forest) prediction model of elevated blood lead levels (EBLLs) by comparison with a parsimonious logistic regression. This prognostic study for temporal validation of multivariable prediction models used data from the Women, Infants, and Children (WIC) program of the Chicago Department of Public Health. Participants included a development cohort of children born from January 1, 2007, to December 31, 2012, and a validation WIC cohort born from January 1 to December 31, 2013. Blood lead levels were measured until December 31, 2018. Data were analyzed from January 1 to October 31, 2019. Blood lead level test results; lead investigation findings; housing characteristics, permits, and violations; and demographic variables. Incident EBLL (≥6 μg/dL). Models were assessed using the area under the receiver operating characteristic curve (AUC) and confusion matrix metrics (positive predictive value, sensitivity, and specificity) at various thresholds. Among 6812 children in the WIC validation cohort, 3451 (50.7%) were female, 3057 (44.9%) were Hispanic, 2804 (41.2%) were non-Hispanic Black, 458 (6.7%) were non-Hispanic White, and 442 (6.5%) were Asian (mean [SD] age, 5.5 [0.3] years). The median year of housing construction was 1919 (interquartile range, 1903-1948). Random forest AUC was 0.69 compared with 0.64 for logistic regression (difference, 0.05; 95% CI, 0.02-0.08). When predicting the 5% of children at highest risk to have EBLLs, random forest and logistic regression models had positive predictive values of 15.5% and 7.8%, respectively (difference, 7.7%; 95% CI, 3.7%-11.3%), sensitivity of 16.2% and 8.1%, respectively (difference, 8.1%; 95% CI, 3.9%-11.7%), and specificity of 95.5% and 95.1% (difference, 0.4%; 95% CI, 0.0%-0.7%). The machine learning model outperformed regression in predicting childhood lead poisoning, especially in identifying children at highest risk. Such a model could be used to target the allocation of lead poisoning prevention resources to these children.

Identifiants

DOI: 10.1001/jamanetworkopen.2020.12734 PMID: 32936296 PMC: PMC7495240

pubmed: 32936296

pii: 2770650

doi: 10.1001/jamanetworkopen.2020.12734

pmc: PMC7495240

doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't Validation Study

Langues

eng

Sous-ensembles de citation

Pagination

e2012734

Références

JAMA Pediatr. 2018 Oct 1;172(10):934-942

pubmed: 30178064

Environ Health Perspect. 2002 Sep;110(9):947-53

pubmed: 12204831

NTP Monogr. 2012 Jun;(1):xiii, xv-148

pubmed: 23964424

Big Data. 2017 Jun;5(2):153-163

pubmed: 28632438

Ann Intern Med. 2015 Jan 6;162(1):W1-73

pubmed: 25560730

Am J Public Health. 2004 Nov;94(11):1945-51

pubmed: 15514235

Pediatrics. 2004 Jul;114(1):19-26

pubmed: 15231903

Int J Hyg Environ Health. 2005;208(1-2):15-20

pubmed: 15881974

Cochrane Database Syst Rev. 2016 Oct 16;10:CD006047

pubmed: 27744650

Public Health Rep. 2005 May-Jun;120(3):305-10

pubmed: 16134573

Public Health Rep. 2011 May-Jun;126 Suppl 1:76-88

pubmed: 21563715

Am Econ Rev. 2015 May;105(5):491-495

pubmed: 27199498

Environ Health Perspect. 2008 Dec;116(12):1735-9

pubmed: 19079729

Am J Med. 2016 Nov;129(11):1213-1218

pubmed: 27341956

Environ Health Perspect. 2009 Jul;117(7):1162-7

pubmed: 19654928

Pediatrics. 2016 Jul;138(1):

pubmed: 27325637

Validation of a Machine Learning Model to Predict Childhood Lead Poisoning.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Références

Auteurs

Eric Potash (E)

Rayid Ghani (R)

Joe Walsh (J)

Emile Jorgensen (E)

Cortland Lohff (C)

Nik Prachand (N)

Raed Mansour (R)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH