A better performing algorithm for identification of implausible growth data from longitudinal pediatric medical records.


Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
06 Aug 2024
Historique:
received: 07 02 2024
accepted: 01 08 2024
medline: 7 8 2024
pubmed: 7 8 2024
entrez: 6 8 2024
Statut: epublish

Résumé

Tracking trajectories of body size in children provides insight into chronic disease risk. One measure of pediatric body size is body mass index (BMI), a function of height and weight. Errors in measuring height or weight may lead to incorrect assessment of BMI. Yet childhood measures of height and weight extracted from electronic medical records often include values which seem biologically implausible in the context of a growth trajectory. Removing biologically implausible values reduces noise in the data, and thus increases the ease of modeling associations between exposures and childhood BMI trajectories, or between childhood BMI trajectories and subsequent health conditions. We developed open-source algorithms (available on github) for detecting and removing biologically implausible values in pediatric trajectories of height and weight. A Monte Carlo simulation experiment compared the sensitivity, specificity and speed of our algorithms to three published algorithms. The comparator algorithms were selected because they used trajectory information, had open-source code, and had published verification studies. Simulation inputs were derived from longitudinal epidemiological cohorts. Our algorithms had higher specificity, with similar sensitivity and speed, when compared to the three published algorithms. The results suggest that our algorithms should be adopted for cleaning longitudinal pediatric growth data.

Identifiants

pubmed: 39107468
doi: 10.1038/s41598-024-69161-5
pii: 10.1038/s41598-024-69161-5
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

18276

Subventions

Organisme : NIGMS NIH HHS
ID : 5R01GM121081-08
Pays : United States
Organisme : NIH HHS
ID : R01DK068001
Pays : United States

Informations de copyright

© 2024. The Author(s).

Références

Daymont, C. et al. Automated identification of implausible values in growth data from pediatric electronic health records. J. Am. Med. Inform. Assoc. 24, 1080–1087 (2017).
doi: 10.1093/jamia/ocx037 pubmed: 28453637 pmcid: 7651915
Shi, J., Korsiak, J. & Roth, D. E. New approach for the identification of implausible values and outliers in longitudinal childhood anthropometric data. Ann. Epidemiol. 28, 204-211.e3 (2018).
doi: 10.1016/j.annepidem.2018.01.007 pubmed: 29398298 pmcid: 5840491
Phan, H. T. T. et al. Automated data cleaning of paediatric anthropometric data from longitudinal electronic health records: Protocol and application to a large patient cohort. Sci. Rep. 10, 10164 (2020).
doi: 10.1038/s41598-020-66925-7 pubmed: 32576940 pmcid: 7311482
Lawman, H. G. et al. Comparing methods for identifying biologically implausible values in height, weight, and body mass index among youth. Am. J. Epidemiol. 182, 359–365 (2015).
doi: 10.1093/aje/kwv057 pubmed: 26182944 pmcid: 4528955
Centers for Disease Control. Growth Charts - 2000 CDC Growth Charts - United States. https://www.cdc.gov/growthcharts/cdc_charts.htm (2022).
Hockett, C. W., Harrall, K. K., Glueck, D. H. & Dabelea, D. M. Exposure to gestational diabetes and BMI trajectories through adolescence: Exploring Perinatal Outcomes in Children study. J. Clin. Endocrinol. Metab. https://doi.org/10.1210/clinem/dgad278 (2023).
doi: 10.1210/clinem/dgad278 pubmed: 37200149 pmcid: 10583996
Bekelman, T. A. et al. Adherence to index-based dietary patterns in childhood and BMI trajectory during the transition to adolescence: The EPOCH study. Int. J. Obes. (London) 45, 2439–2446 (2021).
doi: 10.1038/s41366-021-00917-z
Moore, B. F., Harrall, K. K., Sauder, K. A., Glueck, D. H. & Dabelea, D. Neonatal adiposity and childhood obesity. Pediatrics 146, e20200737 (2020).
doi: 10.1542/peds.2020-0737 pubmed: 32796097
Hockett, C. W. et al. Persistent effects of in utero overnutrition on offspring adiposity: The Exploring Perinatal Outcomes among Children (EPOCH) study. Diabetologia 62, 2017–2024 (2019).
doi: 10.1007/s00125-019-04981-z pubmed: 31444527 pmcid: 7593989
Kim, C., Harrall, K. K., Glueck, D. H. & Dabelea, D. Sex steroids and adiposity in a prospective observational cohort of youth. Obes. Sci. Pract. 7, 432–440 (2021).
doi: 10.1002/osp4.510 pubmed: 34401201 pmcid: 8346372
Cohen, C. C. et al. Body composition trajectories from birth to 5 years and hepatic fat in early childhood. Am. J. Clin. Nutr. 116, 1010–1018 (2022).
doi: 10.1093/ajcn/nqac168 pubmed: 36055960 pmcid: 9535524
World Health Organization Expert Committee. Physical status: The use and interpretation of anthropometry. Report of a WHO Expert Committee. World Health Organ. Tech. Rep. Ser. 854, 1–452 (1995).
Field, A. E. et al. Relation between dieting and weight change among preadolescents and adolescents. Pediatrics 112, 900–906 (2003).
doi: 10.1542/peds.112.4.900 pubmed: 14523184
Lobstein, T. J., James, W. P. T. & Cole, T. J. Increasing levels of excess weight among children in England. Int. J. Obes. 27, 1136–1138 (2003).
doi: 10.1038/sj.ijo.0802324
National Health and Nutrition Examination Survey. NHANES 2001–2002: Body measures data documentation, codebook, and frequencies. https://wwwn.cdc.gov/nchs/nhanes/2001-2002/BMX_B.htm (2004).
Conde, W. L. & Monteiro, C. A. Body mass index cutoff points for evaluation of nutritional status in Brazilian children and adolescents. J. Pediatr. (Rio J) 82, 266–272 (2006).
doi: 10.2223/JPED.1502 pubmed: 16858504
Smith, N. et al. Body weight and height data in electronic medical records of children. Int. J. Pediatr. Obes. 5, 237–242 (2010).
doi: 10.3109/17477160903268308 pubmed: 19961272
Youth Risk Behavior Surveillance System. 2013 YRBS data user’s guide. https://www.cdc.gov/healthyyouth/data/yrbs/files/2013/pdf/yrbs_2013_national_user_guide.pdf (2012).
Centers for Disease Control and Prevention. Cut-offs to define outliers in the 2000 CDC growth charts (2014).
Lo, J. C. et al. Prevalence of obesity and extreme obesity in children aged 3–5 years. Pediatr. Obes. 9, 167–175 (2014).
doi: 10.1111/j.2047-6310.2013.00154.x pubmed: 23677690
Kim, J. et al. Incidence and remission rates of overweight among children aged 5 to 13 years in a district-wide school surveillance system. Am. J. Public Health 95, 1588–1594 (2005).
doi: 10.2105/AJPH.2004.054015 pubmed: 16051932 pmcid: 1449402
Sturm, R. & Datar, A. Body mass index in elementary school children, metropolitan area food prices and food outlet density. Public Health 119, 1059–1068 (2005).
doi: 10.1016/j.puhe.2005.05.007 pubmed: 16140349
Lawman, H. G. et al. Trends in relative weight over one year in low-income urban youth. Obesity (Silver Spring) 23, 436–442 (2015).
doi: 10.1002/oby.20928 pubmed: 25354706
Yang, S. & Hutcheon, J. A. Identifying outliers and implausible values in growth trajectory data. Ann. Epidemiol. 26, 77-80.e2 (2016).
doi: 10.1016/j.annepidem.2015.10.002 pubmed: 26590476
Preece, M. A. & Baines, M. J. A new family of mathematical models describing the human growth curve. Ann. Hum. Biol. 5, 1–24 (1978).
doi: 10.1080/03014467800002601 pubmed: 646321
Cole, T. J., Donaldson, M. D. C. & Ben-Shlomo, Y. SITAR—a useful instrument for growth curve analysis. Int. J. Epidemiol. 39, 1558–1566 (2010).
doi: 10.1093/ije/dyq115 pubmed: 20647267 pmcid: 2992626
SAS Institute Inc. SAS® 9.4 Language Reference: Concepts 6th edn. (SAS Institute Inc., 2016).
R Core Team. R: A Language and Environment for Statistical Computing. https://www.r-project.org (2023).
Crume, T. L. et al. Association of exposure to diabetes in utero with adiposity and fat distribution in a multiethnic population of youth: the Exploring Perinatal Outcomes among Children (EPOCH) Study. Diabetologia 54, 87–92 (2011).
doi: 10.1007/s00125-010-1925-3 pubmed: 20953862
Carroll, R. J., Ruppert, D., Stefanski, L. A. & Crainiceanu, C. M. Measurement Error in Nonlinear Models: A Modern Perspective (CRC Press, 2006).
doi: 10.1201/9781420010138
The WHO Child Growth Standards. https://www.who.int/tools/child-growth-standards/standards .
Freedman, D. S. et al. Validity of the WHO cutoffs for biologically implausible values of weight, height, and BMI in children and adolescents in NHANES from 1999 through 2012. Am. J. Clin. Nutr. 102, 1000–1006 (2015).
doi: 10.3945/ajcn.115.115576 pubmed: 26377160
Free Software Foundation. The GNU Public License. https://www.gnu.org/licenses/gpl-3.0.en.html (2023).
Gillman, M. W. & Blaisdell, C. J. Environmental influences on Child Health Outcomes, a Research Program of the National Institutes of Health. Curr. Opinion Pediatr. 30, 260 (2018).
doi: 10.1097/MOP.0000000000000600 pubmed: 29356702
Vrijheid, M. et al. The human early-life exposome (HELIX): Project rationale and design. Environ. Health Perspect. 122, 535–544 (2014).
doi: 10.1289/ehp.1307204 pubmed: 24610234 pmcid: 4048258
Morris, T. P., White, I. R. & Crowther, M. J. Using simulation studies to evaluate statistical methods. Stat. Med. 38, 2074–2102 (2019).
doi: 10.1002/sim.8086 pubmed: 30652356 pmcid: 6492164

Auteurs

Kylie K Harrall (KK)

Department of Health Outcomes and Biomedical Informatics, University of Florida School of Medicine, Gainesville, FL, USA. KylieHarrall@ufl.edu.
Lifecourse Epidemiology of Adiposity and Diabetes Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. KylieHarrall@ufl.edu.

Sarah M Bird (SM)

Lifecourse Epidemiology of Adiposity and Diabetes Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
Department of Biostatistics, Colorado School of Public Health, Anschutz Medical Campus, Aurora, CO, USA.

Keith E Muller (KE)

Department of Health Outcomes and Biomedical Informatics, University of Florida School of Medicine, Gainesville, FL, USA.

Lauren A Vanderlinden (LA)

Deparment of Epidemiology, Colorado School of Public Health, Anschutz Medical Campus, Aurora, CO, USA.

Maya E Payton (ME)

Urban Institute, Washington, DC, USA.

Anna Bellatorre (A)

Lifecourse Epidemiology of Adiposity and Diabetes Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Dana Dabelea (D)

Lifecourse Epidemiology of Adiposity and Diabetes Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
Deparment of Epidemiology, Colorado School of Public Health, Anschutz Medical Campus, Aurora, CO, USA.
Department of Pediatrics, University of Colorado School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Deborah H Glueck (DH)

Lifecourse Epidemiology of Adiposity and Diabetes Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
Department of Pediatrics, University of Colorado School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH