Asthma in farm children is more determined by genetic polymorphisms and in non-farm children by environmental factors.

childhood asthma environment farming genome-wide association studies machine learning penalized regression random forest risk prediction single-nucleotide polymorphisms statistical learning

Journal

Pediatric allergy and immunology : official publication of the European Society of Pediatric Allergy and Immunology
ISSN: 1399-3038
Titre abrégé: Pediatr Allergy Immunol
Pays: England
ID NLM: 9106718

Informations de publication

Date de publication:
02 2021
Historique:
received: 27 05 2020
revised: 22 09 2020
accepted: 23 09 2020
pubmed: 1 10 2020
medline: 19 8 2021
entrez: 30 9 2020
Statut: ppublish

Résumé

The asthma syndrome is influenced by hereditary and environmental factors. With the example of farm exposure, we study whether genetic and environmental factors interact for asthma. Statistical learning approaches based on penalized regression and decision trees were used to predict asthma in the GABRIELA study with 850 cases (9% farm children) and 857 controls (14% farm children). Single-nucleotide polymorphisms (SNPs) were selected from a genome-wide dataset based on a literature search or by statistical selection techniques. Prediction was assessed by receiver operating characteristics (ROC) curves and validated in the PASTURE cohort. Prediction by family history of asthma and atopy yielded an area under the ROC curve (AUC) of 0.62 [0.57-0.66] in the random forest machine learning approach. By adding information on demographics (sex and age) and 26 environmental exposure variables, the quality of prediction significantly improved (AUC = 0.65 [0.61-0.70]). In farm children, however, environmental variables did not improve prediction quality. Rather SNPs related to IL33 and RAD50 contributed significantly to the prediction of asthma (AUC = 0.70 [0.62-0.78]). Asthma in farm children is more likely predicted by other factors as compared to non-farm children though in both forms, family history may integrate environmental exposure, genotype and degree of penetrance.

Sections du résumé

BACKGROUND
The asthma syndrome is influenced by hereditary and environmental factors. With the example of farm exposure, we study whether genetic and environmental factors interact for asthma.
METHODS
Statistical learning approaches based on penalized regression and decision trees were used to predict asthma in the GABRIELA study with 850 cases (9% farm children) and 857 controls (14% farm children). Single-nucleotide polymorphisms (SNPs) were selected from a genome-wide dataset based on a literature search or by statistical selection techniques. Prediction was assessed by receiver operating characteristics (ROC) curves and validated in the PASTURE cohort.
RESULTS
Prediction by family history of asthma and atopy yielded an area under the ROC curve (AUC) of 0.62 [0.57-0.66] in the random forest machine learning approach. By adding information on demographics (sex and age) and 26 environmental exposure variables, the quality of prediction significantly improved (AUC = 0.65 [0.61-0.70]). In farm children, however, environmental variables did not improve prediction quality. Rather SNPs related to IL33 and RAD50 contributed significantly to the prediction of asthma (AUC = 0.70 [0.62-0.78]).
CONCLUSIONS
Asthma in farm children is more likely predicted by other factors as compared to non-farm children though in both forms, family history may integrate environmental exposure, genotype and degree of penetrance.

Identifiants

pubmed: 32997854
doi: 10.1111/pai.13385
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

295-304

Informations de copyright

© 2020 The Authors. Pediatric Allergy and Immunology published by European Academy of Allergy and Clinical Immunology and John Wiley & Sons Ltd.

Références

Moffatt MF, Gut IG, Demenais F, et al. A large-scale, consortium-based genomewide association study of asthma. N Engl J Med. 2010;363(13):1211-1221.
Duffy DL, Martin NG, Battistutta D, Hopper JL, Mathews JD. Genetics of asthma and hay fever in Australian twins. Am Rev Respir Dis. 1990;142(6 Pt 1):1351-1358.
Loss GJ, Depner M, Hose AJ, et al. The early development of wheeze. Environmental determinants and genetic susceptibility at 17q21. Am J Respir Crit Care Med. 2016;193(8):889-897.
Dudbridge F, Gusnanto A. Estimation of significance thresholds for genomewide association scans. Genet Epidemiol. 2008;32(3):227-234.
Ege MJ, Strachan DP, Cookson WO, et al. Gene-environment interaction for childhood asthma and exposure to farming in Central Europe. J Allergy Clin Immunol. 2011;127(1):138-144, 144 e131-134.
Genuneit J, Buchele G, Waser M, et al. The GABRIEL advanced surveys: study design, participation and evaluation of bias. Paediatr Perin Epidemiol. 2011;25(5):436-447.
Weiland SK, von Mutius E, Hirsch T, et al. Prevalence of respiratory and atopic disorders among children in the East and West of Germany five years after unification. Eur Respir J. 1999;14(4):862-870.
Li Y, Willer CJ, Ding J, Scheet P, Abecasis GR. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet Epidemiol. 2010;34(8):816-834.
Genomes Project C, Abecasis GR, Auton A, et al. An integrated map of genetic variation from 1,092 human genomes. Nature. 2012;491(7422):56-65.
Wu J, Pfeiffer RM, Gail MH. Strategies for developing prediction models from genome-wide association studies. Genet Epidemiol. 2013;37(8):768-777.
MacArthur J, Bowler E, Cerezo M, et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 2017;45(D1):D896-D901.
R Core Team. R: A Language and Environment for Statistical Computing [Computer Program]. Vienna, Austria: R Foundation for Statistical Computing; 2016.
Buuren SV, Groothuis-Oudshoorn K. mice: multivariate imputation by chained equations inR. J Stat Softw. 2011;45(3):1-67.
Hastie T, Tibshirani R, Friedman J. Overview of Supervised Learning. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. New York, NY: Springer; 2009:9-41.
Boulesteix AL, De Bin R, Jiang X, Fuchs M. IPF-LASSO: integrative L1-penalized regression with penalty factors for prediction based on multi-omics data. Comput Math Methods Med. 2017;2017:7691937.
Breiman L. Random forests. Mach Learn. 2001;45(1):5-32.
Fawcett T. An introduction to ROC analysis. Pattern Recognit Lett. 2006;27(8):861-874.
Ober C. Asthma genetics in the post-GWAS Era. Ann Am Thor Soc. 2016;13(Suppl 1):S85-S90.
Pasaniuc B, Price AL. Dissecting the genetics of complex traits using summary association statistics. Nat Rev Genet. 2017;18(2):117-127.
Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits: from polygenic to omnigenic. Cell. 2017;169(7):1177-1186.
Spycher BD, Henderson J, Granell R, et al. Genome-wide prediction of childhood asthma and related phenotypes in a longitudinal birth cohort. J Allergy Clin Immunol. 2012;130(2):503-509 e507.
Torous W, Valkanov R. Boundaries of Predictability: Noisy Predictive Regressions. Los Angeles, CA: Anderson Graduate School of Management, UCLA; 2000.
Backhed F, Roswall J, Peng Y, et al. Dynamics and stabilization of the human gut microbiome during the first year of life. Cell Host Microbe. 2015;17(5):690-703.
Ege MJ. Asthma and prenatal inflammation. Am J Respir Crit Care Med. 2017;195(5):546-548.
Bonnelykke K, Sleiman P, Nielsen K, et al. A genome-wide association study identifies CDHR3 as a susceptibility locus for early childhood asthma with severe exacerbations. Nat Genet. 2014;46(1):51-55.
Schroder PC, Casaca VI, Illi S, et al. IL-33 polymorphisms are associated with increased risk of hay fever and reduced regulatory T cells in a birth cohort. Pediatr Allergy Immunol. 2016;27(7):687-695.
Li X, Howard TD, Zheng SL, et al. Genome-wide association study of asthma identifies RAD50-IL13 and HLA-DR/DQ regions. J Allergy Clin Immunol. 2010;125(2):328-335 e311.
Weidinger S, Willis-Owen SA, Kamatani Y, et al. A genome-wide association study of atopic dermatitis identifies loci with overlapping effects on asthma and psoriasis. Hum Mole Genet. 2013;22(23):4841-4856.
Sehmi R, Smith SG, Kjarsgaard M, et al. Role of local eosinophilopoietic processes in the development of airway eosinophilia in prednisone-dependent severe asthma. Clin Exp Allergy. 2016;46(6):793-802.
Depner M, Fuchs O, Genuneit J, et al. Clinical and epidemiologic phenotypes of childhood asthma. Am J Respir Crit Care Med. 2014;189(2):129-138.
Caliskan M, Bochkov YA, Kreiner-Moller E, et al. Rhinovirus wheezing illness and genetic risk of childhood-onset asthma. N Engl J Med. 2013;368(15):1398-1407.
Eder W, Ege MJ, von Mutius E. The asthma epidemic. N Engl J Med. 2006;355(21):2226-2235.

Auteurs

Norbert Krautenbacher (N)

Helmholtz Zentrum München, German Research Center for Environmental Health, Institute of Computational Biology, Neuherberg, Germany.
Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, Technische Universität München, Garching, Germany.

Michael Kabesch (M)

University Children's Hospital Regensburg (KUNO), Regensburg, Germany.
Clinic for Pediatric Pneumology and Neonatology, Hannover Medical School, Hannover, Germany.
The German Center for Lung Research (DZL), Germany.

Elisabeth Horak (E)

Department of Pediatrics and Adolescents, Innsbruck Medical University, Innsbruck, Austria.

Charlotte Braun-Fahrländer (C)

Swiss Tropical and Public Health Institute Basel, Basel, Switzerland.
University of Basel, Basel, Switzerland.

Jon Genuneit (J)

Institute of Epidemiology and Medical Biometry, Ulm University, Ulm, Germany.
Pediatric Epidemiology, Department of Pediatrics, Medical Faculty, Leipzig University, Leipzig, Germany.

Andrzej Boznanski (A)

Wroclaw Medical University, Wroclaw, Poland.

Erika von Mutius (E)

The German Center for Lung Research (DZL), Germany.
Dr von Hauner Children's Hospital, LMU Munich, Munich, Germany.
Helmholtz Zentrum München, German Research Center for Environmental Health, Institute of Asthma and Allergy Prevention, Neuherberg, Germany.

Fabian Theis (F)

Helmholtz Zentrum München, German Research Center for Environmental Health, Institute of Computational Biology, Neuherberg, Germany.
Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, Technische Universität München, Garching, Germany.

Christiane Fuchs (C)

Helmholtz Zentrum München, German Research Center for Environmental Health, Institute of Computational Biology, Neuherberg, Germany.
Center for Mathematics, Chair of Mathematical Modeling of Biological Systems, Technische Universität München, Garching, Germany.
Department of Business Administration and Economics, Bielefeld University, Bielefeld, Germany.

Markus J Ege (MJ)

The German Center for Lung Research (DZL), Germany.
Dr von Hauner Children's Hospital, LMU Munich, Munich, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH