A Bayesian network approach incorporating imputation of missing data enables exploratory analysis of complex causal biological relationships.


Journal

PLoS genetics
ISSN: 1553-7404
Titre abrégé: PLoS Genet
Pays: United States
ID NLM: 101239074

Informations de publication

Date de publication:
09 2021
Historique:
received: 09 02 2021
accepted: 07 09 2021
revised: 11 10 2021
pubmed: 30 9 2021
medline: 15 12 2021
entrez: 29 9 2021
Statut: epublish

Résumé

Bayesian networks can be used to identify possible causal relationships between variables based on their conditional dependencies and independencies, which can be particularly useful in complex biological scenarios with many measured variables. Here we propose two improvements to an existing method for Bayesian network analysis, designed to increase the power to detect potential causal relationships between variables (including potentially a mixture of both discrete and continuous variables). Our first improvement relates to the treatment of missing data. When there is missing data, the standard approach is to remove every individual with any missing data before performing analysis. This can be wasteful and undesirable when there are many individuals with missing data, perhaps with only one or a few variables missing. This motivates the use of imputation. We present a new imputation method that uses a version of nearest neighbour imputation, whereby missing data from one individual is replaced with data from another individual, their nearest neighbour. For each individual with missing data, the subsets of variables to be used to select the nearest neighbour are chosen by sampling without replacement the complete data and estimating a best fit Bayesian network. We show that this approach leads to marked improvements in the recall and precision of directed edges in the final network identified, and we illustrate the approach through application to data from a recent study investigating the causal relationship between methylation and gene expression in early inflammatory arthritis patients. We also describe a second improvement in the form of a pseudo-Bayesian approach for upweighting certain network edges, which can be useful when there is prior evidence concerning their directions.

Identifiants

pubmed: 34587167
doi: 10.1371/journal.pgen.1009811
pii: PGENETICS-D-21-00184
pmc: PMC8504979
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1009811

Subventions

Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 102858/Z/13/Z
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/P020941/1
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 219424/Z/19/Z
Pays : United Kingdom

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

BMC Med Inform Decis Mak. 2016 Jul 25;16 Suppl 3:74
pubmed: 27454392
Genet Mol Res. 2016 Aug 19;15(3):
pubmed: 27706596
Artif Intell Med. 2013 Mar;57(3):207-17
pubmed: 23395009
PLoS Genet. 2020 Mar 2;16(3):e1008198
pubmed: 32119656
Sci Rep. 2017 Feb 24;7:43434
pubmed: 28233834
BMC Genet. 2018 Sep 17;19(Suppl 1):74
pubmed: 30255779
Genet Epidemiol. 2017 Nov;41(7):577-586
pubmed: 28691305
BMC Genet. 2009 May 27;10:23
pubmed: 19473544
J Allergy Clin Immunol. 2020 May;145(5):1438-1451
pubmed: 31945409
Int J Methods Psychiatr Res. 2011 Mar;20(1):40-9
pubmed: 21499542

Auteurs

Richard Howey (R)

Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne, United Kingdom.

Alexander D Clark (AD)

Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, United Kingdom.

Najib Naamane (N)

Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, United Kingdom.

Louise N Reynard (LN)

Biosciences Institute, Newcastle University, Newcastle upon Tyne, United Kingdom.

Arthur G Pratt (AG)

Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, United Kingdom.
Musculoskeletal Services Directorate, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, United Kingdom.

Heather J Cordell (HJ)

Population Health Sciences Institute, Newcastle University, Newcastle upon Tyne, United Kingdom.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH