Testing the impact of trait prevalence priors in Bayesian-based genetic prediction modeling of human appearance traits.


Journal

Forensic science international. Genetics
ISSN: 1878-0326
Titre abrégé: Forensic Sci Int Genet
Pays: Netherlands
ID NLM: 101317016

Informations de publication

Date de publication:
01 2021
Historique:
received: 15 04 2020
revised: 09 09 2020
accepted: 25 10 2020
pubmed: 2 12 2020
medline: 6 7 2021
entrez: 1 12 2020
Statut: ppublish

Résumé

The prediction of appearance traits by use of solely genetic information has become an established approach and a number of statistical prediction models have already been developed for this purpose. However, given limited knowledge on appearance genetics, currently available models are incomplete and do not include all causal genetic variants as predictors. Therefore such prediction models may benefit from the inclusion of additional information that acts as a proxy for this unknown genetic background. Use of priors, possibly informed by trait category prevalence values in biogeographic ancestry groups, in a Bayesian framework may thus improve the prediction accuracy of previously predicted externally visible characteristics, but has not been investigated as of yet. In this study, we assessed the impact of using trait prevalence-informed priors on the prediction performance in Bayesian models for eye, hair and skin color as well as hair structure and freckles in comparison to the respective prior-free models. Those prior-free models were either similarly defined either very close to the already established ones by using a reduced predictive marker set. However, these differences in the number of the predictive markers should not affect significantly our main outcomes. We observed that such priors often had a strong effect on the prediction performance, but to varying degrees between different traits and also different trait categories, with some categories barely showing an effect. While we found potential for improving the prediction accuracy of many of the appearance trait categories tested by using priors, our analyses also showed that misspecification of those prior values often severely diminished the accuracy compared to the respective prior-free approach. This emphasizes the importance of accurate specification of prevalence-informed priors in Bayesian prediction modeling of appearance traits. However, the existing literature knowledge on spatial prevalence is sparse for most appearance traits, including those investigated here. Due to the limitations in appearance trait prevalence knowledge, our results render the use of trait prevalence-informed priors in DNA-based appearance trait prediction currently infeasible.

Identifiants

pubmed: 33260052
pii: S1872-4973(20)30184-8
doi: 10.1016/j.fsigen.2020.102412
pii:
doi:

Substances chimiques

Genetic Markers 0
DNA 9007-49-2

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

102412

Subventions

Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Medical Research Council
Pays : United Kingdom
Organisme : Department of Health
Pays : United Kingdom

Informations de copyright

Copyright © 2020 The Author(s). Published by Elsevier B.V. All rights reserved.

Auteurs

Maria-Alexandra Katsara (MA)

Cologne Center for Genomics, University of Cologne, Cologne, Germany.

Wojciech Branicki (W)

Malopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland; Central Forensic Laboratory of the Police, Warsaw, Poland.

Ewelina Pośpiech (E)

Malopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland.

Pirro Hysi (P)

Department of Twin Research & Genetic Epidemiology, St Thomas Hospital, Campus, Kings College London (KCL), London, UK.

Susan Walsh (S)

Department of Biology, Indiana University Purdue University Indianapolis (IUPUI), Indianapolis, IN, USA.

Manfred Kayser (M)

Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.

Michael Nothnagel (M)

Cologne Center for Genomics, University of Cologne, Cologne, Germany; University Hospital Cologne, Cologne, Germany. Electronic address: michael.nothnagel@uni-koeln.de.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH