Population Bias in Polygenic Risk Prediction Models for Coronary Artery Disease.


Journal

Circulation. Genomic and precision medicine
ISSN: 2574-8300
Titre abrégé: Circ Genom Precis Med
Pays: United States
ID NLM: 101714113

Informations de publication

Date de publication:
12 2020
Historique:
pubmed: 11 11 2020
medline: 27 10 2021
entrez: 10 11 2020
Statut: ppublish

Résumé

Individual risk prediction based on genome-wide polygenic risk scores (PRSs) using millions of genetic variants has attracted much attention. It is under debate whether PRS models can be applied-without loss of precision-to populations of similar ethnic but different geographic background than the one the scores were trained on. Here, we examine how PRS trained in population-specific but European data sets perform in other European subpopulations in distinguishing between coronary artery disease patients and healthy individuals. We use data from UK and Estonian biobanks (UKB, EB) as well as case-control data from the German population (DE) to develop and evaluate PRS in the same and different populations. PRSs have the highest performance in their corresponding population testing data sets, whereas their performance significantly drops if applied to testing data sets from different European populations. Models trained on DE data revealed area under the curves in independent testing sets in DE: 0.6752, EB: 0.6156, and UKB: 0.5989; trained on EB and tested on EB: 0.6565, DE: 0.5407, and UKB: 0.6043; trained on UKB and tested on UKB: 0.6133, DE: 0.5143, and EB: 0.6049. This result has a direct impact on the clinical usability of PRS for risk prediction models using PRS: a population effect must be kept in mind when applying risk estimation models, which are based on additional genetic information even for individuals from different European populations of the same ethnicity.

Sections du résumé

BACKGROUND
Individual risk prediction based on genome-wide polygenic risk scores (PRSs) using millions of genetic variants has attracted much attention. It is under debate whether PRS models can be applied-without loss of precision-to populations of similar ethnic but different geographic background than the one the scores were trained on. Here, we examine how PRS trained in population-specific but European data sets perform in other European subpopulations in distinguishing between coronary artery disease patients and healthy individuals.
METHODS
We use data from UK and Estonian biobanks (UKB, EB) as well as case-control data from the German population (DE) to develop and evaluate PRS in the same and different populations.
RESULTS
PRSs have the highest performance in their corresponding population testing data sets, whereas their performance significantly drops if applied to testing data sets from different European populations. Models trained on DE data revealed area under the curves in independent testing sets in DE: 0.6752, EB: 0.6156, and UKB: 0.5989; trained on EB and tested on EB: 0.6565, DE: 0.5407, and UKB: 0.6043; trained on UKB and tested on UKB: 0.6133, DE: 0.5143, and EB: 0.6049.
CONCLUSIONS
This result has a direct impact on the clinical usability of PRS for risk prediction models using PRS: a population effect must be kept in mind when applying risk estimation models, which are based on additional genetic information even for individuals from different European populations of the same ethnicity.

Identifiants

pubmed: 33170024
doi: 10.1161/CIRCGEN.120.002932
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e002932

Auteurs

Damian Gola (D)

Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck (D.G., I.R.K.).
German Centre for Cardiovascular Research, Partner Site Hamburg/Kiel/Lübeck (D.G., J.E., I.R.K.).

Jeanette Erdmann (J)

German Centre for Cardiovascular Research, Partner Site Hamburg/Kiel/Lübeck (D.G., J.E., I.R.K.).
Institute for Cardiogenetics, Universität zu Lübeck, Germany (J.E.).

Kristi Läll (K)

Estonian Genome Centre, Institute of Genomics, University of Tartu, Estonia (K.L., R.M.).

Reedik Mägi (R)

Estonian Genome Centre, Institute of Genomics, University of Tartu, Estonia (K.L., R.M.).

Bertram Müller-Myhsok (B)

Department of Translational Research in Psychiatry, Max Planck Institute of Psychiatry, Munich Cluster of Systems Neurology, SyNergy, Germany (B.M.-M.).
Institute of Translational Medicine, University of Liverpool, United Kingdom (B.M.-M.).

Heribert Schunkert (H)

Deutsches Herzzentrum München, Technische Universität München, German Centre for Cardiovascular Research, Partner Site München, Germany (H.S.).

Inke R König (IR)

Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck (D.G., I.R.K.).
German Centre for Cardiovascular Research, Partner Site Hamburg/Kiel/Lübeck (D.G., J.E., I.R.K.).

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH