Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.


Journal

American journal of human genetics
ISSN: 1537-6605
Titre abrégé: Am J Hum Genet
Pays: United States
ID NLM: 0370475

Informations de publication

Date de publication:
03 06 2021
Historique:
received: 04 01 2021
accepted: 20 04 2021
pubmed: 9 5 2021
medline: 29 6 2021
entrez: 8 5 2021
Statut: ppublish

Résumé

The accuracy of polygenic risk scores (PRSs) to predict complex diseases increases with the training sample size. PRSs are generally derived based on summary statistics from large meta-analyses of multiple genome-wide association studies (GWASs). However, it is now common for researchers to have access to large individual-level data as well, such as the UK Biobank data. To the best of our knowledge, it has not yet been explored how best to combine both types of data (summary statistics and individual-level data) to optimize polygenic prediction. The most widely used approach to combine data is the meta-analysis of GWAS summary statistics (meta-GWAS), but we show that it does not always provide the most accurate PRS. Through simulations and using 12 real case-control and quantitative traits from both iPSYCH and UK Biobank along with external GWAS summary statistics, we compare meta-GWAS with two alternative data-combining approaches, stacked clumping and thresholding (SCT) and meta-PRS. We find that, when large individual-level data are available, the linear combination of PRSs (meta-PRS) is both a simple alternative to meta-GWAS and often more accurate.

Identifiants

pubmed: 33964208
pii: S0002-9297(21)00145-2
doi: 10.1016/j.ajhg.2021.04.014
pmc: PMC8206385
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1001-1011

Subventions

Organisme : Medical Research Council
ID : MC_PC_17228
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_QA137853
Pays : United Kingdom

Informations de copyright

Copyright © 2021 The Authors. Published by Elsevier Inc. All rights reserved.

Références

Am J Hum Genet. 2011 Mar 11;88(3):294-305
pubmed: 21376301
Psychol Med. 2019 Dec;49(16):2646-2656
pubmed: 31559935
Nat Genet. 2019 Mar;51(3):431-444
pubmed: 30804558
Nat Commun. 2019 Nov 8;10(1):5086
pubmed: 31704910
Mol Psychiatry. 2018 May;23(5):1368-1374
pubmed: 28785111
Nucleic Acids Res. 2019 Jan 8;47(D1):D1005-D1012
pubmed: 30445434
Bioinformatics. 2010 Sep 1;26(17):2190-1
pubmed: 20616382
Genet Epidemiol. 2017 Dec;41(8):811-823
pubmed: 29110330
Nat Genet. 2019 Aug;51(8):1207-1214
pubmed: 31308545
Genet Epidemiol. 2017 Sep;41(6):469-480
pubmed: 28480976
Nat Hum Behav. 2019 May;3(5):513-525
pubmed: 30962613
Nat Genet. 2021 Apr;53(4):420-425
pubmed: 33692568
Genome Res. 2014 Sep;24(9):1550-7
pubmed: 24963154
Nat Genet. 2014 Nov;46(11):1173-86
pubmed: 25282103
Nat Genet. 2015 Mar;47(3):284-90
pubmed: 25642633
BMC Bioinformatics. 2012 May 10;13:88
pubmed: 22574887
Diabetes. 2017 Nov;66(11):2888-2902
pubmed: 28566273
Am J Hum Genet. 2020 May 7;106(5):679-693
pubmed: 32330416
PLoS One. 2008;3(10):e3395
pubmed: 18852893
Nat Commun. 2018 Mar 7;9(1):989
pubmed: 29515099
Bioinformatics. 2015 May 1;31(9):1466-8
pubmed: 25550326
PLoS Genet. 2013;9(2):e1003264
pubmed: 23408905
J Dairy Sci. 2012 Jul;95(7):4114-29
pubmed: 22720968
Nat Genet. 2015 Mar;47(3):291-5
pubmed: 25642630
Nat Genet. 2015 Nov;47(11):1236-41
pubmed: 26414676
Am J Hum Genet. 2019 Dec 5;105(6):1213-1221
pubmed: 31761295
Nat Genet. 2018 Jul;50(7):906-908
pubmed: 29892013
Nat Neurosci. 2019 Mar;22(3):353-361
pubmed: 30692689
Bioinformatics. 2020 Feb 1;36(3):930-933
pubmed: 31393554
Nat Genet. 2018 May;50(5):668-681
pubmed: 29700475
Nat Genet. 2019 Jan;51(1):63-75
pubmed: 30478444
Curr Opin Psychol. 2019 Jun;27:77-81
pubmed: 30339992
Nature. 2014 Jul 24;511(7510):421-7
pubmed: 25056061
Bioinformatics. 2020 Dec 16;:
pubmed: 33326037
Am J Hum Genet. 2020 Jul 2;107(1):46-59
pubmed: 32470373
Nat Genet. 2019 Apr;51(4):584-591
pubmed: 30926966
Am J Hum Genet. 2011 Jan 7;88(1):76-82
pubmed: 21167468
Contemp Clin Trials. 2018 Nov;74:61-69
pubmed: 30287268
Nat Genet. 2015 Oct;47(10):1121-1130
pubmed: 26343387
Nature. 2018 Oct;562(7726):203-209
pubmed: 30305743
J Am Coll Cardiol. 2018 Oct 16;72(16):1883-1893
pubmed: 30309464
PLoS Genet. 2021 May 4;17(5):e1009021
pubmed: 33945532
Bioinformatics. 2018 Aug 15;34(16):2781-2787
pubmed: 29617937
J Child Psychol Psychiatry. 2014 Oct;55(10):1068-87
pubmed: 25132410
Nat Genet. 2019 May;51(5):793-803
pubmed: 31043756
Nat Commun. 2019 Apr 16;10(1):1776
pubmed: 30992449
Am J Hum Genet. 2007 Sep;81(3):559-75
pubmed: 17701901
Nat Genet. 2018 Sep;50(9):1219-1224
pubmed: 30104762
Nat Commun. 2018 Apr 16;9(1):1470
pubmed: 29662059
Am J Hum Genet. 2015 Oct 1;97(4):576-92
pubmed: 26430803
Mol Psychiatry. 2018 Jan;23(1):6-14
pubmed: 28924187
Nat Commun. 2018 Oct 19;9(1):4361
pubmed: 30341297
Nature. 2017 Nov 2;551(7678):92-94
pubmed: 29059683
Bioinformatics. 2010 Nov 15;26(22):2867-73
pubmed: 20926424
Genetics. 2019 May;212(1):65-74
pubmed: 30808621
Bioinformatics. 2020 Aug 15;36(16):4449-4457
pubmed: 32415959
Nature. 2015 Feb 12;518(7538):197-206
pubmed: 25673413
Genetica. 2009 Jun;136(2):245-57
pubmed: 18704696
Int J Epidemiol. 2020 Aug 1;49(4):1397-1403
pubmed: 31967640

Auteurs

Clara Albiñana (C)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark. Electronic address: albinanaclara@gmail.com.

Jakob Grove (J)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Department of Biomedicine and Center for Integrative Sequencing, iSEQ, Aarhus University, 8000 Aarhus C, Denmark; Center for Genomics and Personalized Medicine, CGPM, Aarhus University, 8000 Aarhus C, Denmark; Bioinformatics Research Centre, Aarhus University, 8000 Aarhus C, Denmark.

John J McGrath (JJ)

National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark; Queensland Centre for Mental Health Research, The Park Centre for Mental Health, Brisbane, QLD 4076, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia.

Esben Agerbo (E)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark.

Naomi R Wray (NR)

Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD 4072, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia.

Cynthia M Bulik (CM)

Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA; Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, 171 77 Stockholm, Sweden; Department of Nutrition, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA.

Merete Nordentoft (M)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Copenhagen University Hospital, Mental Health Centre Copenhagen Mental Health Services in the Capital Region of Denmark, 2100 Copenhagen Ø, Denmark; Department of Clinical Medicine, University of Copenhagen, 2200 Copenhagen N, Denmark.

David M Hougaard (DM)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Center for Neonatal Screening, Department for Congenital Disorders, Statens Serum Institut, 2300 Copenhagen S, Denmark.

Thomas Werge (T)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Institute of Biological Psychiatry, MHC Sct. Hans, Mental Health Services Copenhagen, 4000 Roskilde, Denmark; Department of Clinical Medicine, University of Copenhagen, 2200 Copenhagen N, Denmark; Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, 1350 Copenhagen K, Denmark.

Anders D Børglum (AD)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; Department of Biomedicine and Center for Integrative Sequencing, iSEQ, Aarhus University, 8000 Aarhus C, Denmark; Center for Genomics and Personalized Medicine, CGPM, Aarhus University, 8000 Aarhus C, Denmark.

Preben Bo Mortensen (PB)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark.

Florian Privé (F)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark.

Bjarni J Vilhjálmsson (BJ)

The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, 8210 Aarhus V, Denmark; National Centre for Register-Based Research, Aarhus University, 8210 Aarhus V, Denmark; Bioinformatics Research Centre, Aarhus University, 8000 Aarhus C, Denmark. Electronic address: bjv@econ.au.dk.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH