Leveraging both individual-level genetic data and GWAS summary statistics increases polygenic prediction.
PRS
complex traits
genetic prediction
meta-analysis
polygenic risk scores
psychiatric disorders
Journal
American journal of human genetics
ISSN: 1537-6605
Titre abrégé: Am J Hum Genet
Pays: United States
ID NLM: 0370475
Informations de publication
Date de publication:
03 06 2021
03 06 2021
Historique:
received:
04
01
2021
accepted:
20
04
2021
pubmed:
9
5
2021
medline:
29
6
2021
entrez:
8
5
2021
Statut:
ppublish
Résumé
The accuracy of polygenic risk scores (PRSs) to predict complex diseases increases with the training sample size. PRSs are generally derived based on summary statistics from large meta-analyses of multiple genome-wide association studies (GWASs). However, it is now common for researchers to have access to large individual-level data as well, such as the UK Biobank data. To the best of our knowledge, it has not yet been explored how best to combine both types of data (summary statistics and individual-level data) to optimize polygenic prediction. The most widely used approach to combine data is the meta-analysis of GWAS summary statistics (meta-GWAS), but we show that it does not always provide the most accurate PRS. Through simulations and using 12 real case-control and quantitative traits from both iPSYCH and UK Biobank along with external GWAS summary statistics, we compare meta-GWAS with two alternative data-combining approaches, stacked clumping and thresholding (SCT) and meta-PRS. We find that, when large individual-level data are available, the linear combination of PRSs (meta-PRS) is both a simple alternative to meta-GWAS and often more accurate.
Identifiants
pubmed: 33964208
pii: S0002-9297(21)00145-2
doi: 10.1016/j.ajhg.2021.04.014
pmc: PMC8206385
pii:
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1001-1011Subventions
Organisme : Medical Research Council
ID : MC_PC_17228
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_QA137853
Pays : United Kingdom
Informations de copyright
Copyright © 2021 The Authors. Published by Elsevier Inc. All rights reserved.
Références
Am J Hum Genet. 2011 Mar 11;88(3):294-305
pubmed: 21376301
Psychol Med. 2019 Dec;49(16):2646-2656
pubmed: 31559935
Nat Genet. 2019 Mar;51(3):431-444
pubmed: 30804558
Nat Commun. 2019 Nov 8;10(1):5086
pubmed: 31704910
Mol Psychiatry. 2018 May;23(5):1368-1374
pubmed: 28785111
Nucleic Acids Res. 2019 Jan 8;47(D1):D1005-D1012
pubmed: 30445434
Bioinformatics. 2010 Sep 1;26(17):2190-1
pubmed: 20616382
Genet Epidemiol. 2017 Dec;41(8):811-823
pubmed: 29110330
Nat Genet. 2019 Aug;51(8):1207-1214
pubmed: 31308545
Genet Epidemiol. 2017 Sep;41(6):469-480
pubmed: 28480976
Nat Hum Behav. 2019 May;3(5):513-525
pubmed: 30962613
Nat Genet. 2021 Apr;53(4):420-425
pubmed: 33692568
Genome Res. 2014 Sep;24(9):1550-7
pubmed: 24963154
Nat Genet. 2014 Nov;46(11):1173-86
pubmed: 25282103
Nat Genet. 2015 Mar;47(3):284-90
pubmed: 25642633
BMC Bioinformatics. 2012 May 10;13:88
pubmed: 22574887
Diabetes. 2017 Nov;66(11):2888-2902
pubmed: 28566273
Am J Hum Genet. 2020 May 7;106(5):679-693
pubmed: 32330416
PLoS One. 2008;3(10):e3395
pubmed: 18852893
Nat Commun. 2018 Mar 7;9(1):989
pubmed: 29515099
Bioinformatics. 2015 May 1;31(9):1466-8
pubmed: 25550326
PLoS Genet. 2013;9(2):e1003264
pubmed: 23408905
J Dairy Sci. 2012 Jul;95(7):4114-29
pubmed: 22720968
Nat Genet. 2015 Mar;47(3):291-5
pubmed: 25642630
Nat Genet. 2015 Nov;47(11):1236-41
pubmed: 26414676
Am J Hum Genet. 2019 Dec 5;105(6):1213-1221
pubmed: 31761295
Nat Genet. 2018 Jul;50(7):906-908
pubmed: 29892013
Nat Neurosci. 2019 Mar;22(3):353-361
pubmed: 30692689
Bioinformatics. 2020 Feb 1;36(3):930-933
pubmed: 31393554
Nat Genet. 2018 May;50(5):668-681
pubmed: 29700475
Nat Genet. 2019 Jan;51(1):63-75
pubmed: 30478444
Curr Opin Psychol. 2019 Jun;27:77-81
pubmed: 30339992
Nature. 2014 Jul 24;511(7510):421-7
pubmed: 25056061
Bioinformatics. 2020 Dec 16;:
pubmed: 33326037
Am J Hum Genet. 2020 Jul 2;107(1):46-59
pubmed: 32470373
Nat Genet. 2019 Apr;51(4):584-591
pubmed: 30926966
Am J Hum Genet. 2011 Jan 7;88(1):76-82
pubmed: 21167468
Contemp Clin Trials. 2018 Nov;74:61-69
pubmed: 30287268
Nat Genet. 2015 Oct;47(10):1121-1130
pubmed: 26343387
Nature. 2018 Oct;562(7726):203-209
pubmed: 30305743
J Am Coll Cardiol. 2018 Oct 16;72(16):1883-1893
pubmed: 30309464
PLoS Genet. 2021 May 4;17(5):e1009021
pubmed: 33945532
Bioinformatics. 2018 Aug 15;34(16):2781-2787
pubmed: 29617937
J Child Psychol Psychiatry. 2014 Oct;55(10):1068-87
pubmed: 25132410
Nat Genet. 2019 May;51(5):793-803
pubmed: 31043756
Nat Commun. 2019 Apr 16;10(1):1776
pubmed: 30992449
Am J Hum Genet. 2007 Sep;81(3):559-75
pubmed: 17701901
Nat Genet. 2018 Sep;50(9):1219-1224
pubmed: 30104762
Nat Commun. 2018 Apr 16;9(1):1470
pubmed: 29662059
Am J Hum Genet. 2015 Oct 1;97(4):576-92
pubmed: 26430803
Mol Psychiatry. 2018 Jan;23(1):6-14
pubmed: 28924187
Nat Commun. 2018 Oct 19;9(1):4361
pubmed: 30341297
Nature. 2017 Nov 2;551(7678):92-94
pubmed: 29059683
Bioinformatics. 2010 Nov 15;26(22):2867-73
pubmed: 20926424
Genetics. 2019 May;212(1):65-74
pubmed: 30808621
Bioinformatics. 2020 Aug 15;36(16):4449-4457
pubmed: 32415959
Nature. 2015 Feb 12;518(7538):197-206
pubmed: 25673413
Genetica. 2009 Jun;136(2):245-57
pubmed: 18704696
Int J Epidemiol. 2020 Aug 1;49(4):1397-1403
pubmed: 31967640