Machine learning optimized polygenic scores for blood cell traits identify sex-specific trajectories and genetic correlations with disease.
Blood cell trait
Disease assocations
Machine learning
Method
Polygenic score
Population stratification
Journal
Cell genomics
ISSN: 2666-979X
Titre abrégé: Cell Genom
Pays: United States
ID NLM: 9918284260106676
Informations de publication
Date de publication:
12 Jan 2022
12 Jan 2022
Historique:
received:
20
10
2020
revised:
24
08
2021
accepted:
13
12
2021
entrez:
24
1
2022
pubmed:
25
1
2022
medline:
25
1
2022
Statut:
epublish
Résumé
Genetic association studies for blood cell traits, which are key indicators of health and immune function, have identified several hundred associations and defined a complex polygenic architecture. Polygenic scores (PGSs) for blood cell traits have potential clinical utility in disease risk prediction and prevention, but designing PGS remains challenging and the optimal methods are unclear. To address this, we evaluated the relative performance of 6 methods to develop PGS for 26 blood cell traits, including a standard method of pruning and thresholding (P + T) and 5 learning methods: LDpred2, elastic net (EN), Bayesian ridge (BR), multilayer perceptron (MLP) and convolutional neural network (CNN). We evaluated these optimized PGSs on blood cell trait data from UK Biobank and INTERVAL. We find that PGSs designed using common machine learning methods EN and BR show improved prediction of blood cell traits and consistently outperform other methods. Our analyses suggest EN/BR as the top choices for PGS construction, showing improved performance for 25 blood cell traits in the external validation, with correlations with the directly measured traits increasing by 10%-23%. Ten PGSs showed significant statistical interaction with sex, and sex-specific PGS stratification showed that all of them had substantial variation in the trajectories of blood cell traits with age. Genetic correlations between the PGSs for blood cell traits and common human diseases identified well-known as well as new associations. We develop machine learning-optimized PGS for blood cell traits, demonstrate their relationships with sex, age, and disease, and make these publicly available as a resource.
Identifiants
pubmed: 35072137
doi: 10.1016/j.xgen.2021.100086
pii: S2666-979X(21)00107-5
pmc: PMC8758502
doi:
Types de publication
Journal Article
Langues
eng
Pagination
NoneSubventions
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_PC_17228
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_QA137853
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/L003120/1
Pays : United Kingdom
Informations de copyright
© 2021 The Author(s).
Déclaration de conflit d'intérêts
P.A. is a full-time employee of Regeneron Pharmaceuticals. A.S.B. has received grants (outside of this work) from AstraZeneca, Bayer, Biogen, BioMarin, Bioverativ, Merck, Novartis, Regeneron, and Sanofi. J.D. reports grants, personal fees, and non-financial support from Merck Sharp & Dohme (MSD); grants, personal fees, and non-financial support from Novartis; grants from Pfizer; and grants from AstraZeneca outside the submitted work. J.D. sits on the International Cardiovascular and Metabolic Advisory Board for Novartis (since 2010), serves on the Steering Committee of UK Biobank (since 2011), is an MRC International Advisory Group (ING) member, London (since 2013), an MRC High Throughput Science ‘Omics Panel Member, London (since 2013), a Scientific Advisory Committee member for Sanofi (since 2013), an International Cardiovascular and Metabolism Research and Development Portfolio Committee member for Novartis, and was a member of the Astra Zeneca Genomics Advisory Board (2018).
Références
Nat Genet. 2013 Jan;45(1):25-33
pubmed: 23202125
J Allergy Clin Immunol. 2007 Jun;119(6):1303-10; quiz 1311-2
pubmed: 17481712
Nat Genet. 2015 Sep;47(9):979-986
pubmed: 26192919
Genet Epidemiol. 2014 May;38(4):281-90
pubmed: 24676807
Gigascience. 2015 Feb 25;4:7
pubmed: 25722852
Nat Rev Genet. 2019 Mar;20(3):173-190
pubmed: 30581192
Genet Sel Evol. 2020 Feb 24;52(1):12
pubmed: 32093611
Nat Genet. 2015 Nov;47(11):1228-35
pubmed: 26414678
Cell. 2018 Jun 14;173(7):1705-1715.e16
pubmed: 29906448
Nat Genet. 2021 Apr;53(4):420-425
pubmed: 33692568
Nat Rev Genet. 2016 Jul;17(7):392-406
pubmed: 27140283
Nat Metab. 2021 Nov;3(11):1476-1483
pubmed: 34750571
J Gastroenterol Hepatol. 2018 Dec;33(12):1984-1989
pubmed: 29889986
Genetics. 2018 Nov;210(3):809-819
pubmed: 30171033
Nature. 2015 Oct 1;526(7571):82-90
pubmed: 26367797
PLoS Genet. 2014 Feb 13;10(2):e1004137
pubmed: 24550740
Nat Genet. 2015 Mar;47(3):291-5
pubmed: 25642630
Diabetes Care. 2019 Feb;42(2):200-207
pubmed: 30655379
Nat Genet. 2018 Jan;50(1):42-53
pubmed: 29273806
Clin Interv Aging. 2019 May 08;14:849-857
pubmed: 31190771
Nature. 2014 Feb 20;506(7488):376-81
pubmed: 24390342
Bioinformatics. 2020 Dec 16;:
pubmed: 33326037
Cell. 2016 Nov 17;167(5):1415-1429.e19
pubmed: 27863252
PLoS Genet. 2009 Oct;5(10):e1000678
pubmed: 19816555
Psychiatry Res. 2019 Jan;271:131-135
pubmed: 30472508
Lancet Respir Med. 2015 May;3(5):355-66
pubmed: 25736990
Genome Biol. 2017 Apr 11;18(1):67
pubmed: 28395661
P T. 2016 Dec;41(12):765-768
pubmed: 27990078
Annu Rev Immunol. 2018 Apr 26;36:489-517
pubmed: 29400998
Nat Genet. 2015 Oct;47(10):1121-1130
pubmed: 26343387
PLoS Genet. 2014 Nov 13;10(11):e1004754
pubmed: 25393026
Nature. 2018 Oct;562(7726):203-209
pubmed: 30305743
J Am Coll Cardiol. 2018 Oct 16;72(16):1883-1893
pubmed: 30309464
Nat Genet. 2017 Dec;49(12):1752-1757
pubmed: 29083406
Genet Epidemiol. 2013 Feb;37(2):184-95
pubmed: 23203348
Hum Mol Genet. 2019 Nov 21;28(R2):R133-R142
pubmed: 31363735
Cell. 2020 Sep 3;182(5):1214-1231.e11
pubmed: 32888494
J Exp Biol. 2009 Nov;212(Pt 21):3387-93
pubmed: 19837879
Cell. 2016 Nov 17;167(5):1167-1169
pubmed: 27863238
PLoS Genet. 2020 Oct 23;16(10):e1009141
pubmed: 33095761
Mol Psychiatry. 2020 Apr;25(4):761-775
pubmed: 30214039
Nat Genet. 2016 Nov;48(11):1443-1448
pubmed: 27694958
Am J Hum Genet. 2015 Oct 1;97(4):576-92
pubmed: 26430803
Trials. 2014 Sep 17;15:363
pubmed: 25230735
PLoS Med. 2015 Mar 31;12(3):e1001779
pubmed: 25826379
PLoS One. 2016 Sep 22;11(9):e0162953
pubmed: 27657912
Int J Lab Hematol. 2013 Jun;35(3):254-61
pubmed: 23590652
Nat Rev Genet. 2018 Sep;19(9):581-590
pubmed: 29789686
Eur Heart J. 2016 Nov 14;37(43):3267-3278
pubmed: 27655226
Planta. 2018 Nov;248(5):1307-1318
pubmed: 30101399
Hum Mol Genet. 2009 Sep 15;18(18):3525-31
pubmed: 19553258