Machine Learning Prediction of Biomarkers from SNPs and of Disease Risk from Biomarkers in the UK Biobank.
Adult
Atherosclerosis
/ blood
Biological Specimen Banks
Biomarkers
/ blood
Calcium
/ blood
Cardiovascular Diseases
/ blood
Female
Heart Disease Risk Factors
Hemoglobins
/ genetics
Humans
Lipoprotein(a)
/ blood
Lipoproteins, HDL
/ blood
Lipoproteins, LDL
/ blood
Machine Learning
Male
Middle Aged
Multifactorial Inheritance
/ genetics
Risk Assessment
United Kingdom
/ epidemiology
United States
/ epidemiology
atherosclerotic cardiovascular disease
biomarkers
disease risk
machine learning
polygenic scores
Journal
Genes
ISSN: 2073-4425
Titre abrégé: Genes (Basel)
Pays: Switzerland
ID NLM: 101551097
Informations de publication
Date de publication:
29 06 2021
29 06 2021
Historique:
received:
30
03
2021
revised:
22
06
2021
accepted:
23
06
2021
entrez:
2
7
2021
pubmed:
3
7
2021
medline:
3
2
2022
Statut:
epublish
Résumé
We use UK Biobank data to train predictors for 65 blood and urine markers such as HDL, LDL, lipoprotein A, glycated haemoglobin, etc. from SNP genotype. For example, our Polygenic Score (PGS) predictor correlates ∼0.76 with lipoprotein A level, which is highly heritable and an independent risk factor for heart disease. This may be the most accurate genomic prediction of a quantitative trait that has yet been produced (specifically, for European ancestry groups). We also train predictors of common disease risk using blood and urine biomarkers alone (no DNA information); we call these predictors biomarker risk scores, BMRS. Individuals who are at high risk (e.g., odds ratio of >5× population average) can be identified for conditions such as coronary artery disease (AUC∼0.75), diabetes (AUC∼0.95), hypertension, liver and kidney problems, and cancer using biomarkers alone. Our atherosclerotic cardiovascular disease (ASCVD) predictor uses ∼10 biomarkers and performs in UKB evaluation as well as or better than the American College of Cardiology ASCVD Risk Estimator, which uses quite different inputs (age, diagnostic history, BMI, smoking status, statin usage, etc.). We compare polygenic risk scores (risk conditional on genotype: PRS) for common diseases to the risk predictors which result from the concatenation of learned functions BMRS and PGS, i.e., applying the BMRS predictors to the PGS output.
Identifiants
pubmed: 34209487
pii: genes12070991
doi: 10.3390/genes12070991
pmc: PMC8308062
pii:
doi:
Substances chimiques
Biomarkers
0
Hemoglobins
0
Lipoprotein(a)
0
Lipoproteins, HDL
0
Lipoproteins, LDL
0
Calcium
SY7Q814VUP
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : Medical Research Council
ID : MC_PC_17228
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_QA137853
Pays : United Kingdom
Références
Sci Rep. 2020 Aug 6;10(1):13190
pubmed: 32764582
Am J Hum Genet. 2017 Apr 6;100(4):635-649
pubmed: 28366442
Sci Transl Med. 2013 Jul 17;5(194):194cm5
pubmed: 23863829
Elife. 2019 Mar 21;8:
pubmed: 30895925
PLoS Biol. 2013 Sep;11(9):e1001661
pubmed: 24068893
Cardiovasc Diagn Ther. 2020 Aug;10(4):919-938
pubmed: 32968651
Nat Rev Genet. 2016 Jul;17(7):392-406
pubmed: 27140283
Circ J. 2006 Oct;70(10):1249-55
pubmed: 16998254
Elife. 2019 Mar 21;8:
pubmed: 30895923
Hepatology. 2007 Mar;45(3):797-805
pubmed: 17326206
Circulation. 2008 Feb 12;117(6):743-53
pubmed: 18212285
JAMA Psychiatry. 2021 Jan 1;78(1):101-109
pubmed: 32997097
Am J Hum Genet. 1992 Oct;51(4):829-40
pubmed: 1415225
Gigascience. 2014 Jun 16;3:10
pubmed: 25002967
PLoS Med. 2018 Mar 30;15(3):e1002546
pubmed: 29601582
J Am Coll Cardiol. 2012 Aug 21;60(8):716-21
pubmed: 22898069
Hepatology. 2012 Mar;55(3):965-7
pubmed: 22213561
Biodemography Soc Biol. 2018 Jul-Sep;64(3-4):187-215
pubmed: 31852332
Science. 2018 Jan 26;359(6374):424-428
pubmed: 29371463
Heart. 2007 Feb;93(2):172-6
pubmed: 17090561
BMJ. 2017 May 23;357:j2099
pubmed: 28536104
Eur Heart J. 2016 Aug 1;37(29):2315-2381
pubmed: 27222591
Bioinformatics. 2015 May 1;31(9):1466-8
pubmed: 25550326
Arterioscler Thromb Vasc Biol. 2015 Jul;35(7):1704-11
pubmed: 25953646
Evol Med Public Health. 2018 Dec 27;2019(1):26-34
pubmed: 30838127
Sci Rep. 2020 Jul 21;10(1):12055
pubmed: 32694572
BMC Med Genet. 2007 Sep 19;8 Suppl 1:S17
pubmed: 17903299
PLoS Genet. 2014 Feb 13;10(2):e1004137
pubmed: 24550740
BMJ. 2007 Jul 21;335(7611):136
pubmed: 17615182
Elife. 2019 Mar 21;8:
pubmed: 30895926
Twin Res Hum Genet. 2018 Apr;21(2):73-83
pubmed: 29530109
J Clin Neurol. 2014 Jan;10(1):1-9
pubmed: 24465256
Gigascience. 2015 Feb 25;4:7
pubmed: 25722852
Nat Rev Endocrinol. 2014 Jan;10(1):51-61
pubmed: 24247219
Proc Natl Acad Sci U S A. 2009 Nov 10;106(45):18914-9
pubmed: 19858495
Nature. 2021 Mar;591(7849):211-219
pubmed: 33692554
Genetics. 2018 Oct;210(2):477-497
pubmed: 30150289
Genomics. 1988 Oct;3(3):230-6
pubmed: 2976021
J Hypertens. 2007 Aug;25(8):1578-82
pubmed: 17620952
Genet Med. 2010 Nov;12(11):686-93
pubmed: 20808229
Proc Natl Acad Sci U S A. 2018 Jul 31;115(31):E7275-E7284
pubmed: 29987013
Am J Kidney Dis. 2010 Apr;55(4):622-7
pubmed: 20338463
Hum Genet. 1988 Aug;79(4):352-6
pubmed: 3410459
Proc Natl Acad Sci U S A. 2005 Jul 5;102(27):9446-51
pubmed: 15976026
Genet Med. 2016 Nov;18(11):1075-1084
pubmed: 27171546
J Lipid Res. 2016 Aug;57(8):1339-59
pubmed: 27074913
N Engl J Med. 2012 Jul 5;367(1):20-9
pubmed: 22762315
PLoS One. 2018 Jul 26;13(7):e0200785
pubmed: 30048462
Eur Heart J. 2003 Jun;24(11):987-1003
pubmed: 12788299
Nat Genet. 2019 Apr;51(4):584-591
pubmed: 30926966
Stroke. 1994 Jan;25(1):40-3
pubmed: 8266381
Genome Med. 2021 Jan 28;13(1):14
pubmed: 33509269
Sci Rep. 2019 Oct 25;9(1):15286
pubmed: 31653892
Nature. 2008 Nov 6;456(7218):98-101
pubmed: 18758442
Nat Commun. 2019 Nov 8;10(1):5086
pubmed: 31704910
Genome Med. 2020 May 18;12(1):44
pubmed: 32423490
Nat Genet. 2012 Feb 05;44(3):243-6
pubmed: 22306651
Breast Cancer Res Treat. 2016 Oct;159(3):513-25
pubmed: 27565998
PLoS Genet. 2010 Feb 26;6(2):e1000864
pubmed: 20195508
Circulation. 2019 Aug 13;140(7):542-552
pubmed: 31216866
J Am Heart Assoc. 2018 Nov 20;7(22):e009476
pubmed: 30571498
Nucleic Acids Res. 2019 Jan 8;47(D1):D1005-D1012
pubmed: 30445434
Circulation. 2002 Jan 22;105(3):310-5
pubmed: 11804985
Circulation. 2008 Nov 25;118(22):2243-51, 4p following 2251
pubmed: 18997194
J Clin Invest. 1992 Jul;90(1):52-60
pubmed: 1386087
Lancet Diabetes Endocrinol. 2015 May;3(5):339-55
pubmed: 25819778
Nat Genet. 2015 Oct;47(10):1121-1130
pubmed: 26343387
Ann Intern Med. 2009 May 5;150(9):604-12
pubmed: 19414839
Stroke. 2002 Jul;33(7):1776-81
pubmed: 12105351
J Am Coll Cardiol. 2018 Oct 16;72(16):1883-1893
pubmed: 30309464
Circulation. 2019 Jun 18;139(25):e1162-e1177
pubmed: 30586766
Nat Genet. 2018 Aug;50(8):1112-1121
pubmed: 30038396
J Am Coll Cardiol. 2014 Jul 1;63(25 Pt B):2935-2959
pubmed: 24239921
Br J Surg. 1973 Aug;60(8):646-9
pubmed: 4541913
Nat Commun. 2018 May 14;9(1):1865
pubmed: 29760457
Nature. 2018 Oct;562(7726):203-209
pubmed: 30305743
Nephron. 1976;16(1):31-41
pubmed: 1244564
Nutrients. 2019 Jan 04;11(1):
pubmed: 30621171
Cardiovasc Res. 2020 Dec 1;116(14):2216-2225
pubmed: 31853543
Arterioscler Thromb Vasc Biol. 2021 Jan;41(1):458-464
pubmed: 33115273
Eur J Hum Genet. 2006 Feb;14(2):190-201
pubmed: 16267501
Hum Genet. 1992 Nov;90(3):220-30
pubmed: 1336760
Sci Rep. 2016 May 25;6:26471
pubmed: 27220488
Mayo Clin Proc. 2020 May;95(5):1015-1039
pubmed: 32370835
Nat Genet. 2018 Sep;50(9):1219-1224
pubmed: 30104762
Nat Genet. 2006 Aug;38(8):904-9
pubmed: 16862161
Am J Hum Genet. 2020 Sep 3;107(3):418-431
pubmed: 32758451
Am J Hum Genet. 2015 Oct 1;97(4):576-92
pubmed: 26430803
J Natl Cancer Inst. 2010 May 19;102(10):680-91
pubmed: 20427433
Eur J Cardiovasc Prev Rehabil. 2007 Apr;14(2):161-2
pubmed: 17446792
Trends Genet. 2018 Oct;34(10):746-754
pubmed: 30139641
Child Dev. 2020 Sep;91(5):1745-1761
pubmed: 31657015
Am J Kidney Dis. 2008 Oct;52(4):645-8
pubmed: 18805345
Mayo Clin Proc. 2011 Jul;86(7):606-14
pubmed: 21646302
G3 (Bethesda). 2020 Nov 5;10(11):4027-4036
pubmed: 32878958
J Lipid Res. 2017 Sep;58(9):1834-1844
pubmed: 28512139
Genetics. 2019 May;212(1):65-74
pubmed: 30808621
Nat Rev Nephrol. 2018 Dec;14(12):723-724
pubmed: 30279535
Hum Genet. 1989 Jan;81(2):149-52
pubmed: 2521477
N Engl J Med. 2009 Dec 24;361(26):2518-28
pubmed: 20032323
Genetics. 2019 Apr;211(4):1131-1141
pubmed: 30967442
J Am Heart Assoc. 2017 Mar 17;6(3):
pubmed: 28314800
Nat Rev Genet. 2018 Sep;19(9):581-590
pubmed: 29789686
Nat Commun. 2016 Mar 23;7:11122
pubmed: 27005778
JAMA. 2007 Feb 14;297(6):611-9
pubmed: 17299196
Clin Sci (Lond). 2001 Dec;101(6):671-9
pubmed: 11724655
JAMA. 2014 Apr 9;311(14):1406-15
pubmed: 24682252
Cell. 2019 Apr 18;177(3):587-596.e9
pubmed: 31002795
BMJ. 2008 Jun 28;336(7659):1475-82
pubmed: 18573856
J Am Coll Cardiol. 2019 Sep 10;74(10):e177-e232
pubmed: 30894318
JAMA. 2019 Aug 20;322(7):666-685
pubmed: 31429902