Predicting venous thromboembolism risk from exomes in the Critical Assessment of Genome Interpretation (CAGI) challenges.
exomes
machine learning
phenotype prediction
prediction challenge
venous thromboembolism
Journal
Human mutation
ISSN: 1098-1004
Titre abrégé: Hum Mutat
Pays: United States
ID NLM: 9215429
Informations de publication
Date de publication:
09 2019
09 2019
Historique:
received:
31
01
2019
revised:
07
05
2019
accepted:
27
05
2019
pubmed:
30
5
2019
medline:
14
3
2020
entrez:
30
5
2019
Statut:
ppublish
Résumé
Genetics play a key role in venous thromboembolism (VTE) risk, however established risk factors in European populations do not translate to individuals of African descent because of the differences in allele frequencies between populations. As part of the fifth iteration of the Critical Assessment of Genome Interpretation, participants were asked to predict VTE status in exome data from African American subjects. Participants were provided with 103 unlabeled exomes from patients treated with warfarin for non-VTE causes or VTE and asked to predict which disease each subject had been treated for. Given the lack of training data, many participants opted to use unsupervised machine learning methods, clustering the exomes by variation in genes known to be associated with VTE. The best performing method using only VTE related genes achieved an area under the ROC curve of 0.65. Here, we discuss the range of methods used in the prediction of VTE from sequence data and explore some of the difficulties of conducting a challenge with known confounders. In addition, we show that an existing genetic risk score for VTE that was developed in European subjects works well in African Americans.
Identifiants
pubmed: 31140652
doi: 10.1002/humu.23825
pmc: PMC7047641
mid: NIHMS1068896
doi:
Substances chimiques
Warfarin
5Q7ZVV76EI
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
1314-1320Subventions
Organisme : National Institue of General Medical Sciences
ID : GM115486
Pays : International
Organisme : NIH Office of Extramural Research
ID : NIH R13 HG006650
Pays : International
Organisme : NIGMS NIH HHS
ID : R01 GM102365
Pays : United States
Organisme : NHGRI NIH HHS
ID : R13 HG006650
Pays : United States
Organisme : NHGRI NIH HHS
ID : U41 HG007346
Pays : United States
Organisme : NIGMS NIH HHS
ID : GM115486
Pays : United States
Organisme : NIH Office of Extramural Research
ID : NIH U41 HG007346
Pays : International
Organisme : NIH Office of Extramural Research
ID : HG007346
Pays : International
Organisme : NIGMS NIH HHS
ID : R24 GM061374
Pays : United States
Organisme : NIH Office of Extramural Research
ID : HG006650
Pays : International
Organisme : National Institute of Health (NIH)
ID : T32LM012409
Pays : International
Organisme : NLM NIH HHS
ID : T32 LM012409
Pays : United States
Organisme : National Institue of Health (NIH)
ID : T32LM012409
Pays : International
Informations de copyright
© 2019 Wiley Periodicals, Inc.
Références
Bioinformatics. 2008 Oct 15;24(20):2397-8
pubmed: 18757876
Br J Haematol. 2008 Nov;143(3):321-35
pubmed: 18710381
Hum Mutat. 2017 Sep;38(9):1182-1192
pubmed: 28634997
J Thromb Haemost. 2011 Oct;9(10):1877-82
pubmed: 21797965
Br J Haematol. 2009 Aug;146(4):369-83
pubmed: 19552721
Prim Care. 2004 Sep;31(3):685-709, xi
pubmed: 15331254
Nucleic Acids Res. 2017 Jan 4;45(D1):D833-D839
pubmed: 27924018
J Am Heart Assoc. 2014 Oct 23;3(5):e001060
pubmed: 25341889
J Thromb Haemost. 2009 Jul;7 Suppl 1:301-4
pubmed: 19630821
Blood. 2014 Oct 2;124(14):2298-305
pubmed: 25079360
J Thromb Haemost. 2016 Jan;14(1):83-8
pubmed: 26565658
Mol Genet Genomic Med. 2016 Jun 21;4(5):513-20
pubmed: 27652279
J Thromb Haemost. 2003 Jan;1(1):80-7
pubmed: 12871543