Exploring Integrative Analysis Using the BioMedical Evidence Graph.
Journal
JCO clinical cancer informatics
ISSN: 2473-4276
Titre abrégé: JCO Clin Cancer Inform
Pays: United States
ID NLM: 101708809
Informations de publication
Date de publication:
02 2020
02 2020
Historique:
entrez:
26
2
2020
pubmed:
26
2
2020
medline:
9
3
2021
Statut:
ppublish
Résumé
The analysis of cancer biology data involves extremely heterogeneous data sets, including information from RNA sequencing, genome-wide copy number, DNA methylation data reporting on epigenetic regulation, somatic mutations from whole-exome or whole-genome analyses, pathology estimates from imaging sections or subtyping, drug response or other treatment outcomes, and various other clinical and phenotypic measurements. Bringing these different resources into a common framework, with a data model that allows for complex relationships as well as dense vectors of features, will unlock integrated data set analysis. We introduce the BioMedical Evidence Graph (BMEG), a graph database and query engine for discovery and analysis of cancer biology. The BMEG is unique from other biologic data graphs in that sample-level molecular and clinical information is connected to reference knowledge bases. It combines gene expression and mutation data with drug-response experiments, pathway information databases, and literature-derived associations. The construction of the BMEG has resulted in a graph containing > 41 million vertices and 57 million edges. The BMEG system provides a graph query-based application programming interface to enable analysis, with client code available for Python, Javascript, and R, and a server online at bmeg.io. Using this system, we have demonstrated several forms of cross-data set analysis to show the utility of the system. The BMEG is an evolving resource dedicated to enabling integrative analysis. We have demonstrated queries on the system that illustrate mutation significance analysis, drug-response machine learning, patient-level knowledge-base queries, and pathway level analysis. We have compared the resulting graph to other available integrated graph systems and demonstrated the former is unique in the scale of the graph and the type of data it makes available.
Identifiants
pubmed: 32097025
doi: 10.1200/CCI.19.00110
pmc: PMC7049249
doi:
Substances chimiques
Antineoplastic Agents
0
Biomarkers, Tumor
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
147-159Subventions
Organisme : NIGMS NIH HHS
ID : R01 GM109031
Pays : United States
Références
Genome Biol. 2016 Jun 06;17(1):122
pubmed: 27268795
Sci Rep. 2016 Dec 16;6:39259
pubmed: 27982081
Nucleic Acids Res. 2014 Jan;42(Database issue):D1098-106
pubmed: 24234439
Cell. 2013 Aug 29;154(5):1151-1161
pubmed: 23993102
Nucleic Acids Res. 2014 Jan;42(Database issue):D222-30
pubmed: 24288371
Nucleic Acids Res. 2013 Jan;41(Database issue):D955-61
pubmed: 23180760
Nat Methods. 2016 Sep 29;13(10):806-7
pubmed: 27684579
Nucleic Acids Res. 2018 Jan 4;46(D1):D994-D1002
pubmed: 30053271
Nucleic Acids Res. 2018 Jan 4;46(D1):D649-D655
pubmed: 29145629
Nucleic Acids Res. 2017 Jan 4;45(D1):D712-D722
pubmed: 27899636
Nucleic Acids Res. 2017 Jan 4;45(D1):D972-D978
pubmed: 27651457
Nat Biotechnol. 2014 Dec;32(12):1202-12
pubmed: 24880487
Database (Oxford). 2011 Nov 26;2011:bar052
pubmed: 22120663
Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432
pubmed: 30357350
Nucleic Acids Res. 2010 Jan;38(Database issue):D787-91
pubmed: 19933260
J Biomed Inform. 2008 Oct;41(5):706-16
pubmed: 18472304
Nucleic Acids Res. 2016 Jan 4;44(D1):D1202-13
pubmed: 26400175
Nature. 2019 May;569(7757):503-508
pubmed: 31068700
Nucleic Acids Res. 2019 Jan 8;47(D1):D559-D563
pubmed: 30357367
BMC Syst Biol. 2013 Feb 21;7:15
pubmed: 23433509
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D449-51
pubmed: 14681454
Genome Biol. 2010 Jan 12;11(1):R3
pubmed: 20067622
Genome Biol. 2005;6(1):R2
pubmed: 15642094
Cell Syst. 2018 Mar 28;6(3):271-281.e7
pubmed: 29596782
Genomics Inform. 2017 Mar;15(1):19-27
pubmed: 28416946
Genome Biol. 2011;12(4):R41
pubmed: 21527027
Nature. 2011 Feb 10;470(7333):163-5
pubmed: 21307913
Nucleic Acids Res. 2015 Jan;43(Database issue):D512-20
pubmed: 25514926
Nucleic Acids Res. 2017 Jan 4;45(D1):D183-D189
pubmed: 27899595
Bioinformatics. 2011 Jun 15;27(12):1739-40
pubmed: 21546393
Annu Rev Pharmacol Toxicol. 2014;54:9-26
pubmed: 24016212
Nucleic Acids Res. 2014 Jan;42(Database issue):D358-63
pubmed: 24234451
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D535-9
pubmed: 16381927
Nucleic Acids Res. 2018 Jan 4;46(D1):D1068-D1073
pubmed: 29156001
Nat Chem Biol. 2016 Feb;12(2):109-16
pubmed: 26656090
Nucleic Acids Res. 2009 Jan;37(Database issue):D767-72
pubmed: 18988627
Nucleic Acids Res. 2003 Jan 1;31(1):248-50
pubmed: 12519993
Nucleic Acids Res. 2002 Jan 1;30(1):38-41
pubmed: 11752248
Nucleic Acids Res. 2019 Jan 8;47(D1):D923-D929
pubmed: 30260411
Nat Genet. 2000 May;25(1):25-9
pubmed: 10802651
Bioinformatics. 2013 Dec 15;29(24):3107-8
pubmed: 24135261
Nucleic Acids Res. 2014 Jan;42(Database issue):D1083-90
pubmed: 24214965
PLoS Biol. 2008 Jul 22;6(7):e184
pubmed: 18651794
Nat Biotechnol. 2013 May;31(5):419-25
pubmed: 23455439
BMC Syst Biol. 2018 Nov 20;12(Suppl 5):98
pubmed: 30458802
Nucleic Acids Res. 2011 Jan;39(Database issue):D685-90
pubmed: 21071392
Nat Genet. 2013 Jun;45(6):580-5
pubmed: 23715323
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50
pubmed: 16199517
Nucleic Acids Res. 2011 Jan;39(Database issue):D800-6
pubmed: 21045057
Clin Lung Cancer. 2014 Jan;15(1):21-51
pubmed: 24377743
Nat Rev Drug Discov. 2011 Aug 01;10(8):579-90
pubmed: 21804595
Nucleic Acids Res. 2009 Jan;37(Database issue):D674-9
pubmed: 18832364
Cell Rep. 2019 Dec 10;29(11):3367-3373.e4
pubmed: 31825821
Nature. 2019 Jan;565(7738):E5-E6
pubmed: 30559381
BioData Min. 2016 Jul 25;9:23
pubmed: 27462371
Nat Genet. 2014 Apr;46(4):318-319
pubmed: 24675517