Identifying multi-hit carcinogenic gene combinations: Scaling up a weighted set cover algorithm using compressed binary matrix representation on a GPU.
Algorithms
Antineoplastic Combined Chemotherapy Protocols
/ pharmacology
Biomarkers, Tumor
/ antagonists & inhibitors
Carcinogenesis
/ genetics
Computational Biology
/ instrumentation
Computer Graphics
Datasets as Topic
Humans
Molecular Targeted Therapy
/ methods
Mutation
Neoplasms
/ drug therapy
Oligonucleotide Array Sequence Analysis
/ instrumentation
Precision Medicine
/ methods
Time Factors
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
06 02 2020
06 02 2020
Historique:
received:
29
08
2019
accepted:
20
01
2020
entrez:
8
2
2020
pubmed:
8
2
2020
medline:
21
11
2020
Statut:
epublish
Résumé
Despite decades of research, effective treatments for most cancers remain elusive. One reason is that different instances of cancer result from different combinations of multiple genetic mutations (hits). Therefore, treatments that may be effective in some cases are not effective in others. We previously developed an algorithm for identifying combinations of carcinogenic genes with mutations (multi-hit combinations), which could suggest a likely cause for individual instances of cancer. Most cancers are estimated to require three or more hits. However, the computational complexity of the algorithm scales exponentially with the number of hits, making it impractical for identifying combinations of more than two hits. To identify combinations of greater than two hits, we used a compressed binary matrix representation, and optimized the algorithm for parallel execution on an NVIDIA V100 graphics processing unit (GPU). With these enhancements, the optimized GPU implementation was on average an estimated 12,144 times faster than the original integer matrix based CPU implementation, for the 3-hit algorithm, allowing us to identify 3-hit combinations. The 3-hit combinations identified using a training set were able to differentiate between tumor and normal samples in a separate test set with 90% overall sensitivity and 93% overall specificity. We illustrate how the distribution of mutations in tumor and normal samples in the multi-hit gene combinations can suggest potential driver mutations for further investigation. With experimental validation, these combinations may provide insight into the etiology of cancer and a rational basis for targeted combination therapy.
Identifiants
pubmed: 32029803
doi: 10.1038/s41598-020-58785-y
pii: 10.1038/s41598-020-58785-y
pmc: PMC7005272
doi:
Substances chimiques
Biomarkers, Tumor
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
2022Références
PLoS One. 2012;7(12):e51352
pubmed: 23251504
Science. 2015 May 22;348(6237):880-6
pubmed: 25999502
Br J Cancer. 1954 Mar;8(1):1-12
pubmed: 13172380
Br J Cancer. 1953 Mar;7(1):68-72
pubmed: 13051507
Brief Bioinform. 2016 Jul;17(4):642-56
pubmed: 26307061
Sci Rep. 2017 Feb 24;7:43169
pubmed: 28233799
Nature. 2013 Jul 11;499(7457):214-218
pubmed: 23770567
PLoS Genet. 2016 Oct 10;12(10):e1006193
pubmed: 27723796
BMC Bioinformatics. 2018 Jun 5;19(1):214
pubmed: 29871594
Bioinformatics. 2020 Feb 1;36(3):928-929
pubmed: 31393560
Sci Rep. 2017 Dec;7(1):14
pubmed: 28144039
Oncogene. 2010 Feb 18;29(7):949-56
pubmed: 19881536
Clin Cancer Res. 1996 Sep;2(9):1571-5
pubmed: 9816335
Science. 2018 Nov 23;362(6417):911-917
pubmed: 30337457
Curr Opin Oncol. 2018 Jan;30(1):23-29
pubmed: 29076966
Cell. 2018 Apr 5;173(2):371-385.e18
pubmed: 29625053
Breast Cancer Res Treat. 2005 May;91(2):121-4
pubmed: 15868439
Sci Transl Med. 2012 Mar 28;4(127):127rv4
pubmed: 22461645
Br J Cancer. 1969 Jun;23(2):313-28
pubmed: 5788039
BMC Bioinformatics. 2013 Aug 10;14:244
pubmed: 23937229
JAMA. 2017 Jun 20;317(23):2402-2416
pubmed: 28632866
Science. 2013 Mar 29;339(6127):1546-58
pubmed: 23539594
Ecancermedicalscience. 2015 May 21;9:541
pubmed: 26082798
JAMA Netw Open. 2019 Jun 5;2(6):e196520
pubmed: 31251381
Tumour Biol. 1994;15(4):223-9
pubmed: 7939171
Proc Natl Acad Sci U S A. 2015 Jan 6;112(1):118-23
pubmed: 25535351
Cancer Res. 2016 Oct 1;76(19):5605-5609
pubmed: 27550448
Nature. 2014 Jan 23;505(7484):495-501
pubmed: 24390350
J Mol Graph Model. 2010 Jun;28(8):904-10
pubmed: 20452792
CA Cancer J Clin. 2019 Jan;69(1):7-34
pubmed: 30620402
J Pathol. 2010 May;221(1):49-56
pubmed: 20229506
Cold Spring Harb Perspect Med. 2017 Apr 3;7(4):
pubmed: 28270529
Cancer Res. 2007 Aug 15;67(16):7547-50
pubmed: 17699756
JAMA. 2016 Jul 5;316(1):93-4
pubmed: 27380347
Bioinformatics. 2016 Sep 1;32(17):i736-i745
pubmed: 27587696
Mol Genet Genomic Med. 2019 Aug;7(8):e866
pubmed: 31332964
BMC Bioinformatics. 2016 Nov 8;17(Suppl 12):341
pubmed: 28185561
PLoS Comput Biol. 2019 Mar 7;15(3):e1006881
pubmed: 30845172
Nat Genet. 2016 Oct;48(10):1288-94
pubmed: 27618449
Sci Rep. 2015 Oct 19;5:15199
pubmed: 26477325
Proc Natl Acad Sci U S A. 1971 Apr;68(4):820-3
pubmed: 5279523
Nature. 2016 Apr 14;532(7598):162-4
pubmed: 27075078
Cancer Discov. 2012 May;2(5):401-4
pubmed: 22588877
Nat Biotechnol. 2012 Jul 10;30(7):679-92
pubmed: 22781697
Annu Rev Pathol. 2015;10:25-50
pubmed: 25340638
Cancer Res. 2009 Mar 15;69(6):2349-57
pubmed: 19276375
Pflugers Arch. 2019 May;471(5):673-682
pubmed: 30919088
Clin Genet. 2018 Dec;94(6):512-520
pubmed: 30182498
BMC Med Genet. 2019 Oct 29;20(1):166
pubmed: 31664938
Nat Rev Cancer. 2017 Apr;17(4):239-253
pubmed: 28256574
Nat Genet. 2013 Oct;45(10):1113-20
pubmed: 24071849
Eur J Gynaecol Oncol. 1996;17(6):487-92
pubmed: 8971524
Proc Natl Acad Sci U S A. 2002 Nov 12;99(23):15095-100
pubmed: 12415112
Cancer Biol Ther. 2013 Mar;14(3):246-53
pubmed: 23291982
Oncol Lett. 2019 Dec;18(6):6431-6442
pubmed: 31807166
J Hum Genet. 2017 Apr;62(5):569-573
pubmed: 27928161
Nat Cell Biol. 2007 May;9(5):573-80
pubmed: 17417627
J Gen Physiol. 2015 Nov;146(5):399-410
pubmed: 26503721
Math Biosci. 2003 Jun;183(2):111-34
pubmed: 12711407
Bioinformatics. 2013 Sep 15;29(18):2238-44
pubmed: 23884480
Neurol Genet. 2017 Dec 11;3(6):e206
pubmed: 29264397
J Mol Diagn. 2014 Jan;16(1):75-88
pubmed: 24211364
Ann Clin Transl Neurol. 2019 Jul;6(7):1311-1318
pubmed: 31353864
Bioinformatics. 2014 Sep 1;30(17):i572-8
pubmed: 25161249
Sci Rep. 2019 Jan 30;9(1):1005
pubmed: 30700767
Oncogene. 1995 Oct 5;11(7):1217-28
pubmed: 7478541
Genome Res. 2012 Aug;22(8):1589-98
pubmed: 22759861
Neurol Genet. 2017 Dec 11;3(6):e198
pubmed: 29264390
BMC Bioinformatics. 2014 Sep 19;15:308
pubmed: 25236784
Clin Transl Med. 2018 Mar 1;7(1):11
pubmed: 29541939