Benchmarking machine learning methods for synthetic lethality prediction in cancer.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
20 Oct 2024
Historique:
received: 27 11 2023
accepted: 23 09 2024
medline: 21 10 2024
pubmed: 21 10 2024
entrez: 20 10 2024
Statut: epublish

Résumé

Synthetic lethality (SL) is a gold mine of anticancer drug targets, exposing cancer-specific dependencies of cellular survival. To complement resource-intensive experimental screening, many machine learning methods for SL prediction have emerged recently. However, a comprehensive benchmarking is lacking. This study systematically benchmarks 12 recent machine learning methods for SL prediction, assessing their performance across diverse data splitting scenarios, negative sample ratios, and negative sampling techniques, on both classification and ranking tasks. We observe that all the methods can perform significantly better by improving data quality, e.g., excluding computationally derived SLs from training and sampling negative labels based on gene expression. Among the methods, SLMGAE performs the best. Furthermore, the methods have limitations in realistic scenarios such as cold-start independent tests and context-specific SLs. These results, together with source code and datasets made freely available, provide guidance for selecting suitable methods and developing more powerful techniques for SL virtual screening.

Identifiants

pubmed: 39428397
doi: 10.1038/s41467-024-52900-7
pii: 10.1038/s41467-024-52900-7
doi:

Substances chimiques

Antineoplastic Agents 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

9058

Informations de copyright

© 2024. The Author(s).

Références

Bridges, C. B. The origin of variations in sexual and sex-limited characters. Am. Nat. 56, 51–63 (1922).
doi: 10.1086/279847
Dobzhansky, T. Genetics of natural populations. XIII. Recombination and variability in populations of Drosophila pseudoobscura. Genetics 31, 269–290 (1946).
pubmed: 20985721 pmcid: 1209328 doi: 10.1093/genetics/31.3.269
Hartwell, L. H., Szankasi, P., Roberts, C. J., Murray, A. W. & Friend, S. H. Integrating genetic approaches into the discovery of anticancer drugs. Science 278, 1064–1068 (1997).
pubmed: 9353181 doi: 10.1126/science.278.5340.1064
Kaelin, W. G. Choosing anticancer drug targets in the postgenomic era. J. Clin. Investig. 104, 1503–1506 (1999).
pubmed: 10587513 pmcid: 409873 doi: 10.1172/JCI8888
Lord, C. J. & Ashworth, A. PARP inhibitors: synthetic lethality in the clinic. Science 355, 1152–1158 (2017).
pubmed: 28302823 pmcid: 6175050 doi: 10.1126/science.aam7344
Satoh, M. S. & Lindahl, T. Role of poly(ADP-ribose) formation in DNA repair. Nature 356, 356–358 (1992).
pubmed: 1549180 doi: 10.1038/356356a0
De Vos, M., Schreiber, V. & Dantzer, F. The diverse roles and clinical relevance of PARPs in DNA damage repair: Current state of the art. Biochem. Pharmacol. 84, 137–146 (2012).
pubmed: 22469522 doi: 10.1016/j.bcp.2012.03.018
Krishnakumar, R. & Kraus, W. L. The PARP side of the nucleus: molecular actions, physiological outcomes, and clinical targets. Mol. Cell 39, 8–24 (2010).
pubmed: 20603072 pmcid: 2923840 doi: 10.1016/j.molcel.2010.06.017
Bryant, H. E. et al. Specific killing of BRCA2-deficient tumours with inhibitors of poly(ADP-ribose) polymerase (vol 434, pg 913, 2005). Nature 447, 346–346 (2007).
doi: 10.1038/nature05789
Farago, A. F. et al. Combination olaparib and temozolomide in relapsed small cell lung cancer. Cancer Discov. 9, 1372–1387 (2019).
Moore, K. et al. Maintenance olaparib in patients with newly diagnosed advanced ovarian cancer. N. Engl. J. Med. 379, 2495–2505 (2018).
pubmed: 30345884 doi: 10.1056/NEJMoa1810858
Fong, P. C. et al. Inhibition of poly(ADP-ribose) polymerase in tumors from BRCA mutation carriers. N. Engl. J. Med. 361, 123–134 (2009).
pubmed: 19553641 doi: 10.1056/NEJMoa0900212
Liu, L. et al. Synthetic lethality-based identification of targets for anticancer drugs in the human signaling network. Sci. Rep. 8, 8440 (2018).
pubmed: 29855504 pmcid: 5981615 doi: 10.1038/s41598-018-26783-w
Setten, R. L., Rossi, J. J. & Han, S.-P. The current state and future directions of RNAi-based therapeutics. Nat. Rev. Drug Discov. 18, 421–446 (2019).
pubmed: 30846871 doi: 10.1038/s41573-019-0017-4
Behan, F. M. et al. Prioritization of cancer therapeutic targets using CRISPR-Cas9 screens. Nature 568, 511–516 (2019).
pubmed: 30971826 doi: 10.1038/s41586-019-1103-9
Topatana, W. et al. Advances in synthetic lethality for cancer therapy: cellular mechanism and clinical translation. J. Hematol. Oncol. 13, 1–22 (2020).
doi: 10.1186/s13045-020-00956-5
Horlbeck, M. A. et al. Mapping the genetic landscape of human cells. Cell 174, 953–967.e22 (2018).
pubmed: 30033366 pmcid: 6426455 doi: 10.1016/j.cell.2018.06.010
Wang, J. et al. Computational methods, databases and tools for synthetic lethality prediction. Brief. Bioinform. 23, bbac106 (2022).
pubmed: 35352098 pmcid: 9116379 doi: 10.1093/bib/bbac106
Jerby-Arnon, L. et al. Predicting cancer-specific vulnerability via data-driven detection of synthetic lethality. Cell 158, 1199–1209 (2014).
pubmed: 25171417 doi: 10.1016/j.cell.2014.07.027
Lee, J. S. et al. Harnessing synthetic lethality to predict the response to cancer treatment. Nat. Commun. 9, 2546 (2018).
pubmed: 29959327 pmcid: 6026173 doi: 10.1038/s41467-018-04647-1
Sinha, S. et al. Systematic discovery of mutation-specific synthetic lethals by mining pan-cancer human primary tumor data. Nat. Commun. 8, 15580 (2017).
pubmed: 28561042 pmcid: 5460027 doi: 10.1038/ncomms15580
Yang, C. et al. Mapping the landscape of synthetic lethal interactions in liver cancer. Theranostics 11, 9038–9053 (2021).
pubmed: 34522226 pmcid: 8419043 doi: 10.7150/thno.63416
De Kegel, B., Quinn, N., Thompson, N. A., Adams, D. J. & Ryan, C. J. Comprehensive prediction of robust synthetic lethality between paralog pairs in cancer cell lines. Cell Syst. 12, 1144–+ (2021).
pubmed: 34529928 doi: 10.1016/j.cels.2021.08.006
Benfatto, S. et al. Uncovering cancer vulnerabilities by machine learning prediction of synthetic lethality. Mol. Cancer 20, 111 (2021).
pubmed: 34454516 pmcid: 8401190 doi: 10.1186/s12943-021-01405-8
Li, J. et al. Identification of synthetic lethality based on a functional network by using machine learning algorithms. J. Cell. Biochem. 120, 405–416 (2019).
pubmed: 30125975 doi: 10.1002/jcb.27395
Tang, S. et al. Synthetic lethal gene pairs: experimental approaches and predictive models. Front. Genet. 13, 961611 (2022).
pubmed: 36531238 pmcid: 9751344 doi: 10.3389/fgene.2022.961611
Wang, J. et al. SynLethDB 2.0: A web-based knowledge graph database on synthetic lethality for novel anticancer drug discovery. Database. 2022, baac030 (2022).
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat. Genet. 25, 25–9 (2000).
pubmed: 10802651 pmcid: 3037419 doi: 10.1038/75556
Oughtred, R. et al. The BioGRID database: a comprehensive biomedical resource of curated protein, genetic, and chemical interactions. Protein Sci. 30, 187–200 (2021).
pubmed: 33070389 doi: 10.1002/pro.3978
Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M. & Ishiguro-Watanabe, M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 51, D587–D592 (2022).
Liu, Y., Wu, M., Liu, C., Li, X.-L. & Zheng, J. SL
pubmed: 30969932 doi: 10.1109/TCBB.2019.2909908
Liany, H., Jeyasekharan, A. & Rajan, V. Predicting synthetic lethal interactions using heterogeneous data sources. Bioinformatics 36, 2209–2216 (2020).
pubmed: 31782759 doi: 10.1093/bioinformatics/btz893
Huang, J., Wu, M., Lu, F., Ou-Yang, L. & Zhu, Z. Predicting synthetic lethal interactions in human cancers using graph regularized self-representative matrix factorization. BMC Bioinform. 20, 1–8 (2019).
doi: 10.1186/s12859-019-3197-3
Cai, R., Chen, X., Fang, Y., Wu, M. & Hao, Y. Dual-dropout graph convolutional network for predicting synthetic lethality in human cancers. Bioinformatics 36, 4458–4465 (2020).
pubmed: 32221609 doi: 10.1093/bioinformatics/btaa211
Long, Y. et al. Graph contextualized attention network for predicting synthetic lethality in human cancers. Bioinformatics 37, 2432–2440 (2021).
pubmed: 33609108 doi: 10.1093/bioinformatics/btab110
Hao, Z. et al. Prediction of synthetic lethal interactions in human cancers using multi-view graph auto-encoder. IEEE J. Biomed. Health Inform. 25, 4041–4051 (2021).
pubmed: 33974548 doi: 10.1109/JBHI.2021.3079302
Lai, M. et al. Predicting synthetic lethality in human cancers via multi-graph ensemble neural network (IEEE, 2021).
Long, Y. et al. Pre-training graph neural networks for link prediction in biomedical networks. Bioinformatics 38, 2254–2262 (2022).
pubmed: 35171981 doi: 10.1093/bioinformatics/btac100
Wang, S. et al. KG4SL: knowledge graph neural network for synthetic lethality prediction in human cancers. Bioinformatics 37, i418–i425 (2021).
pubmed: 34252965 pmcid: 8336442 doi: 10.1093/bioinformatics/btab271
Zhu, Y., Zhou, Y., Liu, Y., Wang, X. & Li, J. SLGNN: synthetic lethality prediction in human cancers based on factor-aware knowledge graph neural network. Bioinformatics 39, btad015 (2023).
pubmed: 36645245 pmcid: 9907046 doi: 10.1093/bioinformatics/btad015
Liu, X. et al. PiLSL: pairwise interaction learning-based graph neural network for synthetic lethality prediction in human cancers. Bioinformatics 38, ii106–ii112 (2022).
pubmed: 36124788 doi: 10.1093/bioinformatics/btac476
Wang, S. et al. NSF4SL: negative-sample-free contrastive learning for ranking synthetic lethal partner genes in human cancers. Bioinformatics 38, ii13–ii19 (2022).
pubmed: 36124790 doi: 10.1093/bioinformatics/btac462
Gillespie, M. et al. The reactome pathway knowledgebase 2022. Nucleic Acids Res. 50, D687–D692 (2022).
pubmed: 34788843 doi: 10.1093/nar/gkab1028
Zhang, K., Wu, M., Liu, Y., Feng, Y. & Zheng, J. KR4SL: knowledge graph reasoning for explainable prediction of synthetic lethality. Bioinformatics 39, i158–i167 (2023).
pubmed: 37387166 pmcid: 10311291 doi: 10.1093/bioinformatics/btad261
Fan, K., Tang, S., Gökbağ, B., Cheng, L. & Li, L. Multi-view graph convolutional network for cancer cell-specific synthetic lethality prediction. Front. Genet. 13, 1103092 (2022).
Tepeli, Y. I., Seale, C. & Gonçalves, J. P. ELISL: early-late integrated synthetic lethality prediction in cancer. Bioinformatics 40, btad764 (2024).
pubmed: 38113447 doi: 10.1093/bioinformatics/btad764
Shen, J. P. et al. Combinatorial CRISPR-Cas9 screens for de novo mapping of genetic interactions. Nat. Methods 14, 573–576 (2017).
pubmed: 28319113 pmcid: 5449203 doi: 10.1038/nmeth.4225
Han, K. et al. Synergistic drug combinations for cancer identified in a CRISPR screen for pairwise genetic interactions. Nat. Biotechnol. 35, 463–474 (2017).
pubmed: 28319085 pmcid: 5557292 doi: 10.1038/nbt.3834
Najm, F. J. et al. Orthologous CRISPR-Cas9 enzymes for combinatorial genetic screens. Nat. Biotechnol. 36, 179–189 (2018).
pubmed: 29251726 doi: 10.1038/nbt.4048
Zhao, D. et al. Combinatorial CRISPR-Cas9 metabolic screens reveal critical redox control points dependent on the KEAP1-NRF2 regulatory axis. Mol. Cell 69, 699–708.e7 (2018).
pubmed: 29452643 pmcid: 5819357 doi: 10.1016/j.molcel.2018.01.017
Ma, M., Na, S. & Wang, H. AEGCN: an autoencoder-constrained graph convolutional network. Neurocomputing 432, 21–31 (2021).
doi: 10.1016/j.neucom.2020.12.061
Li, Q., Han, Z. & Wu, X.-m. Deeper insights into graph convolutional networks for semi-supervised learning. in Proc. of the AAAI Conference on Artificial Intelligence 32 (2018).
Ito, T. et al. Paralog knockout profiling identifies DUSP4 and DUSP6 as a digenic dependence in MAPK pathway-driven cancers. Nat. Genet. 53, 1664–1672 (2021).
pubmed: 34857952 doi: 10.1038/s41588-021-00967-z
Parrish, P. C. R. et al. Discovery of synthetic lethal and tumor suppressor paralog pairs in the human genome. Cell Rep. 36, 109597 (2021).
pubmed: 34469736 pmcid: 8534300 doi: 10.1016/j.celrep.2021.109597
Thompson, N. A. et al. Combinatorial CRISPR screen identifies fitness effects of gene paralogues. Nat. Commun. 12, 1302 (2021).
pubmed: 33637726 pmcid: 7910459 doi: 10.1038/s41467-021-21478-9
Vidigal, J. A. & Ventura, A. Rapid and efficient one-step generation of paired gRNA CRISPR-Cas9 libraries. Nat. Commun. 6, 8083 (2015).
pubmed: 26278926 doi: 10.1038/ncomms9083
Zhang, B. et al. The tumor therapy landscape of synthetic lethality. Nat. Commun. 12, 1275 (2021).
pubmed: 33627666 pmcid: 7904840 doi: 10.1038/s41467-021-21544-2
Srivatsa, S. et al. Discovery of synthetic lethal interactions from large-scale pan-cancer perturbation screens. Nat. Commun. 13, 7748 (2022).
pubmed: 36517508 pmcid: 9751287 doi: 10.1038/s41467-022-35378-z
Reid, R. J. D. et al. A synthetic dosage lethal genetic interaction between CKS1B and PLK1 is conserved in yeast and human cancer cells. Genetics 204, 807–819 (2016).
pubmed: 27558135 pmcid: 5068864 doi: 10.1534/genetics.116.190231
O’Neil, N. J., Bailey, M. L. & Hieter, P. Synthetic lethality and cancer. Nat. Rev. Genet. 18, 613–623 (2017).
pubmed: 28649135 doi: 10.1038/nrg.2017.47
Muller, F. L., Aquilanti, E. A. & Depinho, R. A. Collateral lethality: a new therapeutic strategy in oncology. Trends Cancer 1, 161–173 (2015).
pubmed: 26870836 pmcid: 4746004 doi: 10.1016/j.trecan.2015.10.002
Dey, P. et al. Genomic deletion of malic enzyme 2 confers collateral lethality in pancreatic cancer. Nature 542, 119–123 (2017).
pubmed: 28099419 pmcid: 5398413 doi: 10.1038/nature21052
Li, S. et al. Development of synthetic lethality in cancer: molecular and cellular classification. Signal Transduct. Target. Ther. 5, 241 (2020).
pubmed: 33077733 pmcid: 7573576 doi: 10.1038/s41392-020-00358-6
Seal, R. L. et al. Genenames.org: the HGNC resources in 2023. Nucleic Acids Res. 51, D1003–D1009 (2022).
Cunningham, F. et al. Ensembl 2022. Nucleic Acids Res. 50, D988–D995 (2022).
pubmed: 34791404 doi: 10.1093/nar/gkab1049
Chua, H. N., Sung, W.-K. & Wong, L. Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions. Bioinformatics 22, 1623–1630 (2006).
pubmed: 16632496 doi: 10.1093/bioinformatics/btl145
Tsitsiridis, G. et al. CORUM: The comprehensive resource of mammalian protein complexes-2022. Nucleic Acids Res. 51, D539–D545 (2022).
Yu, G. Gene ontology semantic similarity analysis using GOSemSim. in Methods in Molecular Biology 2117, 207–215 (2020).
Tsherniak, A. et al. Defining a cancer dependency map. Cell 170, 564–576.e16 (2017).
pubmed: 28753430 pmcid: 5667678 doi: 10.1016/j.cell.2017.06.010
The UniProt Consortium. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489 (2021).
doi: 10.1093/nar/gkaa1100
Feng, Y. et al. Benchmarking machine learning methods for synthetic lethality prediction in cancer. Zenodo repository, https://zenodo.org/records/13691648 (2024).

Auteurs

Yimiao Feng (Y)

School of Information Science and Technology, ShanghaiTech University, Shanghai, China.
Lingang Laboratory, Shanghai, China.

Yahui Long (Y)

Bioformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore.

He Wang (H)

School of Information Science and Technology, ShanghaiTech University, Shanghai, China.

Yang Ouyang (Y)

School of Information Science and Technology, ShanghaiTech University, Shanghai, China.

Quan Li (Q)

School of Information Science and Technology, ShanghaiTech University, Shanghai, China.

Min Wu (M)

Institute for Infocomm Research, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore. wumin@i2r.a-star.edu.sg.

Jie Zheng (J)

School of Information Science and Technology, ShanghaiTech University, Shanghai, China. zhengjie@shanghaitech.edu.cn.
Shanghai Engineering Research Center of Intelligent Vision and Imaging, Shanghai, China. zhengjie@shanghaitech.edu.cn.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH