Comprehensive interrogation of gene lists from genome-scale cancer screens with oncoEnrichR.

cancer relevance gene set analysis genome-scale screening hit prioritization target discovery

Journal

International journal of cancer
ISSN: 1097-0215
Titre abrégé: Int J Cancer
Pays: United States
ID NLM: 0042124

Informations de publication

Date de publication:
15 11 2023
Historique:
revised: 19 06 2023
received: 01 03 2023
accepted: 04 07 2023
medline: 19 9 2023
pubmed: 8 8 2023
entrez: 8 8 2023
Statut: ppublish

Résumé

Genome-scale screening experiments in cancer produce long lists of candidate genes that require extensive interpretation for biological insight and prioritization for follow-up studies. Interrogation of gene lists frequently represents a significant and time-consuming undertaking, in which experimental biologists typically combine results from a variety of bioinformatics resources in an attempt to portray and understand cancer relevance. As a means to simplify and strengthen the support for this endeavor, we have developed oncoEnrichR, a flexible bioinformatics tool that allows cancer researchers to comprehensively interrogate a given gene list along multiple facets of cancer relevance. oncoEnrichR differs from general gene set analysis frameworks through the integration of an extensive set of prior knowledge specifically relevant for cancer, including ranked gene-tumor type associations, literature-supported proto-oncogene and tumor suppressor gene annotations, target druggability data, regulatory interactions, synthetic lethality predictions, as well as prognostic associations, gene aberrations and co-expression patterns across tumor types. The software produces a structured and user-friendly analysis report as its main output, where versions of all underlying data resources are explicitly logged, the latter being a critical component for reproducible science. We demonstrate the usefulness of oncoEnrichR through interrogation of two candidate lists from proteomic and CRISPR screens. oncoEnrichR is freely available as a web-based service hosted by the Galaxy platform (https://oncotools.elixir.no), and can also be accessed as a stand-alone R package (https://github.com/sigven/oncoEnrichR).

Identifiants

pubmed: 37551617
doi: 10.1002/ijc.34666
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1819-1828

Informations de copyright

© 2023 The Authors. International Journal of Cancer published by John Wiley & Sons Ltd on behalf of UICC.

Références

Bock C, Datlinger P, Chardon F, et al. High-content CRISPR screening. Nat Rev Methods Primers. 2022;2:1-23.
Sharifi Tabar M, Francis H, Yeo D, Bailey CG, Rasko JEJ. Mapping oncogenic protein interactions for precision medicine. Int J Cancer. 2022;151:7-19.
Cieślik M, Chinnaiyan AM. Cancer transcriptome profiling at the juncture of clinical translation. Nat Rev Genet. 2018;19:93-109.
Zhou Y, Zhou B, Pache L, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10:1523.
Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44-57.
Chen EY, Tan CM, Kou Y, et al. Enrichr: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinform. 2013;14:128.
Safran M, Dalah I, Alexander J, et al. GeneCards version 3: the human gene integrator. Database. 2010;2010:baq020.
von Mering C, Jensen LJ, Snel B, et al. STRING: known and predicted protein-protein associations, integrated and transferred across organisms. Nucleic Acids Res. 2005;33:D433-D437.
Mostafavi S, Ray D, Warde-Farley D, Grouios C, Morris Q. GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function. Genome Biol. 2008;9(Suppl 1):S4.
Cerami E, Gao J, Dogrusoz U, et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012;2:401-404.
Tang Z, Kang B, Li C, Chen T, Zhang Z. GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res. 2019;47:W556-W560.
The Cancer Genome Atlas Research Network, Weinstein JN, Collisson EA, et al. The cancer genome atlas pan-cancer analysis project. Nat Genet. 2013;45:1113-1120.
Ochoa D, Hercules A, Carmona M, et al. Open targets platform: supporting systematic drug-target identification and prioritisation. Nucleic Acids Res. 2021;49:D1302-D1310.
Malone J, Holloway E, Adamusiak T, et al. Modeling sample variables with an experimental factor ontology. Bioinformatics. 2010;26:1112-1118.
Kundra R, Zhang H, Sheridan R, et al. OncoTree: A cancer classification system for precision oncology. JCO Clin Cancer Inform. 2021;5:221-230.
Afgan E, Baker D, Batut B, et al. The galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018;46:W537-W544.
Sondka Z, Bamford S, Cole CG, Ward SA, Dunham I, Forbes SA. The COSMIC cancer gene census: describing genetic dysfunction across all human cancers. Nat Rev Cancer. 2018;18:696-705.
Repana D, Nulsen J, Dressler L, et al. The network of cancer genes (NCG): a comprehensive catalogue of known and candidate cancer genes from cancer sequencing screens. Genome Biol. 2019;20:1.
Lever J, Zhao EY, Grewal J, Jones MR, Jones SJM. CancerMine: a literature-mined resource for drivers, oncogenes and tumor suppressors in cancer. Nat Methods. 2019;16:505-507.
Martínez-Jiménez F, Muiños F, Sentís I, et al. A compendium of mutational cancer driver genes. Nat Rev Cancer. 2020;20:555-572.
Dwane L, Behan FM, Gonçalves E, et al. Project score database: a resource for investigating cancer cell dependencies and prioritizing therapeutic targets. Nucleic Acids Res. 2020;49:D1365-D1372.
de Kegel B, Quinn N, Thompson NA, Adams DJ, Ryan CJ. Comprehensive prediction of robust synthetic lethality between paralog pairs in cancer cell lines. Cell Syst. 2021;12:1144-59.e6.
Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284-287.
Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006;34:D535-D539.
Tsitsiridis G, Steinkamp R, Giurgiu M, et al. CORUM: the comprehensive resource of mammalian protein complexes-2022. Nucleic Acids Res. 2022;51:D539-D545.
Meldal BHM, Bye-A-Jee H, Gajdoš L, et al. Complex portal 2018: extended content and enhanced visualization tools for macromolecular complexes. Nucleic Acids Res. 2018;47:D550-D558.
Vinayagam A, Hu Y, Kulkarni M, et al. Protein complex-based analysis framework for high-throughput data sets. Sci Signal. 2013;6:rs5.
Drew K, Wallingford JB, Marcotte EM. hu.MAP 2.0: integration of over 15,000 proteomic experiments builds a global compendium of human multiprotein assemblies. Mol Syst Biol. 2021;17:e10016.
Jin S, Guerrero-Juarez CF, Zhang L, et al. Inference and analysis of cell-cell communication using CellChat. Nat Commun. 2021;12:1088.
Garcia-Alonso L, Holland CH, Ibrahim MM, Turei D, Saez-Rodriguez J. Benchmark and integration of resources for the estimation of human transcription factor activities. Genome Res. 2019;29:1363-1375.
Jain A, Tuteja G. TissueEnrich: tissue-specific gene enrichment analysis. Bioinformatics. 2019;35:1966-1967.
Uhlen M, Oksvold P, Fagerberg L, et al. Towards a knowledge-based human protein atlas. Nat Biotechnol. 2010;28:1248-1250.
Smith JC, Sheltzer JM. Genome-wide identification and analysis of prognostic features in human cancers. Cell Rep. 2022;38:110569.
Gogleva A, Polychronopoulos D, Pfeifer M, et al. Knowledge graph-based recommendation framework identifies drivers of resistance in EGFR mutant non-small cell lung cancer. Nat Commun. 2022;13:1667.
Kostas M, Haugsten EM, Zhen Y, et al. Protein tyrosine phosphatase receptor type G (PTPRG) controls fibroblast growth factor receptor (FGFR) 1 activity and influences sensitivity to FGFR kinase inhibitors. Mol Cell Proteomics. 2018;17:850-870.
Ahmad I, Iwata T, Leung HY. Mechanisms of FGFR-mediated carcinogenesis. Biochim Biophys Acta. 2012;1823:850-860.
Tanner Y, Grose RP. Dysregulated FGF signalling in neoplastic disorders. Semin Cell Dev Biol. 2016;53:126-135.
Gentleman RC, Carey VJ, Bates DM, et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004;5:R80.
Dingar D, Kalkat M, Chan P-K, et al. BioID identifies novel c-MYC interacting partners in cultured cells and xenograft tumors. J Proteome. 2015;118:95-111.

Auteurs

Sigve Nakken (S)

Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway.
Centre for Cancer Cell Reprogramming, Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway.
Centre for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway.

Sveinung Gundersen (S)

Centre for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway.

Fabian L M Bernal (FLM)

University Center for Information Technology, University of Oslo, Oslo, Norway.

Dimitris Polychronopoulos (D)

Ochre Bio Ltd, Oxford, UK.

Eivind Hovig (E)

Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway.
Centre for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway.

Jørgen Wesche (J)

Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway.
Centre for Cancer Cell Reprogramming, Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH