A heuristic algorithm solving the mutual-exclusivity-sorting problem.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 01 2023
Historique:
received: 04 07 2022
revised: 14 11 2022
accepted: 11 01 2023
entrez: 20 1 2023
pubmed: 21 1 2023
medline: 25 1 2023
Statut: ppublish

Résumé

Binary (or Boolean) matrices provide a common effective data representation adopted in several domains of computational biology, especially for investigating cancer and other human diseases. For instance, they are used to summarize genetic aberrations-copy number alterations or mutations-observed in cancer patient cohorts, effectively highlighting combinatorial relations among them. One of these is the tendency for two or more genes not to be co-mutated in the same sample or patient, i.e. a mutual-exclusivity trend. Exploiting this principle has allowed identifying new cancer driver protein-interaction networks and has been proposed to design effective combinatorial anti-cancer therapies rationally. Several tools exist to identify and statistically assess mutual-exclusive cancer-driver genomic events. However, these tools need to be equipped with robust/efficient methods to sort rows and columns of a binary matrix to visually highlight possible mutual-exclusivity trends. Here, we formalize the mutual-exclusivity-sorting problem and present MutExMatSorting: an R package implementing a computationally efficient algorithm able to sort rows and columns of a binary matrix to highlight mutual-exclusivity patterns. Particularly, our algorithm minimizes the extent of collective vertical overlap between consecutive non-zero entries across rows while maximizing the number of adjacent non-zero entries in the same row. Here, we demonstrate that existing tools for mutual-exclusivity analysis are suboptimal according to these criteria and are outperformed by MutExMatSorting. https://github.com/AleVin1995/MutExMatSorting. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 36669133
pii: 6986128
doi: 10.1093/bioinformatics/btad016
pmc: PMC9857977
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2023. Published by Oxford University Press.

Références

Lung Cancer. 2019 Apr;130:50-58
pubmed: 30885352
Cancer Res. 2011 Nov 1;71(21):6718-27
pubmed: 21900401
Nat Commun. 2019 Aug 5;10(1):3512
pubmed: 31383865
Nat Rev Genet. 2013 Oct;14(10):703-18
pubmed: 24022702
Bioinformatics. 2014 Sep 1;30(17):i617-23
pubmed: 25161255
Front Oncol. 2021 Oct 04;11:745187
pubmed: 34671561
Cell Syst. 2020 May 20;10(5):424-432.e6
pubmed: 32437684
Nat Genet. 2013 Oct;45(10):1113-20
pubmed: 24071849
Nat Rev Drug Discov. 2017 Oct;16(10):736
pubmed: 28935909
Nucleic Acids Res. 2020 Jan 8;48(D1):D1145-D1152
pubmed: 31686107
Sci Rep. 2018 Apr 30;8(1):6713
pubmed: 29713020
Nat Rev Genet. 2017 Oct;18(10):613-623
pubmed: 28649135
Oncogene. 2016 Jun 16;35(24):3209-16
pubmed: 26477306
Science. 2017 Mar 17;355(6330):1152-1158
pubmed: 28302823
Biol Direct. 2015 Oct 01;10:57
pubmed: 26427375
BMC Bioinformatics. 2016 Dec 20;17(1):542
pubmed: 27998275
Proc Natl Acad Sci U S A. 2014 Feb 25;111(8):3128-33
pubmed: 24520176
Database (Oxford). 2011 Sep 19;2011:bar026
pubmed: 21930502
Nat Med. 2014 Mar;20(3):251-4
pubmed: 24562383
Mod Pathol. 2014 Feb;27(2):255-61
pubmed: 23887303
Nat Commun. 2019 Dec 20;10(1):5817
pubmed: 31862961
Nature. 2019 Apr;568(7753):511-516
pubmed: 30971826
Mutagenesis. 2015 Nov;30(6):763-70
pubmed: 26443852
Genome Res. 2012 Feb;22(2):375-85
pubmed: 21653252
Cell Cycle. 2017;16(24):2313-2314
pubmed: 29279027
PLoS Genet. 2021 Dec 23;17(12):e1009986
pubmed: 34941867
Nature. 2020 Feb;578(7793):94-101
pubmed: 32025018
Database (Oxford). 2022 May 13;2022:
pubmed: 35562840
Cancer Discov. 2012 May;2(5):401-4
pubmed: 22588877
Nat Genet. 2017 Dec;49(12):1779-1784
pubmed: 29083409
Curr Protoc Bioinformatics. 2013 Mar;Chapter 8:8.17.1-8.17.12
pubmed: 23504936
Genome Res. 2012 Feb;22(2):398-406
pubmed: 21908773
Anal Chim Acta. 2021 Jan 2;1141:144-162
pubmed: 33248648
Cell. 2017 Jul 27;170(3):564-576.e16
pubmed: 28753430
Nucleic Acids Res. 2021 Jan 8;49(D1):D1365-D1372
pubmed: 33068406
Trends Cancer. 2021 Sep;7(9):823-836
pubmed: 34031014
Drug Discov Today. 2014 Apr;19(4):433-40
pubmed: 24183925
Curr Opin Neurol. 2020 Apr;33(2):249-254
pubmed: 32073441
Cell. 2016 Jul 28;166(3):740-754
pubmed: 27397505

Auteurs

Alessandro Vinceti (A)

Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy.

Lucia Trastulla (L)

Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy.

Umberto Perron (U)

Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy.

Andrea Raiconi (A)

Institute for Applied Mathematics "Mauro Picone", National Research Council (IAC-CNR), 80131 Napoli, Italy.

Francesco Iorio (F)

Computational Biology Research Centre, Human Technopole, 20157 Milano, Italy.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH