A novel method for data fusion over entity-relation graphs and its application to protein-protein interaction prediction.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
25 Aug 2021
Historique:
received: 26 10 2020
revised: 14 01 2021
accepted: 04 02 2021
medline: 10 2 2021
pubmed: 10 2 2021
entrez: 9 2 2021
Statut: ppublish

Résumé

Modern bioinformatics is facing increasingly complex problems to solve, and we are indeed rapidly approaching an era in which the ability to seamlessly integrate heterogeneous sources of information will be crucial for the scientific progress. Here, we present a novel non-linear data fusion framework that generalizes the conventional matrix factorization paradigm allowing inference over arbitrary entity-relation graphs, and we applied it to the prediction of protein-protein interactions (PPIs). Improving our knowledge of PPI networks at the proteome scale is indeed crucial to understand protein function, physiological and disease states and cell life in general. We devised three data fusion-based models for the proteome-level prediction of PPIs, and we show that our method outperforms state of the art approaches on common benchmarks. Moreover, we investigate its predictions on newly published PPIs, showing that this new data has a clear shift in its underlying distributions and we thus train and test our models on this extended dataset. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 33560405
pii: 6131674
doi: 10.1093/bioinformatics/btab092
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

2275-2281

Subventions

Organisme : Research Foundation - Flanders

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auteurs

Daniele Raimondi (D)

ESAT-STADIUS, KU Leuven, 3001 Leuven, Belgium.

Jaak Simm (J)

ESAT-STADIUS, KU Leuven, 3001 Leuven, Belgium.

Adam Arany (A)

ESAT-STADIUS, KU Leuven, 3001 Leuven, Belgium.

Yves Moreau (Y)

ESAT-STADIUS, KU Leuven, 3001 Leuven, Belgium.

Classifications MeSH