Latent network-based representations for large-scale gene expression data analysis.


Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
04 Feb 2019
Historique:
received: 26 05 2018
accepted: 09 11 2018
entrez: 6 2 2019
pubmed: 6 2 2019
medline: 5 3 2019
Statut: epublish

Résumé

With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system. In this paper, we propose LATNET, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LATNET aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine. Multiple patterns could be hidden or weakly observed in expression data. LATNET helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LATNET for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data.

Sections du résumé

BACKGROUND BACKGROUND
With the recent advancements in high-throughput experimental procedures, biologists are gathering huge quantities of data. A main priority in bioinformatics and computational biology is to provide system level analytical tools capable of meeting an ever-growing production of high-throughput biological data while taking into account its biological context. In gene expression data analysis, genes have widely been considered as independent components. However, a systemic view shows that they act synergistically in living cells, forming functional complexes and more generally a biological system.
RESULTS RESULTS
In this paper, we propose LATNET, a signal transformation framework that, starting from an initial large-scale gene expression data, allows to generate new representations based on latent network-based relationships between the genes. LATNET aims to leverage system level relations between the genes as an underlying hidden structure to derive the new transformed latent signals. We present a concrete implementation of our framework, based on a gene regulatory network structure and two signal transformation approaches, to quantify latent network-based activity of regulators, as well as gene perturbation signals. The new gene/regulator signals are at the level of each sample of the input data and, thus, could directly be used instead of the initial expression signals for major bioinformatics analysis, including diagnosis and personalized medicine.
CONCLUSION CONCLUSIONS
Multiple patterns could be hidden or weakly observed in expression data. LATNET helps in uncovering latent signals that could emphasize hidden patterns based on the relations between the genes and, thus, enhancing the performance of gene expression-based analysis algorithms. We use LATNET for the analysis of real-world gene expression data of bladder cancer and we show the efficiency of our transformation framework as compared to using the initial expression data.

Identifiants

pubmed: 30717663
doi: 10.1186/s12859-018-2481-y
pii: 10.1186/s12859-018-2481-y
pmc: PMC7394327
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

466

Subventions

Organisme : Institut National de la Santé et de la Recherche Médicale (FR)
ID : BIO2015-04

Références

Sci Transl Med. 2014 Jul 9;6(244):244ra91
pubmed: 25009231
N Engl J Med. 2006 Aug 10;355(6):560-9
pubmed: 16899776
BMC Bioinformatics. 2006 Mar 20;7 Suppl 1:S7
pubmed: 16723010
Bioinformatics. 2015 Sep 15;31(18):3066-8
pubmed: 25979476
Nucleic Acids Res. 2015 Jan;43(Database issue):D1113-6
pubmed: 25361974
Nucleic Acids Res. 2018 Jan 4;46(D1):D380-D386
pubmed: 29087512
BMC Bioinformatics. 2005 Nov 09;6:269
pubmed: 16280084
Nat Genet. 2006 Dec;38(12):1386-96
pubmed: 17099711
Cell. 2015 Oct 8;163(2):506-19
pubmed: 26451490
Proc Natl Acad Sci U S A. 2004 Mar 23;101(12):4164-9
pubmed: 15016911
Nucleic Acids Res. 2002 Jan 1;30(1):207-10
pubmed: 11752295
Clin Cancer Res. 2012 Jun 15;18(12):3377-86
pubmed: 22553347
Brief Bioinform. 2016 Jul;17(4):628-41
pubmed: 26969681
BMC Genomics. 2017 Sep 11;18(1):712
pubmed: 28893186
Brief Bioinform. 2018 Jul 20;19(4):575-592
pubmed: 28077403
IEEE Trans Nanobioscience. 2014 Jun;13(2):97-103
pubmed: 24771593
Adv Protein Chem Struct Biol. 2017;107:223-273
pubmed: 28215225
Proc Natl Acad Sci U S A. 2000 Aug 29;97(18):10101-6
pubmed: 10963673
BMC Genomics. 2013 Oct 07;14:688
pubmed: 24093424
Nucleic Acids Res. 2016 Jan 4;44(D1):D567-73
pubmed: 26582928
Bioinformatics. 2007 Sep 15;23(18):2407-14
pubmed: 17720703
PLoS Biol. 2007 Jan;5(1):e8
pubmed: 17214507
BMC Syst Biol. 2015;9 Suppl 6:S6
pubmed: 26679516
PLoS Comput Biol. 2009 Oct;5(10):e1000543
pubmed: 19876380
BMC Proc. 2008 Dec 17;2 Suppl 4:S4
pubmed: 19091051

Auteurs

Wajdi Dhifli (W)

University of Lille, 42, rue Paul Duez, Lille, 59000, France.

Julia Puig (J)

University of Lille, 42, rue Paul Duez, Lille, 59000, France.

Aurélien Dispot (A)

University of Lille, 42, rue Paul Duez, Lille, 59000, France.

Mohamed Elati (M)

University of Lille, 42, rue Paul Duez, Lille, 59000, France. mohamed.elati@univ-lille.fr.
UMR 8030 ; Génomique Métabolique / Laboratoire iSSB ; CEA-CNRS-UEVE, Genopole campus 1, 5 rue Henri Desbruères, Évry, 91030 Cedex, France. mohamed.elati@univ-lille.fr.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH