A Bayesian data fusion based approach for learning genome-wide transcriptional regulatory networks.
Bayesian networks
Genomic transcriptional networks
Hybrid structure learning algorithm
omics-data fusion
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
29 May 2020
29 May 2020
Historique:
received:
09
01
2020
accepted:
22
04
2020
entrez:
31
5
2020
pubmed:
31
5
2020
medline:
24
7
2020
Statut:
epublish
Résumé
Reverse engineering of transcriptional regulatory networks (TRN) from genomics data has always represented a computational challenge in System Biology. The major issue is modeling the complex crosstalk among transcription factors (TFs) and their target genes, with a method able to handle both the high number of interacting variables and the noise in the available heterogeneous experimental sources of information. In this work, we propose a data fusion approach that exploits the integration of complementary omics-data as prior knowledge within a Bayesian framework, in order to learn and model large-scale transcriptional networks. We develop a hybrid structure-learning algorithm able to jointly combine TFs ChIP-Sequencing data and gene expression compendia to reconstruct TRNs in a genome-wide perspective. Applying our method to high-throughput data, we verified its ability to deal with the complexity of a genomic TRN, providing a snapshot of the synergistic TFs regulatory activity. Given the noisy nature of data-driven prior knowledge, which potentially contains incorrect information, we also tested the method's robustness to false priors on a benchmark dataset, comparing the proposed approach to other regulatory network reconstruction algorithms. We demonstrated the effectiveness of our framework by evaluating structural commonalities of our learned genomic network with other existing networks inferred by different DNA binding information-based methods. This Bayesian omics-data fusion based methodology allows to gain a genome-wide picture of the transcriptional interplay, helping to unravel key hierarchical transcriptional interactions, which could be subsequently investigated, and it represents a promising learning approach suitable for multi-layered genomic data integration, given its robustness to noisy sources and its tailored framework for handling high dimensional data.
Sections du résumé
BACKGROUND
BACKGROUND
Reverse engineering of transcriptional regulatory networks (TRN) from genomics data has always represented a computational challenge in System Biology. The major issue is modeling the complex crosstalk among transcription factors (TFs) and their target genes, with a method able to handle both the high number of interacting variables and the noise in the available heterogeneous experimental sources of information.
RESULTS
RESULTS
In this work, we propose a data fusion approach that exploits the integration of complementary omics-data as prior knowledge within a Bayesian framework, in order to learn and model large-scale transcriptional networks. We develop a hybrid structure-learning algorithm able to jointly combine TFs ChIP-Sequencing data and gene expression compendia to reconstruct TRNs in a genome-wide perspective. Applying our method to high-throughput data, we verified its ability to deal with the complexity of a genomic TRN, providing a snapshot of the synergistic TFs regulatory activity. Given the noisy nature of data-driven prior knowledge, which potentially contains incorrect information, we also tested the method's robustness to false priors on a benchmark dataset, comparing the proposed approach to other regulatory network reconstruction algorithms. We demonstrated the effectiveness of our framework by evaluating structural commonalities of our learned genomic network with other existing networks inferred by different DNA binding information-based methods.
CONCLUSIONS
CONCLUSIONS
This Bayesian omics-data fusion based methodology allows to gain a genome-wide picture of the transcriptional interplay, helping to unravel key hierarchical transcriptional interactions, which could be subsequently investigated, and it represents a promising learning approach suitable for multi-layered genomic data integration, given its robustness to noisy sources and its tailored framework for handling high dimensional data.
Identifiants
pubmed: 32471360
doi: 10.1186/s12859-020-3510-1
pii: 10.1186/s12859-020-3510-1
pmc: PMC7257163
doi:
Substances chimiques
Transcription Factors
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
219Subventions
Organisme : Fondazione Cariplo and the Regione Lombardia
ID : 2013-0387
Références
Leukemia. 2017 Nov;31(11):2315-2325
pubmed: 28216661
Semin Hematol. 2017 Apr;54(2):81-86
pubmed: 28637621
J Bioinform Comput Biol. 2004 Mar;2(1):77-98
pubmed: 15272434
Mol Cell Biol. 2004 Mar;24(5):1870-83
pubmed: 14966269
Nature. 2016 Jun 08;534(7607):341-6
pubmed: 27281222
J Cell Biol. 2017 Nov 6;216(11):3535-3549
pubmed: 28887438
Proc Natl Acad Sci U S A. 2014 Jan 21;111(3):996-1001
pubmed: 24335803
Cancer Sci. 2015 Jul;106(7):797-802
pubmed: 25940801
Nucleic Acids Res. 2014 Feb;42(4):2099-111
pubmed: 24243859
Oncol Rep. 2018 Sep;40(3):1601-1613
pubmed: 29956795
Mol Cancer Res. 2018 May;16(5):791-804
pubmed: 29483235
Genes Dev. 2006 Sep 1;20(17):2397-409
pubmed: 16951254
BMC Bioinformatics. 2006 Mar 20;7 Suppl 1:S7
pubmed: 16723010
Wiley Interdiscip Rev Syst Biol Med. 2012 Jul-Aug;4(4):311-25
pubmed: 22246697
Cell. 2012 Sep 14;150(6):1274-86
pubmed: 22959076
Cancer Res. 2018 Apr 15;78(8):1890-1897
pubmed: 29618461
Dev Cell. 2016 Mar 7;36(5):572-87
pubmed: 26923725
Biostatistics. 2003 Apr;4(2):249-64
pubmed: 12925520
Stat Appl Genet Mol Biol. 2007;6:Article15
pubmed: 17542777
Curr Opin Genet Dev. 2017 Apr;43:110-119
pubmed: 28359978
Adv Immunol. 2013;117:1-38
pubmed: 23611284
Pac Symp Biocomput. 2005;:459-70
pubmed: 15759651
Stem Cell Res Ther. 2011 Feb 10;2(1):6
pubmed: 21345252
BMC Syst Biol. 2015 Nov 20;9:84
pubmed: 26589494
Bioinformatics. 2013 Apr 15;29(8):1060-7
pubmed: 23525069
Blood. 2008 Dec 15;112(13):4862-73
pubmed: 18840712
J Biol Chem. 2012 Sep 7;287(37):31342-8
pubmed: 22822070
Nucleic Acids Res. 2014 Jan;42(Database issue):D161-6
pubmed: 24170807
Front Genet. 2012 Feb 03;3:8
pubmed: 22408642
Interface Focus. 2011 Dec 6;1(6):857-70
pubmed: 23226586
Genome Res. 2011 Mar;21(3):456-64
pubmed: 21106903
Nature. 2012 Sep 6;489(7414):91-100
pubmed: 22955619
Nat Methods. 2016 Apr;13(4):303-9
pubmed: 26901649
Leukemia. 2010 Jul;24(7):1249-57
pubmed: 20520638
Nucleic Acids Res. 2012 Jan;40(Database issue):D700-5
pubmed: 22110037
Leukemia. 2006 Jun;20(6):1028-34
pubmed: 16617318
J Biomed Inform. 2008 Dec;41(6):914-26
pubmed: 18337190
Pac Symp Biocomput. 2001;:422-33
pubmed: 11262961
Mol Cell. 2007 Jul 6;27(1):107-19
pubmed: 17612494
J Med Signals Sens. 2012 Jan;2(1):61-70
pubmed: 23493097
J Cell Biol. 2017 Nov 6;216(11):3429-3431
pubmed: 29066607
Nature. 2012 Sep 6;489(7414):57-74
pubmed: 22955616
High Throughput. 2018 Oct 26;7(4):
pubmed: 30373182
Br J Haematol. 2008 Sep;142(5):802-7
pubmed: 18573112
Blood. 2013 Aug 15;122(7):1293-304
pubmed: 23836560
Wiley Interdiscip Rev Syst Biol Med. 2017 May;9(3):
pubmed: 28093886
J Cancer. 2017 Sep 16;8(16):3318-3330
pubmed: 29158805
Comput Biol Med. 2014 May;48:55-65
pubmed: 24637147
Curr Opin Genet Dev. 2016 Apr;37:101-108
pubmed: 26950762
Exp Hematol Oncol. 2014 May 08;3:13
pubmed: 24904756
Expert Opin Ther Targets. 2017 Mar;21(3):319-331
pubmed: 28076698
Pac Symp Biocomput. 2002;:437-49
pubmed: 11928497
PLoS Comput Biol. 2013;9(11):e1003342
pubmed: 24278002
Bioinformatics. 2016 Jul 15;32(14):2233-5
pubmed: 27153652
Genome Biol. 2008;9(9):R137
pubmed: 18798982
Dev Biol. 2006 Jun 15;294(2):525-40
pubmed: 16626682
Nucleic Acids Res. 2019 Jan 8;47(D1):D529-D541
pubmed: 30476227
Curr Genomics. 2015 Feb;16(1):3-22
pubmed: 25937810
J Exp Med. 2013 Jan 14;210(1):71-84
pubmed: 23267012
Am J Clin Exp Urol. 2015 Apr 25;3(1):13-23
pubmed: 26069883
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613
pubmed: 30476243
Cell. 2017 Feb 9;168(4):629-643
pubmed: 28187285
J Oncol. 2011;2011:798592
pubmed: 21436996
J Biol Chem. 2017 Nov 17;292(46):18924-18936
pubmed: 28900037
Genome Res. 2011 Mar;21(3):447-55
pubmed: 21106904
Blood. 2016 Aug 4;128(5):638-49
pubmed: 27301860
Dev Cell. 2011 May 17;20(5):597-609
pubmed: 21571218
Genome Res. 2014 Dec;24(12):1945-62
pubmed: 25319994
Oncogene. 2002 May 13;21(21):3414-21
pubmed: 12032779
Mol Biol Cell. 1998 Dec;9(12):3273-97
pubmed: 9843569
Nucleic Acids Res. 2016 Jan 4;44(D1):D133-43
pubmed: 26527724
Biosystems. 2009 Apr;96(1):86-103
pubmed: 19150482
Mol Aspects Med. 2014 Dec;40:1-116
pubmed: 25010388
Genome Res. 2011 Oct;21(10):1659-71
pubmed: 21795386
Mol Cell Biol. 1993 Feb;13(2):841-51
pubmed: 8423806
Nat Methods. 2012 Jul 15;9(8):796-804
pubmed: 22796662
J Biomed Inform. 2015 Feb;53:27-35
pubmed: 25181467
PLoS Comput Biol. 2006 Dec 29;2(12):e174
pubmed: 17196032
Cancer Cell. 2012 Aug 14;22(2):209-21
pubmed: 22897851
Science. 2004 Feb 6;303(5659):799-805
pubmed: 14764868
J Biol Chem. 2006 Dec 8;281(49):37345-52
pubmed: 16963445
Front Genet. 2017 Jun 16;8:84
pubmed: 28670325
Am J Transl Res. 2016 May 15;8(5):2265-74
pubmed: 27347333
Acta Naturae. 2018 Jan-Mar;10(1):15-23
pubmed: 29713515
EMBO J. 2002 Jun 17;21(12):3039-50
pubmed: 12065417
Mol Syst Biol. 2007;3:78
pubmed: 17299415