Supervised Learning of Gene Regulatory Networks.

gene expression profiles gene regulatory networks supervised learning support vector machine

Journal

Current protocols in plant biology
ISSN: 2379-8068
Titre abrégé: Curr Protoc Plant Biol
Pays: United States
ID NLM: 101685882

Informations de publication

Date de publication:
06 2020
Historique:
entrez: 25 3 2020
pubmed: 25 3 2020
medline: 4 9 2020
Statut: ppublish

Résumé

Identifying the entirety of gene regulatory interactions in a biological system offers the possibility to determine the key molecular factors that affect important traits on the level of cells, tissues, and whole organisms. Despite the development of experimental approaches and technologies for identification of direct binding of transcription factors (TFs) to promoter regions of downstream target genes, computational approaches that utilize large compendia of transcriptomics data are still the predominant methods used to predict direct downstream targets of TFs, and thus reconstruct genome-wide gene-regulatory networks (GRNs). These approaches can broadly be categorized into unsupervised and supervised, based on whether data about known, experimentally verified gene-regulatory interactions are used in the process of reconstructing the underlying GRN. Here, we first describe the generic steps of supervised approaches for GRN reconstruction, since they have been recently shown to result in improved accuracy of the resulting networks? We also illustrate how they can be used with data from model organisms to obtain more accurate prediction of gene regulatory interactions. © 2020 The Authors. Basic Protocol 1: Construction of features used in supervised learning of gene regulatory interactions Basic Protocol 2: Learning the non-interacting TF-gene pairs Basic Protocol 3: Learning a classifier for gene regulatory interactions.

Identifiants

pubmed: 32207875
doi: 10.1002/cppb.20106
doi:

Substances chimiques

Transcription Factors 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e20106

Informations de copyright

© 2020 The Authors.

Références

Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20, 273-297. doi: 10.1007/BF00994018.
Gama-Castro, S., Salgado, H., Santos-Zavaleta, A., Ledezma-Tejeida, D., Muñiz-Rascado, L., García-Sotelo, J. S., … Collado-Vides, J. (2016). RegulonDB version 9.0: High-level integration of gene regulation, coexpression, motif clustering and beyond. Nucleic Acids Research, 44, D133-D143. doi: 10.1093/nar/gkv1156.
Haury, A., Mordelet, F., Vera-Licona, P., & Vert, J. (2012). TIGRESS: Trustful inference of gene regulation using stability selection. BMC Systems Biology, 6, 145. doi: 10.1186/1752-0509-6-145.
Huynh-Thu, V. A., Irrthum, A., Wehenkel, L., & Geurts, P. (2009). Inferring regulatory networks from expression data using tree-based methods. PloS One, 5(9), pii: e12776. doi: 10.1371/journal.pone.0012776.
Maetschke, S. R., Madhamshettiwar, P. B., Davis, M. J., & Ragan, M. A. (2014). Supervised, semi-supervised and unsupervised inference of gene regulatory networks. Brief Bioinformatics, 15(2), 195-211. doi: 10.1093/bib/bbt034.
Marbach, D., Costello, J. C., Küffner, R., Vega, N., Prill, R. J., Camacho, D. M., … Stolovitzky, G. (2012). Wisdom of crowds for robust gene network inference. Nature Methods, 9(8), 796-804. doi: 10.1038/nmeth.2016.
Margolin, A. A., Nemenman, I., Basso, K., Wiggins, C., Stolovitzky, G., Dalla Favera, R., & Califano, A. (2006). ARACNE: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformatics, 7, S7. doi: 10.1186/1471-2105-7-S1-S7.
Meyer, P., Kontos, K., Lafitte, F., & Bontempi, G. (2007). Information-theoretic inference of large transcriptional regulatory networks. EURASIP Journal on Bioinformatics and Systems Biology, 2007, 79879. doi: 10.1155/2007/79879.
Mordelet, F., & Vert, J. P. (2008). SIRENE: Supervised inference of regulatory networks. Bioinformatics, 24, 76-82.
Ni, Y., Aghamirzaie, D., Elmarakeby, H., Collakova, E., Li, S., Grene, R., & Heath, L. (2016). A machine learning approach to predict gene regulatory networks in seed development in Arabidopsis. Frontiers in Plant Science, 7, 1936. doi: 10.3389/fpls.2016.01936.
Omranian, N., Eloundou-Mbebi, J. M. O., Mueller-Roeber, B., & Nikoloski, Z. (2016). Gene regulatory network inference using fused LASSO on multiple data sets. Scientific Reports, 6, 20533. doi: 10.1038/srep20533.
Petralia, F., Wang, P., Yang, J., & Tu, Z. (2015). Integrative random forest for gene regulatory network inference. Bioinformatics, 31(12), i197-i205. doi: 10.1093/bioinformatics/btv268.
Razaghi-Moghadam, Z., & Nikoloski, Z. Supervised learning of gene regulatory networks based on graph distance profiles of transcriptomics data. Nature Systems Biology and Applications. Submitted for publication.
Teixeira, M. C., Monteiro, P. T., Palma, M., Costa, C., Godinho, C. P., Pais, P., … Sá-Correia, I. (2018). YEASTRACT: An upgraded database for the analysis of transcription regulatory networks in Saccharomyces cerevisiae. Nucleic Acids Research, 46(D1), D348-D353. doi: 10.1093/nar/gkx842.
Vert, J. P. (2010). Reconstruction of biological networks by supervised machine learning approaches. Elements of Computational Systems Biology, 165-188.
Yilmaz, A., Mejia-Guerra, M. K., Kurz, K., Liang, X., Welch, L., & Grotewold, E. (2011). AGRIS: The Arabidopsis gene regulatory information server, an update. Nucleic Acids Research, 39, D1118-D1122. doi: 10.1093/nar/gkq1120.

Auteurs

Zahra Razaghi-Moghadam (Z)

Systems Biology and Mathematical Modelling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany.

Zoran Nikoloski (Z)

Systems Biology and Mathematical Modelling Group, Max Planck Institute of Molecular Plant Physiology, Potsdam, Germany.
Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany.

Articles similaires

Humans Colorectal Neoplasms Biomarkers, Tumor Prognosis Gene Expression Regulation, Neoplastic
Animals Lung India Sheep Transcriptome
Triticum Transcription Factors Gene Expression Regulation, Plant Plant Proteins Salt Stress

Classifications MeSH