Application of GeneCloudOmics: Transcriptomic Data Analytics for Synthetic Biology.


Journal

Methods in molecular biology (Clifton, N.J.)
ISSN: 1940-6029
Titre abrégé: Methods Mol Biol
Pays: United States
ID NLM: 9214969

Informations de publication

Date de publication:
2023
Historique:
entrez: 13 10 2022
pubmed: 14 10 2022
medline: 18 10 2022
Statut: ppublish

Résumé

Research in synthetic biology and metabolic engineering require a deep understanding on the function and regulation of complex pathway genes. This can be achieved through gene expression profiling which quantifies the transcriptome-wide expression under any condition, such as a cell development stage, mutant, disease, or treatment with a drug. The expression profiling is usually done using high-throughput techniques such as RNA sequencing (RNA-Seq) or microarray. Although both methods are based on different technical approaches, they provide quantitative measures of the expression levels of thousands of genes. The expression levels of the genes are compared under different conditions to identify the differentially expressed genes (DEGs), the genes with different expression levels under different conditions. DEGs, usually involving thousands in number, are then investigated using bioinformatics and data analytic tools to infer and compare their functional roles between conditions. Dealing with such large datasets, therefore, requires intensive data processing and analyses to ensure its quality and produce results that are statistically sound. Thus, there is a need for deep statistical and bioinformatics knowledge to deal with high-throughput gene expression data. This represents a barrier for wet biologists with limited computational, programming, and data analytic skills that prevent them from getting the full potential of the data. In this chapter, we present a step-by-step protocol to perform transcriptome analysis using GeneCloudOmics, a cloud-based web server that provides an end-to-end platform for high-throughput gene expression analysis.

Identifiants

pubmed: 36227547
doi: 10.1007/978-1-0716-2617-7_12
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

221-263

Informations de copyright

© 2023. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Références

Dasgupta A, Chowdhury N, De RK (2020) Metabolic pathway engineering: perspectives and applications. Comput Methods Prog Biomed 192:105436. https://doi.org/10.1016/J.CMPB.2020.105436
doi: 10.1016/J.CMPB.2020.105436
Erb TJ, Jones PR, Bar-Even A (2017) Synthetic metabolism: metabolic engineering meets enzyme design. Curr Opin Chem Biol 37:56–62. https://doi.org/10.1016/J.CBPA.2016.12.023
doi: 10.1016/J.CBPA.2016.12.023 pubmed: 28152442 pmcid: 7610756
McCarty NS, Ledesma-Amaro R (2019) Synthetic biology tools to engineer microbial communities for biotechnology. Trends Biotechnol 37:181–197. https://doi.org/10.1016/J.TIBTECH.2018.11.002
doi: 10.1016/J.TIBTECH.2018.11.002 pubmed: 30497870 pmcid: 6340809
Tinafar A, Jaenes K, Pardee K (2019) Synthetic biology goes cell-free. BMC Biol 17:1–14. https://doi.org/10.1186/S12915-019-0685-X
doi: 10.1186/S12915-019-0685-X
Soliman S, El-Keblawy A, Mosa KA et al (2018) Understanding the phytohormones biosynthetic pathways for developing engineered environmental stress-tolerant crops. In: Biotechnologies of crop improvement, vol 2. Springer, Cham, pp 417–450. https://doi.org/10.1007/978-3-319-90650-8_15
doi: 10.1007/978-3-319-90650-8_15
Mosa KA, Saadoun I, Kumar K et al (2016) Potential biotechnological strategies for the cleanup of heavy metals and metalloids. Front Plant Sci 7:303
doi: 10.3389/fpls.2016.00303
Mao N, Cubillos-Ruiz A, Cameron DE, Collins JJ (2018) Probiotic strains detect and suppress cholera in mice. Sci Transl Med 10:2586. https://doi.org/10.1126/SCITRANSLMED.AAO2586/SUPPL_FILE/AAO2586_TABLE_S1.ZIP
doi: 10.1126/SCITRANSLMED.AAO2586/SUPPL_FILE/AAO2586_TABLE_S1.ZIP
Siciliano V, Diandreth B, Monel B et al (2018) Engineering modular intracellular protein sensor-actuator devices. Nat Commun 9(1):1–7. https://doi.org/10.1038/s41467-018-03984-5
doi: 10.1038/s41467-018-03984-5
Fossati E, Ekins A, Narcross L et al (2014) Reconstitution of a 10-gene pathway for synthesis of the plant alkaloid dihydrosanguinarine in Saccharomyces cerevisiae. Nat Commun 51(5):1–11. https://doi.org/10.1038/ncomms4283
doi: 10.1038/ncomms4283
Nielsen J, Keasling JD (2016) Engineering cellular metabolism. Cell 164:1185–1197. https://doi.org/10.1016/J.CELL.2016.02.004
doi: 10.1016/J.CELL.2016.02.004 pubmed: 26967285
Wagner TE, Becraft JR, Bodner K et al (2018) Small-molecule-based regulation of RNA-delivered circuits in mammalian cells. Nat Chem Biol 1411(14):1043–1050. https://doi.org/10.1038/s41589-018-0146-9
doi: 10.1038/s41589-018-0146-9
Cho JH, Collins JJ, Wong WW (2018) Universal chimeric antigen receptors for multiplexed and logical control of T cell responses. Cell 173:1426–1438, e11. https://doi.org/10.1016/J.CELL.2018.03.038
doi: 10.1016/J.CELL.2018.03.038 pubmed: 29706540 pmcid: 5984158
Kulkarni R (2016) Metabolic engineering: biological art of producing useful chemicals. Indian Acad Sci 21:233–237
García-Granados R, Lerma-Escalera JA, Morones-Ramírez JR (2019) Metabolic engineering and synthetic biology: synergies, future, and challenges. Front Bioeng Biotechnol 7:36. https://doi.org/10.3389/fbioe.2019.00036
doi: 10.3389/fbioe.2019.00036 pubmed: 30886847 pmcid: 6409320
Comba S, Arabolaza A, Gramajo H (2012) Emerging engineering principles for yield improvement in microbial cell design. Comput Struct Biotechnol J 3:e201210016
doi: 10.5936/csbj.201210016
Helmy M, Smith D, Selvarajoo K (2020) Systems biology approaches integrated with artificial intelligence for optimized metabolic engineering. Elsevier B.V.
doi: 10.1016/j.mec.2020.e00149
Cheng JK, Alper HS (2016) Transcriptomics-guided design of synthetic promoters for a mammalian system. ACS Synth Biol 5:1455–1465. https://doi.org/10.1021/ACSSYNBIO.6B00075/SUPPL_FILE/SB6B00075_SI_001.PDF
doi: 10.1021/ACSSYNBIO.6B00075/SUPPL_FILE/SB6B00075_SI_001.PDF pubmed: 27268512
El-Metwally S, Ouda OM, Helmy M (2014) Next generation sequencing technologies and challenges in sequence assembly, 1st edn. Springer, New York
doi: 10.1007/978-1-4939-0715-1
El Karoui M, Hoyos-Flight M, Fletcher L (2019) Future trends in synthetic biology—a report. Front Bioeng Biotechnol 7:175. https://doi.org/10.3389/FBIOE.2019.00175/BIBTEX
doi: 10.3389/FBIOE.2019.00175/BIBTEX pubmed: 31448268 pmcid: 6692427
Khalil AS, Collins JJ (2010) Synthetic biology: applications come of age. Nat Rev Genet 115(11):367–379. https://doi.org/10.1038/nrg2775
doi: 10.1038/nrg2775
Brooks SM, Alper HS (2021) Applications, challenges, and needs for employing synthetic biology beyond the lab. Nat Commun 121(12):1–16. https://doi.org/10.1038/s41467-021-21740-0
doi: 10.1038/s41467-021-21740-0
Flores Bueso Y, Tangney M (2017) Synthetic biology in the driving seat of the bioeconomy. Trends Biotechnol 35:373–378. https://doi.org/10.1016/J.TIBTECH.2017.02.002
doi: 10.1016/J.TIBTECH.2017.02.002 pubmed: 28249675
Kwon SW, Paari KA, Malaviya A, Jang YS (2020) Synthetic biology tools for genome and transcriptome engineering of solventogenic clostridium. Front Bioeng Biotechnol 8:282. https://doi.org/10.3389/FBIOE.2020.00282
doi: 10.3389/FBIOE.2020.00282 pubmed: 32363182 pmcid: 7181999
Hrdlickova R, Toloue M, Tian B (2017) RNA-Seq methods for transcriptome analysis. Wiley Interdiscip Rev RNA 8(1):e1364. https://doi.org/10.1002/WRNA.1364
doi: 10.1002/WRNA.1364
Niazian M (2019) Application of genetics and biotechnology for improving medicinal plants. Planta 249:953–973. https://doi.org/10.1007/S00425-019-03099-1
doi: 10.1007/S00425-019-03099-1 pubmed: 30715560
Wright RC, Nemhauser J (2019) Plant Synthetic Biology: Quantifying the “Known Unknowns” and Discovering the “Unknown Unknowns.”. Plant Physiol 179:885. https://doi.org/10.1104/PP.18.01222
doi: 10.1104/PP.18.01222 pubmed: 30630870 pmcid: 6393784
Poliner E, Farré EM, Benning C (2018) Advanced genetic tools enable synthetic biology in the oleaginous microalgae Nannochloropsis sp. Plant Cell Rep 37:1383–1399. https://doi.org/10.1007/S00299-018-2270-0
doi: 10.1007/S00299-018-2270-0 pubmed: 29511798
Helmy M, Agrawal R, Ali J et al (2021) GeneCloudOmics: a data analytic cloud platform for high-throughput gene expression analysis. Front Bioinf 2021:63. https://doi.org/10.3389/FBINF.2021.693836
doi: 10.3389/FBINF.2021.693836
McDermaid A, Monier B, Zhao J et al (2019) Interpretation of differential gene expression results of RNA-seq data: review and integration. Brief Bioinform 20:2044–2054
doi: 10.1093/bib/bby067
Mantione KJ, Kream RM, Kuzelova H et al (2014) Comparing bioinformatic gene expression profiling methods: microarray and RNA-Seq. Med Sci Monit Basic Res 20:138–142. https://doi.org/10.12659/MSMBR.892101
doi: 10.12659/MSMBR.892101 pubmed: 25149683 pmcid: 4152252
Zou Y, Bui TT, Selvarajoo K (2019) ABioTrans: a biostatistical tool for transcriptomics analysis. Front Genet 10:499. https://doi.org/10.3389/fgene.2019.00499
doi: 10.3389/fgene.2019.00499 pubmed: 31214245 pmcid: 6555198
Kim S, Park J, Jeon BW et al (2021) Chemical control of receptor kinase signaling by rapamycin-induced dimerization. Mol Plant 14:1379–1390. https://doi.org/10.1016/J.MOLP.2021.05.006
doi: 10.1016/J.MOLP.2021.05.006 pubmed: 33964457
Schultheiss SJ (2011) Ten simple rules for providing a scientific web resource. PLoS Comput Biol 7:e1001126
doi: 10.1371/journal.pcbi.1001126
Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15:550. https://doi.org/10.1186/s13059-014-0550-8
doi: 10.1186/s13059-014-0550-8 pubmed: 25516281 pmcid: 4302049
Tarazona S, Furió-Tarí P, Turrà D et al (2015) Data quality aware analysis of differential expression in RNA-seq with NOISeq R/Bioc package. Nucleic Acids Res 43:e140. https://doi.org/10.1093/nar/gkv711
doi: 10.1093/nar/gkv711 pubmed: 26184878 pmcid: 4666377
Robinson MD, McCarthy DJ, Smyth GK (2009) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26:139–140. https://doi.org/10.1093/bioinformatics/btp616
doi: 10.1093/bioinformatics/btp616 pubmed: 19910308 pmcid: 2796818
Raychaudhuri S, Stuart JM, Altman RB (2000) Principal components analysis to summarize microarray experiments: application to sporulation time series. Pac Symp Biocomput 2000:455–466. https://doi.org/10.1142/9789814447331_0043
doi: 10.1142/9789814447331_0043
Piras V, Tomita M, Selvarajoo K (2014) Transcriptome-wide variability in single embryonic development cells. Sci Rep 4:1–9. https://doi.org/10.1038/srep07137
doi: 10.1038/srep07137
Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27:379–423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x
doi: 10.1002/j.1538-7305.1948.tb01338.x
Breiman L (1996) Bagging predictors. Mach Learn 24:123–140
doi: 10.1007/BF00058655
Cutler A, Cutler DR, Stevens JR (2012) Random forests. In: Ensemble machine learning. Springer, Boston, pp 157–175. https://doi.org/10.1007/978-1-4419-9326-7_5
doi: 10.1007/978-1-4419-9326-7_5
Villmann T, Bauer HU (1998) Applications of the growing self-organizing map. Neurocomputing 21:91–100. https://doi.org/10.1016/S0925-2312(98)00037-X
doi: 10.1016/S0925-2312(98)00037-X
Cieslak MC, Castelfranco AM, Roncalli V et al (2020) t-Distributed Stochastic Neighbor Embedding (t-SNE): a tool for eco-physiological transcriptomic analysis. Mar Genomics 51:100723. https://doi.org/10.1016/j.margen.2019.100723
doi: 10.1016/j.margen.2019.100723 pubmed: 31784353
Bateman A (2019) UniProt: a worldwide hub of protein knowledge. Nucleic Acids Res 47:D506–D515. https://doi.org/10.1093/nar/gky1049
doi: 10.1093/nar/gky1049
Reimand J, Isserlin R, Voisin V et al (2019) Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap. Nat Protoc 142(14):482–517. https://doi.org/10.1038/s41596-018-0103-9
doi: 10.1038/s41596-018-0103-9
Raudvere U, Kolberg L, Kuzmin I et al (2019) G:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res 47:W191–W198. https://doi.org/10.1093/nar/gkz369
doi: 10.1093/nar/gkz369 pubmed: 31066453 pmcid: 6602461
Franz M, Lopes CT, Huck G et al (2015) Cytoscape.js: a graph theory library for visualisation and analysis. Bioinformatics 32:btv557. https://doi.org/10.1093/bioinformatics/btv557
doi: 10.1093/bioinformatics/btv557
Giurgiu M, Reinhard J, Brauner B et al (2019) CORUM: the comprehensive resource of mammalian protein complexes - 2019. Nucleic Acids Res 47:D559–D563. https://doi.org/10.1093/nar/gky973
doi: 10.1093/nar/gky973 pubmed: 30357367
Vella D, Zoppis I, Mauri G et al (2017) From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data. EURASIP J Bioinforma Syst Biol 2017:6
doi: 10.1186/s13637-017-0059-z
Franz M, Rodriguez H, Lopes C et al (2018) GeneMANIA update 2018. Nucleic Acids Res 46:W60–W64. https://doi.org/10.1093/nar/gky311
doi: 10.1093/nar/gky311 pubmed: 29912392 pmcid: 6030815
Kyte J, Doolittle RF (1982) A simple method for displaying the hydropathic character of a protein. J Mol Biol 157:105–132. https://doi.org/10.1016/0022-2836(82)90515-0
doi: 10.1016/0022-2836(82)90515-0 pubmed: 7108955
Amberger JS, Bocchini CA, Scott AF, Hamosh A (2019) OMIM.org: leveraging knowledge across phenotype-gene relationships. Nucleic Acids Res 47:D1038–D1043. https://doi.org/10.1093/nar/gky1151
doi: 10.1093/nar/gky1151 pubmed: 30445645
Hatos A, Hajdu-Soltész B, Monzon AM et al (2020) DisProt: intrinsic protein disorder annotation in 2020. Nucleic Acids Res 48:D269–D276. https://doi.org/10.1093/nar/gkz975
doi: 10.1093/nar/gkz975 pubmed: 31713636
Piñero J, Ramírez-Anguita JM, Saüch-Pitarch J et al (2020) The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res 48:D845–D855. https://doi.org/10.1093/nar/gkz1021
doi: 10.1093/nar/gkz1021 pubmed: 31680165

Auteurs

Mohamed Helmy (M)

Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore. mohamed_helmy@bii.a-star.edu.sg.
Department of Computer Science, Lakehead University, Thunder Bay, ON, Canada. mohamed_helmy@bii.a-star.edu.sg.

Kumar Selvarajoo (K)

Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore.
Singapore Institute of Food and Biotechnology Innovation, Agency for Science, Technology and Research (A*STAR), Singapore, Singapore.
NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore.
School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Drought Resistance Gene Expression Profiling Gene Expression Regulation, Plant Gossypium Multigene Family
Coal Metagenome Phylogeny Bacteria Genome, Bacterial

Classifications MeSH