Interactive and Reproducible Workflows for Exploring and Modeling RNA-seq Data with pcaExplorer, Ideal, and GeneTonic.

RNA-seq data visualization functional interpretation interactive data analysis reproducible research

Journal

Current protocols
ISSN: 2691-1299
Titre abrégé: Curr Protoc
Pays: United States
ID NLM: 101773894

Informations de publication

Date de publication:
Apr 2022
Historique:
entrez: 25 4 2022
pubmed: 26 4 2022
medline: 28 4 2022
Statut: ppublish

Résumé

The generation and interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task. While raw data quality control, alignment, and quantification can be streamlined via efficient algorithms that can deliver the preprocessed expression matrix, a common bottleneck in the analysis of such large datasets is the subsequent in-depth, iterative processes of data exploration, statistical testing, visualization, and interpretation. Specific tools for these workflow steps are available but require a level of technical expertise which might be prohibitive for life and clinical scientists, who are left with essential pieces of information distributed among different tabular and list formats. Our protocols are centered on the joint use of our Bioconductor packages (pcaExplorer, ideal, GeneTonic) for interactive and reproducible workflows. All our packages provide an interactive and accessible experience via Shiny web applications, while still documenting the steps performed with RMarkdown as a framework to guarantee the reproducibility of the analyses, reducing the overall time to generate insights from the data at hand. These protocols guide readers through the essential steps of Exploratory Data Analysis, statistical testing, and functional enrichment analyses, followed by integration and contextualization of results. In our packages, the core elements are linked together in interactive widgets that make drill-down tasks efficient by viewing the data at a level of increased detail. Thanks to their interoperability with essential classes and gold-standard pipelines implemented in the open-source Bioconductor project and community, these protocols will permit complex tasks in RNA-seq data analysis, combining interactivity and reproducibility for following modern best scientific practices and helping to streamline the discovery process for transcriptome data. © 2022 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Exploratory Data Analysis with pcaExplorer Basic Protocol 2: Differential Expression Analysis with ideal Basic Protocol 3: Interpretation of RNA-seq results with GeneTonic Support Protocol: Downloading and installing pcaExplorer, ideal, and GeneTonic Alternate Protocol: Using functions from pcaExplorer, ideal, and GeneTonic in custom analyses.

Identifiants

pubmed: 35467799
doi: 10.1002/cpz1.411
doi:

Substances chimiques

RNA 63231-63-0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e411

Informations de copyright

© 2022 The Authors. Current Protocols published by Wiley Periodicals LLC.

Auteurs

Annekathrin Ludt (A)

Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), Division Statistical Genomics and Bioinformatics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany.

Arsenij Ustjanzew (A)

Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), Division Statistical Genomics and Bioinformatics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany.

Harald Binder (H)

Institute of Medical Biometry and Statistics (IMBI), Faculty of Medicine and Medical Center, University of Freiburg, Freiburg im Breisgau, Germany.

Konstantin Strauch (K)

Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), Division Statistical Genomics and Bioinformatics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany.

Federico Marini (F)

Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), Division Statistical Genomics and Bioinformatics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany.
Center for Thrombosis and Hemostasis Mainz (CTH), University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany.

Articles similaires

Humans Middle Aged Female Male Surveys and Questionnaires
Adolescent Child Female Humans Male
Animals Lung India Sheep Transcriptome

Classifications MeSH