NASA GeneLab RNA-seq consensus pipeline: standardized processing of short-read RNA-seq data.

Omics Space Sciences

Journal

iScience
ISSN: 2589-0042
Titre abrégé: iScience
Pays: United States
ID NLM: 101724038

Informations de publication

Date de publication:
23 Apr 2021
Historique:
received: 08 09 2020
revised: 30 10 2020
accepted: 23 03 2021
entrez: 19 4 2021
pubmed: 20 4 2021
medline: 20 4 2021
Statut: epublish

Résumé

With the development of transcriptomic technologies, we are able to quantify precise changes in gene expression profiles from astronauts and other organisms exposed to spaceflight. Members of NASA GeneLab and GeneLab-associated analysis working groups (AWGs) have developed a consensus pipeline for analyzing short-read RNA-sequencing data from spaceflight-associated experiments. The pipeline includes quality control, read trimming, mapping, and gene quantification steps, culminating in the detection of differentially expressed genes. This data analysis pipeline and the results of its execution using data submitted to GeneLab are now all publicly available through the GeneLab database. We present here the full details and rationale for the construction of this pipeline in order to promote transparency, reproducibility, and reusability of pipeline data; to provide a template for data processing of future spaceflight-relevant datasets; and to encourage cross-analysis of data from other databases with the data available in GeneLab.

Identifiants

pubmed: 33870146
doi: 10.1016/j.isci.2021.102361
pii: S2589-0042(21)00329-1
pmc: PMC8044432
doi:

Types de publication

Journal Article

Langues

eng

Pagination

102361

Subventions

Organisme : NCI NIH HHS
ID : R50 CA243876
Pays : United States

Déclaration de conflit d'intérêts

The authors declare no competing interests.

Références

Patterns (N Y). 2020 Nov 25;1(9):100148
pubmed: 33336201
Nucleic Acids Res. 2019 Jul 2;47(W1):W199-W205
pubmed: 31114916
RNA. 2016 Jun;22(6):839-51
pubmed: 27022035
BMC Genomics. 2011 Jun 06;12:293
pubmed: 21645359
Nature. 2020 Jul;583(7818):693-698
pubmed: 32728248
F1000Res. 2015 Dec 30;4:1521
pubmed: 26925227
BMC Bioinformatics. 2011 Aug 04;12:323
pubmed: 21816040
Bioinformatics. 2013 Jan 1;29(1):15-21
pubmed: 23104886
Genome Res. 2011 Sep;21(9):1543-51
pubmed: 21816910
Genome Res. 2003 Sep;13(9):2129-41
pubmed: 12952881
Nucleic Acids Res. 2013 Apr;41(8):4378-91
pubmed: 23444143
Nucleic Acids Res. 2021 Jan 8;49(D1):D1515-D1522
pubmed: 33080015
Nat Genet. 2012 Jan 27;44(2):121-6
pubmed: 22281772
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
Genome Biol. 2016 Apr 23;17:74
pubmed: 27107712
Cell Rep. 2020 Dec 8;33(10):108441
pubmed: 33242404
BMC Genomics. 2018 Jul 3;19(1):510
pubmed: 29969991
BMC Bioinformatics. 2011 Dec 17;12:480
pubmed: 22177264
ACM BCB. 2015 Sep;2015:462-471
pubmed: 27583310
Genome Biol. 2009;10(3):R25
pubmed: 19261174
Genome Biol. 2014;15(12):550
pubmed: 25516281
Nat Biotechnol. 2016 May;34(5):525-7
pubmed: 27043002
iScience. 2020 Nov 25;23(12):101733
pubmed: 33376967
Genome Biol. 2016 Dec 13;17(1):256
pubmed: 27964738
Nucleic Acids Res. 2013 Jan;41(Database issue):D377-86
pubmed: 23193289
Bioinformatics. 2016 Oct 1;32(19):3047-8
pubmed: 27312411
Genome Biol. 2004;5(10):R80
pubmed: 15461798
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50
pubmed: 16199517
J Pers Med. 2019 Apr 03;9(2):
pubmed: 30987214
PLoS One. 2017 Dec 21;12(12):e0190152
pubmed: 29267363
Sci Rep. 2017 Dec 21;7(1):18022
pubmed: 29269933
Nucleic Acids Res. 2019 Jan 8;47(D1):D607-D613
pubmed: 30476243
BMC Bioinformatics. 2016 Feb 25;17:103
pubmed: 26911985
Nat Commun. 2014 Sep 25;5:5125
pubmed: 25254650
Genome Biol. 2013 Jul 03;14(7):405
pubmed: 23822731
Genome Biol. 2019 Oct 9;20(1):203
pubmed: 31597578
F1000Res. 2016 Jun 17;5:1408
pubmed: 27441086
Genome Res. 2017 Mar;27(3):491-499
pubmed: 28100584
Nat Methods. 2015 Feb;12(2):115-21
pubmed: 25633503
Genome Biol. 2016 Jan 26;17:13
pubmed: 26813401
Int J Mol Sci. 2020 Mar 03;21(5):
pubmed: 32138290
Nat Methods. 2017 Apr;14(4):417-419
pubmed: 28263959
Nat Biotechnol. 2014 Sep;32(9):896-902
pubmed: 25150836
Nat Methods. 2017 Feb;14(2):135-139
pubmed: 27941783
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W305-11
pubmed: 19465376
Bioinformatics. 2010 Sep 15;26(18):2354-6
pubmed: 20679334

Auteurs

Eliah G Overbey (EG)

Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.

Amanda M Saravia-Butler (AM)

Logyx, LLC, Mountain View, CA 94043, USA.
Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Zhe Zhang (Z)

Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia, University of Pennsylvania, Philadelphia, PA 19104, USA.

Komal S Rathi (KS)

Department of Biomedical and Health Informatics, The Children's Hospital of Philadelphia, University of Pennsylvania, Philadelphia, PA 19104, USA.

Homer Fogle (H)

The Bionetics Corporation, NASA Ames Research Center, Moffett Field, CA 94035, USA.
Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Willian A da Silveira (WA)

Institute for Global Food Security (IGFS) & School of Biological Sciences, Queen's University Belfast, Belfast, UK.

Richard J Barker (RJ)

Department of Botany, University of Wisconsin, Madison, WI 53706, USA.

Joseph J Bass (JJ)

MRC Versus Arthritis Centre for Musculoskeletal Ageing Research, Royal Derby Hospital, University of Nottingham & National Institute for Health Research Nottingham Biomedical Research Centre, Derby DE22 3DT, UK.

Afshin Beheshti (A)

KBR, NASA Ames Research Center, Moffett Field, CA 94035, USA.
Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.

Daniel C Berrios (DC)

Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Elizabeth A Blaber (EA)

Center for Biotechnology and Interdisciplinary Studies, Department of Biomedical Engineering, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.

Egle Cekanaviciute (E)

Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Helio A Costa (HA)

Departments of Pathology, and of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA 94305, USA.

Laurence B Davin (LB)

Institute of Biological Chemistry, Washington State University, Pullman, WA 99164, USA.

Kathleen M Fisch (KM)

Center for Computational Biology & Bioinformatics, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA.

Samrawit G Gebre (SG)

Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.
KBR, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Matthew Geniza (M)

Phylos Bioscience, Portland, OR 97214, USA.

Rachel Gilbert (R)

NASA Postdoctoral Program, Universities Space Research Association, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Simon Gilroy (S)

Department of Botany, University of Wisconsin, Madison, WI 53706, USA.

Gary Hardiman (G)

Institute for Global Food Security (IGFS) & School of Biological Sciences, Queen's University Belfast, Belfast, UK.
Medical University of South Carolina, Charleston, SC, USA.

Raúl Herranz (R)

Centro de Investigaciones Biológicas Margarita Salas (CSIC), Ramiro de Maeztu 9, 28040 Madrid, Spain.

Yared H Kidane (YH)

Center for Pediatric Bone Biology and Translational Research, Texas Scottish Rite Hospital for Children, 2222 Welborn St., Dallas, TX 75219, USA.

Colin P S Kruse (CPS)

Los Alamos National Laboratory, Bioscience Division, Los Alamos, NM 87545, USA.

Michael D Lee (MD)

Exobiology Branch, NASA Ames Research Center, Mountain View, CA 94035, USA.
Blue Marble Space Institute of Science, Seattle, WA 98154, USA.

Ted Liefeld (T)

Department of Medicine, University of California San Diego, San Diego, CA 92093, USA.

Norman G Lewis (NG)

Institute of Biological Chemistry, Washington State University, Pullman, WA 99164, USA.

J Tyson McDonald (JT)

Department of Radiation Medicine, Georgetown University Medical Center, Washington, DC 20007, USA.

Robert Meller (R)

Department of Neurobiology and Pharmacology, Morehouse School of Medicine, Atlanta, GA 30310, USA.

Tejaswini Mishra (T)

Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA.

Imara Y Perera (IY)

Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC 27695, USA.

Shayoni Ray (S)

NGM Biopharmaceuticals, South San Francisco, CA 94080, USA.

Sigrid S Reinsch (SS)

Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Sara Brin Rosenthal (SB)

Center for Computational Biology & Bioinformatics, Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA.

Michael Strong (M)

National Jewish Health, Center for Genes, Environment, and Health, 1400 Jackson Street, Denver, CO 80206, USA.

Nathaniel J Szewczyk (NJ)

Ohio Musculoskeletal and Neurological Institute and Department of Biomedical Sciences, Ohio University, Athens, OH 43147, USA.

Candice G T Tahimic (CGT)

Department of Biology, University of North Florida, Jacksonville, FL 32224, USA.

Deanne M Taylor (DM)

Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia and the Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.

Joshua P Vandenbrink (JP)

Department of Biology, Louisiana Tech University, Ruston, LA 71272, USA.

Alicia Villacampa (A)

Centro de Investigaciones Biológicas Margarita Salas (CSIC), Ramiro de Maeztu 9, 28040 Madrid, Spain.

Silvio Weging (S)

Institute of Computer Science, Martin-Luther University Halle-Wittenberg, Von-Seckendorff-Platz 1, Halle 06120, Germany.

Chris Wolverton (C)

Department of Botany and Microbiology, Ohio Wesleyan University, Delaware, OH, USA.

Sarah E Wyatt (SE)

Department of Environmental and Plant Biology, Ohio University, Athens, OH 45701, USA.
Interdisciplinary Program in Molecular and Cellular Biology, Ohio University, Athens, OH 45701, USA.

Luis Zea (L)

BioServe Space Technologies, Aerospace Engineering Sciences Department, University of Colorado Boulder, Boulder 80303 USA.

Sylvain V Costes (SV)

Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Jonathan M Galazka (JM)

Space Biosciences Division, NASA Ames Research Center, Moffett Field, CA 94035, USA.

Classifications MeSH