Dataset of 143 metagenome-assembled genomes from the Arctic and Atlantic Oceans, including 21 for eukaryotic organisms.

Genome resolved metagenomics MAGs Marine algae Phycology Phytoplankton Polar microbiomes

Journal

Data in brief
ISSN: 2352-3409
Titre abrégé: Data Brief
Pays: Netherlands
ID NLM: 101654995

Informations de publication

Date de publication:
Apr 2023
Historique:
received: 19 08 2022
revised: 18 01 2023
accepted: 09 02 2023
entrez: 7 3 2023
pubmed: 8 3 2023
medline: 8 3 2023
Statut: epublish

Résumé

This article presents metagenome-assembled genomes (MAGs) for both eukaryotic and prokaryotic organisms originating from the Arctic and Atlantic oceans, along with gene prediction and functional annotation for MAGs from both domains. Eleven samples from the chlorophyll-a maximum layer of the surface ocean were collected during two cruises in 2012; six from the Arctic in June-July on ARK-XXVII/1 (PS80), and five from the Atlantic in November on ANT-XXIX/1 (PS81). Sequencing and assembly was carried out by the Joint Genome Institute (JGI), who provide annotation of the assembled sequences, and 122 MAGs for prokaryotic organisms. A subsequent binning process identified 21 MAGs for eukaryotic organisms, mostly identified as Mamiellophyceae or Bacillariophyceae. The data for each MAG includes sequences in FASTA format, and tables of functional annotation of genes. For eukaryotic MAGs, transcript and protein sequences for predicted genes are available. A spreadsheet is provided summarising quality measures and taxonomic classifications for each MAG. These data provide draft genomes for uncultured marine microbes, including some of the first MAGs for polar eukaryotes, and can provide reference genetic data for these environments, or used in genomics-based comparison between environments.

Identifiants

pubmed: 36879606
doi: 10.1016/j.dib.2023.108990
pii: S2352-3409(23)00108-7
pmc: PMC9984783
doi:

Banques de données

figshare
['10.6084/m9.figshare.c.5017517.v4']

Types de publication

Journal Article

Langues

eng

Pagination

108990

Informations de copyright

© 2023 The Author(s).

Déclaration de conflit d'intérêts

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Références

Microbiome. 2022 Apr 28;10(1):67
pubmed: 35484634
ISME J. 2017 Jun;11(6):1372-1385
pubmed: 28267153
Nat Biotechnol. 2016 May;34(5):525-7
pubmed: 27043002
Nucleic Acids Res. 2011 Jul;39(Web Server issue):W29-37
pubmed: 21593126
Bioinformatics. 2011 Aug 1;27(15):2159-60
pubmed: 21697132
Genome Res. 2011 Mar;21(3):487-93
pubmed: 21209072
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
PLoS Biol. 2014 Jun 24;12(6):e1001889
pubmed: 24959919
Genome Res. 2018 Apr;28(4):569-580
pubmed: 29496730
Nucleic Acids Res. 1998 Feb 15;26(4):1107-15
pubmed: 9461475
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
pubmed: 9254694
Nucleic Acids Res. 2007 Jan;35(Database issue):D237-40
pubmed: 17135202
Bioinformatics. 2013 Nov 15;29(22):2933-5
pubmed: 24008419
Nat Biotechnol. 2017 Nov;35(11):1026-1028
pubmed: 29035372
Bioinformatics. 2015 May 15;31(10):1674-6
pubmed: 25609793
Bioinformatics. 2014 May 1;30(9):1236-40
pubmed: 24451626
Microbiome. 2020 Apr 3;8(1):48
pubmed: 32245390
BMC Bioinformatics. 2011 Dec 22;12:491
pubmed: 22192575
Genome Res. 2008 Dec;18(12):1979-90
pubmed: 18757608
J Comput Biol. 2012 May;19(5):455-77
pubmed: 22506599
Nucleic Acids Res. 2010 Nov;38(20):e191
pubmed: 20805240
Stand Genomic Sci. 2016 Feb 24;11:17
pubmed: 26918089
Nat Commun. 2021 Sep 16;12(1):5483
pubmed: 34531387
Nucleic Acids Res. 2019 Jan 8;47(D1):D666-D677
pubmed: 30289528
Genome Res. 2015 Jul;25(7):1043-55
pubmed: 25977477
BMC Bioinformatics. 2010 Mar 08;11:119
pubmed: 20211023
DNA Res. 2008 Dec;15(6):387-96
pubmed: 18940874
Genome Biol. 2020 Sep 10;21(1):244
pubmed: 32912302
Bioinformatics. 2019 Nov 15;:
pubmed: 31730192
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45
pubmed: 26553804
Nat Microbiol. 2018 Jul;3(7):804-813
pubmed: 29891866
PeerJ. 2015 Aug 27;3:e1165
pubmed: 26336640

Auteurs

Anthony Duncan (A)

School of Computing Sciences, University of East Anglia, Norwich Research Park, Norwich, NR47TJ, UK.

Kerrie Barry (K)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Chris Daum (C)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Emiley Eloe-Fadrosh (E)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Simon Roux (S)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Katrin Schmidt (K)

School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, NR47TJ, UK.

Susannah G Tringe (SG)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Klaus U Valentin (KU)

Alfred-Wegener Institute for Polar and Marine Research, Am Handelshafen 12, 27570, Bremerhaven, Germany.

Neha Varghese (N)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Asaf Salamov (A)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Igor V Grigoriev (IV)

US Department of Energy Joint Genome Institute, 1 Cyclotron Road, Berkeley, CA, 94720, USA.

Richard M Leggett (RM)

Earlham Institute, Norwich Research Park, Norwich, NR4 7UG, UK.

Vincent Moulton (V)

School of Computing Sciences, University of East Anglia, Norwich Research Park, Norwich, NR47TJ, UK.

Thomas Mock (T)

School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, NR47TJ, UK.

Classifications MeSH