A draft genome sequence of the elusive giant squid, Architeuthis dux.
Animals
Biological Evolution
Chromatography, Liquid
Computational Biology
/ methods
DNA Transposable Elements
Decapodiformes
/ genetics
Gene Expression Profiling
Genome
Genomics
/ methods
Molecular Sequence Annotation
Multigene Family
RNA, Untranslated
Tandem Mass Spectrometry
Transcriptome
Whole Genome Sequencing
cephalopod
genome assembly
invertebrate
Journal
GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872
Informations de publication
Date de publication:
01 01 2020
01 01 2020
Historique:
received:
24
06
2019
revised:
27
10
2019
accepted:
05
12
2019
entrez:
17
1
2020
pubmed:
17
1
2020
medline:
8
10
2020
Statut:
ppublish
Résumé
The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked. We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome. This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.
Sections du résumé
BACKGROUND
The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked.
FINDINGS
We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
CONCLUSIONS
This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.
Identifiants
pubmed: 31942620
pii: 5697198
doi: 10.1093/gigascience/giz152
pmc: PMC6962438
pii:
doi:
Substances chimiques
DNA Transposable Elements
0
RNA, Untranslated
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : Wellcome Trust
ID : WT108749/Z/15/Z
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/N020146/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/M009122/1
Pays : United Kingdom
Informations de copyright
© The Author(s) 2020. Published by Oxford University Press.
Références
Genome Res. 2002 Aug;12(8):1269-76
pubmed: 12176934
Commun Integr Biol. 2011 Jan;4(1):26-9
pubmed: 21509172
Anal Biochem. 1976 May 7;72:248-54
pubmed: 942051
J Dev Biol. 2016 Mar 19;4(1):null
pubmed: 29615580
Cell. 2010 Oct 29;143(3):343-53
pubmed: 21029858
Nucleic Acids Res. 2013 Jan;41(Database issue):D226-32
pubmed: 23125362
Science. 2004 Jan 9;303(5655):235-8
pubmed: 14716016
BMC Bioinformatics. 2004 May 14;5:59
pubmed: 15144565
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
Cell. 2017 Apr 6;169(2):191-202.e11
pubmed: 28388405
Evodevo. 2016 May 10;7:11
pubmed: 27168931
Nature. 2015 Aug 13;524(7564):220-4
pubmed: 26268193
Nucleic Acids Res. 2011 Jan;39(Database issue):D141-5
pubmed: 21062808
Mar Biotechnol (NY). 2009 Nov-Dec;11(6):686-98
pubmed: 19221839
Nucleic Acids Res. 2009 Apr;37(7):e52
pubmed: 19264800
Sci Data. 2019 Apr 1;6(1):13
pubmed: 30931949
Gigascience. 2018 Nov 1;7(11):
pubmed: 30256935
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7
pubmed: 25392425
Science. 2009 Nov 20;326(5956):1112-5
pubmed: 19965430
Genome Biol Evol. 2016 Jan 21;8(2):403-10
pubmed: 26802115
PLoS One. 2008;3(12):e4004
pubmed: 19104667
PLoS One. 2012;7(2):e30619
pubmed: 22312429
Proc Natl Acad Sci U S A. 2019 Feb 19;116(8):3030-3035
pubmed: 30635418
Mol Ecol. 2019 Mar;28(6):1537-1549
pubmed: 30003608
Nucleic Acids Res. 1997 Mar 1;25(5):955-64
pubmed: 9023104
Electrophoresis. 1988 Jun;9(6):255-62
pubmed: 2466658
PeerJ. 2015 Jan 13;3:e715
pubmed: 25649000
Bioinformatics. 2005 Jun;21 Suppl 1:i351-8
pubmed: 15961478
Genome Res. 2007 Jan;17(1):117-25
pubmed: 17151342
Proc Biol Sci. 2013 Mar 20;280(1759):20130273
pubmed: 23516246
Development. 2013 Aug;140(16):3297-302
pubmed: 23900538
Nucleic Acids Res. 2005 Nov 28;33(20):6494-506
pubmed: 16314312
Nucleic Acids Res. 2009 Jan;37(Database issue):D93-7
pubmed: 18984615
Mar Pollut Bull. 2008 Aug;56(8):1498-500
pubmed: 18501382
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Bioinformatics. 2014 Jan 1;30(1):31-7
pubmed: 23732276
Genes Dev. 1997 Dec 15;11(24):3286-305
pubmed: 9407023
Ann Rev Mar Sci. 2013;5:393-420
pubmed: 22809177
Mol Biol Evol. 2010 Jul;27(7):1645-58
pubmed: 20176615
Curr Biol. 2004 Mar 9;14(5):354-62
pubmed: 15028209
PLoS One. 2011;6(8):e23501
pubmed: 21876754
Nature. 2013 Jan 24;493(7433):526-31
pubmed: 23254933
BMC Genomics. 2017 Nov 16;18(1):884
pubmed: 29145825
Nature. 2001 Feb 15;409(6822):860-921
pubmed: 11237011
PLoS One. 2012;7(11):e47768
pubmed: 23185243
Curr Biol. 2016 May 23;26(10):R406-7
pubmed: 27218844
Dev Biol. 2010 Aug 1;344(1):7-15
pubmed: 20435029
Mar Environ Res. 2008 Aug;66(2):278-87
pubmed: 18514304
Nat Protoc. 2013 Aug;8(8):1494-512
pubmed: 23845962
Proc Biol Sci. 2012 Oct 22;279(1745):4243-52
pubmed: 22896651
Bioinformatics. 2015 Jul 1;31(13):2199-201
pubmed: 25701574
Mol Phylogenet Evol. 2006 Sep;40(3):872-9
pubmed: 16759883
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500