A draft genome sequence of the elusive giant squid, Architeuthis dux.


Journal

GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872

Informations de publication

Date de publication:
01 01 2020
Historique:
received: 24 06 2019
revised: 27 10 2019
accepted: 05 12 2019
entrez: 17 1 2020
pubmed: 17 1 2020
medline: 8 10 2020
Statut: ppublish

Résumé

The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked. We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome. This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.

Sections du résumé

BACKGROUND
The giant squid (Architeuthis dux; Steenstrup, 1857) is an enigmatic giant mollusc with a circumglobal distribution in the deep ocean, except in the high Arctic and Antarctic waters. The elusiveness of the species makes it difficult to study. Thus, having a genome assembled for this deep-sea-dwelling species will allow several pending evolutionary questions to be unlocked.
FINDINGS
We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long reads, and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from 3 different tissue types from 3 other species of squid (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein-coding genes supported by evidence, and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
CONCLUSIONS
This annotated draft genome of A. dux provides a critical resource to investigate the unique traits of this species, including its gigantism and key adaptations to deep-sea environments.

Identifiants

pubmed: 31942620
pii: 5697198
doi: 10.1093/gigascience/giz152
pmc: PMC6962438
pii:
doi:

Substances chimiques

DNA Transposable Elements 0
RNA, Untranslated 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Wellcome Trust
ID : WT108749/Z/15/Z
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/N020146/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/M009122/1
Pays : United Kingdom

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press.

Références

Genome Res. 2002 Aug;12(8):1269-76
pubmed: 12176934
Commun Integr Biol. 2011 Jan;4(1):26-9
pubmed: 21509172
Anal Biochem. 1976 May 7;72:248-54
pubmed: 942051
J Dev Biol. 2016 Mar 19;4(1):null
pubmed: 29615580
Cell. 2010 Oct 29;143(3):343-53
pubmed: 21029858
Nucleic Acids Res. 2013 Jan;41(Database issue):D226-32
pubmed: 23125362
Science. 2004 Jan 9;303(5655):235-8
pubmed: 14716016
BMC Bioinformatics. 2004 May 14;5:59
pubmed: 15144565
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
Cell. 2017 Apr 6;169(2):191-202.e11
pubmed: 28388405
Evodevo. 2016 May 10;7:11
pubmed: 27168931
Nature. 2015 Aug 13;524(7564):220-4
pubmed: 26268193
Nucleic Acids Res. 2011 Jan;39(Database issue):D141-5
pubmed: 21062808
Mar Biotechnol (NY). 2009 Nov-Dec;11(6):686-98
pubmed: 19221839
Nucleic Acids Res. 2009 Apr;37(7):e52
pubmed: 19264800
Sci Data. 2019 Apr 1;6(1):13
pubmed: 30931949
Gigascience. 2018 Nov 1;7(11):
pubmed: 30256935
Nucleic Acids Res. 2015 Jan;43(Database issue):D130-7
pubmed: 25392425
Science. 2009 Nov 20;326(5956):1112-5
pubmed: 19965430
Genome Biol Evol. 2016 Jan 21;8(2):403-10
pubmed: 26802115
PLoS One. 2008;3(12):e4004
pubmed: 19104667
PLoS One. 2012;7(2):e30619
pubmed: 22312429
Proc Natl Acad Sci U S A. 2019 Feb 19;116(8):3030-3035
pubmed: 30635418
Mol Ecol. 2019 Mar;28(6):1537-1549
pubmed: 30003608
Nucleic Acids Res. 1997 Mar 1;25(5):955-64
pubmed: 9023104
Electrophoresis. 1988 Jun;9(6):255-62
pubmed: 2466658
PeerJ. 2015 Jan 13;3:e715
pubmed: 25649000
Bioinformatics. 2005 Jun;21 Suppl 1:i351-8
pubmed: 15961478
Genome Res. 2007 Jan;17(1):117-25
pubmed: 17151342
Proc Biol Sci. 2013 Mar 20;280(1759):20130273
pubmed: 23516246
Development. 2013 Aug;140(16):3297-302
pubmed: 23900538
Nucleic Acids Res. 2005 Nov 28;33(20):6494-506
pubmed: 16314312
Nucleic Acids Res. 2009 Jan;37(Database issue):D93-7
pubmed: 18984615
Mar Pollut Bull. 2008 Aug;56(8):1498-500
pubmed: 18501382
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Bioinformatics. 2014 Jan 1;30(1):31-7
pubmed: 23732276
Genes Dev. 1997 Dec 15;11(24):3286-305
pubmed: 9407023
Ann Rev Mar Sci. 2013;5:393-420
pubmed: 22809177
Mol Biol Evol. 2010 Jul;27(7):1645-58
pubmed: 20176615
Curr Biol. 2004 Mar 9;14(5):354-62
pubmed: 15028209
PLoS One. 2011;6(8):e23501
pubmed: 21876754
Nature. 2013 Jan 24;493(7433):526-31
pubmed: 23254933
BMC Genomics. 2017 Nov 16;18(1):884
pubmed: 29145825
Nature. 2001 Feb 15;409(6822):860-921
pubmed: 11237011
PLoS One. 2012;7(11):e47768
pubmed: 23185243
Curr Biol. 2016 May 23;26(10):R406-7
pubmed: 27218844
Dev Biol. 2010 Aug 1;344(1):7-15
pubmed: 20435029
Mar Environ Res. 2008 Aug;66(2):278-87
pubmed: 18514304
Nat Protoc. 2013 Aug;8(8):1494-512
pubmed: 23845962
Proc Biol Sci. 2012 Oct 22;279(1745):4243-52
pubmed: 22896651
Bioinformatics. 2015 Jul 1;31(13):2199-201
pubmed: 25701574
Mol Phylogenet Evol. 2006 Sep;40(3):872-9
pubmed: 16759883
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500

Auteurs

Rute R da Fonseca (RR)

Center for Macroecology, Evolution and Climate (CMEC), GLOBE Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark.
The Bioinformatics Centre, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark.

Alvarina Couto (A)

Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain.

Andre M Machado (AM)

CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal.

Brona Brejova (B)

Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Mlynská dolina, 842 48 Bratislava, Slovak Republic.

Carolin B Albertin (CB)

Eugene Bell Center for Regenerative Biology and Tissue Engineering, Marine Biological Laboratory, Woods Hole, MA 02543, USA.

Filipe Silva (F)

CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal.
Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal.

Paul Gardner (P)

Department of Biochemistry, University of Otago, 710 Cumberland Street, North Dunedin, Dunedin 9016, New Zealand.

Tobias Baril (T)

Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE, UK.

Alex Hayward (A)

Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE, UK.

Alexandre Campos (A)

CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal.

Ângela M Ribeiro (ÂM)

CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal.

Inigo Barrio-Hernandez (I)

Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark.

Henk-Jan Hoving (HJ)

GEOMAR Helmholtz Centre for Ocean Research Kiel,Wischhofstraße 1-3, 24148 Kiel, Germany.

Ricardo Tafur-Jimenez (R)

Instituto del Mar del Perú, Esq. Gamarra y Gral. Valle, Chucuito Apartado 22, Callao, Peru.

Chong Chu (C)

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Barbara Frazão (B)

CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal.
IPMA, Fitoplâncton Lab, Rua C do Aeroporto, 1749-077, Lisboa, Portugal.

Bent Petersen (B)

Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Batu 3 1/2, Butik Air Nasi, 08100 Bedong, Kedah, Malaysia.
Evolutionary Genomics Section, Globe Institute, University of Copenhagen,Øster Farimagsgade 5, 1353 Copenhagen, Denmark.

Fernando Peñaloza (F)

Unidad de Investigación Biomédica en Cáncer, Instituto Nacional de Cancerología-Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Mexico City, México.

Francesco Musacchia (F)

Genomic Medicine, Telethon Institute of Genetics and Medicine, Via Campi Flegrei, 34, 80078 Pozzuoli, Naples, Italy.

Graham C Alexander (GC)

GCB Sequencing and Genomic Technologies Shared Resource, Duke University CIEMAS, 101 Science Drive, Durham, NC 27708, USA.

Hugo Osório (H)

i3S-Instituto de Investigação e Inovação em Saúde, Universidade do Porto, Rua Alfredo Allen 208, 4200-135 Porto, Portugal.
IPATIMUP-Institute of Molecular Pathology and Immunology, University of Porto, Rua Júlio Amaral de Carvalho 45, 4200-135 Porto, Portugal.
Faculty of Medicine of the University of Porto, Alameda Prof. Hernani Monteiro, 4200-319 Porto, Portugal.

Inger Winkelmann (I)

Section for GeoGenetics, GLOBE Institute, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen, Denmark.

Oleg Simakov (O)

Department of Molecular Evolution and Development, University of Vienna, Althanstrasse 14 (UZA1), A-1090 Vienna, Austria.

Simon Rasmussen (S)

Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen, Denmark.

M Ziaur Rahman (MZ)

Bioinformatics Solutions Inc, 470 Weber St N Suite 204, Waterloo, ON N2L 6J2, Canada.

Davide Pisani (D)

School of Biological Sciences and School of Earth Sciences, University of Bristol, 24 Tyndall Avenue, Bristol, BS8 1TG, UK.

Jakob Vinther (J)

School of Biological Sciences and School of Earth Sciences, University of Bristol, 24 Tyndall Avenue, Bristol, BS8 1TG, UK.

Erich Jarvis (E)

Howard Hughes Medical Institute, 4000 Jones Bridge Rd, Chevy Chase, MD 20815, USA.
The Rockefeller University, 1230 York Ave, New York, NY 10065, USA.

Guojie Zhang (G)

Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark.
China National Genebank, BGI-Shenzhen, Shenzhen 518083, China.
State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, 32 Jiaochang Donglu Kunming, Yunnan 650223, China.
CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, 32 Jiaochang Donglu Kunming, Yunnan 650223, China.

Jan M Strugnell (JM)

Centre for Sustainable Tropical Fisheries & Aquaculture, James Cook University, Townsville, Douglas QLD 4814, Australia.
Department of Ecology, Environment and Evolution, School of Life Sciences, La Trobe University, Melbourne Victoria 3086, Australia.

L Filipe C Castro (LFC)

CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal.
Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal.

Olivier Fedrigo (O)

The Rockefeller University, 1230 York Ave, New York, NY 10065, USA.

Mateus Patricio (M)

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK.

Qiye Li (Q)

BGI-Shenzhen, Shenzhen, China.

Sara Rocha (S)

Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo 36310, Spain.
Biomedical Research Center (CINBIO), University of Vigo, Campus Universitario Lagoas-Marcosende, 36310 Vigo, Spain.

Agostinho Antunes (A)

CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros de Leixões, Av. General Norton de Matos, 4450'208 Matosinhos, Portugal.
Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal.

Yufeng Wu (Y)

Department of Computer Science and Engineering, University of Connecticut, Storrs, CT 06269, USA.

Bin Ma (B)

School of Computer Science, University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, Canada.

Remo Sanges (R)

Area of Neuroscience, Scuola Internazionale Superiore di Studi Avanzati (SISSA), Via Bonomea 265, 34136 Trieste, Italy.
Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy.

Tomas Vinar (T)

Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Mlynská dolina, 842 48 Bratislava, Slovak Republic.

Blagoy Blagoev (B)

Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark.

Thomas Sicheritz-Ponten (T)

Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Batu 3 1/2, Butik Air Nasi, 08100 Bedong, Kedah, Malaysia.
Evolutionary Genomics Section, Globe Institute, University of Copenhagen,Øster Farimagsgade 5, 1353 Copenhagen, Denmark.

Rasmus Nielsen (R)

Section for GeoGenetics, GLOBE Institute, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen, Denmark.
Departments of Integrative Biology and Statistics, University of California, 3040 Valley Life Sciences, Berkeley, CA 94720-3200, USA.

M Thomas P Gilbert (MTP)

Section for GeoGenetics, GLOBE Institute, University of Copenhagen, Øster Voldgade 5-7, 1350 Copenhagen, Denmark.
Norwegian University of Science and Technology, University Museum, Høgskolering 1, 7491 Trondheim, Norway.

Articles similaires

Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice
Animals Tail Swine Behavior, Animal Animal Husbandry

Classifications MeSH