Chromosome-level genome assembly, annotation, and phylogenomics of the gooseneck barnacle Pollicipes pollicipes.
Pollicipes
annotation
assembly
barnacle
crustacea
genome
larval evolution
phylogeny
Journal
GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872
Informations de publication
Date de publication:
12 03 2022
12 03 2022
Historique:
received:
05
11
2021
revised:
09
01
2022
accepted:
11
02
2022
entrez:
12
3
2022
pubmed:
13
3
2022
medline:
5
4
2022
Statut:
ppublish
Résumé
The barnacles are a group of >2,000 species that have fascinated biologists, including Darwin, for centuries. Their lifestyles are extremely diverse, from free-swimming larvae to sessile adults, and even root-like endoparasites. Barnacles also cause hundreds of millions of dollars of losses annually due to biofouling. However, genomic resources for crustaceans, and barnacles in particular, are lacking. Using 62× Pacific Biosciences coverage, 189× Illumina whole-genome sequencing coverage, 203× HiC coverage, and 69× CHi-C coverage, we produced a chromosome-level genome assembly of the gooseneck barnacle Pollicipes pollicipes. The P. pollicipes genome is 770 Mb long and its assembly is one of the most contiguous and complete crustacean genomes available, with a scaffold N50 of 47 Mb and 90.5% of the BUSCO Arthropoda gene set. Using the genome annotation produced here along with transcriptomes of 13 other barnacle species, we completed phylogenomic analyses on a nearly 2 million amino acid alignment. Contrary to previous studies, our phylogenies suggest that the Pollicipedomorpha is monophyletic and sister to the Balanomorpha, which alters our understanding of barnacle larval evolution and suggests homoplasy in a number of naupliar characters. We also compared transcriptomes of P. pollicipes nauplius larvae and adults and found that nearly one-half of the genes in the genome are differentially expressed, highlighting the vastly different transcriptomes of larvae and adult gooseneck barnacles. Annotation of the genes with KEGG and GO terms reveals that these stages exhibit many differences including cuticle binding, chitin binding, microtubule motor activity, and membrane adhesion. This study provides high-quality genomic resources for a key group of crustaceans. This is especially valuable given the roles P. pollicipes plays in European fisheries, as a sentinel species for coastal ecosystems, and as a model for studying barnacle adhesion as well as its key position in the barnacle tree of life. A combination of genomic, phylogenetic, and transcriptomic analyses here provides valuable insights into the evolution and development of barnacles.
Sections du résumé
BACKGROUND
The barnacles are a group of >2,000 species that have fascinated biologists, including Darwin, for centuries. Their lifestyles are extremely diverse, from free-swimming larvae to sessile adults, and even root-like endoparasites. Barnacles also cause hundreds of millions of dollars of losses annually due to biofouling. However, genomic resources for crustaceans, and barnacles in particular, are lacking.
RESULTS
Using 62× Pacific Biosciences coverage, 189× Illumina whole-genome sequencing coverage, 203× HiC coverage, and 69× CHi-C coverage, we produced a chromosome-level genome assembly of the gooseneck barnacle Pollicipes pollicipes. The P. pollicipes genome is 770 Mb long and its assembly is one of the most contiguous and complete crustacean genomes available, with a scaffold N50 of 47 Mb and 90.5% of the BUSCO Arthropoda gene set. Using the genome annotation produced here along with transcriptomes of 13 other barnacle species, we completed phylogenomic analyses on a nearly 2 million amino acid alignment. Contrary to previous studies, our phylogenies suggest that the Pollicipedomorpha is monophyletic and sister to the Balanomorpha, which alters our understanding of barnacle larval evolution and suggests homoplasy in a number of naupliar characters. We also compared transcriptomes of P. pollicipes nauplius larvae and adults and found that nearly one-half of the genes in the genome are differentially expressed, highlighting the vastly different transcriptomes of larvae and adult gooseneck barnacles. Annotation of the genes with KEGG and GO terms reveals that these stages exhibit many differences including cuticle binding, chitin binding, microtubule motor activity, and membrane adhesion.
CONCLUSION
This study provides high-quality genomic resources for a key group of crustaceans. This is especially valuable given the roles P. pollicipes plays in European fisheries, as a sentinel species for coastal ecosystems, and as a model for studying barnacle adhesion as well as its key position in the barnacle tree of life. A combination of genomic, phylogenetic, and transcriptomic analyses here provides valuable insights into the evolution and development of barnacles.
Identifiants
pubmed: 35277961
pii: 6547680
doi: 10.1093/gigascience/giac021
pmc: PMC8917513
pii:
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
© The Author(s) 2022. Published by Oxford University Press GigaScience.
Références
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Proc Natl Acad Sci U S A. 2007 Jul 3;104(27):11304-9
pubmed: 17563388
Aquat Toxicol. 2020 May;222:105462
pubmed: 32169740
Bioinformatics. 2011 Mar 15;27(6):764-70
pubmed: 21217122
Bioinformatics. 2018 Jul 1;34(13):i142-i150
pubmed: 29949969
Bioinformatics. 2012 Dec 1;28(23):3150-2
pubmed: 23060610
Gigascience. 2016 Jan 28;5:5
pubmed: 26823974
Methods Mol Biol. 2019;1858:1-14
pubmed: 30414106
Mol Biol Evol. 2017 Mar 1;34(3):772-773
pubmed: 28013191
Bioinformatics. 2006 Jul 1;22(13):1658-9
pubmed: 16731699
Syst Biol. 2018 Mar 01;67(2):216-235
pubmed: 28950365
Syst Biol. 2007 Aug;56(4):564-77
pubmed: 17654362
Arch Environ Contam Toxicol. 2014 Apr;66(3):317-26
pubmed: 24337668
Science. 2009 Oct 9;326(5950):289-93
pubmed: 19815776
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W435-9
pubmed: 16845043
Aquat Toxicol. 2019 May;210:69-84
pubmed: 30826642
Nature. 2008 Apr 10;452(7188):745-9
pubmed: 18322464
Bioinformatics. 2014 Apr 1;30(7):923-30
pubmed: 24227677
Biofouling. 2011 Jan;27(1):87-98
pubmed: 21161774
Insects. 2018 Dec 05;9(4):
pubmed: 30563147
Bioinformatics. 2008 Aug 15;24(16):1757-64
pubmed: 18567917
Bioinformatics. 2017 Jul 15;33(14):2202-2204
pubmed: 28369201
PeerJ. 2019 Aug 16;7:e7387
pubmed: 31440430
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
BMC Bioinformatics. 2013 Nov 19;14:330
pubmed: 24252138
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
Mol Ecol Resour. 2021 Feb;21(2):511-525
pubmed: 33010101
Nat Methods. 2016 Jul;13(7):581-3
pubmed: 27214047
Sci Rep. 2019 Sep 5;9(1):12804
pubmed: 31488852
Bioinformatics. 2006 Jan 15;22(2):134-41
pubmed: 16287941
J Hered. 2014 Jan-Feb;105(1):1-18
pubmed: 24336862
Evolution. 2022 Jan;76(1):139-157
pubmed: 34705275
Bioinformatics. 2014 May 1;30(9):1236-40
pubmed: 24451626
Bioinformatics. 2006 Jul 1;22(13):1600-7
pubmed: 16606683
Biol Direct. 2008 May 21;3:20
pubmed: 18495041
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Genome Biol. 2014;15(12):550
pubmed: 25516281
Bioinformatics. 2014 May 1;30(9):1312-3
pubmed: 24451623
Nucleic Acids Res. 2014 Jan;42(Database issue):D756-63
pubmed: 24259432
Bioscience. 2020 Feb 1;70(2):195
pubmed: 32063649
Nat Biotechnol. 2019 Aug;37(8):907-915
pubmed: 31375807
Nat Genet. 2000 May;25(1):25-9
pubmed: 10802651
Proc Biol Sci. 2004 Mar 7;271(1538):537-44
pubmed: 15129965
Gigascience. 2022 Mar 12;11:
pubmed: 35277961
Nat Ecol Evol. 2018 Aug;2(8):1250-1257
pubmed: 29988158
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654
pubmed: 34320186
BMC Genomics. 2020 Apr 19;21(1):312
pubmed: 32306892
PLoS One. 2018 Feb 23;13(2):e0192730
pubmed: 29474419
BMC Biol. 2009 Apr 17;7:15
pubmed: 19374762
Mol Biol Evol. 2004 Jun;21(6):1095-109
pubmed: 15014145
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Mol Phylogenet Evol. 2015 Oct;91:1-11
pubmed: 25979758
BMC Bioinformatics. 2018 Nov 29;19(1):460
pubmed: 30497373
Nature. 2021 Apr;592(7856):737-746
pubmed: 33911273
Bioinformatics. 2008 Mar 1;24(5):715-6
pubmed: 18227120
Nucleic Acids Res. 2021 Jan 8;49(D1):D325-D334
pubmed: 33290552
Nucleic Acids Res. 2011 Jan;39(Database issue):D19-21
pubmed: 21062823
Bioinformatics. 2018 Aug 1;34(15):2666-2669
pubmed: 29547981
Mol Ecol. 2015 Feb;24(3):673-89
pubmed: 25602032
Genome Biol. 2018 Nov 28;19(1):208
pubmed: 30486838
Mol Biol Evol. 2020 May 1;37(5):1530-1534
pubmed: 32011700
F1000Res. 2020 Apr 28;9:304
pubmed: 32489650
Mol Biol Evol. 2018 Feb 1;35(2):518-522
pubmed: 29077904
Bioinformatics. 2008 Oct 15;24(20):2317-23
pubmed: 18718941
Genome Biol Evol. 2019 Aug 1;11(8):2055-2070
pubmed: 31270537
Mol Phylogenet Evol. 2014 Dec;81:147-58
pubmed: 25261121
Nat Struct Mol Biol. 2007 Feb;14(2):103-5
pubmed: 17277804
PLoS Comput Biol. 2018 Jun 25;14(6):e1006277
pubmed: 29939994
Nucleic Acids Res. 2012 Jan;40(Database issue):D290-301
pubmed: 22127870
Nature. 2011 Mar 24;471(7339):473-9
pubmed: 21179090
J Cell Sci. 2016 Sep 1;129(17):3309-19
pubmed: 27422100
PLoS One. 2010 Mar 10;5(3):e9490
pubmed: 20224823
Nucleic Acids Res. 2002 Apr 1;30(7):1575-84
pubmed: 11917018
BMC Evol Biol. 2007 Feb 08;7 Suppl 1:S4
pubmed: 17288577
BMC Bioinformatics. 2012 Sep 19;13:238
pubmed: 22988817
Mar Biotechnol (NY). 2019 Feb;21(1):38-51
pubmed: 30413912
PLoS One. 2014 Nov 19;9(11):e112963
pubmed: 25409509
Nat Methods. 2016 Dec;13(12):1050-1054
pubmed: 27749838
Nucleic Acids Res. 2000 Jan 1;28(1):27-30
pubmed: 10592173
Nat Protoc. 2013 Aug;8(8):1494-512
pubmed: 23845962
Proc Natl Acad Sci U S A. 2018 Apr 24;115(17):4325-4333
pubmed: 29686065
Nature. 2019 Jun;570(7759):27-29
pubmed: 31164768
Genome Res. 2014 Aug;24(8):1384-95
pubmed: 24755901
Curr Biol. 2014 Jun 16;24(12):1429-1434
pubmed: 24909326
C R Biol. 2010 Feb;333(2):99-106
pubmed: 20338525
Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45
pubmed: 26553804
Genome Biol. 2019 Nov 28;20(1):257
pubmed: 31779668
Nat Rev Mol Cell Biol. 2021 Feb;22(2):96-118
pubmed: 33353982
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153
pubmed: 29745866
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712
Bioinformatics. 2020 Apr 1;36(7):2251-2252
pubmed: 31742321
Gigascience. 2015 Oct 19;4:48
pubmed: 26500767
Genome Res. 2016 Mar;26(3):342-50
pubmed: 26848124
Mol Biol Evol. 2014 Nov;31(11):3081-92
pubmed: 25158799