De novo assembly of a chromosome-level reference genome for the California Scrub-Jay, Aphelocoma californica.
CCGP
California Conservation Genomics Project
N50
Omni-C
PacBio
long-read sequencing
Journal
The Journal of heredity
ISSN: 1465-7333
Titre abrégé: J Hered
Pays: United States
ID NLM: 0375373
Informations de publication
Date de publication:
15 Nov 2023
15 Nov 2023
Historique:
received:
04
04
2023
accepted:
27
07
2023
medline:
16
11
2023
pubmed:
17
8
2023
entrez:
17
8
2023
Statut:
ppublish
Résumé
We announce the assembly of the first de novo reference genome for the California Scrub-Jay (Aphelocoma californica). The genus Aphelocoma comprises four currently recognized species including many locally adapted populations across Mesoamerica and North America. Intensive study of Aphelocoma has revealed novel insights into the evolutionary mechanisms driving diversification in natural systems. Additional insights into the evolutionary history of this group will require continued development of high-quality, publicly available genomic resources. We extracted high molecular weight genomic DNA from a female California Scrub-Jay from northern California and generated PacBio HiFi long-read data and Omni-C chromatin conformation capture data. We used these data to generate a de novo partially phased diploid genome assembly, consisting of two pseudo-haplotypes, and scaffolded them using inferred physical proximity information from the Omni-C data. The more complete pseudo-haplotype assembly (arbitrarily designated "Haplotype 1") is 1.35 Gb in total length, highly contiguous (contig N50 = 11.53 Mb), and highly complete (BUSCO completeness score = 97%), with comparable scaffold sizes to chromosome-level avian reference genomes (scaffold N50 = 66.14 Mb). Our California Scrub-Jay assembly is highly syntenic with the New Caledonian Crow reference genome despite ~10 million years of divergence, highlighting the temporal stability of the avian genome. This high-quality reference genome represents a leap forward in publicly available genomic resources for Aphelocoma, and the family Corvidae more broadly. Future work using Aphelocoma as a model for understanding the evolutionary forces generating and maintaining biodiversity across phylogenetic scales can now benefit from a highly contiguous, in-group reference genome.
Identifiants
pubmed: 37589384
pii: 7243675
doi: 10.1093/jhered/esad047
pmc: PMC10650945
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
669-680Subventions
Organisme : NIH HHS
ID : S10 OD010786
Pays : United States
Organisme : NIH HHS
ID : S10 OD018174
Pays : United States
Informations de copyright
© The American Genetic Association. 2023.
Références
Trends Ecol Evol. 2010 May;25(5):283-91
pubmed: 20363047
Proc Biol Sci. 2009 Jan 7;276(1654):55-61
pubmed: 18765340
Nat Commun. 2018 Jan 15;9(1):189
pubmed: 29335486
Ecol Evol. 2022 Nov 30;12(12):e9517
pubmed: 36466137
PLoS Genet. 2017 Aug 3;13(8):e1006911
pubmed: 28771477
Genome Biol. 2020 Sep 14;21(1):245
pubmed: 32928274
BMC Res Notes. 2019 Jul 24;12(1):456
pubmed: 31340859
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654
pubmed: 34320186
Syst Biol. 2022 Oct 12;71(6):1453-1470
pubmed: 35552760
Nature. 2021 Apr;592(7856):737-746
pubmed: 33911273
Mol Ecol. 2008 May;17(10):2505-21
pubmed: 18430143
Nature. 2020 Nov;587(7833):252-257
pubmed: 33177665
Bioinformatics. 2020 Jan 1;36(1):311-316
pubmed: 31290943
J Hered. 2022 Nov 30;113(6):597-603
pubmed: 36048626
Genome Biol. 2018 Aug 24;19(1):125
pubmed: 30143029
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
J Hered. 2023 Aug 23;114(5):549-560
pubmed: 37395718
Syst Biol. 2022 Oct 12;71(6):1423-1439
pubmed: 35703981
Ann N Y Acad Sci. 2017 Feb;1389(1):164-185
pubmed: 27997700
Curr Protoc Bioinformatics. 2004 May;Chapter 4:Unit 4.10
pubmed: 18428725
BMC Genomics. 2022 Feb 22;23(1):157
pubmed: 35193521
Chromosome Res. 2023 Jan 20;31(1):2
pubmed: 36662301
BMC Genomics. 2017 Jul 12;18(1):527
pubmed: 28701198
Mol Ecol. 2022 May;31(10):2830-2846
pubmed: 35315161
Evolution. 2011 Jan;65(1):184-202
pubmed: 20681982
Gigascience. 2017 Oct 1;6(10):1-16
pubmed: 29020750
BMC Evol Biol. 2014 Jun 17;14:135
pubmed: 24938753
Nat Commun. 2020 Mar 18;11(1):1432
pubmed: 32188846
Curr Biol. 2016 Nov 7;26(21):2974-2979
pubmed: 27746026
Mol Ecol Resour. 2020 Jul;20(4):892-905
pubmed: 32243090
Chromosome Res. 2010 Jul;18(5):543-53
pubmed: 20535633
G3 (Bethesda). 2020 Apr 9;10(4):1361-1374
pubmed: 32071071
G3 (Bethesda). 2020 Sep 2;10(9):3047-3060
pubmed: 32601059
PLoS Comput Biol. 2019 Aug 21;15(8):e1007273
pubmed: 31433799
Nat Biotechnol. 2022 Sep;40(9):1332-1335
pubmed: 35332338
Genome Res. 2009 Mar;19(3):470-80
pubmed: 19204328
Bioinformatics. 2013 Apr 15;29(8):1072-5
pubmed: 23422339
Proc Biol Sci. 2021 Jan 27;288(1943):20201805
pubmed: 33468000
J Hered. 2022 Nov 30;113(6):577-588
pubmed: 35395669