De novo assembly of a chromosome-level reference genome for the California Scrub-Jay, Aphelocoma californica.

CCGP California Conservation Genomics Project N50 Omni-C PacBio long-read sequencing

Journal

The Journal of heredity
ISSN: 1465-7333
Titre abrégé: J Hered
Pays: United States
ID NLM: 0375373

Informations de publication

Date de publication:
15 Nov 2023
Historique:
received: 04 04 2023
accepted: 27 07 2023
medline: 16 11 2023
pubmed: 17 8 2023
entrez: 17 8 2023
Statut: ppublish

Résumé

We announce the assembly of the first de novo reference genome for the California Scrub-Jay (Aphelocoma californica). The genus Aphelocoma comprises four currently recognized species including many locally adapted populations across Mesoamerica and North America. Intensive study of Aphelocoma has revealed novel insights into the evolutionary mechanisms driving diversification in natural systems. Additional insights into the evolutionary history of this group will require continued development of high-quality, publicly available genomic resources. We extracted high molecular weight genomic DNA from a female California Scrub-Jay from northern California and generated PacBio HiFi long-read data and Omni-C chromatin conformation capture data. We used these data to generate a de novo partially phased diploid genome assembly, consisting of two pseudo-haplotypes, and scaffolded them using inferred physical proximity information from the Omni-C data. The more complete pseudo-haplotype assembly (arbitrarily designated "Haplotype 1") is 1.35 Gb in total length, highly contiguous (contig N50 = 11.53 Mb), and highly complete (BUSCO completeness score = 97%), with comparable scaffold sizes to chromosome-level avian reference genomes (scaffold N50 = 66.14 Mb). Our California Scrub-Jay assembly is highly syntenic with the New Caledonian Crow reference genome despite ~10 million years of divergence, highlighting the temporal stability of the avian genome. This high-quality reference genome represents a leap forward in publicly available genomic resources for Aphelocoma, and the family Corvidae more broadly. Future work using Aphelocoma as a model for understanding the evolutionary forces generating and maintaining biodiversity across phylogenetic scales can now benefit from a highly contiguous, in-group reference genome.

Identifiants

pubmed: 37589384
pii: 7243675
doi: 10.1093/jhered/esad047
pmc: PMC10650945
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

669-680

Subventions

Organisme : NIH HHS
ID : S10 OD010786
Pays : United States
Organisme : NIH HHS
ID : S10 OD018174
Pays : United States

Informations de copyright

© The American Genetic Association. 2023.

Références

Trends Ecol Evol. 2010 May;25(5):283-91
pubmed: 20363047
Proc Biol Sci. 2009 Jan 7;276(1654):55-61
pubmed: 18765340
Nat Commun. 2018 Jan 15;9(1):189
pubmed: 29335486
Ecol Evol. 2022 Nov 30;12(12):e9517
pubmed: 36466137
PLoS Genet. 2017 Aug 3;13(8):e1006911
pubmed: 28771477
Genome Biol. 2020 Sep 14;21(1):245
pubmed: 32928274
BMC Res Notes. 2019 Jul 24;12(1):456
pubmed: 31340859
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654
pubmed: 34320186
Syst Biol. 2022 Oct 12;71(6):1453-1470
pubmed: 35552760
Nature. 2021 Apr;592(7856):737-746
pubmed: 33911273
Mol Ecol. 2008 May;17(10):2505-21
pubmed: 18430143
Nature. 2020 Nov;587(7833):252-257
pubmed: 33177665
Bioinformatics. 2020 Jan 1;36(1):311-316
pubmed: 31290943
J Hered. 2022 Nov 30;113(6):597-603
pubmed: 36048626
Genome Biol. 2018 Aug 24;19(1):125
pubmed: 30143029
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
J Hered. 2023 Aug 23;114(5):549-560
pubmed: 37395718
Syst Biol. 2022 Oct 12;71(6):1423-1439
pubmed: 35703981
Ann N Y Acad Sci. 2017 Feb;1389(1):164-185
pubmed: 27997700
Curr Protoc Bioinformatics. 2004 May;Chapter 4:Unit 4.10
pubmed: 18428725
BMC Genomics. 2022 Feb 22;23(1):157
pubmed: 35193521
Chromosome Res. 2023 Jan 20;31(1):2
pubmed: 36662301
BMC Genomics. 2017 Jul 12;18(1):527
pubmed: 28701198
Mol Ecol. 2022 May;31(10):2830-2846
pubmed: 35315161
Evolution. 2011 Jan;65(1):184-202
pubmed: 20681982
Gigascience. 2017 Oct 1;6(10):1-16
pubmed: 29020750
BMC Evol Biol. 2014 Jun 17;14:135
pubmed: 24938753
Nat Commun. 2020 Mar 18;11(1):1432
pubmed: 32188846
Curr Biol. 2016 Nov 7;26(21):2974-2979
pubmed: 27746026
Mol Ecol Resour. 2020 Jul;20(4):892-905
pubmed: 32243090
Chromosome Res. 2010 Jul;18(5):543-53
pubmed: 20535633
G3 (Bethesda). 2020 Apr 9;10(4):1361-1374
pubmed: 32071071
G3 (Bethesda). 2020 Sep 2;10(9):3047-3060
pubmed: 32601059
PLoS Comput Biol. 2019 Aug 21;15(8):e1007273
pubmed: 31433799
Nat Biotechnol. 2022 Sep;40(9):1332-1335
pubmed: 35332338
Genome Res. 2009 Mar;19(3):470-80
pubmed: 19204328
Bioinformatics. 2013 Apr 15;29(8):1072-5
pubmed: 23422339
Proc Biol Sci. 2021 Jan 27;288(1943):20201805
pubmed: 33468000
J Hered. 2022 Nov 30;113(6):577-588
pubmed: 35395669

Auteurs

Devon A DeRaad (DA)

Department of Ecology and Evolutionary Biology and Biodiversity Institute, University of Kansas, Lawrence, KS, United States.

Merly Escalona (M)

Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, United States.

Phred M Benham (PM)

Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, CA, United States.
Department of Integrative Biology, University of California, Berkeley, Berkeley, CA, United States.

Mohan P A Marimuthu (MPA)

DNA Technologies and Expression Analysis Core Laboratory, Genome Center, University of California-Davis, Davis, CA, United States.

Ruta M Sahasrabudhe (RM)

DNA Technologies and Expression Analysis Core Laboratory, Genome Center, University of California-Davis, Davis, CA, United States.

Oanh Nguyen (O)

DNA Technologies and Expression Analysis Core Laboratory, Genome Center, University of California-Davis, Davis, CA, United States.

Noravit Chumchim (N)

DNA Technologies and Expression Analysis Core Laboratory, Genome Center, University of California-Davis, Davis, CA, United States.

Eric Beraut (E)

Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, United States.

Colin W Fairbairn (CW)

Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, United States.

William Seligmann (W)

Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, United States.

Rauri C K Bowie (RCK)

Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, CA, United States.
Department of Integrative Biology, University of California, Berkeley, Berkeley, CA, United States.

Carla Cicero (C)

Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, CA, United States.

John E McCormack (JE)

Moore Laboratory of Zoology, Occidental College, Los Angeles, CA, United States.

Robert K Wayne (RK)

Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, United States.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH