A chromosome-level genome assembly of the forestry pest Coronaproctus castanopsis.
Journal
Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192
Informations de publication
Date de publication:
17 Feb 2024
17 Feb 2024
Historique:
received:
17
10
2023
accepted:
22
01
2024
medline:
19
2
2024
pubmed:
18
2
2024
entrez:
17
2
2024
Statut:
epublish
Résumé
As an important forestry pest, Coronaproctus castanopsis (Monophlebidae) has caused serious damage to the globally valuable Gutianshan ecosystem, China. In this study, we assembled the first chromosome-level genome of the female specimen of C. castanopsis by merging BGI reads, HiFi long reads and Hi-C data. The assembled genome size is 700.81 Mb, with a scaffold N50 size of 273.84 Mb and a contig N50 size of 12.37 Mb. Hi-C scaffolding assigned 98.32% (689.03 Mb) of C. Castanopsis genome to three chromosomes. The BUSCO analysis (n = 1,367) showed a completeness of 91.2%, comprising 89.2% of single-copy BUSCOs and 2.0% of multicopy BUSCOs. The mapping ratio of BGI, second-generation RNA, third-generation RNA and HiFi reads are 97.84%, 96.15%, 97.96%, and 99.33%, respectively. We also identified 64.97% (455.3 Mb) repetitive elements, 1,373 non-coding RNAs and 10,542 protein-coding genes. This study assembled a high-quality genome of C. castanopsis, which accumulated valuable molecular data for scale insects.
Identifiants
pubmed: 38368451
doi: 10.1038/s41597-024-03016-6
pii: 10.1038/s41597-024-03016-6
pmc: PMC10874433
doi:
Substances chimiques
RNA
63231-63-0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
218Informations de copyright
© 2024. The Author(s).
Références
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
Methods Mol Biol. 2019;1962:1-14
pubmed: 31020551
Nucleic Acids Res. 2016 Jan 4;44(D1):D81-9
pubmed: 26612867
Nucleic Acids Res. 2017 Jan 4;45(D1):D190-D199
pubmed: 27899635
BMC Bioinformatics. 2011 Dec 22;12:491
pubmed: 22192575
Biotechniques. 1995 Sep;19(3):332-4
pubmed: 7495537
Mol Biol Evol. 2021 Sep 27;38(10):4647-4654
pubmed: 34320186
Ecol Evol. 2015 Feb;5(3):607-17
pubmed: 25691985
Bioinformatics. 2020 Apr 1;36(7):2253-2255
pubmed: 31778144
Nat Methods. 2015 Apr;12(4):357-60
pubmed: 25751142
Bioinformatics. 2013 Nov 15;29(22):2933-5
pubmed: 24008419
Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432
pubmed: 30357350
Nucleic Acids Res. 2019 Jan 8;47(D1):D807-D811
pubmed: 30395283
Science. 2017 Apr 7;356(6333):92-95
pubmed: 28336562
Mol Biol Evol. 2021 Dec 9;38(12):5825-5829
pubmed: 34597405
Cell Syst. 2016 Jul;3(1):95-8
pubmed: 27467249
Bioinformatics. 2021 Dec 7;37(23):4572-4574
pubmed: 34623391
BMC Bioinformatics. 2018 May 30;19(1):189
pubmed: 29843602
Nucleic Acids Res. 2018 Jan 4;46(D1):D493-D496
pubmed: 29040681
Gigascience. 2019 Sep 1;8(9):
pubmed: 31518402
Nat Methods. 2021 Apr;18(4):366-368
pubmed: 33828273
Proc Natl Acad Sci U S A. 2020 Apr 28;117(17):9451-9457
pubmed: 32300014
Genome Biol. 2019 Dec 16;20(1):278
pubmed: 31842956
Nat Methods. 2021 Feb;18(2):170-175
pubmed: 33526886
NAR Genom Bioinform. 2020 Jun;2(2):lqaa026
pubmed: 32440658
Zootaxa. 2023 Mar 13;5254(3):434-442
pubmed: 37044711
Bioinformatics. 2022 Apr 28;38(9):2617-2618
pubmed: 35199151
Nucleic Acids Res. 2017 Jan 4;45(D1):D200-D203
pubmed: 27899674
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Nucleic Acids Res. 2009 Jan;37(Database issue):D380-6
pubmed: 19036790
Bioinformatics. 2016 Mar 1;32(5):767-9
pubmed: 26559507
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W309-12
pubmed: 15215400
Eur J Lipid Sci Technol. 2015 Nov;117(11):1772-1781
pubmed: 26726293
Nucleic Acids Res. 2019 Jan 8;47(D1):D309-D314
pubmed: 30418610