Single-fly genome assemblies fill major phylogenomic gaps across the Drosophilidae Tree of Life.


Journal

PLoS biology
ISSN: 1545-7885
Titre abrégé: PLoS Biol
Pays: United States
ID NLM: 101183755

Informations de publication

Date de publication:
Jul 2024
Historique:
received: 06 11 2023
accepted: 03 06 2024
medline: 18 7 2024
pubmed: 18 7 2024
entrez: 18 7 2024
Statut: epublish

Résumé

Long-read sequencing is driving rapid progress in genome assembly across all major groups of life, including species of the family Drosophilidae, a longtime model system for genetics, genomics, and evolution. We previously developed a cost-effective hybrid Oxford Nanopore (ONT) long-read and Illumina short-read sequencing approach and used it to assemble 101 drosophilid genomes from laboratory cultures, greatly increasing the number of genome assemblies for this taxonomic group. The next major challenge is to address the laboratory culture bias in taxon sampling by sequencing genomes of species that cannot easily be reared in the lab. Here, we build upon our previous methods to perform amplification-free ONT sequencing of single wild flies obtained either directly from the field or from ethanol-preserved specimens in museum collections, greatly improving the representation of lesser studied drosophilid taxa in whole-genome data. Using Illumina Novaseq X Plus and ONT P2 sequencers with R10.4.1 chemistry, we set a new benchmark for inexpensive hybrid genome assembly at US $150 per genome while assembling genomes from as little as 35 ng of genomic DNA from a single fly. We present 183 new genome assemblies for 179 species as a resource for drosophilid systematics, phylogenetics, and comparative genomics. Of these genomes, 62 are from pooled lab strains and 121 from single adult flies. Despite the sample limitations of working with small insects, most single-fly diploid assemblies are comparable in contiguity (>1 Mb contig N50), completeness (>98% complete dipteran BUSCOs), and accuracy (>QV40 genome-wide with ONT R10.4.1) to assemblies from inbred lines. We present a well-resolved multi-locus phylogeny for 360 drosophilid and 4 outgroup species encompassing all publicly available (as of August 2023) genomes for this group. Finally, we present a Progressive Cactus whole-genome, reference-free alignment built from a subset of 298 suitably high-quality drosophilid genomes. The new assemblies and alignment, along with updated laboratory protocols and computational pipelines, are released as an open resource and as a tool for studying evolution at the scale of an entire insect family.

Identifiants

pubmed: 39024225
doi: 10.1371/journal.pbio.3002697
pii: PBIOLOGY-D-23-02916
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e3002697

Informations de copyright

Copyright: © 2024 Kim et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Auteurs

Bernard Y Kim (BY)

Department of Biology, Stanford University, Stanford, California, United States of America.

Hannah R Gellert (HR)

Department of Biology, Stanford University, Stanford, California, United States of America.

Samuel H Church (SH)

Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut United States of America.

Anton Suvorov (A)

Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, United States of America.

Sean S Anderson (SS)

Department of Biology, University of North Carolina Chapel Hill, Chapel Hill, North Carolina, United States of America.

Olga Barmina (O)

Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America.

Sofia G Beskid (SG)

Department of Biology, Stanford University, Stanford, California, United States of America.

Aaron A Comeault (AA)

School of Environmental and Natural Sciences, Bangor University, Bangor, United Kingdom.

K Nicole Crown (KN)

Department of Biology, Case Western Reserve University, Cleveland, Ohio, United States of America.

Sarah E Diamond (SE)

Department of Biology, Case Western Reserve University, Cleveland, Ohio, United States of America.

Steve Dorus (S)

Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, New York, United States of America.

Takako Fujichika (T)

Department of Biological Sciences, Tokyo Metropolitan University, Tokyo, Japan.

James A Hemker (JA)

Department of Developmental Biology, Stanford University, Stanford, California, United States of America.

Jan Hrcek (J)

Institute of Entomology, Biology Centre, Czech Academy of Sciences, České Budějovice, Czech Republic.

Maaria Kankare (M)

Department of Biological and Environmental Science, University of Jyväskylä, Jyväskylä, Finland.

Toru Katoh (T)

Department of Biological Sciences, Hokkaido University, Sapporo, Japan.

Karl N Magnacca (KN)

Hawaii Invertebrate Program, Division of Forestry & Wildlife, Honolulu, Hawaii, United States of America.

Ryan A Martin (RA)

Department of Biology, Case Western Reserve University, Cleveland, Ohio, United States of America.

Teruyuki Matsunaga (T)

Department of Complexity Science and Engineering, The University of Tokyo, Tokyo, Japan.

Matthew J Medeiros (MJ)

Pacific Biosciences Research Center, University of Hawai'i, Mānoa, Hawaii, United States of America.

Danny E Miller (DE)

Division of Genetic Medicine, Department of Pediatrics; Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America.

Scott Pitnick (S)

Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, New York, United States of America.

Michele Schiffer (M)

Daintree Rainforest Observatory, James Cook University, Townsville, Australia.

Sara Simoni (S)

Department of Biology, Stanford University, Stanford, California, United States of America.

Tessa E Steenwinkel (TE)

Baylor College of Medicine, Houston, Texas, United States of America.

Zeeshan A Syed (ZA)

Center for Reproductive Evolution, Department of Biology, Syracuse University, Syracuse, New York, United States of America.

Aya Takahashi (A)

Department of Biological Sciences, Tokyo Metropolitan University, Tokyo, Japan.

Kevin H-C Wei (KH)

Department of Zoology, The University of British Columbia, Vancouver, Canada.

Tsuya Yokoyama (T)

Department of Biology, Stanford University, Stanford, California, United States of America.

Michael B Eisen (MB)

Department of Cell and Molecular Biology, University of California Berkeley, Berkeley, California, United States of America.
Howard Hughes Medical Institute, University of California Berkeley, Berkeley, California, United States of America.

Artyom Kopp (A)

Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America.

Daniel Matute (D)

Department of Biology, University of North Carolina Chapel Hill, Chapel Hill, North Carolina, United States of America.

Darren J Obbard (DJ)

Institute of Ecology and Evolution, University of Edinburgh, Edinburgh, United Kingdom.

Patrick M O'Grady (PM)

Department of Entomology, Cornell University, Ithaca, New York, United States of America.

Donald K Price (DK)

School of Life Sciences, University of Nevada Las Vegas, Las Vegas, Nevada, United States of America.

Masanori J Toda (MJ)

Hokkaido University Museum, Hokkaido University, Sapporo, Japan.

Thomas Werner (T)

Department of Biological Sciences, Michigan Technological University, Houghton, Michigan, United States of America.

Dmitri A Petrov (DA)

Department of Biology, Stanford University, Stanford, California, United States of America.
CZ Biohub, Investigator, San Francisco, California, United States of America.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH