Single-fly assemblies fill major phylogenomic gaps across the Drosophilidae Tree of Life.


Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
02 Oct 2023
Historique:
pubmed: 24 10 2023
medline: 24 10 2023
entrez: 24 10 2023
Statut: epublish

Résumé

Long-read sequencing is driving rapid progress in genome assembly across all major groups of life, including species of the family Drosophilidae, a longtime model system for genetics, genomics, and evolution. We previously developed a cost-effective hybrid Oxford Nanopore (ONT) long-read and Illumina short-read sequencing approach and used it to assemble 101 drosophilid genomes from laboratory cultures, greatly increasing the number of genome assemblies for this taxonomic group. The next major challenge is to address the laboratory culture bias in taxon sampling by sequencing genomes of species that cannot easily be reared in the lab. Here, we build upon our previous methods to perform amplification-free ONT sequencing of single wild flies obtained either directly from the field or from ethanol-preserved specimens in museum collections, greatly improving the representation of lesser studied drosophilid taxa in whole-genome data. Using Illumina Novaseq X Plus and ONT P2 sequencers with R10.4.1 chemistry, we set a new benchmark for inexpensive hybrid genome assembly at US $150 per genome while assembling genomes from as little as 35 ng of genomic DNA from a single fly. We present 183 new genome assemblies for 179 species as a resource for drosophilid systematics, phylogenetics, and comparative genomics. Of these genomes, 62 are from pooled lab strains and 121 from single adult flies. Despite the sample limitations of working with small insects, most single-fly diploid assemblies are comparable in contiguity (>1Mb contig N50), completeness (>98% complete dipteran BUSCOs), and accuracy (>QV40 genome-wide with ONT R10.4.1) to assemblies from inbred lines. We present a well-resolved multi-locus phylogeny for 360 drosophilid and 4 outgroup species encompassing all publicly available (as of August 2023) genomes for this group. Finally, we present a Progressive Cactus whole-genome, reference-free alignment built from a subset of 298 suitably high-quality drosophilid genomes. The new assemblies and alignment, along with updated laboratory protocols and computational pipelines, are released as an open resource and as a tool for studying evolution at the scale of an entire insect family.

Identifiants

pubmed: 37873137
doi: 10.1101/2023.10.02.560517
pmc: PMC10592941
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NIGMS NIH HHS
ID : R35 GM122592
Pays : United States
Organisme : NIGMS NIH HHS
ID : R35 GM148244
Pays : United States

Auteurs

Bernard Y Kim (BY)

Department of Biology, Stanford University, USA.

Hannah R Gellert (HR)

Department of Biology, Stanford University, USA.

Samuel H Church (SH)

Department of Ecology and Evolutionary Biology, Yale University, USA.

Anton Suvorov (A)

Department of Biological Sciences, Virginia Tech, USA.

Sean S Anderson (SS)

Department of Biology, University of North Carolina Chapel Hill, USA.

Olga Barmina (O)

Department of Evolution and Ecology, University of California Davis, USA.

Sofia G Beskid (SG)

Department of Biology, Stanford University, USA.

Aaron A Comeault (AA)

School of Environmental and Natural Sciences, Bangor University, UK.

K Nicole Crown (KN)

Department of Biology, Case Western Reserve University, USA.

Sarah E Diamond (SE)

Department of Biology, Case Western Reserve University, USA.

Steve Dorus (S)

Center for Reproductive Evolution, Department of Biology, Syracuse University, USA.

Takako Fujichika (T)

Department of Biological Sciences, Tokyo Metropolitan University, Japan.

James A Hemker (JA)

Department of Developmental Biology, Stanford University, USA.

Jan Hrcek (J)

Institute of Entomology, Biology Centre, Czech Academy of Sciences, Czechia.

Maaria Kankare (M)

Department of Biological and Environmental Science, University of Jyväskylä, Finland.

Toru Katoh (T)

Department of Biological Sciences, Hokkaido University, Japan.

Karl N Magnacca (KN)

Hawaii Invertebrate Program, Division of Forestry & Wildlife, State of Hawaii, USA.

Ryan A Martin (RA)

Department of Biology, Case Western Reserve University, USA.

Teruyuki Matsunaga (T)

Department of Complexity Science and Engineering, The University of Tokyo, Japan.

Matthew J Medeiros (MJ)

Pacific Biosciences Research Center, University of Hawai'i, Mānoa, USA.

Danny E Miller (DE)

Division of Genetic Medicine, Department of Pediatrics; Department of Laboratory Medicine and Pathology, University of Washington, USA.

Scott Pitnick (S)

Center for Reproductive Evolution, Department of Biology, Syracuse University, USA.

Sara Simoni (S)

Department of Biology, Stanford University, USA.

Tessa E Steenwinkel (TE)

Baylor College of Medicine, USA.

Michele Schiffer (M)

Daintree Rainforest Observatory, James Cook University, Australia.

Zeeshan A Syed (ZA)

Center for Reproductive Evolution, Department of Biology, Syracuse University, USA.

Aya Takahashi (A)

Department of Biological Sciences, Tokyo Metropolitan University, Japan.

Kevin H-C Wei (KH)

Department of Zoology, The University of British Columbia.

Tsuya Yokoyama (T)

Department of Biology, Stanford University, USA.

Michael B Eisen (MB)

Department of Cell and Molecular Biology, University of California Berkeley, United States.
Howard Hughes Medical Institute,University of California Berkeley, United States.

Artyom Kopp (A)

Department of Evolution and Ecology, University of California Davis, USA.

Daniel Matute (D)

Department of Biology, University of North Carolina Chapel Hill, USA.

Darren J Obbard (DJ)

Institute of Ecology and Evolution, University of Edinburgh, UK.

Patrick M O'Grady (PM)

Department of Entomology, Cornell University, USA.

Donald K Price (DK)

School of Life Sciences, University of Nevada Las Vegas, USA.

Masanori J Toda (MJ)

Hokkaido University Museum, Hokkaido University, Japan.

Thomas Werner (T)

Department of Biological Sciences, Michigan Technological University, USA.

Dmitri A Petrov (DA)

Department of Biology, Stanford University, USA.
CZ Biohub, Investigator.

Classifications MeSH