A high-quality blue whale genome, segmental duplications, and historical demography.


Journal

Molecular biology and evolution
ISSN: 1537-1719
Titre abrégé: Mol Biol Evol
Pays: United States
ID NLM: 8501455

Informations de publication

Date de publication:
20 Feb 2024
Historique:
received: 08 03 2023
revised: 11 01 2024
accepted: 22 01 2024
medline: 20 2 2024
pubmed: 20 2 2024
entrez: 20 2 2024
Statut: aheadofprint

Résumé

The blue whale, Balaenoptera musculus, is the largest animal known to have ever existed, making it an important case study in longevity and resistance to cancer. To further this and other blue whale-related research, we report a reference-quality, long-read-based genome assembly of this fascinating species. We assembled the genome from PacBio long reads and utilized Illumina/10X, optical maps, and Hi-C data for scaffolding, polishing, and manual curation. We also provided long read RNA-seq data to facilitate the annotation of the assembly by NCBI and Ensembl. Additionally, we annotated both haplotypes using TOGA and measured the genome size by flow cytometry. We then compared the blue whale genome with other cetaceans and artiodactyls, including vaquita (Phocoena sinus), the world's smallest cetacean, to investigate blue whale's unique biological traits. We found a dramatic amplification of several genes in the blue whale genome resulting from a recent burst in segmental duplications, though the possible connection between this amplification and giant body size requires further study. We also discovered sites in the insulin-like growth factor-1 gene correlated with body size in cetaceans. Finally, using our assembly to examine the heterozygosity and historical demography of Pacific and Atlantic blue whale populations, we found that the genomes of both populations are highly heterozygous and that their genetic isolation dates to the last interglacial period. Taken together, these results indicate how a high-quality, annotated blue whale genome will serve as an important resource for biology, evolution, and conservation research.

Identifiants

pubmed: 38376487
pii: 7611405
doi: 10.1093/molbev/msae036
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.

Auteurs

Yury V Bukhman (YV)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Phillip A Morin (PA)

Southwest Fisheries Science Center, National Oceanic and Atmospheric Administration (NOAA), 8901 La Jolla Shores Dr., La Jolla, CA 92037, USA.

Susanne Meyer (S)

Neuroscience Research Institute, University of California, Santa Barbara, CA, USA.

Li-Fang Chu (LF)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.
Department of Comparative Biology and Experimental Medicine, University of Calgary, Calgary, Canada.

Jeff K Jacobsen (JK)

V.E. Enterprises, Arcata CA, USA.

Jessica Antosiewicz-Bourget (J)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Daniel Mamott (D)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Maylie Gonzales (M)

Neuroscience Research Institute, University of California, Santa Barbara, CA, USA.

Cara Argus (C)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Jennifer Bolin (J)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Mark E Berres (ME)

University of Wisconsin Biotechnology Center, Bioinformatics Resource Center, 425 Henry Mall, Madison, WI 53706, USA.

Olivier Fedrigo (O)

Vertebrate Genome Lab, The Rockefeller University, 1230 York Avenue New York, NY 10065, USA.

John Steill (J)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Scott A Swanson (SA)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Peng Jiang (P)

Center for Gene Regulation in Health and Disease (GRHD), Cleveland State University, 2121 Euclid Ave, Cleveland, OH, USA.
Department of Biological, Geological and Environmental Sciences, Cleveland State University, 2121 Euclid Ave, Cleveland, OH, USA.
Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, Cleveland, OH, USA.

Arang Rhie (A)

Genome Informatics Section, National Human Genome Research Institute, 49 Convent Dr, Bethesda, MD 20892, USA.

Giulio Formenti (G)

Laboratory of Neurogenetics of Language, The Rockefeller University/HHMI, 1230 York Avenue, New York, NY 10065, USA.

Adam M Phillippy (AM)

Genome Informatics Section, National Human Genome Research Institute, 49 Convent Dr, Bethesda, MD 20892, USA.

Robert S Harris (RS)

Department of Biology, Pennsylvania State University, 508 Wartik Labs, University Park, PA 16802, USA.

Jonathan M D Wood (JMD)

Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK.

Kerstin Howe (K)

Tree of Life, Wellcome Sanger Institute, Cambridge CB10 1SA, UK.

Bogdan M Kirilenko (BM)

LOEWE Centre for Translational Biodiversity Genomics, Senckenberganlage 25, 60325 Frankfurt, Germany.
Senckenberg Research Institute, Senckenberganlage 25, 60325 Frankfurt, Germany.
Institute of Cell Biology and Neuroscience, Faculty of Biosciences, Goethe University Frankfurt, Max-von-Laue-Str. 9, 60438 Frankfurt, Germany.

Chetan Munegowda (C)

LOEWE Centre for Translational Biodiversity Genomics, Senckenberganlage 25, 60325 Frankfurt, Germany.
Senckenberg Research Institute, Senckenberganlage 25, 60325 Frankfurt, Germany.
Institute of Cell Biology and Neuroscience, Faculty of Biosciences, Goethe University Frankfurt, Max-von-Laue-Str. 9, 60438 Frankfurt, Germany.

Michael Hiller (M)

LOEWE Centre for Translational Biodiversity Genomics, Senckenberganlage 25, 60325 Frankfurt, Germany.
Senckenberg Research Institute, Senckenberganlage 25, 60325 Frankfurt, Germany.
Institute of Cell Biology and Neuroscience, Faculty of Biosciences, Goethe University Frankfurt, Max-von-Laue-Str. 9, 60438 Frankfurt, Germany.

Aashish Jain (A)

Department of Computer Science, Purdue University, 249 S. Martin Jischke Dr., West Lafayette, IN 47907, USA.

Daisuke Kihara (D)

Department of Computer Science, Purdue University, 249 S. Martin Jischke Dr., West Lafayette, IN 47907, USA.
Department of Biological Sciences, Purdue University, 249 S. Martin Jischke Dr., West Lafayette, IN 47907, USA.

J Spencer Johnston (JS)

Department of Entomology, Texas A&M University, College Station, TX, 77843, USA.

Alexander Ionkov (A)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Kalpana Raja (K)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Huishi Toh (H)

Neuroscience Research Institute, University of California, Santa Barbara, CA, USA.

Aimee Lang (A)

Southwest Fisheries Science Center, National Oceanic and Atmospheric Administration (NOAA), 8901 La Jolla Shores Dr., La Jolla, CA 92037, USA.
Ocean Associates, Inc., Arlington, VA, USA.

Magnus Wolf (M)

Institute for Evolution and Biodiversity (IEB), University of Muenster, Huefferstr.1 48149 Muenster, Germany.
Senckenberg Biodiversity and Climate Research Centre (BiK-F), Georg-Voigt-Strasse 14-16, Frankfurt am Main, Germany.

Erich D Jarvis (ED)

Vertebrate Genome Lab, The Rockefeller University, 1230 York Avenue New York, NY 10065, USA.
Laboratory of Neurogenetics of Language, The Rockefeller University/HHMI, 1230 York Avenue, New York, NY 10065, USA.

James A Thomson (JA)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.
Department of Molecular, Cellular and Developmental Biology, University of California Santa Barbara, Santa Barbara, CA 93106, USA.
Department of Cell and Regenerative Biology, University of Wisconsin School of Medicine and Public Health, Madison, WI 53726, USA.

Mark J P Chaisson (MJP)

Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, 1050 Childs Way RRI 408H, Los Angeles, CA 90089, USA.

Ron Stewart (R)

Regenerative Biology, Morgridge Institute for Research, 330 N. Orchard St., Madison, WI 53715, USA.

Classifications MeSH