The variation and evolution of complete human centromeres.


Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
30 May 2023
Historique:
pubmed: 3 7 2023
medline: 3 7 2023
entrez: 3 7 2023
Statut: epublish

Résumé

We completely sequenced and assembled all centromeres from a second human genome and used two reference sets to benchmark genetic, epigenetic, and evolutionary variation within centromeres from a diversity panel of humans and apes. We find that centromere single-nucleotide variation can increase by up to 4.1-fold relative to other genomic regions, with the caveat that up to 45.8% of centromeric sequence, on average, cannot be reliably aligned with current methods due to the emergence of new α-satellite higher-order repeat (HOR) structures and two to threefold differences in the length of the centromeres. The extent to which this occurs differs depending on the chromosome and haplotype. Comparing the two sets of complete human centromeres, we find that eight harbor distinctly different α-satellite HOR array structures and four contain novel α-satellite HOR variants in high abundance. DNA methylation and CENP-A chromatin immunoprecipitation experiments show that 26% of the centromeres differ in their kinetochore position by at least 500 kbp-a property not readily associated with novel α-satellite HORs. To understand evolutionary change, we selected six chromosomes and sequenced and assembled 31 orthologous centromeres from the common chimpanzee, orangutan, and macaque genomes. Comparative analyses reveal nearly complete turnover of α-satellite HORs, but with idiosyncratic changes in structure characteristic to each species. Phylogenetic reconstruction of human haplotypes supports limited to no recombination between the p- and q-arms of human chromosomes and reveals that novel α-satellite HORs share a monophyletic origin, providing a strategy to estimate the rate of saltatory amplification and mutation of human centromeric DNA.

Identifiants

pubmed: 37398417
doi: 10.1101/2023.05.30.542849
pmc: PMC10312506
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NIGMS NIH HHS
ID : K99 GM147352
Pays : United States
Organisme : NCI NIH HHS
ID : R01 CA266339
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG010169
Pays : United States

Commentaires et corrections

Type : UpdateIn

Déclaration de conflit d'intérêts

COMPETING INTERESTS SN is now an employee of Oxford Nanopore Technologies, Inc.; SK has received travel funds to speak at events hosted by Oxford Nanopore Technologies, Inc.; EEE is a scientific advisory board member of Variant Bio, Inc.

Auteurs

Glennis A Logsdon (GA)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Allison N Rozanski (AN)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Fedor Ryabov (F)

Masters Program in National Research University Higher School of Economics, Moscow, Russia.

Tamara Potapova (T)

Stowers Institute for Medical Research, Kansas City, MO, USA.

Valery A Shepelev (VA)

Institute of Molecular Genetics, Moscow, Russia; Currently retired.

Yafei Mao (Y)

Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China.

Mikko Rautiainen (M)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

Sergey Koren (S)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

Sergey Nurk (S)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

David Porubsky (D)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Julian K Lucas (JK)

Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, USA.
UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA.

Kendra Hoekzema (K)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Katherine M Munson (KM)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Jennifer L Gerton (JL)

Stowers Institute for Medical Research, Kansas City, MO, USA.

Adam M Phillippy (AM)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.

Ivan A Alexandrov (IA)

Department of Human Molecular Genetics and Biochemistry, Tel Aviv University, Tel Aviv, Israel.
Department of Anatomy and Anthropology, Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel.
Dan David Center for Human Evolution and Biohistory Research, Tel Aviv University, Tel Aviv, Israel.

Evan E Eichler (EE)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.

Classifications MeSH