Structurally divergent and recurrently mutated regions of primate genomes.


Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
07 Mar 2023
Historique:
pubmed: 23 3 2023
medline: 23 3 2023
entrez: 22 3 2023
Statut: epublish

Résumé

To better understand the pattern of primate genome structural variation, we sequenced and assembled using multiple long-read sequencing technologies the genomes of eight nonhuman primate species, including New World monkeys (owl monkey and marmoset), Old World monkey (macaque), Asian apes (orangutan and gibbon), and African ape lineages (gorilla, bonobo, and chimpanzee). Compared to the human genome, we identified 1,338,997 lineage-specific fixed structural variants (SVs) disrupting 1,561 protein-coding genes and 136,932 regulatory elements, including the most complete set of human-specific fixed differences. Across 50 million years of primate evolution, we estimate that 819.47 Mbp or ~27% of the genome has been affected by SVs based on analysis of these primate lineages. We identify 1,607 structurally divergent regions (SDRs) wherein recurrent structural variation contributes to creating SV hotspots where genes are recurrently lost (

Identifiants

pubmed: 36945442
doi: 10.1101/2023.03.07.531415
pmc: PMC10028934
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NHGRI NIH HHS
ID : U41 HG010972
Pays : United States
Organisme : NIAID NIH HHS
ID : R01 AI137011
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG010169
Pays : United States
Organisme : NHGRI NIH HHS
ID : U24 HG009081
Pays : United States
Organisme : NIGMS NIH HHS
ID : K99 GM147352
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG010485
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG002385
Pays : United States
Organisme : NIH HHS
ID : P51 OD011092
Pays : United States
Organisme : NHGRI NIH HHS
ID : U01 HG010961
Pays : United States
Organisme : NIDA NIH HHS
ID : DP1 DA046108
Pays : United States

Déclaration de conflit d'intérêts

Competing interests E.E.E. is a scientific advisory board (SAB) member of Variant Bio, Inc. The other authors declare no competing interests.

Références

Annu Rev Genet. 2014;48:519-35
pubmed: 25251849
Nat Rev Genet. 2023 May;24(5):314-331
pubmed: 36599936
Science. 2018 Jun 8;360(6393):
pubmed: 29880660
Nature. 2021 Jun;594(7862):227-233
pubmed: 33910227
Proc Natl Acad Sci U S A. 2009 Oct 6;106(40):17055-60
pubmed: 19805151
Nature. 2009 Feb 12;457(7231):877-81
pubmed: 19212409
Nature. 2016 Aug 11;536(7615):205-9
pubmed: 27487209
Nature. 2021 May;593(7857):101-107
pubmed: 33828295
Science. 2000 Aug 25;289(5483):1295-6
pubmed: 10979852
Science. 2019 Oct 18;366(6463):
pubmed: 31624180
Am J Hum Genet. 2004 Jul;75(1):82-91
pubmed: 15138899
Science. 2016 Apr 1;352(6281):aae0344
pubmed: 27034376
Genome Biol. 2019 Dec 19;20(1):291
pubmed: 31856913
Nat Rev Genet. 2020 Oct;21(10):597-614
pubmed: 32504078
Nat Methods. 2022 Jun;19(6):635-638
pubmed: 35689027
Proc Natl Acad Sci U S A. 2018 Oct 9;115(41):E9717-E9726
pubmed: 30242134
PLoS Biol. 2020 Dec 3;18(12):e3000954
pubmed: 33270638
Cell. 2022 Sep 1;185(18):3426-3440.e19
pubmed: 36055201
Nat Biotechnol. 2023 Oct;41(10):1474-1482
pubmed: 36797493
Science. 2022 Apr;376(6588):44-53
pubmed: 35357919
Nat Methods. 2018 Jun;15(6):461-468
pubmed: 29713083
Nature. 2014 Sep 11;513(7517):195-201
pubmed: 25209798
Nat Genet. 2007 Nov;39(11):1361-8
pubmed: 17922013
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Nature. 2012 Mar 07;483(7388):169-75
pubmed: 22398555
Genome Biol. 2016 Jun 06;17(1):122
pubmed: 27268795
Sci Adv. 2019 Jan 30;5(1):eaau6947
pubmed: 30854422
Nature. 2022 Apr;604(7906):437-446
pubmed: 35444317
Nat Commun. 2021 Aug 25;12(1):5118
pubmed: 34433829
Cell. 2018 May 31;173(6):1356-1369.e22
pubmed: 29856954
J Cell Biol. 2015 May 11;209(3):339-48
pubmed: 25963817
Syst Biol. 2011 Jan;60(1):16-31
pubmed: 21051775
Mol Phylogenet Evol. 2014 Jun;75:165-83
pubmed: 24583291
Mol Ecol Resour. 2020 May;20(3):
pubmed: 32073732
Nature. 2005 Sep 1;437(7055):69-87
pubmed: 16136131
Clin Endocrinol (Oxf). 2018 Jun;88(6):820-829
pubmed: 29464738
Nature. 2021 Aug;596(7873):583-589
pubmed: 34265844
Nat Ecol Evol. 2017;1(3):69
pubmed: 28580430
PLoS Genet. 2011 Mar;7(3):e1001342
pubmed: 21436896
Science. 2016 Oct 28;354(6311):477-481
pubmed: 27789843
PLoS Comput Biol. 2014 Apr 10;10(4):e1003537
pubmed: 24722319
Nature. 2020 Jul;583(7818):699-710
pubmed: 32728249
Nature. 2011 Jan 27;469(7331):529-33
pubmed: 21270892
Nat Rev Genet. 2014 May;15(5):347-59
pubmed: 24709753
Front Behav Neurosci. 2022 Apr 01;16:847410
pubmed: 35431833
Mol Biol Evol. 2020 Feb 1;37(2):395-405
pubmed: 31614365
Nature. 2016 Feb 11;530(7589):177-83
pubmed: 26814963
Genome Res. 2005 Mar;15(3):343-51
pubmed: 15710750
Science. 2022 Apr;376(6588):eabj6965
pubmed: 35357917
Bioinformatics. 2018 Sep 1;34(17):i748-i756
pubmed: 30423094
Proc Natl Acad Sci U S A. 2019 Aug 6;116(32):16036-16045
pubmed: 31332008
Genome Biol. 2021 Oct 18;22(1):295
pubmed: 34663425
Nat Rev Genet. 2020 Oct;21(10):575-576
pubmed: 32770171
Nat Methods. 2021 Feb;18(2):170-175
pubmed: 33526886
Psychiatr Genet. 2021 Dec 1;31(6):239-245
pubmed: 34412080
Science. 2001 Feb 16;291(5507):1304-51
pubmed: 11181995
Mol Biol Evol. 2015 Jan;32(1):268-74
pubmed: 25371430
Nature. 2021 Jun;594(7861):77-81
pubmed: 33953399
J Hum Genet. 2012 Aug;57(8):545-51
pubmed: 22673690
Proc Natl Acad Sci U S A. 2018 May 8;115(19):E4433-E4442
pubmed: 29686068
Nature. 2012 Jun 28;486(7404):527-31
pubmed: 22722832
An Acad Bras Cienc. 2000 Jun;72(2):165-72
pubmed: 10932115
Science. 2020 Dec 18;370(6523):
pubmed: 33335035
Elife. 2016 Aug 09;5:
pubmed: 27504805
Semin Fetal Neonatal Med. 2012 Dec;17(6):336-40
pubmed: 22871417
Nature. 2022 Feb;602(7896):263-267
pubmed: 34937052
Science. 2021 Apr 2;372(6537):
pubmed: 33632895
Nature. 2001 Feb 15;409(6822):860-921
pubmed: 11237011
BMC Bioinformatics. 2012 Sep 19;13:238
pubmed: 22988817
Proc Natl Acad Sci U S A. 2020 Aug 11;117(32):19328-19338
pubmed: 32690705
Genome Res. 2023 Apr;33(4):496-510
pubmed: 37164484
Elife. 2022 Jan 14;11:
pubmed: 35029146
Mol Phylogenet Evol. 2006 Nov;41(2):384-94
pubmed: 16815047
Nat Commun. 2019 Sep 17;10(1):4233
pubmed: 31530812
Cell. 2012 May 11;149(4):912-22
pubmed: 22559943
Cell. 2022 May 26;185(11):1986-2005.e26
pubmed: 35525246
Nature. 2013 Jul 25;499(7459):471-5
pubmed: 23823723
Mol Biol Evol. 2020 Sep 1;37(9):2727-2733
pubmed: 32365179
Am J Med Genet C Semin Med Genet. 2022 Mar;190(1):72-88
pubmed: 35238134
Science. 1990 Apr 6;248(4951):44-9
pubmed: 2181665
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Nat Neurosci. 2016 Aug 26;19(9):1118-22
pubmed: 27571190
Genome Res. 2013 Sep;23(9):1373-82
pubmed: 23825009
Genome Biol. 2023 Jul 4;24(1):157
pubmed: 37403156
Structure. 2021 Jun 3;29(6):572-586.e6
pubmed: 33529594
Curr Biol. 2018 Oct 8;28(19):3193-3197.e5
pubmed: 30270182

Auteurs

Yafei Mao (Y)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China.

William T Harvey (WT)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

David Porubsky (D)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Katherine M Munson (KM)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Kendra Hoekzema (K)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Alexandra P Lewis (AP)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Peter A Audano (PA)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Allison Rozanski (A)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Xiangyu Yang (X)

Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China.

Shilong Zhang (S)

Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China.

David S Gordon (DS)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.

Xiaoxi Wei (X)

Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China.

Glennis A Logsdon (GA)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Marina Haukness (M)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.

Philip C Dishuck (PC)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Hyeonsoo Jeong (H)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.

Ricardo Del Rosario (R)

McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA.
Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Vanessa L Bauer (VL)

BioFrontiers Institute, Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, CO, USA.

Will T Fattor (WT)

BioFrontiers Institute, Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, CO, USA.

Gregory K Wilkerson (GK)

Department of Veterinary Sciences, Michale E. Keeling Center for Comparative Medicine and Research, The University of Texas MD Anderson Cancer Center, Bastrop, TX, USA.
Department of Clinical Sciences, North Carolina State University, Raleigh, NC, USA.

Qing Lu (Q)

Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, China.

Benedict Paten (B)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.

Guoping Feng (G)

McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA.
Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Sara L Sawyer (SL)

BioFrontiers Institute, Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, CO, USA.

Wesley C Warren (WC)

Department of Animal Sciences, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.
Department of Surgery, School of Medicine, University of Missouri, Columbia, MO, USA.
Institute of Data Science and Informatics, University of Missouri, Columbia, MO, USA.

Lucia Carbone (L)

Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR, USA.
Division of Genetics, Oregon National Primate Research Center, Beaverton, OR, USA.
Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, OR, USA.
Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR, USA.

Evan E Eichler (EE)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA.
Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.

Classifications MeSH