Structural variation across 138,134 samples in the TOPMed consortium.

NGS Population Structural Variants TOPMed

Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
25 Jan 2023
Historique:
pubmed: 8 2 2023
medline: 8 2 2023
entrez: 7 2 2023
Statut: epublish

Résumé

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

Identifiants

pubmed: 36747810
doi: 10.1101/2023.01.25.525428
pmc: PMC9900832
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NHLBI NIH HHS
ID : P01 HL132825
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL105756
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL123915
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL155742
Pays : United States

Auteurs

Goo Jun (G)

Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston.

Adam C English (AC)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Ginger A Metcalf (GA)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Jianzhi Yang (J)

University of Southern California, Los Angeles, CA, USA.

Mark Jp Chaisson (MJ)

University of Southern California, Los Angeles, CA, USA.

Nathan Pankratz (N)

University of Minnesota, Minneapolis, MN, USA.

Vipin K Menon (VK)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

William J Salerno (WJ)

Regeneron Genetics Center.

Olga Krasheninina (O)

Regeneron Genetics Center.

Albert V Smith (AV)

Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI.

John A Lane (JA)

Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI.

Tom Blackwell (T)

Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI.

Hyun Min Kang (HM)

Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI.

Sejal Salvi (S)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Qingchang Meng (Q)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Hua Shen (H)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Divya Pasham (D)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Sravya Bhamidipati (S)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Kavya Kottapalli (K)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Donna K Arnett (DK)

Department of Epidemiology, University of Kentucky College of Public Health.

Allison Ashley-Koch (A)

Department of Medicine, Duke University Medical Center, Durham, NC.
Duke Molecular Physiology Institute, Duke University Medical Center, Durham, NC.

Paul L Auer (PL)

Division of Biostatistics and Cancer Center, Medical College of Wisconsin, Milwaukee WI.

Kathleen M Beutel (KM)

University of Minnesota, Minneapolis, MN, USA.

Joshua C Bis (JC)

Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA.

John Blangero (J)

Department of Human Genetics and South Texas Diabetes and Obesity Institute, University of Texas, Rio Grande Valley School of Medicine, Brownsville, TX.

Donald W Bowden (DW)

Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA.

Jennifer A Brody (JA)

Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA.

Brian E Cade (BE)

Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA.

Yii-Der Ida Chen (YI)

Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center.
The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA.

Michael H Cho (MH)

Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA.

Joanne E Curran (JE)

Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA.

Myriam Fornage (M)

Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX.

Barry I Freedman (BI)

Department of Internal Medicine, Section on Nephrology, Wake Forest School of Medicine, Winston-Salem, NC, USA.

Tasha Fingerlin (T)

Center for Genes, Environment and Health, National Jewish Health, 1400 Jackson St., Denver, CO, 80206, USA.

Bruce D Gelb (BD)

Mindich Child Health and Development Institute and the Departments of Pediatrics and Genetics & Genomic Sciences, Icahn School of Medicine at Mount Sinai.

Lifang Hou (L)

Northwestern University, Chicago, IL.

Yi-Jen Hung (YJ)

Institute of Preventive Medicine, National Defense Medical Center, Taiwan.

John P Kane (JP)

Cardiovascular Research Institute, University of California, San Francisco.

Robert Kaplan (R)

Department of epidemiology and population health, Albert Einstein College of Medicine, Bronx NY USA.

Wonji Kim (W)

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.

Ruth J F Loos (RJF)

The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY.

Gregory M Marcus (GM)

Division of Cardiology, University of California, San Francisco CA.

Rasika A Mathias (RA)

Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD.

Stephen T McGarvey (ST)

Department of Epidemiology, International Health Institute and Department of Anthropology, Brown University.

Courtney Montgomery (C)

Genes and Human Disease Research Program, Oklahoma Medical Research Foundation.

Take Naseri (T)

Ministry of Health, Government of Samoa, Apia, Samoa.

S Mehdi Nouraie (SM)

Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213.

Michael H Preuss (MH)

The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY.

Nicholette D Palmer (ND)

Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA.

Patricia A Peyser (PA)

Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI USA.

Laura M Raffield (LM)

Department of Genetics, University of North Carolina at Chapel Hill.

Aakrosh Ratan (A)

Center for Public Health Genomics, University of Virginia, Charlottesville, VA USA.

Susan Redline (S)

Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA.

Sefuiva Reupena (S)

Lutia i Puava ae Mapu i Fagalele, Apia, Samoa 663030.

Jerome I Rotter (JI)

The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA.
Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands.

Stephen S Rich (SS)

Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI USA.

Michiel Rienstra (M)

Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands.

Ingo Ruczinski (I)

Department of Biostatistics, Johns Hopkins University Bloomberg, School of Public Health, Baltimore, MD, USA.

Vijay G Sankaran (VG)

Division of Hematology/Oncology, Boston Children's Hospital and Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115.
Broad Institute of MIT and Harvard, Cambridge, MA 02142.

David A Schwartz (DA)

University of Colorado Department of Medicine.

Christine E Seidman (CE)

Department of Genetics, Harvard Medical School.
Cardiovascular Division, Brigham & Women's Hospital, Harvard University.
Howard Hughes Medical Institute, Harvard University.

Jonathan G Seidman (JG)

Department of Genetics, Harvard Medical School.

Edwin K Silverman (EK)

Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA.

Jennifer A Smith (JA)

Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213.

Adrienne Stilp (A)

Department of Biostatistics, University of Washington, Seattle, WA.

Kent D Taylor (KD)

The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA USA.
Center for Public Health Genomics, University of Virginia, Charlottesville, VA USA.

Marilyn J Telen (MJ)

Department of Medicine, Duke University Medical Center, Durham, NC.

Scott T Weiss (ST)

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.

L Keoki Williams (LK)

Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, Michigan, United States of America.

Baojun Wu (B)

Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, Michigan, United States of America.

Lisa R Yanek (LR)

The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY.

Yingze Zhang (Y)

Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15213.

Jessica Lasky-Su (J)

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.

Marie Claude Gingras (MC)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Susan K Dutcher (SK)

Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110, USA.

Evan E Eichler (EE)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, USA.
Howard Hughes Medical Institute, University of Washington, Seattle, Washington, USA.

Stacey Gabriel (S)

Broad Institute of MIT and Harvard, Cambridge, MA 02142.

Soren Germer (S)

New York Genome Center, New York, NY, USA.

Ryan Kim (R)

Psomagen, Inc.,Rockville, Maryland, USA.

Karine A Viaud-Martinez (KA)

Illumina Laboratory Services, Illumina, Inc., San Diego.

Deborah A Nickerson (DA)

Department of Genome Sciences, University of Washington, Seattle, WA 98195.

James Luo (J)

National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA.

Alex Reiner (A)

Department of Epidemiology, University of Washington, Seattle, WA 98109, USA.

Richard A Gibbs (RA)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Eric Boerwinkle (E)

Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston.
Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.

Goncalo Abecasis (G)

Regeneron Genetics Center.
Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI.

Fritz J Sedlazeck (FJ)

Baylor College of Medicine Human Genome Sequencing Center, Houston, TX, USA.
Department of Computer Science, Rice University, 6100 Main Street, Houston, TX, 77005, USA.

Classifications MeSH