Robust, flexible, and scalable tests for Hardy-Weinberg equilibrium across diverse ancestries.


Journal

Genetics
ISSN: 1943-2631
Titre abrégé: Genetics
Pays: United States
ID NLM: 0374636

Informations de publication

Date de publication:
17 05 2021
Historique:
received: 24 11 2020
accepted: 03 02 2021
pubmed: 16 3 2021
medline: 19 2 2022
entrez: 15 3 2021
Statut: ppublish

Résumé

Traditional Hardy-Weinberg equilibrium (HWE) tests (the χ2 test and the exact test) have long been used as a metric for evaluating genotype quality, as technical artifacts leading to incorrect genotype calls often can be identified as deviations from HWE. However, in data sets composed of individuals from diverse ancestries, HWE can be violated even without genotyping error, complicating the use of HWE testing to assess genotype data quality. In this manuscript, we present the Robust Unified Test for HWE (RUTH) to test for HWE while accounting for population structure and genotype uncertainty, and to evaluate the impact of population heterogeneity and genotype uncertainty on the standard HWE tests and alternative methods using simulated and real sequence data sets. Our results demonstrate that ignoring population structure or genotype uncertainty in HWE tests can inflate false-positive rates by many orders of magnitude. Our evaluations demonstrate different tradeoffs between false positives and statistical power across the methods, with RUTH consistently among the best across all evaluations. RUTH is implemented as a practical and scalable software tool to rapidly perform HWE tests across millions of markers and hundreds of thousands of individuals while supporting standard VCF/BCF formats. RUTH is publicly available at https://www.github.com/statgen/ruth.

Identifiants

pubmed: 33720349
pii: 6171183
doi: 10.1093/genetics/iyab044
pmc: PMC8128395
pii:
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NIGMS NIH HHS
ID : U54 GM104938
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL120393
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL120393
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH105653
Pays : United States
Organisme : NHLBI NIH HHS
ID : HHSN268201800001C
Pays : United States
Organisme : NHLBI NIH HHS
ID : K01 HL129039
Pays : United States
Organisme : NIDA NIH HHS
ID : R01 DA037904
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL117626
Pays : United States
Organisme : NHLBI NIH HHS
ID : P01 HL132825
Pays : United States
Organisme : NHLBI NIH HHS
ID : K01 HL135405
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL142711
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL113326
Pays : United States
Organisme : NHLBI NIH HHS
ID : P01 HL045522
Pays : United States
Organisme : NIAID NIH HHS
ID : R01 AI132476
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG009976
Pays : United States
Organisme : NIDDK NIH HHS
ID : P30 DK020572
Pays : United States
Organisme : NHLBI NIH HHS
ID : R03 HL154284
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL137182
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG007022
Pays : United States
Organisme : NCI NIH HHS
ID : U01 CA182913
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL117626
Pays : United States

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Références

Nat Rev Genet. 2011 Jun;12(6):443-51
pubmed: 21587300
Nat Genet. 2006 Jan;38(1):86-92
pubmed: 16468122
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Nat Genet. 2006 Aug;38(8):904-9
pubmed: 16862161
Nat Methods. 2013 Jan;10(1):5-6
pubmed: 23269371
Genome Res. 2020 Feb;30(2):185-194
pubmed: 31980570
Genetica. 1995;96(1-2):3-12
pubmed: 7607457
Genet Epidemiol. 2011 Nov;35(7):671-8
pubmed: 21818775
Genetics. 2019 Nov;213(3):759-770
pubmed: 31537622
Science. 1908 Jul 10;28(706):49-50
pubmed: 17779291
Bioinformatics. 2016 Mar 1;32(5):713-21
pubmed: 26545820
Mol Ecol Resour. 2019 Sep;19(5):1144-1152
pubmed: 30977299
Brief Bioinform. 2013 Mar;14(2):144-61
pubmed: 22908213
Nat Genet. 2016 Feb;48(2):134-43
pubmed: 26691988
Science. 2008 Feb 22;319(5866):1100-4
pubmed: 18292342
J Hered. 2015 Jan-Feb;106(1):1-19
pubmed: 25425676
Nature. 2010 Sep 2;467(7311):52-8
pubmed: 20811451
Genetics. 2008 Nov;180(3):1609-16
pubmed: 18791257
Mol Ecol. 2002 Jul;11(7):1157-64
pubmed: 12074723
Am J Hum Genet. 1998 Nov;63(5):1531-40
pubmed: 9867708
Genet Epidemiol. 2008 Nov;32(7):589-99
pubmed: 18449919
Genome Res. 1998 Mar;8(3):186-94
pubmed: 9521922
Am J Hum Genet. 2012 Nov 2;91(5):839-48
pubmed: 23103226
Bioinformatics. 2011 Aug 1;27(15):2156-8
pubmed: 21653522
Am J Hum Genet. 2005 May;76(5):887-93
pubmed: 15789306
Stat Appl Genet Mol Biol. 2010;9:Article 13
pubmed: 20196748
G3 (Bethesda). 2019 Aug 8;9(8):2447-2461
pubmed: 31151998
Nature. 2018 Oct;562(7726):203-209
pubmed: 30305743
Genet Epidemiol. 2010 Sep;34(6):591-602
pubmed: 20718045
Theor Popul Biol. 2003 May;63(3):221-30
pubmed: 12689793
Stat Appl Genet Mol Biol. 2013 Aug;12(4):433-48
pubmed: 23934608
Science. 2002 Dec 20;298(5602):2381-5
pubmed: 12493913
Am J Hum Genet. 2007 Sep;81(3):559-75
pubmed: 17701901
Nat Genet. 2012 May 20;44(6):725-31
pubmed: 22610118
Nature. 2021 Feb;590(7845):290-299
pubmed: 33568819
Nature. 2015 Feb 12;518(7538):197-206
pubmed: 25673413
Am J Hum Genet. 2017 Jul 6;101(1):37-49
pubmed: 28602423

Auteurs

Alan M Kwong (AM)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Thomas W Blackwell (TW)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Jonathon LeFaive (J)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Mariza de Andrade (M)

Mayo Clinic, Rochester, MN 55905, USA.

John Barnard (J)

Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44106, USA.

Kathleen C Barnes (KC)

Department of Medicine, Anschultz Medical Campus, University of Colorado, Aurora, CO 80045, USA.

John Blangero (J)

Department of Human Genetics, South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX 78520, USA.

Eric Boerwinkle (E)

Department of Epidemiology, Human Genetics Center, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA.
Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA.

Esteban G Burchard (EG)

Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94143, USA.
Department of Medicine, University of California San Francisco, San Francisco, CA 94143, USA.

Brian E Cade (BE)

Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA 02115, USA.
Division of Sleep Medicine, Harvard Medical School, Boston, MA 02115, USA.

Daniel I Chasman (DI)

Division of Preventive Medicine, Brigham and Women's Hospital, Boston, MA 02215, USA.

Han Chen (H)

Department of Epidemiology, Human Genetics Center, Human Genetics and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA.
Center for Precision Health, School of Public Health and School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA.

Matthew P Conomos (MP)

Department of Biostatistics, University of Washington, Seattle, WA 98195, USA.

L Adrienne Cupples (LA)

Department of Biostatistics, Boston University School of Public Health, Boston, MA 02118, USA.
Framingham Heart Study, Framingham, MA 01702, USA.

Patrick T Ellinor (PT)

Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA.
Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA 02124, USA.

Celeste Eng (C)

Department of Medicine, University of California San Francisco, San Francisco, CA 94143, USA.

Yan Gao (Y)

Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS 39216 USA.

Xiuqing Guo (X)

Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute at Harbor-UCLA Medical Center, Torrance, CA 90502, USA.

Marguerite Ryan Irvin (MR)

Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL 35294, USA.

Tanika N Kelly (TN)

Department of Epidemiology, Tulane University, New Orleans, LA 70112, USA.

Wonji Kim (W)

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA.

Charles Kooperberg (C)

Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.

Steven A Lubitz (SA)

Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA 02114, USA.
Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, MA 02124, USA.

Angel C Y Mak (ACY)

Department of Medicine, University of California San Francisco, San Francisco, CA 94143, USA.

Ani W Manichaikul (AW)

Department of Public Health Sciences, Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA.

Rasika A Mathias (RA)

GeneSTAR Research Program and Division of Allergy and Clinical Immunology, Department of Medicine, Johns Hopkins University, Baltimore, MD 21205, USA.

May E Montasser (ME)

Division of Endocrinology, Diabetes and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA.

Courtney G Montgomery (CG)

Sarcoidosis Research Unit, Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA.

Solomon Musani (S)

Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS 39216, USA.

Nicholette D Palmer (ND)

Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC 27157, USA.

Gina M Peloso (GM)

Department of Biostatistics, Boston University School of Public Health, Boston, MA 02118, USA.

Dandi Qiao (D)

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA.

Alexander P Reiner (AP)

Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.

Dan M Roden (DM)

Departments of Medicine, Pharmacology, and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA.

M Benjamin Shoemaker (MB)

Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA.

Jennifer A Smith (JA)

Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA.

Nicholas L Smith (NL)

Department of Epidemiology, University of Washington, Seattle, WA 98195, USA.
Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA 98101, USA.
Department of Veterans Affairs, Seattle Epidemiologic Research and Information Center, Office of Research and Development, Seattle, WA 98108, USA.

Jessica Lasky Su (JL)

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA.

Hemant K Tiwari (HK)

Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL 35294, USA.

Daniel E Weeks (DE)

Departments of Human Genetics and Biostatistics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA.

Scott T Weiss (ST)

Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA.

Laura J Scott (LJ)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Albert V Smith (AV)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Gonçalo R Abecasis (GR)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Michael Boehnke (M)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Hyun Min Kang (HM)

Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH