ArCH: Improving the performance of clonal hematopoiesis variant calling and interpretation.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
14 Mar 2024
Historique:
received: 03 07 2023
revised: 17 01 2024
accepted: 13 03 2024
medline: 15 3 2024
pubmed: 15 3 2024
entrez: 14 3 2024
Statut: aheadofprint

Résumé

The acquisition of somatic mutations in hematopoietic stem and progenitor stem cells (HSPCs) with resultant clonal expansion, termed clonal hematopoiesis (CH), is associated with increased risk of hematologic malignancies and other adverse outcomes. CH is generally present at low allelic fractions, but clonal expansion and acquisition of additional mutations leads to hematologic cancers in a small proportion of individuals. With high depth and high sensitivity sequencing, CH can be detected in most adults and its clonal trajectory mapped over time. However, accurate CH variant calling is challenging due to the difficulty in distinguishing low frequency CH mutations from sequencing artifacts. The lack of well-validated bioinformatic pipelines for CH calling may contribute to lack of reproducibility in studies of CH. ArCH, an Artifact filtering Clonal Hematopoiesis variant calling pipeline for detecting single nucleotide variants (SNVs) and short insertions/deletions (INDELs) by combining the output of four variant calling tools and filtering based on variant characteristics and sequencing error rate estimation. ArCH is an end-to-end cloud-based pipeline optimized to accept a variety of inputs with customizable parameters adaptable to multiple sequencing technologies, research questions, and datasets. Using deep targeted sequencing data generated from six AML patient tumor: normal dilutions, 31 blood samples with orthogonal validation, and 21 blood samples with technical replicates, we show that ArCH improves the sensitivity and positive predictive value of CH variant detection at low allele frequencies compared to standard application of commonly used variant calling approaches. The code for this workflow is available at: https://github.com/kbolton-lab/ArCH. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 38485690
pii: 7629130
doi: 10.1093/bioinformatics/btae121
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press.

Auteurs

Irenaeus C C Chan (ICC)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Alex Panchot (A)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Evelyn Schmidt (E)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Samantha McNulty (S)

Invitae Corporation, San Francisco, California.

Brian J Wiley (BJ)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Jie Liu (J)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Kimberly Turner (K)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Lea Moukarzel (L)

Memorial Sloan Kettering Cancer Center, New York City, New York.

Wendy S W Wong (WSW)

Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland.

Duc Tran (D)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

J Scott Beeler (JS)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Armel Landry Batchi-Bouyou (AL)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Mitchell J Machiela (MJ)

Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland.

Danielle M Karyadi (DM)

Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland.

Benjamin J Krajacich (BJ)

Element BioSciences, San Diego, California.

Junhua Zhao (J)

Element BioSciences, San Diego, California.

Semyon Kruglyak (S)

Element BioSciences, San Diego, California.

Bryan Lajoie (B)

Element BioSciences, San Diego, California.

Shawn Levy (S)

Element BioSciences, San Diego, California.

Minal Patel (M)

Memorial Sloan Kettering Cancer Center, New York City, New York.

Philip W Kantoff (PW)

Memorial Sloan Kettering Cancer Center, New York City, New York.

Christopher E Mason (CE)

Weill Cornell Medical College, New York City, New York.

Daniel C Link (DC)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Todd E Druley (TE)

Mission Bio, San Francisco, California.

Konrad H Stopsack (KH)

Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, Massachusetts.
Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts.

Kelly L Bolton (KL)

Washington University in Saint Louis-School of Medicine, Saint Louis, Missouri.

Classifications MeSH