HemoMIPs-Automated analysis and result reporting pipeline for targeted sequencing data.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
06 2020
Historique:
received: 22 01 2020
accepted: 16 05 2020
revised: 16 06 2020
pubmed: 5 6 2020
medline: 22 9 2020
entrez: 5 6 2020
Statut: epublish

Résumé

Targeted sequencing of genomic regions is a cost- and time-efficient approach for screening patient cohorts. We present a fast and efficient workflow to analyze highly imbalanced, targeted next-generation sequencing data generated using molecular inversion probe (MIP) capture. Our Snakemake pipeline performs sample demultiplexing, overlap paired-end merging, alignment, MIP-arm trimming, variant calling, coverage analysis and report generation. Further, we support the analysis of probes specifically designed to capture certain structural variants and can assign sex using Y-chromosome-unique probes. In a user-friendly HTML report, we summarize all these results including covered, incomplete or missing regions, called variants and their predicted effects. We developed and tested our pipeline using the hemophilia A & B MIP design from the "My Life, Our Future" initiative. HemoMIPs is available as an open-source tool on GitHub at: https://github.com/kircherlab/hemoMIPs.

Identifiants

pubmed: 32497118
doi: 10.1371/journal.pcbi.1007956
pii: PCOMPBIOL-D-20-00116
pmc: PMC7297380
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1007956

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Bioinformatics. 2014 Sep 15;30(18):2670-2
pubmed: 24867941
Bioinformatics. 2010 Mar 1;26(5):589-95
pubmed: 20080505
Science. 2012 Dec 21;338(6114):1619-22
pubmed: 23160955
Genome Res. 2013 May;23(5):843-54
pubmed: 23382536
Genome Biol. 2016 Jun 06;17(1):122
pubmed: 27268795
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Nat Methods. 2009 May;6(5):315-6
pubmed: 19349981
J Infect Dis. 2018 Aug 14;218(6):946-955
pubmed: 29718283
Bioinformatics. 2012 Oct 1;28(19):2520-2
pubmed: 22908215
J Thromb Haemost. 2008 May;6(5):830-6
pubmed: 18284600
Blood Adv. 2017 May 18;1(13):824-834
pubmed: 29296726
Clin Chem. 2017 Feb;63(2):503-512
pubmed: 27974384
Methods Mol Biol. 2017;1492:95-106
pubmed: 27822858
Methods Mol Biol. 2012;840:197-228
pubmed: 22237537

Auteurs

Philip Kleinert (P)

Berlin Institute of Health (BIH), Berlin, Germany.
Charité -Universitätsmedizin Berlin, Berlin, Germany.

Beth Martin (B)

University of Washington, Department of Genome Sciences, Seattle, Washington, United States of America.

Martin Kircher (M)

Berlin Institute of Health (BIH), Berlin, Germany.
Charité -Universitätsmedizin Berlin, Berlin, Germany.
University of Washington, Department of Genome Sciences, Seattle, Washington, United States of America.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH