An efficient genotyper and star-allele caller for pharmacogenomics.


Journal

Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021

Informations de publication

Date de publication:
01 2023
Historique:
received: 11 08 2022
accepted: 12 12 2022
pubmed: 20 1 2023
medline: 4 2 2023
entrez: 19 1 2023
Statut: ppublish

Résumé

High-throughput sequencing provides sufficient means for determining genotypes of clinically important pharmacogenes that can be used to tailor medical decisions to individual patients. However, pharmacogene genotyping, also known as star-allele calling, is a challenging problem that requires accurate copy number calling, structural variation identification, variant calling, and phasing within each pharmacogene copy present in the sample. Here we introduce Aldy 4, a fast and efficient tool for genotyping pharmacogenes that uses combinatorial optimization for accurate star-allele calling across different sequencing technologies. Aldy 4 adds support for long reads and uses a novel phasing model and improved copy number and variant calling models. We compare Aldy 4 against the current state-of-the-art star-allele callers on a large and diverse set of samples and genes sequenced by various sequencing technologies, such as whole-genome and targeted Illumina sequencing, barcoded 10x Genomics, and Pacific Biosciences (PacBio) HiFi. We show that Aldy 4 is the most accurate star-allele caller with near-perfect accuracy in all evaluated contexts, and hope that Aldy remains an invaluable tool in the clinical toolbox even with the advent of long-read sequencing technologies.

Identifiants

pubmed: 36657977
pii: gr.277075.122
doi: 10.1101/gr.277075.122
pmc: PMC9977157
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't Research Support, N.I.H., Intramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

61-70

Informations de copyright

© 2023 Hari et al.; Published by Cold Spring Harbor Laboratory Press.

Références

Bioinformatics. 2015 Jun 15;31(12):i27-34
pubmed: 26072492
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
Cell Syst. 2022 Oct 19;13(10):808-816.e5
pubmed: 36265467
Clin Pharmacol Ther. 2018 Mar;103(3):399-401
pubmed: 29134625
Sci Data. 2020 Nov 17;7(1):399
pubmed: 33203859
Am J Hum Genet. 2021 Oct 7;108(10):1880-1890
pubmed: 34478634
iScience. 2020 Mar 27;23(3):100883
pubmed: 32109676
J Mol Diagn. 2010 Nov;12(6):835-46
pubmed: 20889555
J Mol Diagn. 2022 Jun;24(6):576-585
pubmed: 35452844
J Mol Diagn. 2019 Nov;21(6):1034-1052
pubmed: 31401124
Clin Pharmacol Ther. 2014 Apr;95(4):376-82
pubmed: 24458010
Genet Med. 2019 Feb;21(2):361-372
pubmed: 29875422
Curr Protoc Bioinformatics. 2013;43:11.10.1-11.10.33
pubmed: 25431634
Nat Rev Genet. 2021 Sep;22(9):572-587
pubmed: 34050336
Nat Commun. 2018 Feb 26;9(1):828
pubmed: 29483503
Genome Res. 2017 May;27(5):801-812
pubmed: 27940952
Bioinformatics. 2014 Oct 15;30(20):2843-51
pubmed: 24974202
N Engl J Med. 2010 Jul 22;363(4):301-4
pubmed: 20551152
NPJ Genom Med. 2020 Aug 3;5:30
pubmed: 32789024
Clin Pharmacol Ther. 2007 Sep;82(3):244-8
pubmed: 17700589
Clin Transl Sci. 2021 Jan;14(1):204-213
pubmed: 32931151
Pharmacogenet Genomics. 2012 Sep;22(9):692-4
pubmed: 22868903
Pharmacogenet Genomics. 2009 Oct;19(10):813-22
pubmed: 19741566
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Pharmacogenomics J. 2021 Apr;21(2):251-261
pubmed: 33462347
Pharmacogenet Genomics. 2016 Apr;26(4):161-168
pubmed: 26736087
Mayo Clin Proc. 2017 Nov;92(11):1711-1722
pubmed: 29101939
Pharmacogenomics J. 2005;5(1):6-13
pubmed: 15492763
Nat Commun. 2020 Sep 16;11(1):4662
pubmed: 32938926
Res Comput Mol Biol. 2018 Apr;10812:280-282
pubmed: 29888346
Philos Trans R Soc Lond B Biol Sci. 2013 Jan 06;368(1612):20120428
pubmed: 23297351
Nat Genet. 2016 Jul;48(7):811-6
pubmed: 27270109
Nature. 2015 Oct 15;526(7573):343-50
pubmed: 26469045
Clin Pharmacol Ther. 2020 Jan;107(1):203-210
pubmed: 31306493
Int J Mol Sci. 2020 Mar 26;21(7):
pubmed: 32225115
Nat Biotechnol. 2011 Jan;29(1):24-6
pubmed: 21221095
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943
Bioinformatics. 2022 Jan 3;38(2):549-551
pubmed: 34431982
NPJ Genom Med. 2016 Jan 13;1:15007
pubmed: 29263805
Clin Pharmacol Ther. 2021 Sep;110(3):741-749
pubmed: 33492672
J Mol Diagn. 2016 Jan;18(1):109-23
pubmed: 26621101
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
J Mol Diagn. 2022 Apr;24(4):337-350
pubmed: 35134542
Pharmacogenomics J. 2014 Dec;14(6):564-72
pubmed: 24980783

Auteurs

Ananth Hari (A)

Department of Electrical and Computer Engineering, University of Maryland, College Park, Maryland 20742, USA.
Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.

Qinghui Zhou (Q)

Department of Computer Science, University of Victoria, Victoria, British Columbia V8P 5C2, Canada.

Nina Gonzaludo (N)

Pacific Biosciences, Menlo Park, California 94025, USA.

John Harting (J)

Pacific Biosciences, Menlo Park, California 94025, USA.

Stuart A Scott (SA)

Department of Pathology, Stanford University, Palo Alto, California 94304, USA.

Xiang Qin (X)

Baylor College of Medicine Human Genome Sequencing Center, Houston, Texas 77030, USA.

Steve Scherer (S)

Baylor College of Medicine Human Genome Sequencing Center, Houston, Texas 77030, USA.

S Cenk Sahinalp (SC)

Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.

Ibrahim Numanagić (I)

Department of Computer Science, University of Victoria, Victoria, British Columbia V8P 5C2, Canada.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH