Frugal alignment-free identification of FLT3-internal tandem duplications with FiLT3r.


Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
28 Oct 2022
Historique:
received: 02 06 2022
accepted: 07 10 2022
entrez: 28 10 2022
pubmed: 29 10 2022
medline: 2 11 2022
Statut: epublish

Résumé

Internal tandem duplications in the FLT3 gene, termed FLT3-ITDs, are useful molecular markers in acute myeloid leukemia (AML) for patient risk stratification and follow-up. FLT3-ITDs are increasingly screened through high-throughput sequencing (HTS) raising the need for robust and efficient algorithms. We developed a new algorithm, which performs no alignment and uses little resources, to identify and quantify FLT3-ITDs in HTS data. Our algorithm (FiLT3r) focuses on the k-mers from reads covering FLT3 exons 14 and 15. We show that those k-mers bring enough information to accurately detect, determine the length and quantify FLT3-ITD duplications. We compare the performances of FiLT3r to state-of-the-art alternatives and to fragment analysis, the gold standard method, on a cohort of 185 AML patients sequenced with capture-based HTS. On this dataset FiLT3r is more precise (no false positive nor false negative) than the other software evaluated. We also assess the software on public RNA-Seq data, which confirms the previous results and shows that FiLT3r requires little resources compared to other software. FiLT3r is a free software available at https://gitlab.univ-lille.fr/filt3r/filt3r . The repository also contains a Snakefile to reproduce our experiments. We show that FiLT3r detects FLT3-ITDs better than other software while using less memory and time.

Sections du résumé

BACKGROUND BACKGROUND
Internal tandem duplications in the FLT3 gene, termed FLT3-ITDs, are useful molecular markers in acute myeloid leukemia (AML) for patient risk stratification and follow-up. FLT3-ITDs are increasingly screened through high-throughput sequencing (HTS) raising the need for robust and efficient algorithms. We developed a new algorithm, which performs no alignment and uses little resources, to identify and quantify FLT3-ITDs in HTS data.
RESULTS RESULTS
Our algorithm (FiLT3r) focuses on the k-mers from reads covering FLT3 exons 14 and 15. We show that those k-mers bring enough information to accurately detect, determine the length and quantify FLT3-ITD duplications. We compare the performances of FiLT3r to state-of-the-art alternatives and to fragment analysis, the gold standard method, on a cohort of 185 AML patients sequenced with capture-based HTS. On this dataset FiLT3r is more precise (no false positive nor false negative) than the other software evaluated. We also assess the software on public RNA-Seq data, which confirms the previous results and shows that FiLT3r requires little resources compared to other software.
CONCLUSION CONCLUSIONS
FiLT3r is a free software available at https://gitlab.univ-lille.fr/filt3r/filt3r . The repository also contains a Snakefile to reproduce our experiments. We show that FiLT3r detects FLT3-ITDs better than other software while using less memory and time.

Identifiants

pubmed: 36307762
doi: 10.1186/s12859-022-04983-6
pii: 10.1186/s12859-022-04983-6
pmc: PMC9617311
doi:

Substances chimiques

fms-Like Tyrosine Kinase 3 EC 2.7.10.1
FLT3 protein, human EC 2.7.10.1

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

448

Subventions

Organisme : Institut National Du Cancer
ID : PHRC 2007/1911 and PRTK TRANSLA10-060

Commentaires et corrections

Type : ErratumIn

Informations de copyright

© 2022. The Author(s).

Références

Genome Biol. 2017 Oct 3;18(1):186
pubmed: 28974235
Brief Bioinform. 2016 Jan;17(1):154-79
pubmed: 26026159
Blood. 2001 Sep 15;98(6):1752-9
pubmed: 11535508
Genome Biol. 2013 Mar 28;14(3):R30
pubmed: 23537109
Hematology Am Soc Hematol Educ Program. 2006;:178-84
pubmed: 17124058
Lancet Oncol. 2020 Sep;21(9):1201-1212
pubmed: 32791048
Blood Cancer J. 2015 Aug 14;5:e336
pubmed: 26832846
Leukemia. 2022 Jan;36(1):90-99
pubmed: 34316017
Bioinformatics. 2015 Jan 1;31(1):116-8
pubmed: 25192740
Bioinformatics. 2011 Mar 15;27(6):764-70
pubmed: 21217122
Bioinformatics. 2014 Oct 15;30(20):2959-61
pubmed: 24990603
Nature. 2019 May;569(7757):503-508
pubmed: 31068700
Leukemia. 2019 Feb;33(2):299-312
pubmed: 30651634
J Mol Diagn. 2020 Sep;22(9):1162-1178
pubmed: 32603763
Gigascience. 2020 Aug 1;9(8):
pubmed: 32852038
Leukemia. 2002 Sep;16(9):1699-704
pubmed: 12200684
Blood. 2014 Nov 27;124(23):3441-9
pubmed: 25270908
Curr Drug Targets. 2010 Jul;11(7):781-9
pubmed: 20370649
J Mol Diagn. 2003 May;5(2):96-102
pubmed: 12707374
Bioinformatics. 2012 Feb 15;28(4):593-4
pubmed: 22199392
Gene. 2020 Feb 5;726:144195
pubmed: 31669649
Leuk Res Rep. 2020 Apr 19;13:100198
pubmed: 32368486
Sci Rep. 2019 Aug 21;9(1):12209
pubmed: 31434952
Brief Bioinform. 2021 Sep 2;22(5):
pubmed: 33851200
Bioinformatics. 2009 Nov 1;25(21):2865-71
pubmed: 19561018
Blood. 2017 Jan 26;129(4):424-447
pubmed: 27895058
Curr Hematol Malig Rep. 2020 Feb;15(1):1-8
pubmed: 32034660
Oncotarget. 2018 Jul 10;9(53):30128-30145
pubmed: 30046393
Diagn Pathol. 2016 Jan 22;11:11
pubmed: 26796102
Nat Commun. 2019 May 2;10(1):2037
pubmed: 31048690
Life Sci Alliance. 2019 Aug 19;2(4):
pubmed: 31427380
Blood. 2002 Jun 15;99(12):4326-35
pubmed: 12036858
Leukemia. 2019 Oct;33(10):2535-2539
pubmed: 31089248
N Engl J Med. 2017 Aug 3;377(5):454-464
pubmed: 28644114

Auteurs

Augustin Boudry (A)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.
U1277 Cancer Heterogeneity Plasticity and Resistance to Therapies (CANTHER), University of Lille, INSERM, Lille, France.

Sasha Darmon (S)

Univ. Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, F-59000, Lille, France.
ENS Lyon, Lyon, France.

Nicolas Duployez (N)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.
U1277 Cancer Heterogeneity Plasticity and Resistance to Therapies (CANTHER), University of Lille, INSERM, Lille, France.

Martin Figeac (M)

Univ. Lille, CNRS, Inserm, CHU Lille, Institut Pasteur de Lille, US 41 - UMS 2014 - PLBS, F-59000, Lille, France.

Sandrine Geffroy (S)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.

Maxime Bucci (M)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.

Karine Celli-Lebras (K)

Department of Hematology, Saint Louis Hospital, Assistance Publique-Hôpitaux de Paris (AP-HP), Paris, France.

Matthieu Duchmann (M)

INSERM/CNRS UMR 944/7212, Saint-Louis Research Institute, Paris Diderot University, Paris, France.

Romane Joudinaud (R)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.
U1277 Cancer Heterogeneity Plasticity and Resistance to Therapies (CANTHER), University of Lille, INSERM, Lille, France.

Laurène Fenwarth (L)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.
U1277 Cancer Heterogeneity Plasticity and Resistance to Therapies (CANTHER), University of Lille, INSERM, Lille, France.

Olivier Nibourel (O)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.

Laure Goursaud (L)

Hematology Department, CHU LILLE, Lille, France.

Raphael Itzykson (R)

Department of Hematology, Saint Louis Hospital, Assistance Publique-Hôpitaux de Paris (AP-HP), Paris, France.
INSERM/CNRS UMR 944/7212, Saint-Louis Research Institute, Paris Diderot University, Paris, France.

Hervé Dombret (H)

Department of Hematology, Saint Louis Hospital, Assistance Publique-Hôpitaux de Paris (AP-HP), Paris, France.

Mathilde Hunault (M)

Univ Angers, Université de Nantes, CHU Angers, Inserm, CNRS, CRCI2NA, SFR ICAT, F-49000, Angers, France.

Claude Preudhomme (C)

Hematology Laboratory, Centre de Biologie Pathologie Génétique, CHU Lille, Lille, France.
U1277 Cancer Heterogeneity Plasticity and Resistance to Therapies (CANTHER), University of Lille, INSERM, Lille, France.

Mikaël Salson (M)

Univ. Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, F-59000, Lille, France. mikael.salson@univ-lille.fr.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH