Benchmarking immunoinformatic tools for the analysis of antibody repertoire sequences.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
01 03 2020
01 03 2020
Historique:
received:
21
06
2019
revised:
21
10
2019
accepted:
19
12
2019
pubmed:
25
12
2019
medline:
17
9
2020
entrez:
25
12
2019
Statut:
ppublish
Résumé
Antibody repertoires reveal insights into the biology of the adaptive immune system and empower diagnostics and therapeutics. There are currently multiple tools available for the annotation of antibody sequences. All downstream analyses such as choosing lead drug candidates depend on the correct annotation of these sequences; however, a thorough comparison of the performance of these tools has not been investigated. Here, we benchmark the performance of commonly used immunoinformatic tools, i.e. IMGT/HighV-QUEST, IgBLAST and MiXCR, in terms of reproducibility of annotation output, accuracy and speed using simulated and experimental high-throughput sequencing datasets.We analyzed changes in IMGT reference germline database in the last 10 years in order to assess the reproducibility of the annotation output. We found that only 73/183 (40%) V, D and J human genes were shared between the reference germline sets used by the tools. We found that the annotation results differed between tools. In terms of alignment accuracy, MiXCR had the highest average frequency of gene mishits, 0.02 mishit frequency and IgBLAST the lowest, 0.004 mishit frequency. Reproducibility in the output of complementarity determining three regions (CDR3 amino acids) ranged from 4.3% to 77.6% with preprocessed data. In addition, run time of the tools was assessed: MiXCR was the fastest tool for number of sequences processed per unit of time. These results indicate that immunoinformatic analyses greatly depend on the choice of bioinformatics tool. Our results support informed decision-making to immunoinformaticians based on repertoire composition and sequencing platforms. All tools utilized in the paper are free for academic use. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 31873728
pii: 5686386
doi: 10.1093/bioinformatics/btz845
pmc: PMC7075533
doi:
Substances chimiques
Antibodies
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1731-1739Subventions
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 215840/Z/19/Z
Pays : United Kingdom
Informations de copyright
© The Author(s) 2019. Published by Oxford University Press.
Références
Nat Rev Rheumatol. 2015 Mar;11(3):171-82
pubmed: 25536486
Immunogenetics. 2008 Nov;60(11):669-76
pubmed: 18712520
J Mol Biol. 1997 Nov 7;273(4):927-48
pubmed: 9367782
Immunology. 2018 Jan;153(1):31-41
pubmed: 28898398
Front Immunol. 2017 Nov 01;8:1418
pubmed: 29163494
Nat Rev Rheumatol. 2012 May 31;8(6):317-28
pubmed: 22647780
PLoS One. 2013 Sep 13;8(9):e75294
pubmed: 24058670
Front Immunol. 2018 Sep 28;9:2206
pubmed: 30323809
J Immunol. 2017 Oct 15;199(8):2985-2997
pubmed: 28924003
Front Immunol. 2018 Jul 26;9:1687
pubmed: 30093903
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W34-40
pubmed: 23671333
Nat Biotechnol. 2010 Sep;28(9):965-9
pubmed: 20802495
Proc Natl Acad Sci U S A. 2013 Oct 22;110(43):E4088-97
pubmed: 24106303
Front Immunol. 2018 Mar 08;9:395
pubmed: 29568296
Trends Immunol. 2015 Nov;36(11):738-749
pubmed: 26508293
Methods Mol Biol. 2014;1184:23-55
pubmed: 25048118
Nature. 1983 Apr 14;302(5909):575-81
pubmed: 6300689
Front Immunol. 2018 Oct 16;9:2278
pubmed: 30386328
Bioinformatics. 2010 Dec 15;26(24):3129-30
pubmed: 21036814
Nat Med. 2018 Jul;24(7):1005-1014
pubmed: 29942088
Genome Med. 2015 Nov 20;7:121
pubmed: 26589402
Front Immunol. 2019 Mar 18;10:435
pubmed: 30936866
Bioinformatics. 2015 Oct 1;31(19):3213-5
pubmed: 26007226
Mol Immunol. 2008 Aug;45(14):3832-9
pubmed: 18614234
Nat Methods. 2015 May;12(5):380-1
pubmed: 25924071
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W503-8
pubmed: 18503082
Methods Mol Biol. 2012;882:569-604
pubmed: 22665256
J Immunol. 2017 Feb 15;198(4):1460-1473
pubmed: 28087666
Curr Opin Immunol. 2016 Aug;41:62-67
pubmed: 27343848
Front Immunol. 2014 Feb 05;5:22
pubmed: 24600447
Nat Immunol. 2017 Nov 16;18(12):1274-1278
pubmed: 29144493
Nat Biotechnol. 2013 Feb;31(2):166-9
pubmed: 23334449
Int Rev Immunol. 1992;8(2-3):123-33
pubmed: 1318346
Immunity. 2000 Jul;13(1):37-45
pubmed: 10933393
Bioinformatics. 2019 May 1;35(9):1613-1614
pubmed: 30247621
Genome Med. 2015 May 28;7(1):49
pubmed: 26140055
Nat Immunol. 2014 Feb;15(2):118-27
pubmed: 24448569
Immunol Today. 1997 Nov;18(11):509
pubmed: 9386342
Front Immunol. 2018 Feb 21;9:224
pubmed: 29515569
Philos Trans R Soc Lond B Biol Sci. 2015 Sep 5;370(1676):
pubmed: 26194753