Benchmarking immunoinformatic tools for the analysis of antibody repertoire sequences.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 03 2020
Historique:
received: 21 06 2019
revised: 21 10 2019
accepted: 19 12 2019
pubmed: 25 12 2019
medline: 17 9 2020
entrez: 25 12 2019
Statut: ppublish

Résumé

Antibody repertoires reveal insights into the biology of the adaptive immune system and empower diagnostics and therapeutics. There are currently multiple tools available for the annotation of antibody sequences. All downstream analyses such as choosing lead drug candidates depend on the correct annotation of these sequences; however, a thorough comparison of the performance of these tools has not been investigated. Here, we benchmark the performance of commonly used immunoinformatic tools, i.e. IMGT/HighV-QUEST, IgBLAST and MiXCR, in terms of reproducibility of annotation output, accuracy and speed using simulated and experimental high-throughput sequencing datasets.We analyzed changes in IMGT reference germline database in the last 10 years in order to assess the reproducibility of the annotation output. We found that only 73/183 (40%) V, D and J human genes were shared between the reference germline sets used by the tools. We found that the annotation results differed between tools. In terms of alignment accuracy, MiXCR had the highest average frequency of gene mishits, 0.02 mishit frequency and IgBLAST the lowest, 0.004 mishit frequency. Reproducibility in the output of complementarity determining three regions (CDR3 amino acids) ranged from 4.3% to 77.6% with preprocessed data. In addition, run time of the tools was assessed: MiXCR was the fastest tool for number of sequences processed per unit of time. These results indicate that immunoinformatic analyses greatly depend on the choice of bioinformatics tool. Our results support informed decision-making to immunoinformaticians based on repertoire composition and sequencing platforms. All tools utilized in the paper are free for academic use. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 31873728
pii: 5686386
doi: 10.1093/bioinformatics/btz845
pmc: PMC7075533
doi:

Substances chimiques

Antibodies 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1731-1739

Subventions

Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 215840/Z/19/Z
Pays : United Kingdom

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press.

Références

Nat Rev Rheumatol. 2015 Mar;11(3):171-82
pubmed: 25536486
Immunogenetics. 2008 Nov;60(11):669-76
pubmed: 18712520
J Mol Biol. 1997 Nov 7;273(4):927-48
pubmed: 9367782
Immunology. 2018 Jan;153(1):31-41
pubmed: 28898398
Front Immunol. 2017 Nov 01;8:1418
pubmed: 29163494
Nat Rev Rheumatol. 2012 May 31;8(6):317-28
pubmed: 22647780
PLoS One. 2013 Sep 13;8(9):e75294
pubmed: 24058670
Front Immunol. 2018 Sep 28;9:2206
pubmed: 30323809
J Immunol. 2017 Oct 15;199(8):2985-2997
pubmed: 28924003
Front Immunol. 2018 Jul 26;9:1687
pubmed: 30093903
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W34-40
pubmed: 23671333
Nat Biotechnol. 2010 Sep;28(9):965-9
pubmed: 20802495
Proc Natl Acad Sci U S A. 2013 Oct 22;110(43):E4088-97
pubmed: 24106303
Front Immunol. 2018 Mar 08;9:395
pubmed: 29568296
Trends Immunol. 2015 Nov;36(11):738-749
pubmed: 26508293
Methods Mol Biol. 2014;1184:23-55
pubmed: 25048118
Nature. 1983 Apr 14;302(5909):575-81
pubmed: 6300689
Front Immunol. 2018 Oct 16;9:2278
pubmed: 30386328
Bioinformatics. 2010 Dec 15;26(24):3129-30
pubmed: 21036814
Nat Med. 2018 Jul;24(7):1005-1014
pubmed: 29942088
Genome Med. 2015 Nov 20;7:121
pubmed: 26589402
Front Immunol. 2019 Mar 18;10:435
pubmed: 30936866
Bioinformatics. 2015 Oct 1;31(19):3213-5
pubmed: 26007226
Mol Immunol. 2008 Aug;45(14):3832-9
pubmed: 18614234
Nat Methods. 2015 May;12(5):380-1
pubmed: 25924071
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W503-8
pubmed: 18503082
Methods Mol Biol. 2012;882:569-604
pubmed: 22665256
J Immunol. 2017 Feb 15;198(4):1460-1473
pubmed: 28087666
Curr Opin Immunol. 2016 Aug;41:62-67
pubmed: 27343848
Front Immunol. 2014 Feb 05;5:22
pubmed: 24600447
Nat Immunol. 2017 Nov 16;18(12):1274-1278
pubmed: 29144493
Nat Biotechnol. 2013 Feb;31(2):166-9
pubmed: 23334449
Int Rev Immunol. 1992;8(2-3):123-33
pubmed: 1318346
Immunity. 2000 Jul;13(1):37-45
pubmed: 10933393
Bioinformatics. 2019 May 1;35(9):1613-1614
pubmed: 30247621
Genome Med. 2015 May 28;7(1):49
pubmed: 26140055
Nat Immunol. 2014 Feb;15(2):118-27
pubmed: 24448569
Immunol Today. 1997 Nov;18(11):509
pubmed: 9386342
Front Immunol. 2018 Feb 21;9:224
pubmed: 29515569
Philos Trans R Soc Lond B Biol Sci. 2015 Sep 5;370(1676):
pubmed: 26194753

Auteurs

Erand Smakaj (E)

Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz 4132, Switzerland.

Lmar Babrak (L)

Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz 4132, Switzerland.

Mats Ohlin (M)

Department of Immunotechnology, Lund University, Lund 223, Sweden.

Mikhail Shugay (M)

Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow 121205, Russia.

Bryan Briney (B)

Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA.

Deniz Tosoni (D)

Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz 4132, Switzerland.

Christopher Galli (C)

Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz 4132, Switzerland.

Vendi Grobelsek (V)

Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland.

Igor D'Angelo (I)

One Amgen Center Drive, Amgen, Inc., Therapeutic Discovery/Molecular Engineering, Thousand Oaks, CA 91320, USA.

Branden Olson (B)

Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
Department of Statistics, University of Washington, Seattle, WA 98195, USA.

Sai Reddy (S)

Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland.

Victor Greiff (V)

Department of Immunology, University of Oslo, Oslo 0372, Norway.

Johannes Trück (J)

Paediatric Immunology, Children's Research Center, University Children's Hospital, University of Zurich, Zurich 8032, Switzerland.

Susanna Marquez (S)

Department of Pathology, Yale School of Medicine, New Haven, CT 06511, USA.

William Lees (W)

Department of Biological Sciences and Institute of Structural and Molecular Biology, Birkbeck College, University of London, London WC1E 7HX, UK.

Enkelejda Miho (E)

Institute of Biomedical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz 4132, Switzerland.
aiNET GmbH, Switzerland Innovation Park Basel Area AG, Basel 4057, Switzerland.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH