Benchmarking antibody clustering methods using sequence, structural, and machine learning similarity measures for antibody discovery applications.

antibodies biologics and biosimilars clustering drug discovery language models (LMs) machine learning

Journal

Frontiers in molecular biosciences
ISSN: 2296-889X
Titre abrégé: Front Mol Biosci
Pays: Switzerland
ID NLM: 101653173

Informations de publication

Date de publication:
2024
Historique:
received: 08 12 2023
accepted: 09 02 2024
medline: 12 4 2024
pubmed: 12 4 2024
entrez: 12 4 2024
Statut: epublish

Résumé

Antibodies are proteins produced by our immune system that have been harnessed as biotherapeutics. The discovery of antibody-based therapeutics relies on analyzing large volumes of diverse sequences coming from phage display or animal immunizations. Identification of suitable therapeutic candidates is achieved by grouping the sequences by their similarity and subsequent selection of a diverse set of antibodies for further tests. Such groupings are typically created using sequence-similarity measures alone. Maximizing diversity in selected candidates is crucial to reducing the number of tests of molecules with near-identical properties. With the advances in structural modeling and machine learning, antibodies can now be grouped across other diversity dimensions, such as predicted paratopes or three-dimensional structures. Here we benchmarked antibody grouping methods using clonotype, sequence, paratope prediction, structure prediction, and embedding information. The results were benchmarked on two tasks: binder detection and epitope mapping. We demonstrate that on binder detection no method appears to outperform the others, while on epitope mapping, clonotype, paratope, and embedding clusterings are top performers. Most importantly, all the methods propose orthogonal groupings, offering more diverse pools of candidates when using multiple methods than any single method alone. To facilitate exploring the diversity of antibodies using different methods, we have created an online tool-CLAP-available at (clap.naturalantibody.com) that allows users to group, contrast, and visualize antibodies using the different grouping methods.

Identifiants

pubmed: 38606289
doi: 10.3389/fmolb.2024.1352508
pii: 1352508
pmc: PMC11008471
doi:

Types de publication

Journal Article

Langues

eng

Pagination

1352508

Informations de copyright

Copyright © 2024 Chomicz, Kończak, Wróbel, Satława, Dudzic, Janusz, Tarkowski, Deszyński, Gawłowski, Kostyn, Orłowski, Klaus, Schulte, Martin, Comeau and Krawczyk.

Déclaration de conflit d'intérêts

Authors DC, JK, SW, TS, PD, BJ, MT, PD, TG, KK were employed by company NaturalAntibody. Authors AK, MO, TK were employed by company Pure Biologics. Author LS was employed by Boehringer Ingelheim Pharma GmbH & Co. KG. Authors KM and SC were employed by Boehringer Ingelheim.

Auteurs

Dawid Chomicz (D)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Jarosław Kończak (J)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Sonia Wróbel (S)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Tadeusz Satława (T)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Paweł Dudzic (P)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Bartosz Janusz (B)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Mateusz Tarkowski (M)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Piotr Deszyński (P)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Tomasz Gawłowski (T)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Anna Kostyn (A)

Pure Biologics, Wrocław, Poland.

Marek Orłowski (M)

Pure Biologics, Wrocław, Poland.
Department of Biochemistry, Molecular Biology and Biotechnology, Faculty of Chemistry, Wrocław University of Science and Technology, Wrocław, Poland.

Tomasz Klaus (T)

Pure Biologics, Wrocław, Poland.

Lukas Schulte (L)

Global Computational Biology & Digital Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach, Germany.

Kyle Martin (K)

Biotherapeutics Discovery, Boehringer Ingelheim, Biberach, Germany.

Stephen R Comeau (SR)

Biotherapeutics Discovery, Boehringer Ingelheim, Biberach, Germany.

Konrad Krawczyk (K)

NaturalAntibody, Szczecin, West Pomeranian, Poland.

Classifications MeSH