TEMPRO: nanobody melting temperature estimation model using protein embeddings.

Antibodies Machine learning Nanobodies Neural networks Proteins Single-domain antibodies Thermostability

Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
17 Aug 2024
Historique:
received: 21 06 2024
accepted: 13 08 2024
medline: 18 8 2024
pubmed: 18 8 2024
entrez: 17 8 2024
Statut: epublish

Résumé

Single-domain antibodies (sdAbs) or nanobodies have received widespread attention due to their small size (~ 15 kDa) and diverse applications in bio-derived therapeutics. As many modern biotechnology breakthroughs are applied to antibody engineering and design, nanobody thermostability or melting temperature (T

Identifiants

pubmed: 39154093
doi: 10.1038/s41598-024-70101-6
pii: 10.1038/s41598-024-70101-6
doi:

Substances chimiques

Single-Domain Antibodies 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

19074

Informations de copyright

© 2024. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.

Références

Henry, K. A. & MacKenzie, C. R. Antigen recognition by single-domain antibodies: Structural latitudes and constraints. MAbs 10(6), 815–826 (2018).
pubmed: 29916758 pmcid: 6260137 doi: 10.1080/19420862.2018.1489633
Wesolowski, J. et al. Single domain antibodies: Promising experimental and therapeutic tools in infection and immunity. Med. Microbiol. Immunol. 198, 157–174 (2009).
pubmed: 19529959 pmcid: 2714450 doi: 10.1007/s00430-009-0116-7
Ventola, C. L. The antibiotic resistance crisis. Pharm. Therap. 40(4), 277–283 (2015).
Gould, I. M. & Bal, A. M. New antibiotic agents in the pipeline and how they can help overcome microbial resistance. Virulence 4(2), 185–191 (2013).
pubmed: 23302792 pmcid: 3654619 doi: 10.4161/viru.22507
McConnell, A. D. et al. A general approach to antibody thermostabilization. MAbs 6(5), 1274–1282 (2014).
pubmed: 25517312 pmcid: 4623350 doi: 10.4161/mabs.29680
Ward, E. S. et al. Binding activities of a repertoire of single immunoglobulin variable domains secreted from Escherichia coli. Nature 341(6242), 544–546 (1989).
pubmed: 2677748 doi: 10.1038/341544a0
Hamers-Casterman, C. et al. Naturally occurring antibodies devoid of light chains. Nature 363(6428), 446–448 (1993).
pubmed: 8502296 doi: 10.1038/363446a0
Ovchinnikov, V. et al. Role of framework mutations and antibody flexibility in the evolution of broadly neutralizing antibodies. Elife 7, 1 (2018).
doi: 10.7554/eLife.33038
Kiguchi, Y. et al. The VH framework region 1 as a target of efficient mutagenesis for generating a variety of affinity-matured scFv mutants. Sci. Rep. 11(1), 8201 (2021).
pubmed: 33859250 pmcid: 8050046 doi: 10.1038/s41598-021-87501-7
Nguyen, V. K. et al. Camel heavy-chain antibodies: Diverse germline V(H)H and specific mechanisms enlarge the antigen-binding repertoire. EMBO J. 19(5), 921–930 (2000).
pubmed: 10698934 pmcid: 305632 doi: 10.1093/emboj/19.5.921
Muyldermans, S. et al. Sequence and structure of VH domain from naturally occurring camel heavy chain immunoglobulins lacking light chains. Protein Eng. 7(9), 1129–1135 (1994).
pubmed: 7831284 doi: 10.1093/protein/7.9.1129
Ding, L. et al. Structural insights into the mechanism of single domain VHH antibody binding to cortisol. FEBS Lett. 593(11), 1248–1256 (2019).
pubmed: 31049949 doi: 10.1002/1873-3468.13398
Rudolph, M. J. et al. Contribution of an unusual CDR2 element of a single domain antibody in ricin toxin binding affinity and neutralizing activity. Protein Eng. Des. Select. 31(7–8), 277–287 (2018).
doi: 10.1093/protein/gzy022
Bever, C. S. et al. VHH antibodies: Emerging reagents for the analysis of environmental chemicals. Anal. Bioanal. Chem. 408(22), 5985–6002 (2016).
pubmed: 27209591 pmcid: 4983233 doi: 10.1007/s00216-016-9585-x
Polonelli, L. et al. Antibody complementarity-determining regions (CDRs) can display differential antimicrobial, antiviral and antitumor activities. PLoS ONE 3(6), e2371 (2008).
pubmed: 18545659 pmcid: 2396520 doi: 10.1371/journal.pone.0002371
Liu, J. L. et al. Thermal stability and refolding capability of shark derived single domain antibodies. Mol. Immunol. 59(2), 194–199 (2014).
pubmed: 24667069 doi: 10.1016/j.molimm.2014.02.014
Kunz, P. et al. The structural basis of nanobody unfolding reversibility and thermoresistance. Sci. Rep. 8(1), 7934 (2018).
pubmed: 29784954 pmcid: 5962586 doi: 10.1038/s41598-018-26338-z
Bekker, G. J., Ma, B. & Kamiya, N. Thermal stability of single-domain antibodies estimated by molecular dynamics simulations. Protein Sci. 28(2), 429–438 (2019).
pubmed: 30394618 doi: 10.1002/pro.3546
Jung, F. et al. DeepSTABp: A deep learning approach for the prediction of thermal protein stability. Int. J. Mol. Sci. 24(8), 7444 (2023).
pubmed: 37108605 pmcid: 10138888 doi: 10.3390/ijms24087444
Li, M. et al. DeepTM: A deep learning algorithm for prediction of melting temperature of thermophilic proteins directly from sequences. Comput. Struct. Biotechnol. J. 21, 5544–5560 (2023).
pubmed: 38034401 pmcid: 10681957 doi: 10.1016/j.csbj.2023.11.006
Yang, Y. et al. ProTstab2 for prediction of protein thermal stabilities. Int. J. Mol. Sci. 23, 18 (2022).
Ku, T. et al. Predicting melting temperature directly from protein sequences. Comput. Biol. Chem. 33(6), 445–450 (2009).
pubmed: 19896904 doi: 10.1016/j.compbiolchem.2009.10.002
Haselbeck, F. et al. Superior protein thermophilicity prediction with protein language model embeddings. NAR Genom. Bioinform. 5(4), 087 (2023).
Outeiral, C. & Deane, C. M. Codon language embeddings provide strong signals for use in protein engineering. Nat. Mach. Intell. 6(2), 170–179 (2024).
doi: 10.1038/s42256-024-00791-0
Valdés-Tresanco, M. S. et al. NbThermo: A new thermostability database for nanobodies. Database 2023, 21 (2023).
doi: 10.1093/database/baad021
Kunz, P. et al. Exploiting sequence and stability information for directing nanobody stability engineering. Biochim. Biophys. Acta Gen. Subj. 1861(9), 2196–2205 (2017).
pubmed: 28642127 doi: 10.1016/j.bbagen.2017.06.014
Osorio, D., Rondón-Villarreal, P. & Torres, R. Peptides: A package for data mining of antimicrobial peptides. R J. 7(1), 4–14 (2015).
doi: 10.32614/RJ-2015-001
Ikai, A. Thermostability and aliphatic index of globular proteins. J. Biochem. 88(6), 1895–1898 (1980).
pubmed: 7462208
Guruprasad, K., Reddy, B. V. & Pandit, M. W. Correlation between stability of a protein and its dipeptide composition: A novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. 4(2), 155–161 (1990).
pubmed: 2075190 doi: 10.1093/protein/4.2.155
Kyte, J. & Doolittle, R. F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157(1), 105–132 (1982).
pubmed: 7108955 doi: 10.1016/0022-2836(82)90515-0
Bannas, P., Hambach, J. & Koch-Nolte, F. Nanobodies and nanobody-based human heavy chain antibodies as antitumor therapeutics. Front. Immunol. 8, 1 (2017).
doi: 10.3389/fimmu.2017.01603
Bhaskaran, R. & Ponnuswamy, P. K. Positional flexibilities of amino acid residues in globular proteins. Int. J. Peptide Protein Res. 32(4), 241–255 (1988).
doi: 10.1111/j.1399-3011.1988.tb01258.x
Dong, Y.-W. et al. Structural flexibility and protein adaptation to temperature: Molecular dynamics analysis of malate dehydrogenases of marine molluscs. Proc. Natl. Acad. Sci. 115(6), 1274–1279 (2018).
pubmed: 29358381 pmcid: 5819447 doi: 10.1073/pnas.1718910115
Sheriff, S. et al. Influence of solvent accessibility and intermolecular contacts on atomic mobilities in hemerythrins. Proc. Natl. Acad. Sci. 82(4), 1104–1107 (1985).
pubmed: 3856249 pmcid: 397202 doi: 10.1073/pnas.82.4.1104
Sandberg, M. et al. New chemical descriptors relevant for the design of biologically active peptides. A multivariate characterization of 87 amino acids. J. Med. Chem. 41(14), 2481–2491 (1998).
pubmed: 9651153 doi: 10.1021/jm9700575
Tesfaye, D. Y. et al. Targeting conventional dendritic cells to fine-tune antibody responses. Front. Immunol. 10, 1529 (2019).
pubmed: 31333661 pmcid: 6620736 doi: 10.3389/fimmu.2019.01529
Pervez, S. et al. Effect of polarity and differentiation on antibody localization in multicellular tumour spheroid and xenograft models and its potential importance for in vivo immunotargeting. Int. J. Cancer 44(5), 940–947 (1989).
pubmed: 2583872 doi: 10.1002/ijc.2910440532
Wang, Y. et al. Investigation of the small size of nanobodies for a sensitive fluorescence polarization immunoassay for small molecules: 3-Phenoxybenzoic acid, an exposure biomarker of pyrethroid insecticides as a model. J. Agric. Food Chem. 67(41), 11536–11541 (2019).
pubmed: 31589045 pmcid: 7134064 doi: 10.1021/acs.jafc.9b04621
Moore, D. S. Amino acid and peptide net charges: A simple calculational procedure. Biochem. Educ. 13(1), 10–11 (1985).
doi: 10.1016/0307-4412(85)90114-1
Lehninger, A. L. Lehninger Principles of Biochemistry 6th edn. (W.H. Freeman, 2013).
Rabia, L. A. et al. Net charge of antibody complementarity-determining regions is a key predictor of specificity. Protein Eng. Des. Select. 31(11), 409 (2018).
doi: 10.1093/protein/gzz002
Frank, S. A. Specificity and Cross-Reactivity, in Immunology and Evolution of Infectious Disease (Princeton University Press, 2002).
Ghisaidoobe, A. B. & Chung, S. J. Intrinsic tryptophan fluorescence in the detection and analysis of proteins: A focus on Förster resonance energy transfer techniques. Int. J. Mol. Sci. 15(12), 22518–22538 (2014).
pubmed: 25490136 pmcid: 4284722 doi: 10.3390/ijms151222518
Goldman, E. R. et al. Enhancing stability of camelid and shark single domain antibodies: An overview. Front. Immunol. 8, 1 (2017).
doi: 10.3389/fimmu.2017.00865
Meitzler, J. L. et al. Conserved cysteine residues provide a protein-protein interaction surface in dual oxidase (DUOX) proteins. J. Biol. Chem. 288(10), 7147–7157 (2013).
pubmed: 23362256 pmcid: 3591624 doi: 10.1074/jbc.M112.414797
Wilkins, M. R. et al. Protein identification and analysis tools in the ExPASy server. Methods Mol. Biol. 112, 531–552 (1999).
pubmed: 10027275
Simonian, M. H. Spectrophotometric determination of protein concentration. Curr. Protoc. Toxicol. 1, 1–7 (2004).
Maity, H. et al. Comparison of predicted extinction coefficients of monoclonal antibodies with experimental values as measured by the Edelhoch method. Int. J. Biol. Macromol. 77, 260–265 (2015).
pubmed: 25819219 doi: 10.1016/j.ijbiomac.2015.03.027
Holt, L. J. et al. Domain antibodies: Proteins for therapy. Trends Biotechnol. 21(11), 484–490 (2003).
pubmed: 14573361 doi: 10.1016/j.tibtech.2003.08.007
Laimer, J. et al. MAESTRO—Multi agent stability prediction upon point mutations. BMC Bioinform. 16(1), 116 (2015).
doi: 10.1186/s12859-015-0548-6
Høie, M. H. et al. NetSurfP-3.0: Accurate and fast prediction of protein structural features by protein language models and deep learning. Nucleic Acids Res. 50(W1), W510–W515 (2022).
pubmed: 35648435 pmcid: 9252760 doi: 10.1093/nar/gkac439
Cohen, T., Halfon, M. & Schneidman-Duhovny, D. NanoNet: Rapid and accurate end-to-end nanobody modeling by deep learning. Front. Immunol. 13, 958584 (2022).
pubmed: 36032123 pmcid: 9411858 doi: 10.3389/fimmu.2022.958584
Ruffolo, J. A. & Gray, J. J. Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies. Biophys. J. 121(3), 155–156 (2022).
doi: 10.1016/j.bpj.2021.11.1942
Lin, Z. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379(6637), 1123–1130 (2023).
pubmed: 36927031 doi: 10.1126/science.ade2574
Wu, R. et al. High-resolution de novo structure prediction from primary sequence. BioRxiv 21, 500999 (2022).
AlQuraishi, M. Machine learning in protein structure prediction. Curr. Opin. Chem. Biol. 65, 1–8 (2021).
pubmed: 34015749 doi: 10.1016/j.cbpa.2021.04.005
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596(7873), 583–589 (2021).
pubmed: 34265844 pmcid: 8371605 doi: 10.1038/s41586-021-03819-2
Valdés-Tresanco, M. S. et al. Structural modeling of nanobodies: A benchmark of state-of-the-art artificial intelligence programs. Molecules 28(10), 3991 (2023).
pubmed: 37241731 pmcid: 10220908 doi: 10.3390/molecules28103991
Honegger, A. & Plückthun, A. Yet another numbering scheme for immunoglobulin variable domains: An automatic modeling and analysis tool. J. Mol. Biol. 309(3), 657–670 (2001).
pubmed: 11397087 doi: 10.1006/jmbi.2001.4662
Dunbar, J. & Deane, C. M. ANARCI: Antigen receptor numbering and receptor classification. Bioinformatics 32(2), 298–300 (2015).
pubmed: 26424857 pmcid: 4708101 doi: 10.1093/bioinformatics/btv552
Apweiler, R. et al. UniProt: The universal protein knowledgebase. Nucleic Acids Res. 32, 115–119 (2004).
doi: 10.1093/nar/gkh131
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (Association for Computing Machinery, 2016).
Breiman, L. Random Forests. Mach. Learn. 45(1), 5–32 (2001).
doi: 10.1023/A:1010933404324
Hearst, M. A. et al. Support vector machines. IEEE Intell. Syst. Appl. 13(4), 18–28 (1998).
doi: 10.1109/5254.708428
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996).
doi: 10.1111/j.2517-6161.1996.tb02080.x
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986).
doi: 10.1038/323533a0
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521(7553), 436–444 (2015).
pubmed: 26017442 doi: 10.1038/nature14539
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006).
pubmed: 16873662 doi: 10.1126/science.1127647
Chollet, F. Deep Learning with Python (Simon and Schuster, 2021).
Waskom, M. Seaborn: Statistical data visualization. J. Open Source Softw. 6, 3021 (2021).
doi: 10.21105/joss.03021
Kurgan, L. & Miri Disfani, F. Structural protein descriptors in 1-dimension and their sequence-based predictions. Curr. Protein Peptide Sci. 12(6), 470–489 (2011).
doi: 10.2174/138920311796957711
Singh, H., Singh, S. & Raghava, G. P. Evaluation of protein dihedral angle prediction methods. PLoS ONE 9(8), e105667 (2014).
pubmed: 25166857 pmcid: 4148315 doi: 10.1371/journal.pone.0105667
Jin, B. K. et al. NANOBODIES®: A review of diagnostic and therapeutic applications. Int. J. Mol. Sci. 24, 6 (2023).
doi: 10.3390/ijms24065994
Natesan, R. et al. Heterogeneity in disulfide bond reduction in IgG1 antibodies is governed by solvent accessibility of the cysteines. Antibodies 12(4), 83 (2023).
pubmed: 38131805 pmcid: 10741012 doi: 10.3390/antib12040083
Yin, R. et al. Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants. Protein Sci. 31(8), e4379 (2022).
pubmed: 35900023 pmcid: 9278006 doi: 10.1002/pro.4379
Yin, R. & Pierce, B. G. Evaluation of AlphaFold antibody–antigen modeling with implications for improving predictive accuracy. Protein Sci. 33(1), e4865 (2024).
pubmed: 38073135 pmcid: 10751731 doi: 10.1002/pro.4865
Saerens, D. et al. Disulfide bond introduction for general stabilization of immunoglobulin heavy-chain variable domains. J. Mol. Biol. 377(2), 478–488 (2008).
pubmed: 18262543 doi: 10.1016/j.jmb.2008.01.022
Zabetakis, D. et al. Evaluation of disulfide bond position to enhance the thermal stability of a highly stable single domain antibody. PLoS ONE 9(12), e115405 (2014).
pubmed: 25526640 pmcid: 4272287 doi: 10.1371/journal.pone.0115405
Hussack, G. et al. Engineered single-domain antibodies with high protease resistance and thermal stability. PLoS ONE 6(11), e28218 (2011).
pubmed: 22140551 pmcid: 3227653 doi: 10.1371/journal.pone.0028218
Tabares-da Rosa, S. et al. Competitive selection from single domain antibody libraries allows isolation of high-affinity antihapten antibodies that are not favored in the llama immune response. Anal. Chem. 83(18), 7213–7220 (2011).
pubmed: 21827167 doi: 10.1021/ac201824z
Sturtz, J. et al. Deep learning approaches for the protein scaffold filling problem. In 2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI) (2022).
Stokes, J. M. et al. A deep learning approach to antibiotic discovery. Cell 180(4), 688–702 (2020).
pubmed: 32084340 pmcid: 8349178 doi: 10.1016/j.cell.2020.01.021
Pudžiuvelytė, I. et al. TemStaPro: Protein thermostability prediction using sequence representations from protein language models. Bioinformatics 40, 4 (2024).
doi: 10.1093/bioinformatics/btae157
Deszyński, P. et al. INDI—Integrated nanobody database for immunoinformatics. Nucleic Acids Res. 50(D1), D1273–D1281 (2021).
pmcid: 8728276 doi: 10.1093/nar/gkab1021
Legler, P. M. et al. Structure of a low-melting-temperature anti-cholera toxin: llama V(H)H domain. Acta Crystallogr Sect. F Struct. Biol. Cryst. Commun. 69, 90–93 (2013).
pubmed: 23385744 pmcid: 3564605 doi: 10.1107/S1744309112050750
George, J. et al. Structural and mutational analysis of a monomeric and dimeric form of a single domain antibody with implications for protein misfolding. Proteins Struct. Funct. Bioinform. 82(11), 3101–3116 (2014).
doi: 10.1002/prot.24671
Legler, P. M. et al. Stability of isolated antibody-antigen complexes as a predictive tool for selecting toxin neutralizing antibodies. mAbs 9(1), 43–57 (2017).
pubmed: 27660893 doi: 10.1080/19420862.2016.1236882
Krah, S. et al. Single-domain antibodies for biomedical applications. Immunopharmacol. Immunotoxicol. 38(1), 21–28 (2016).
pubmed: 26551147 doi: 10.3109/08923973.2015.1102934
Tomimoto, Y., Yamazaki, R. & Shirai, H. Increasing the melting temperature of VHH with the in silico free energy score. Sci. Rep. 13(1), 4922 (2023).
pubmed: 36966210 pmcid: 10039853 doi: 10.1038/s41598-023-32022-8
Hagihara, Y., Mine, S. & Uegaki, K. Stabilization of an immunoglobulin fold domain by an engineered disulfide bond at the buried hydrophobic region. J. Biol. Chem. 282(50), 36489–36495 (2007).
pubmed: 17932041 doi: 10.1074/jbc.M707078200
Orlando, M. et al. CDR1 Composition can affect nanobody recombinant expression yields. Biomolecules 11, 9 (2021).
doi: 10.3390/biom11091362
Yang, K. K. et al. Learned protein embeddings for machine learning. Bioinformatics 34(15), 2642–2648 (2018).
pubmed: 29584811 pmcid: 6061698 doi: 10.1093/bioinformatics/bty178
Yeung, W. et al. Tree visualizations of protein sequence embedding space enable improved functional clustering of diverse protein superfamilies. Brief. Bioinform. 24, 1 (2023).
doi: 10.1093/bib/bbac619
Littmann, M. et al. Protein embeddings and deep learning predict binding residues for various ligand classes. Sci. Rep. 11(1), 23916 (2021).
pubmed: 34903827 pmcid: 8668950 doi: 10.1038/s41598-021-03431-4
Ferruz, N., Schmidt, S. & Höcker, B. ProtGPT2 is a deep unsupervised language model for protein design. Nat. Commun. 13(1), 4348 (2022).
pubmed: 35896542 pmcid: 9329459 doi: 10.1038/s41467-022-32007-7
Dean, S. N. et al. PepVAE: Variational autoencoder framework for antimicrobial peptide generation and activity prediction. Front. Microbiol. 12, 725727 (2021).
pubmed: 34659152 pmcid: 8515052 doi: 10.3389/fmicb.2021.725727
Saka, K. et al. Antibody design using LSTM based deep generative model from phage display library for affinity maturation. Sci. Rep. 11(1), 5852 (2021).
pubmed: 33712669 pmcid: 7955064 doi: 10.1038/s41598-021-85274-7
Humpe, A. & Peipp, M. Antibody engineering—Tailor-made next generation antibodies by molecular design. Transfus Med. Hemother. 44(5), 290–291 (2017).
pubmed: 29070973 pmcid: 5649259 doi: 10.1159/000479617

Auteurs

Jerome Anthony E Alvarez (JAE)

Naval Research Laboratory, Center for Bio/Molecular Science and Engineering, Washington, DC, USA.

Scott N Dean (SN)

Naval Research Laboratory, Center for Bio/Molecular Science and Engineering, Washington, DC, USA. Scott.N.Dean.civ@us.navy.mil.

Articles similaires

alpha-Synuclein Humans Animals Mice Lewy Body Disease
Humans Colorectal Neoplasms Biomarkers, Tumor Prognosis Gene Expression Regulation, Neoplastic
Metabolic Networks and Pathways Saccharomyces cerevisiae Computational Biology Synthetic Biology Computer Simulation

Classifications MeSH