TEMPRO: nanobody melting temperature estimation model using protein embeddings.
Antibodies
Machine learning
Nanobodies
Neural networks
Proteins
Single-domain antibodies
Thermostability
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
17 Aug 2024
17 Aug 2024
Historique:
received:
21
06
2024
accepted:
13
08
2024
medline:
18
8
2024
pubmed:
18
8
2024
entrez:
17
8
2024
Statut:
epublish
Résumé
Single-domain antibodies (sdAbs) or nanobodies have received widespread attention due to their small size (~ 15 kDa) and diverse applications in bio-derived therapeutics. As many modern biotechnology breakthroughs are applied to antibody engineering and design, nanobody thermostability or melting temperature (T
Identifiants
pubmed: 39154093
doi: 10.1038/s41598-024-70101-6
pii: 10.1038/s41598-024-70101-6
doi:
Substances chimiques
Single-Domain Antibodies
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
19074Informations de copyright
© 2024. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.
Références
Henry, K. A. & MacKenzie, C. R. Antigen recognition by single-domain antibodies: Structural latitudes and constraints. MAbs 10(6), 815–826 (2018).
pubmed: 29916758
pmcid: 6260137
doi: 10.1080/19420862.2018.1489633
Wesolowski, J. et al. Single domain antibodies: Promising experimental and therapeutic tools in infection and immunity. Med. Microbiol. Immunol. 198, 157–174 (2009).
pubmed: 19529959
pmcid: 2714450
doi: 10.1007/s00430-009-0116-7
Ventola, C. L. The antibiotic resistance crisis. Pharm. Therap. 40(4), 277–283 (2015).
Gould, I. M. & Bal, A. M. New antibiotic agents in the pipeline and how they can help overcome microbial resistance. Virulence 4(2), 185–191 (2013).
pubmed: 23302792
pmcid: 3654619
doi: 10.4161/viru.22507
McConnell, A. D. et al. A general approach to antibody thermostabilization. MAbs 6(5), 1274–1282 (2014).
pubmed: 25517312
pmcid: 4623350
doi: 10.4161/mabs.29680
Ward, E. S. et al. Binding activities of a repertoire of single immunoglobulin variable domains secreted from Escherichia coli. Nature 341(6242), 544–546 (1989).
pubmed: 2677748
doi: 10.1038/341544a0
Hamers-Casterman, C. et al. Naturally occurring antibodies devoid of light chains. Nature 363(6428), 446–448 (1993).
pubmed: 8502296
doi: 10.1038/363446a0
Ovchinnikov, V. et al. Role of framework mutations and antibody flexibility in the evolution of broadly neutralizing antibodies. Elife 7, 1 (2018).
doi: 10.7554/eLife.33038
Kiguchi, Y. et al. The VH framework region 1 as a target of efficient mutagenesis for generating a variety of affinity-matured scFv mutants. Sci. Rep. 11(1), 8201 (2021).
pubmed: 33859250
pmcid: 8050046
doi: 10.1038/s41598-021-87501-7
Nguyen, V. K. et al. Camel heavy-chain antibodies: Diverse germline V(H)H and specific mechanisms enlarge the antigen-binding repertoire. EMBO J. 19(5), 921–930 (2000).
pubmed: 10698934
pmcid: 305632
doi: 10.1093/emboj/19.5.921
Muyldermans, S. et al. Sequence and structure of VH domain from naturally occurring camel heavy chain immunoglobulins lacking light chains. Protein Eng. 7(9), 1129–1135 (1994).
pubmed: 7831284
doi: 10.1093/protein/7.9.1129
Ding, L. et al. Structural insights into the mechanism of single domain VHH antibody binding to cortisol. FEBS Lett. 593(11), 1248–1256 (2019).
pubmed: 31049949
doi: 10.1002/1873-3468.13398
Rudolph, M. J. et al. Contribution of an unusual CDR2 element of a single domain antibody in ricin toxin binding affinity and neutralizing activity. Protein Eng. Des. Select. 31(7–8), 277–287 (2018).
doi: 10.1093/protein/gzy022
Bever, C. S. et al. VHH antibodies: Emerging reagents for the analysis of environmental chemicals. Anal. Bioanal. Chem. 408(22), 5985–6002 (2016).
pubmed: 27209591
pmcid: 4983233
doi: 10.1007/s00216-016-9585-x
Polonelli, L. et al. Antibody complementarity-determining regions (CDRs) can display differential antimicrobial, antiviral and antitumor activities. PLoS ONE 3(6), e2371 (2008).
pubmed: 18545659
pmcid: 2396520
doi: 10.1371/journal.pone.0002371
Liu, J. L. et al. Thermal stability and refolding capability of shark derived single domain antibodies. Mol. Immunol. 59(2), 194–199 (2014).
pubmed: 24667069
doi: 10.1016/j.molimm.2014.02.014
Kunz, P. et al. The structural basis of nanobody unfolding reversibility and thermoresistance. Sci. Rep. 8(1), 7934 (2018).
pubmed: 29784954
pmcid: 5962586
doi: 10.1038/s41598-018-26338-z
Bekker, G. J., Ma, B. & Kamiya, N. Thermal stability of single-domain antibodies estimated by molecular dynamics simulations. Protein Sci. 28(2), 429–438 (2019).
pubmed: 30394618
doi: 10.1002/pro.3546
Jung, F. et al. DeepSTABp: A deep learning approach for the prediction of thermal protein stability. Int. J. Mol. Sci. 24(8), 7444 (2023).
pubmed: 37108605
pmcid: 10138888
doi: 10.3390/ijms24087444
Li, M. et al. DeepTM: A deep learning algorithm for prediction of melting temperature of thermophilic proteins directly from sequences. Comput. Struct. Biotechnol. J. 21, 5544–5560 (2023).
pubmed: 38034401
pmcid: 10681957
doi: 10.1016/j.csbj.2023.11.006
Yang, Y. et al. ProTstab2 for prediction of protein thermal stabilities. Int. J. Mol. Sci. 23, 18 (2022).
Ku, T. et al. Predicting melting temperature directly from protein sequences. Comput. Biol. Chem. 33(6), 445–450 (2009).
pubmed: 19896904
doi: 10.1016/j.compbiolchem.2009.10.002
Haselbeck, F. et al. Superior protein thermophilicity prediction with protein language model embeddings. NAR Genom. Bioinform. 5(4), 087 (2023).
Outeiral, C. & Deane, C. M. Codon language embeddings provide strong signals for use in protein engineering. Nat. Mach. Intell. 6(2), 170–179 (2024).
doi: 10.1038/s42256-024-00791-0
Valdés-Tresanco, M. S. et al. NbThermo: A new thermostability database for nanobodies. Database 2023, 21 (2023).
doi: 10.1093/database/baad021
Kunz, P. et al. Exploiting sequence and stability information for directing nanobody stability engineering. Biochim. Biophys. Acta Gen. Subj. 1861(9), 2196–2205 (2017).
pubmed: 28642127
doi: 10.1016/j.bbagen.2017.06.014
Osorio, D., Rondón-Villarreal, P. & Torres, R. Peptides: A package for data mining of antimicrobial peptides. R J. 7(1), 4–14 (2015).
doi: 10.32614/RJ-2015-001
Ikai, A. Thermostability and aliphatic index of globular proteins. J. Biochem. 88(6), 1895–1898 (1980).
pubmed: 7462208
Guruprasad, K., Reddy, B. V. & Pandit, M. W. Correlation between stability of a protein and its dipeptide composition: A novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. 4(2), 155–161 (1990).
pubmed: 2075190
doi: 10.1093/protein/4.2.155
Kyte, J. & Doolittle, R. F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157(1), 105–132 (1982).
pubmed: 7108955
doi: 10.1016/0022-2836(82)90515-0
Bannas, P., Hambach, J. & Koch-Nolte, F. Nanobodies and nanobody-based human heavy chain antibodies as antitumor therapeutics. Front. Immunol. 8, 1 (2017).
doi: 10.3389/fimmu.2017.01603
Bhaskaran, R. & Ponnuswamy, P. K. Positional flexibilities of amino acid residues in globular proteins. Int. J. Peptide Protein Res. 32(4), 241–255 (1988).
doi: 10.1111/j.1399-3011.1988.tb01258.x
Dong, Y.-W. et al. Structural flexibility and protein adaptation to temperature: Molecular dynamics analysis of malate dehydrogenases of marine molluscs. Proc. Natl. Acad. Sci. 115(6), 1274–1279 (2018).
pubmed: 29358381
pmcid: 5819447
doi: 10.1073/pnas.1718910115
Sheriff, S. et al. Influence of solvent accessibility and intermolecular contacts on atomic mobilities in hemerythrins. Proc. Natl. Acad. Sci. 82(4), 1104–1107 (1985).
pubmed: 3856249
pmcid: 397202
doi: 10.1073/pnas.82.4.1104
Sandberg, M. et al. New chemical descriptors relevant for the design of biologically active peptides. A multivariate characterization of 87 amino acids. J. Med. Chem. 41(14), 2481–2491 (1998).
pubmed: 9651153
doi: 10.1021/jm9700575
Tesfaye, D. Y. et al. Targeting conventional dendritic cells to fine-tune antibody responses. Front. Immunol. 10, 1529 (2019).
pubmed: 31333661
pmcid: 6620736
doi: 10.3389/fimmu.2019.01529
Pervez, S. et al. Effect of polarity and differentiation on antibody localization in multicellular tumour spheroid and xenograft models and its potential importance for in vivo immunotargeting. Int. J. Cancer 44(5), 940–947 (1989).
pubmed: 2583872
doi: 10.1002/ijc.2910440532
Wang, Y. et al. Investigation of the small size of nanobodies for a sensitive fluorescence polarization immunoassay for small molecules: 3-Phenoxybenzoic acid, an exposure biomarker of pyrethroid insecticides as a model. J. Agric. Food Chem. 67(41), 11536–11541 (2019).
pubmed: 31589045
pmcid: 7134064
doi: 10.1021/acs.jafc.9b04621
Moore, D. S. Amino acid and peptide net charges: A simple calculational procedure. Biochem. Educ. 13(1), 10–11 (1985).
doi: 10.1016/0307-4412(85)90114-1
Lehninger, A. L. Lehninger Principles of Biochemistry 6th edn. (W.H. Freeman, 2013).
Rabia, L. A. et al. Net charge of antibody complementarity-determining regions is a key predictor of specificity. Protein Eng. Des. Select. 31(11), 409 (2018).
doi: 10.1093/protein/gzz002
Frank, S. A. Specificity and Cross-Reactivity, in Immunology and Evolution of Infectious Disease (Princeton University Press, 2002).
Ghisaidoobe, A. B. & Chung, S. J. Intrinsic tryptophan fluorescence in the detection and analysis of proteins: A focus on Förster resonance energy transfer techniques. Int. J. Mol. Sci. 15(12), 22518–22538 (2014).
pubmed: 25490136
pmcid: 4284722
doi: 10.3390/ijms151222518
Goldman, E. R. et al. Enhancing stability of camelid and shark single domain antibodies: An overview. Front. Immunol. 8, 1 (2017).
doi: 10.3389/fimmu.2017.00865
Meitzler, J. L. et al. Conserved cysteine residues provide a protein-protein interaction surface in dual oxidase (DUOX) proteins. J. Biol. Chem. 288(10), 7147–7157 (2013).
pubmed: 23362256
pmcid: 3591624
doi: 10.1074/jbc.M112.414797
Wilkins, M. R. et al. Protein identification and analysis tools in the ExPASy server. Methods Mol. Biol. 112, 531–552 (1999).
pubmed: 10027275
Simonian, M. H. Spectrophotometric determination of protein concentration. Curr. Protoc. Toxicol. 1, 1–7 (2004).
Maity, H. et al. Comparison of predicted extinction coefficients of monoclonal antibodies with experimental values as measured by the Edelhoch method. Int. J. Biol. Macromol. 77, 260–265 (2015).
pubmed: 25819219
doi: 10.1016/j.ijbiomac.2015.03.027
Holt, L. J. et al. Domain antibodies: Proteins for therapy. Trends Biotechnol. 21(11), 484–490 (2003).
pubmed: 14573361
doi: 10.1016/j.tibtech.2003.08.007
Laimer, J. et al. MAESTRO—Multi agent stability prediction upon point mutations. BMC Bioinform. 16(1), 116 (2015).
doi: 10.1186/s12859-015-0548-6
Høie, M. H. et al. NetSurfP-3.0: Accurate and fast prediction of protein structural features by protein language models and deep learning. Nucleic Acids Res. 50(W1), W510–W515 (2022).
pubmed: 35648435
pmcid: 9252760
doi: 10.1093/nar/gkac439
Cohen, T., Halfon, M. & Schneidman-Duhovny, D. NanoNet: Rapid and accurate end-to-end nanobody modeling by deep learning. Front. Immunol. 13, 958584 (2022).
pubmed: 36032123
pmcid: 9411858
doi: 10.3389/fimmu.2022.958584
Ruffolo, J. A. & Gray, J. J. Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies. Biophys. J. 121(3), 155–156 (2022).
doi: 10.1016/j.bpj.2021.11.1942
Lin, Z. et al. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 379(6637), 1123–1130 (2023).
pubmed: 36927031
doi: 10.1126/science.ade2574
Wu, R. et al. High-resolution de novo structure prediction from primary sequence. BioRxiv 21, 500999 (2022).
AlQuraishi, M. Machine learning in protein structure prediction. Curr. Opin. Chem. Biol. 65, 1–8 (2021).
pubmed: 34015749
doi: 10.1016/j.cbpa.2021.04.005
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596(7873), 583–589 (2021).
pubmed: 34265844
pmcid: 8371605
doi: 10.1038/s41586-021-03819-2
Valdés-Tresanco, M. S. et al. Structural modeling of nanobodies: A benchmark of state-of-the-art artificial intelligence programs. Molecules 28(10), 3991 (2023).
pubmed: 37241731
pmcid: 10220908
doi: 10.3390/molecules28103991
Honegger, A. & Plückthun, A. Yet another numbering scheme for immunoglobulin variable domains: An automatic modeling and analysis tool. J. Mol. Biol. 309(3), 657–670 (2001).
pubmed: 11397087
doi: 10.1006/jmbi.2001.4662
Dunbar, J. & Deane, C. M. ANARCI: Antigen receptor numbering and receptor classification. Bioinformatics 32(2), 298–300 (2015).
pubmed: 26424857
pmcid: 4708101
doi: 10.1093/bioinformatics/btv552
Apweiler, R. et al. UniProt: The universal protein knowledgebase. Nucleic Acids Res. 32, 115–119 (2004).
doi: 10.1093/nar/gkh131
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proc. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 785–794 (Association for Computing Machinery, 2016).
Breiman, L. Random Forests. Mach. Learn. 45(1), 5–32 (2001).
doi: 10.1023/A:1010933404324
Hearst, M. A. et al. Support vector machines. IEEE Intell. Syst. Appl. 13(4), 18–28 (1998).
doi: 10.1109/5254.708428
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996).
doi: 10.1111/j.2517-6161.1996.tb02080.x
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986).
doi: 10.1038/323533a0
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521(7553), 436–444 (2015).
pubmed: 26017442
doi: 10.1038/nature14539
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313(5786), 504–507 (2006).
pubmed: 16873662
doi: 10.1126/science.1127647
Chollet, F. Deep Learning with Python (Simon and Schuster, 2021).
Waskom, M. Seaborn: Statistical data visualization. J. Open Source Softw. 6, 3021 (2021).
doi: 10.21105/joss.03021
Kurgan, L. & Miri Disfani, F. Structural protein descriptors in 1-dimension and their sequence-based predictions. Curr. Protein Peptide Sci. 12(6), 470–489 (2011).
doi: 10.2174/138920311796957711
Singh, H., Singh, S. & Raghava, G. P. Evaluation of protein dihedral angle prediction methods. PLoS ONE 9(8), e105667 (2014).
pubmed: 25166857
pmcid: 4148315
doi: 10.1371/journal.pone.0105667
Jin, B. K. et al. NANOBODIES®: A review of diagnostic and therapeutic applications. Int. J. Mol. Sci. 24, 6 (2023).
doi: 10.3390/ijms24065994
Natesan, R. et al. Heterogeneity in disulfide bond reduction in IgG1 antibodies is governed by solvent accessibility of the cysteines. Antibodies 12(4), 83 (2023).
pubmed: 38131805
pmcid: 10741012
doi: 10.3390/antib12040083
Yin, R. et al. Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants. Protein Sci. 31(8), e4379 (2022).
pubmed: 35900023
pmcid: 9278006
doi: 10.1002/pro.4379
Yin, R. & Pierce, B. G. Evaluation of AlphaFold antibody–antigen modeling with implications for improving predictive accuracy. Protein Sci. 33(1), e4865 (2024).
pubmed: 38073135
pmcid: 10751731
doi: 10.1002/pro.4865
Saerens, D. et al. Disulfide bond introduction for general stabilization of immunoglobulin heavy-chain variable domains. J. Mol. Biol. 377(2), 478–488 (2008).
pubmed: 18262543
doi: 10.1016/j.jmb.2008.01.022
Zabetakis, D. et al. Evaluation of disulfide bond position to enhance the thermal stability of a highly stable single domain antibody. PLoS ONE 9(12), e115405 (2014).
pubmed: 25526640
pmcid: 4272287
doi: 10.1371/journal.pone.0115405
Hussack, G. et al. Engineered single-domain antibodies with high protease resistance and thermal stability. PLoS ONE 6(11), e28218 (2011).
pubmed: 22140551
pmcid: 3227653
doi: 10.1371/journal.pone.0028218
Tabares-da Rosa, S. et al. Competitive selection from single domain antibody libraries allows isolation of high-affinity antihapten antibodies that are not favored in the llama immune response. Anal. Chem. 83(18), 7213–7220 (2011).
pubmed: 21827167
doi: 10.1021/ac201824z
Sturtz, J. et al. Deep learning approaches for the protein scaffold filling problem. In 2022 IEEE 34th International Conference on Tools with Artificial Intelligence (ICTAI) (2022).
Stokes, J. M. et al. A deep learning approach to antibiotic discovery. Cell 180(4), 688–702 (2020).
pubmed: 32084340
pmcid: 8349178
doi: 10.1016/j.cell.2020.01.021
Pudžiuvelytė, I. et al. TemStaPro: Protein thermostability prediction using sequence representations from protein language models. Bioinformatics 40, 4 (2024).
doi: 10.1093/bioinformatics/btae157
Deszyński, P. et al. INDI—Integrated nanobody database for immunoinformatics. Nucleic Acids Res. 50(D1), D1273–D1281 (2021).
pmcid: 8728276
doi: 10.1093/nar/gkab1021
Legler, P. M. et al. Structure of a low-melting-temperature anti-cholera toxin: llama V(H)H domain. Acta Crystallogr Sect. F Struct. Biol. Cryst. Commun. 69, 90–93 (2013).
pubmed: 23385744
pmcid: 3564605
doi: 10.1107/S1744309112050750
George, J. et al. Structural and mutational analysis of a monomeric and dimeric form of a single domain antibody with implications for protein misfolding. Proteins Struct. Funct. Bioinform. 82(11), 3101–3116 (2014).
doi: 10.1002/prot.24671
Legler, P. M. et al. Stability of isolated antibody-antigen complexes as a predictive tool for selecting toxin neutralizing antibodies. mAbs 9(1), 43–57 (2017).
pubmed: 27660893
doi: 10.1080/19420862.2016.1236882
Krah, S. et al. Single-domain antibodies for biomedical applications. Immunopharmacol. Immunotoxicol. 38(1), 21–28 (2016).
pubmed: 26551147
doi: 10.3109/08923973.2015.1102934
Tomimoto, Y., Yamazaki, R. & Shirai, H. Increasing the melting temperature of VHH with the in silico free energy score. Sci. Rep. 13(1), 4922 (2023).
pubmed: 36966210
pmcid: 10039853
doi: 10.1038/s41598-023-32022-8
Hagihara, Y., Mine, S. & Uegaki, K. Stabilization of an immunoglobulin fold domain by an engineered disulfide bond at the buried hydrophobic region. J. Biol. Chem. 282(50), 36489–36495 (2007).
pubmed: 17932041
doi: 10.1074/jbc.M707078200
Orlando, M. et al. CDR1 Composition can affect nanobody recombinant expression yields. Biomolecules 11, 9 (2021).
doi: 10.3390/biom11091362
Yang, K. K. et al. Learned protein embeddings for machine learning. Bioinformatics 34(15), 2642–2648 (2018).
pubmed: 29584811
pmcid: 6061698
doi: 10.1093/bioinformatics/bty178
Yeung, W. et al. Tree visualizations of protein sequence embedding space enable improved functional clustering of diverse protein superfamilies. Brief. Bioinform. 24, 1 (2023).
doi: 10.1093/bib/bbac619
Littmann, M. et al. Protein embeddings and deep learning predict binding residues for various ligand classes. Sci. Rep. 11(1), 23916 (2021).
pubmed: 34903827
pmcid: 8668950
doi: 10.1038/s41598-021-03431-4
Ferruz, N., Schmidt, S. & Höcker, B. ProtGPT2 is a deep unsupervised language model for protein design. Nat. Commun. 13(1), 4348 (2022).
pubmed: 35896542
pmcid: 9329459
doi: 10.1038/s41467-022-32007-7
Dean, S. N. et al. PepVAE: Variational autoencoder framework for antimicrobial peptide generation and activity prediction. Front. Microbiol. 12, 725727 (2021).
pubmed: 34659152
pmcid: 8515052
doi: 10.3389/fmicb.2021.725727
Saka, K. et al. Antibody design using LSTM based deep generative model from phage display library for affinity maturation. Sci. Rep. 11(1), 5852 (2021).
pubmed: 33712669
pmcid: 7955064
doi: 10.1038/s41598-021-85274-7
Humpe, A. & Peipp, M. Antibody engineering—Tailor-made next generation antibodies by molecular design. Transfus Med. Hemother. 44(5), 290–291 (2017).
pubmed: 29070973
pmcid: 5649259
doi: 10.1159/000479617