PASSerRank: Prediction of allosteric sites with learning to rank.

learning to rank machine learning protein allostery web server

Journal

Journal of computational chemistry
ISSN: 1096-987X
Titre abrégé: J Comput Chem
Pays: United States
ID NLM: 9878362

Informations de publication

Date de publication:
30 10 2023
Historique:
revised: 19 06 2023
received: 02 05 2023
accepted: 10 07 2023
medline: 4 9 2023
pubmed: 10 8 2023
entrez: 10 8 2023
Statut: ppublish

Résumé

Allostery plays a crucial role in regulating protein activity, making it a highly sought-after target in drug development. One of the major challenges in allosteric drug research is the identification of allosteric sites. In recent years, many computational models have been developed for accurate allosteric site prediction. Most of these models focus on designing a general rule that can be applied to pockets of proteins from various families. In this study, we present a new approach using the concept of Learning to Rank (LTR). The LTR model ranks pockets based on their relevance to allosteric sites, that is, how well a pocket meets the characteristics of known allosteric sites. After the training and validation on two datasets, the Allosteric Database (ASD) and CASBench, the LTR model was able to rank an allosteric pocket in the top three positions for 83.6% and 80.5% of test proteins, respectively. The model outperforms other common machine learning models with higher F1 scores (0.662 in ASD and 0.608 in CASBench) and Matthews correlation coefficients (0.645 in ASD and 0.589 in CASBench). The trained model is available on the PASSer platform (https://passer.smu.edu) to aid in drug discovery research.

Identifiants

pubmed: 37561047
doi: 10.1002/jcc.27193
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

2223-2229

Subventions

Organisme : NIH HHS
ID : 2R15GM122013
Pays : United States

Commentaires et corrections

Type : UpdateOf

Informations de copyright

© 2023 The Authors. Journal of Computational Chemistry published by Wiley Periodicals LLC.

Références

J. Liu, R. Nussinov, PLoS Comput. Biol. 2016, 12, e1004966.
S. J. Wodak, E. Paci, N. V. Dokholyan, I. N. Berezovsky, A. Horovitz, J. Li, V. J. Hilser, I. Bahar, J. Karanicolas, G. Stock, P. Hamm, R. H. Stote, J. Eberhardt, Y. Chebaro, A. Dejaegere, M. Cecchini, J. P. Changeux, P. G. Bolhuis, J. Vreede, P. Faccioli, S. Orioli, R. Ravasio, L. Yan, C. Brito, M. Wyart, P. Gkeka, I. Rivalta, G. Palermo, J. A. McCammon, J. Panecka-Hofman, R. C. Wade, A. di Pizio, M. Y. Niv, R. Nussinov, C. J. Tsai, H. Jang, D. Padhorny, D. Kozakov, T. McLeish, Structure 2019, 27, 566.
K. Krishnan, H. Tian, P. Tao, G. M. Verkhivker, J Chem Phys 2022, 157, 245101.
A. W. Fenton, Trends Biochem. Sci. 2008, 33, 420.
R. Nussinov, C.-J. Tsai, Cell 2013, 153, 293.
S. Xiao, G. M. Verkhivker, P. Tao, Trends Biochem. Sci. 2022a, 48, 375.
A. Peracchi, A. Mozzarelli, Biochim. Biophys. Acta 2011, 1814, 922.
N. Wu, L. Strömich, S. N. Yaliraki, Patterns 2022, 3, 100408.
A. Christopoulos, L. May, V. A. Avlani, P. M. Sexton, Biochem. Soc. Trans. 2004, 32, 873.
F. De Smet, A. Christopoulos, P. Carmeliet, Nat. Biotechnol. 2014, 32, 1113.
S. Lu, W. Huang, J. Zhang, Drug Discovery Today 2014, 19, 1595.
S. Lu, Q. Shen, J. Zhang, Acc. Chem. Res. 2019, 52, 492.
W. Huang, S. Lu, Z. Huang, X. Liu, L. Mou, Y. Luo, Y. Zhao, Y. Liu, Z. Chen, T. Hou, J. Zhang, Bioinformatics 2013, 29, 2357.
A. S.-Y. Chen, N. J. Westwood, P. Brear, G. W. Rogers, L. Mavridis, J. B. Mitchell, Mol. Inf. 2016, 35, 125.
H. Tian, X. Jiang, P. Tao, Mach. Learn. Sci. Technol. 2021, 2, 035015.
S. Xiao, H. Tian, P. Tao, Front. Mol. Biosci. 2022b, 9, 879251.
H. Tian, S. Xiao, X. Jiang, P. Tao, Nucl. Acids Res. 2023, 51, W427. https://doi.org/10.1093/nar/gkad303
T. Chen, C. Guestrin. Proc. 22nd ACM SIGKDD Int. Conf. Knowl. Discov. Data Mining. 2016, 785.
T. N. Kipf, M. Welling. arXiv preprint arXiv:1609.02907. 2016.
A. Panjkovich, X. Daura, BMC Bioinform. 2012, 13, 1.
E. Laine, C. Goncalves, J. C. Karst, A. Lesnard, S. Rault, W.-J. Tang, T. E. Malliavin, D. Ladant, A. Blondel, Proc. Natl. Acad. Sci. 2010, 107, 11277.
A. Goncearenco, S. Mitternacht, T. Yong, B. Eisenhaber, F. Eisenhaber, I. N. Berezovsky, Nucleic Acids Res. 2013, 41, W266.
A. Panjkovich, X. Daura, Bioinformatics 2014, 30, 1314.
Z. Huang, L. Zhu, Y. Cao, G. Wu, X. Liu, Y. Chen, Q. Wang, T. Shi, Y. Zhao, Y. Wang, W. Li, Y. Li, H. Chen, G. Chen, J. Zhang, Nucleic Acids Res. 2011, 39, D663.
W. Huang, G. Wang, Q. Shen, X. Liu, S. Lu, L. Geng, Z. Huang, J. Zhang, Bioinformatics 2015, 31, 2598.
S. K. Burley, H. M. Berman, G. J. Kleywegt, J. L. Markley, H. Nakamura, S. Velankar, Protein Crystal. 2017, 1607, 627.
A. Zlobin, D. Suplatov, K. Kopylov, V. Švedas, Acta Naturae 2019, 11, 74.
A. Trotman, Inform. Retriev. 2005, 8, 359.
X. Ru, X. Ye, T. Sakurai, Q. Zou, Bioinformatics 2022, 38, 1964.
K. Furui, M. Ohue. arXiv preprint arXiv:2205.02169. 2022.
V. Le Guilloux, P. Schmidtke, P. Tuffery, BMC Bioinform. 2009, 10, 1.
D. Ni, Z. Chai, Y. Wang, M. Li, Z. Yu, Y. Liu, S. Lu, J. Zhang, Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2022, 12, e1585.
G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Ye, T.-Y. Liu, Adv. Neural Inform. Processing Syst. 2017, 30.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, E. Duchesnay, J. Mach. Learn. Res. 2011, 12, 2825.
C. Yin, Z. Song, H. Tian, T. Palzkill, P. Tao, Phys. Chem. Chem. Phys. 2023, 25, 1349.
S. M. Lundberg, G. Erion, H. Chen, A. DeGrave, J. M. Prutkin, B. Nair, R. Katz, J. Himmelfarb, N. Bansal, S.-I. Lee, Nature Mach. Intell. 2020, 2, 56.
D. Chicco, G. Jurman, BMC Genom. 2020, 21, 1.
J. G. Greener, M. J. Sternberg, BMC Bioinform. 2015, 16, 1.
H. Tian, F. Trozzi, B. D. Zoltowski, P. Tao, J. Phys. Chem. B 2020, 124, 8960.
M. T. Ibrahim, F. Trozzi, P. Tao, Comput. Struct. Biotechnol. J. 2022, 20, 50.
J. Xie, G. Pan, Y. Li, L. Lai, J. Chem. Phys. 2023, 158, 105102.

Auteurs

Hao Tian (H)

Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas, USA.

Sian Xiao (S)

Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas, USA.

Xi Jiang (X)

Department of Statistics, Southern Methodist University, Dallas, Texas, USA.

Peng Tao (P)

Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH