Comparative evaluation of AlphaFold2 and disorder predictors for prediction of intrinsic disorder, disorder content and fully disordered proteins.

AlphaFold2 Deep learning Disorder content Fully disordered proteins Intrinsic disorder Intrinsically disordered protein Prediction

Journal

Computational and structural biotechnology journal
ISSN: 2001-0370
Titre abrégé: Comput Struct Biotechnol J
Pays: Netherlands
ID NLM: 101585369

Informations de publication

Date de publication:
2023
Historique:
received: 28 02 2023
revised: 31 05 2023
accepted: 01 06 2023
medline: 12 1 2024
pubmed: 12 1 2024
entrez: 12 1 2024
Statut: epublish

Résumé

We expand studies of AlphaFold2 (AF2) in the context of intrinsic disorder prediction by comparing it against a broad selection of 20 accurate, popular and recently released disorder predictors. We use 25% larger benchmark dataset with 646 proteins and cover protein-level predictions of disorder content and fully disordered proteins. AF2-based disorder predictions secure a relatively high Area Under receiver operating characteristic Curve (AUC) of 0.77 and are statistically outperformed by several modern disorder predictors that secure AUCs around 0.8 with median runtime of about 20 s compared to 1200 s for AF2. Moreover, AF2 provides modestly accurate predictions of fully disordered proteins (F1 = 0.59 vs. 0.91 for the best disorder predictor) and disorder content (mean absolute error of 0.21 vs. 0.15). AF2 also generates statistically more accurate disorder predictions for about 20% of proteins that have relatively short sequences and a few disordered regions that tend to be located at the sequence termini, and which are absent of disordered protein-binding regions. Interestingly, AF2 and the most accurate disorder predictors rely on deep neural networks, suggesting that these models are useful for protein structure and disorder predictions.

Identifiants

pubmed: 38213902
doi: 10.1016/j.csbj.2023.06.001
pii: S2001-0370(23)00214-3
pmc: PMC10782001
doi:

Types de publication

Journal Article

Langues

eng

Pagination

3248-3258

Informations de copyright

© 2023 The Authors.

Déclaration de conflit d'intérêts

The authors declare no conflicts of interest.

Auteurs

Bi Zhao (B)

Genomics program, College of Public Health, University of South Florida, Tampa, FL, United States.

Sina Ghadermarzi (S)

Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States.

Lukasz Kurgan (L)

Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States.

Classifications MeSH