Novel machine learning approaches revolutionize protein knowledge.
AI
AlphaFold2
embeddings
machine learning
pLM
protein structure prediction
structure alignment
Journal
Trends in biochemical sciences
ISSN: 0968-0004
Titre abrégé: Trends Biochem Sci
Pays: England
ID NLM: 7610674
Informations de publication
Date de publication:
04 2023
04 2023
Historique:
received:
14
07
2022
revised:
24
10
2022
accepted:
17
11
2022
pubmed:
13
12
2022
medline:
22
3
2023
entrez:
12
12
2022
Statut:
ppublish
Résumé
Breakthrough methods in machine learning (ML), protein structure prediction, and novel ultrafast structural aligners are revolutionizing structural biology. Obtaining accurate models of proteins and annotating their functions on a large scale is no longer limited by time and resources. The most recent method to be top ranked by the Critical Assessment of Structure Prediction (CASP) assessment, AlphaFold 2 (AF2), is capable of building structural models with an accuracy comparable to that of experimental structures. Annotations of 3D models are keeping pace with the deposition of the structures due to advancements in protein language models (pLMs) and structural aligners that help validate these transferred annotations. In this review we describe how recent developments in ML for protein science are making large-scale structural bioinformatics available to the general scientific community.
Identifiants
pubmed: 36504138
pii: S0968-0004(22)00308-5
doi: 10.1016/j.tibs.2022.11.001
pmc: PMC10570143
pii:
doi:
Substances chimiques
Proteins
0
Types de publication
Journal Article
Review
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
345-359Subventions
Organisme : Wellcome Trust
ID : 221327/Z/20/Z
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/T002735/1
Pays : United Kingdom
Informations de copyright
Copyright © 2022 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of interests No interests are declared.
Références
Bioinformatics. 2020 Dec 30;36(Suppl_2):i718-i725
pubmed: 33381814
Proc Natl Acad Sci U S A. 2017 Aug 22;114(34):9122-9127
pubmed: 28784799
J Mol Biol. 1995 Apr 7;247(4):536-40
pubmed: 7723011
Nucleic Acids Res. 2019 Jan 8;47(D1):D520-D528
pubmed: 30357364
J Mol Biol. 1993 Jul 20;232(2):584-99
pubmed: 8345525
Sci Rep. 2021 Jan 13;11(1):1160
pubmed: 33441905
Nat Methods. 2022 Jan;19(1):5-10
pubmed: 35017741
Bioinformatics. 2023 Jan 1;39(1):
pubmed: 36648327
Intrinsically Disord Proteins. 2013 Apr 1;1(1):e24157
pubmed: 28516007
Annu Rev Biochem. 2005;74:867-900
pubmed: 15954844
Commun Biol. 2023 Feb 8;6(1):160
pubmed: 36755055
Proteins. 2020 May;88(5):637-642
pubmed: 31693199
Bioinformatics. 2019 Aug 15;35(16):2856-2858
pubmed: 30615063
Nature. 2021 Aug;596(7873):590-596
pubmed: 34293799
Bioinformatics. 2012 Jan 15;28(2):184-90
pubmed: 22101153
BMC Bioinformatics. 2013;14 Suppl 3:S7
pubmed: 23514582
J Mol Biol. 2020 Mar 27;432(7):2428-2443
pubmed: 32142788
Proteins. 2021 Dec;89(12):1752-1769
pubmed: 34387010
Biochem J. 2021 May 28;478(10):1885-1890
pubmed: 34029366
Nat Biotechnol. 2023 Apr 24;:
pubmed: 37095349
Nucleic Acids Res. 2020 Jan 8;48(D1):D570-D578
pubmed: 31696235
Science. 2021 Aug 20;373(6557):871-876
pubmed: 34282049
EMBO Rep. 2021 Nov 4;22(11):e54046
pubmed: 34668287
PLoS One. 2019 Mar 15;14(3):e0213712
pubmed: 30875409
Nat Methods. 2022 Jun;19(6):679-682
pubmed: 35637307
Bioinformatics. 2021 Apr 19;37(2):162-170
pubmed: 32797179
Science. 2021 Dec 10;374(6573):eabm4805
pubmed: 34762488
J Mol Biol. 2022 Jan 30;434(2):167377
pubmed: 34838520
Proteins. 2004 May 15;55(3):678-88
pubmed: 15103630
BMC Bioinformatics. 2019 Feb 26;20(1):93
pubmed: 30808287
Nat Biotechnol. 2022 Jul;40(7):1023-1025
pubmed: 34980915
Bioinform Adv. 2021 Nov 19;1(1):vbab035
pubmed: 36700108
Methods Mol Biol. 2020;2112:29-42
pubmed: 32006276
Structure. 2009 Jun 10;17(6):869-81
pubmed: 19523904
Curr Opin Struct Biol. 2022 Jun;74:102372
pubmed: 35439658
Biomolecules. 2020 Oct 06;10(10):
pubmed: 33036302
Bioinformatics. 2023 May 4;39(5):
pubmed: 37084271
Nucleic Acids Res. 2022 Jan 7;50(D1):D54-D59
pubmed: 34755885
Bioinformatics. 2022 May 13;38(10):2742-2748
pubmed: 35561203
Nat Methods. 2019 Dec;16(12):1315-1322
pubmed: 31636460
Nat Biotechnol. 2023 May 8;:
pubmed: 37156916
Protein Sci. 2022 May;31(5):e4318
pubmed: 35481632
Nat Methods. 2021 Apr;18(4):366-368
pubmed: 33828273
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444
pubmed: 34791371
Nucleic Acids Res. 2021 Jan 8;49(D1):D298-D308
pubmed: 33119734
Nature. 2021 Aug;596(7873):583-589
pubmed: 34265844
Comput Struct Biotechnol J. 2021 Mar 25;19:1750-1758
pubmed: 33897979
Brief Bioinform. 2022 Jul 18;23(4):
pubmed: 35641150
BMC Bioinformatics. 2019 Dec 17;20(1):723
pubmed: 31847804
ACS Chem Neurosci. 2022 Mar 16;13(6):711-713
pubmed: 35255205
J Mol Biol. 1989 Jul 5;208(1):1-22
pubmed: 2769748
Commun Biol. 2022 Apr 5;5(1):316
pubmed: 35383281
J Bioinform Comput Biol. 2008 Apr;6(2):347-66
pubmed: 18464327
Elife. 2022 Dec 15;11:
pubmed: 36519529
Nucleic Acids Res. 2022 Jan 7;50(D1):D553-D559
pubmed: 34850923
Protein Eng. 1998 Sep;11(9):739-47
pubmed: 9796821
Proteins. 2021 Sep;89(9):1226-1228
pubmed: 33973689
Bioinformatics. 2021 Oct 25;37(20):3449-3455
pubmed: 33978744
Sci Rep. 2021 Dec 13;11(1):23916
pubmed: 34903827
PLoS Comput Biol. 2022 Mar 25;18(3):e1009930
pubmed: 35333855
Nature. 2021 Dec;600(7889):547-552
pubmed: 34853475
Nat Commun. 2022 Feb 10;13(1):793
pubmed: 35145092
Int J Mol Sci. 2022 Apr 21;23(9):
pubmed: 35562983
Proteins. 2000 Nov 15;41(3):271-87
pubmed: 11025540
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):7112-7127
pubmed: 34232869
Structure. 2022 Aug 4;30(8):1169-1177.e4
pubmed: 35609601
Nature. 1994 Dec 15;372(6507):631-4
pubmed: 7990952
BMC Bioinformatics. 2010 Aug 18;11:431
pubmed: 20718988
Elife. 2022 Mar 03;11:
pubmed: 35238773
Nat Methods. 2011 Dec 25;9(2):173-5
pubmed: 22198341
Nucleic Acids Res. 2018 Jul 2;46(W1):W296-W303
pubmed: 29788355
Curr Opin Struct Biol. 2011 Jun;21(3):412-8
pubmed: 21514145
Commun Biol. 2022 Apr 5;5(1):312
pubmed: 35383272
Microorganisms. 2021 Oct 14;9(10):
pubmed: 34683471
Annu Rev Biophys. 2013;42:559-82
pubmed: 23527781
PLoS Comput Biol. 2022 Jan 24;18(1):e1009818
pubmed: 35073311
Biomolecules. 2021 Dec 10;11(12):
pubmed: 34944504
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
pubmed: 9254694
PLoS One. 2011;6(12):e28766
pubmed: 22163331
Nat Commun. 2022 Jul 27;13(1):4348
pubmed: 35896542
PLoS Comput Biol. 2020 Jul 8;16(7):e1007970
pubmed: 32639954
Molecules. 2022 Feb 18;27(4):
pubmed: 35209175
Nucleic Acids Res. 2022 Jul 5;50(W1):W228-W234
pubmed: 35489069
Proc Natl Acad Sci U S A. 2021 Apr 13;118(15):
pubmed: 33876751
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489
pubmed: 33237286
Nat Struct Mol Biol. 2022 Nov;29(11):1056-1067
pubmed: 36344848
NAR Genom Bioinform. 2022 Jun 11;4(2):lqac043
pubmed: 35702380
Nature. 2016 Jul 7;535(7610):111-6
pubmed: 27281216
Nucleic Acids Res. 2021 Jan 8;49(D1):D266-D273
pubmed: 33237325
Nat Methods. 2019 Jul;16(7):603-606
pubmed: 31235882
PLoS Comput Biol. 2014 Dec 04;10(12):e1003926
pubmed: 25474468
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9
pubmed: 15849316
Nature. 2018 May;557(7703):62-67
pubmed: 29695864
Hum Genet. 2022 Oct;141(10):1629-1647
pubmed: 34967936
BMC Genomics. 2015;16 Suppl 8:S1
pubmed: 26110438
J Mol Biol. 2021 Oct 1;433(20):167208
pubmed: 34418423
Bioinformatics. 2018 Jul 1;34(13):i254-i262
pubmed: 29949966
Pac Symp Biocomput. 1998;:437-48
pubmed: 9697202
Nat Commun. 2018 Jun 29;9(1):2542
pubmed: 29959318
Nucleic Acids Res. 2006 Aug 02;34(13):3646-59
pubmed: 16885238
Nucleic Acids Res. 2022 Jul 5;50(W1):W510-W515
pubmed: 35648435
Proteins. 2019 Dec;87(12):1011-1020
pubmed: 31589781
Bioinformatics. 2022 Apr 12;38(8):2102-2110
pubmed: 35020807
Nature. 1992 Jun 18;357(6379):543-4
pubmed: 1608464