Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering.
Journal
ACS chemical biology
ISSN: 1554-8937
Titre abrégé: ACS Chem Biol
Pays: United States
ID NLM: 101282906
Informations de publication
Date de publication:
16 06 2023
16 06 2023
Historique:
medline:
19
6
2023
pubmed:
23
5
2023
entrez:
23
5
2023
Statut:
ppublish
Résumé
In the past decade, macrocyclic peptides gained increasing interest as a new therapeutic modality to tackle intracellular and extracellular therapeutic targets that had been previously classified as "undruggable". Several technological advances have made discovering macrocyclic peptides against these targets possible: 1) the inclusion of noncanonical amino acids (NCAAs) into mRNA display, 2) increased availability of next generation sequencing (NGS), and 3) improvements in rapid peptide synthesis platforms. This type of directed-evolution based screening can produce large numbers of potential hit sequences given that DNA sequencing is the functional output of this platform. The current standard for selecting hit peptides from these selections for downstream follow-up relies on the frequency counting and sorting of unique peptide sequences which can result in the generation of false negatives due to technical reasons including low translation efficiency or other experimental factors. To overcome our inability to detect weakly enriched peptide sequences among our large data sets, we wanted to develop a clustering method that would enable the identification of peptide families. Unfortunately, utilizing traditional clustering algorithms, such as ClustalW, is not possible for this technology due to the incorporation of NCAAs in these libraries. Therefore, we developed a new atomistic clustering method with a Pairwise Aligned Peptide (PAP) chemical similarity metric to perform sequence alignments and identify macrocyclic peptide families. With this method, low enriched peptides, including isolated sequences (singletons), can now be clustered into families providing a comprehensive analysis of NGS data resulting from macrocycle discovery selections. Additionally, upon identification of a hit peptide with the desired activity, this clustering algorithm can be used to identify derivatives from the initial data set for structure-activity relationship (SAR) analysis without requiring additional selection experiments.
Identifiants
pubmed: 37220419
doi: 10.1021/acschembio.3c00159
pmc: PMC10278063
doi:
Substances chimiques
Amino Acids
0
Peptides
0
Peptide Library
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1425-1434Références
J Cheminform. 2015 Mar 25;7:11
pubmed: 25866564
Nat Protoc. 2011 Apr;6(4):540-52
pubmed: 21455189
Acc Chem Res. 2021 Sep 21;54(18):3604-3617
pubmed: 34505781
J Med Chem. 2014 Apr 24;57(8):3186-204
pubmed: 24151987
J Am Chem Soc. 2019 Mar 13;141(10):4167-4181
pubmed: 30768253
J Chem Inf Model. 2010 May 24;50(5):742-54
pubmed: 20426451
Nat Chem Biol. 2023 Jan;19(1):55-63
pubmed: 36577875
Nucleic Acids Res. 1994 Nov 11;22(22):4673-80
pubmed: 7984417
Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915-9
pubmed: 1438297
ACS Med Chem Lett. 2016 Jun 06;7(8):757-61
pubmed: 27563399
Curr Opin Biotechnol. 2019 Dec;60:168-178
pubmed: 30974337
Annu Rev Biochem. 2014;83:727-52
pubmed: 24580641
Chemistry. 2021 Jan 21;27(5):1487-1513
pubmed: 32875673
J Mol Biol. 1970 Mar;48(3):443-53
pubmed: 5420325
J Chem Inf Model. 2022 Mar 14;62(5):1259-1267
pubmed: 35192366
PLoS Comput Biol. 2006 Jun 9;2(6):e65
pubmed: 16789818
J Cheminform. 2013 May 30;5(1):26
pubmed: 23721588
Science. 1960 Oct 21;132(3434):1115-8
pubmed: 17790723
Nat Commun. 2020 Mar 13;11(1):1392
pubmed: 32170178
J Cheminform. 2017 Jun 12;9(1):38
pubmed: 29086196