EncoderMap(II): Visualizing Important Molecular Motions with Improved Generation of Protein Conformations.
Journal
Journal of chemical information and modeling
ISSN: 1549-960X
Titre abrégé: J Chem Inf Model
Pays: United States
ID NLM: 101230060
Informations de publication
Date de publication:
25 11 2019
25 11 2019
Historique:
pubmed:
28
10
2019
medline:
8
10
2020
entrez:
25
10
2019
Statut:
ppublish
Résumé
Dimensionality reduction can be used to project high-dimensional molecular data into a simplified, low-dimensional map. One feature of our recently introduced dimensionality reduction technique EncoderMap, which relies on the combination of an autoencoder with multidimensional scaling, is its ability to do the reverse. It is able to generate conformations for any selected points in the low-dimensional map. This transfers the simplified, low-dimensional map back into the high-dimensional conformational space. Although the output is again high-dimensional, certain aspects of the simplification are preserved. The generated conformations only mirror the most dominant conformational differences that determine the positions of conformational states in the low-dimensional map. This allows depicting such differences and-in consequence-visualizing molecular motions and gives a unique perspective on high-dimensional conformational data. In our previous work, protein conformations described in backbone dihedral angle space were used as the input for EncoderMap, and conformations were also generated in this space. For large proteins, however, the generation of conformations is inaccurate with this approach due to the local character of backbone dihedral angles. Here, we present an improved variant of EncoderMap which is able to generate large protein conformations that are accurate in short-range and long-range orders. This is achieved by differentiable reconstruction of Cartesian coordinates from the generated dihedrals, which allows adding a contribution to the cost function that monitors the accuracy of all pairwise distances between the C
Identifiants
pubmed: 31647645
doi: 10.1021/acs.jcim.9b00675
doi:
Substances chimiques
HSP70 Heat-Shock Proteins
0
Proteins
0
Saccharomyces cerevisiae Proteins
0
UBD protein, human
0
Ubiquitins
0
Adenosine Triphosphatases
EC 3.6.1.-
SSA1 protein, S cerevisiae
EC 3.6.1.3
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM