Exploring protein-mediated compaction of DNA by coarse-grained simulations and unsupervised learning.
Journal
Biophysical journal
ISSN: 1542-0086
Titre abrégé: Biophys J
Pays: United States
ID NLM: 0370626
Informations de publication
Date de publication:
23 Jul 2024
23 Jul 2024
Historique:
received:
28
03
2024
revised:
18
06
2024
accepted:
18
07
2024
medline:
24
7
2024
pubmed:
24
7
2024
entrez:
24
7
2024
Statut:
aheadofprint
Résumé
Protein-DNA interactions and protein-mediated DNA compaction play key roles in a range of biological processes. The length scales typically involved in DNA bending, bridging, looping, and compaction (≥ 1 kbp) are challenging to address experimentally or by all-atom molecular dynamics simulations, making coarse-grained simulations a natural approach. Here we present a simple and generic coarse-grained model for the DNA-protein and protein-protein interactions, and investigate the role of the latter in the protein-induced compaction of DNA. Our approach models the DNA as a discrete worm-like chain. The proteins are treated in the grand-canonical ensemble and the protein-DNA binding strength is taken from experimental measurements. Protein-DNA interactions are modeled as an isotropic binding potential with an imposed binding valency, without specific assumptions about the binding geometry. To systematically and quantitatively classify DNA-protein complexes, we present an unsupervised machine learning pipeline that receives a large set of structural order parameters as input, reduces the dimensionality via principal component analysis, and groups the results using a Gaussian mixture model. We apply our method to recent data on the compaction of viral genome-length DNA by HIV integrase and we find that protein-protein interactions are critical to the formation of looped intermediate structures seen experimentally. Our methodology is broadly applicable to DNA-binding proteins and to protein-induced DNA compaction and provides a systematic and semi-quantitative approach for analyzing their mesoscale complexes.
Identifiants
pubmed: 39044429
pii: S0006-3495(24)00482-X
doi: 10.1016/j.bpj.2024.07.023
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
Copyright © 2024. Published by Elsevier Inc.