Prediction of histone post-translational modifications using deep learning.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
05 Apr 2021
05 Apr 2021
Historique:
received:
23
04
2020
revised:
27
11
2020
accepted:
16
12
2020
medline:
29
12
2020
pubmed:
29
12
2020
entrez:
28
12
2020
Statut:
ppublish
Résumé
Histone post-translational modifications (PTMs) are involved in a variety of essential regulatory processes in the cell, including transcription control. Recent studies have shown that histone PTMs can be accurately predicted from the knowledge of transcription factor binding or DNase hypersensitivity data. Similarly, it has been shown that one can predict PTMs from the underlying DNA primary sequence. In this study, we introduce a deep learning architecture called DeepPTM for predicting histone PTMs from transcription factor binding data and the primary DNA sequence. Extensive experimental results show that our deep learning model outperforms the prediction accuracy of the model proposed in Benveniste et al. (PNAS 2014) and DeepHistone (BMC Genomics 2019). The competitive advantage of our framework lies in the synergistic use of deep learning combined with an effective pre-processing step. Our classification framework has also enabled the discovery that the knowledge of a small subset of transcription factors (which are histone-PTM and cell-type-specific) can provide almost the same prediction accuracy that can be obtained using all the transcription factors data. https://github.com/dDipankar/DeepPTM. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 33367499
pii: 6050715
doi: 10.1093/bioinformatics/btaa1075
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
5610-5617Subventions
Organisme : U.S. National Science Foundation
ID : IOS-1543963
Organisme : U.S. Department of Energy
Organisme : Office of Science
Organisme : Office of Biological and Environmental Research
Organisme : Genomic Science Program
ID : DE-SC0019093
Informations de copyright
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.