Prediction of histone post-translational modifications using deep learning.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
05 Apr 2021
Historique:
received: 23 04 2020
revised: 27 11 2020
accepted: 16 12 2020
medline: 29 12 2020
pubmed: 29 12 2020
entrez: 28 12 2020
Statut: ppublish

Résumé

Histone post-translational modifications (PTMs) are involved in a variety of essential regulatory processes in the cell, including transcription control. Recent studies have shown that histone PTMs can be accurately predicted from the knowledge of transcription factor binding or DNase hypersensitivity data. Similarly, it has been shown that one can predict PTMs from the underlying DNA primary sequence. In this study, we introduce a deep learning architecture called DeepPTM for predicting histone PTMs from transcription factor binding data and the primary DNA sequence. Extensive experimental results show that our deep learning model outperforms the prediction accuracy of the model proposed in Benveniste et al. (PNAS 2014) and DeepHistone (BMC Genomics 2019). The competitive advantage of our framework lies in the synergistic use of deep learning combined with an effective pre-processing step. Our classification framework has also enabled the discovery that the knowledge of a small subset of transcription factors (which are histone-PTM and cell-type-specific) can provide almost the same prediction accuracy that can be obtained using all the transcription factors data. https://github.com/dDipankar/DeepPTM. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 33367499
pii: 6050715
doi: 10.1093/bioinformatics/btaa1075
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

5610-5617

Subventions

Organisme : U.S. National Science Foundation
ID : IOS-1543963
Organisme : U.S. Department of Energy
Organisme : Office of Science
Organisme : Office of Biological and Environmental Research
Organisme : Genomic Science Program
ID : DE-SC0019093

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Auteurs

Dipankar Ranjan Baisya (DR)

Department of Computer Science and Engineering, University of California, Riverside, CA, 92521, USA.

Stefano Lonardi (S)

Department of Computer Science and Engineering, University of California, Riverside, CA, 92521, USA.

Classifications MeSH