iDNA-ITLM: An interpretable and transferable learning model for identifying DNA methylation.
Journal
PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081
Informations de publication
Date de publication:
2024
2024
Historique:
received:
16
10
2023
accepted:
20
03
2024
medline:
1
11
2024
pubmed:
1
11
2024
entrez:
31
10
2024
Statut:
epublish
Résumé
In this study, from the perspective of image processing, we propose the iDNA-ITLM model, using a novel data enhance strategy by continuously self-replicating a short DNA sequence into a longer DNA sequence and then embedding it into a high-dimensional matrix to enlarge the receptive field, for identifying DNA methylation sites. Our model consistently outperforms the current state-of-the-art sequence-based DNA methylation site recognition methods when evaluated on 17 benchmark datasets that cover multiple species and include three DNA methylation modifications (4mC, 5hmC, and 6mA). The experimental results demonstrate the robustness and superior performance of our model across these datasets. In addition, our model can transfer learning to RNA methylation sequences and produce good results without modifying the hyperparameters in the model. The proposed iDNA-ITLM model can be considered a universal predictor across DNA and RNA methylation species.
Identifiants
pubmed: 39480834
doi: 10.1371/journal.pone.0301791
pii: PONE-D-23-33766
doi:
Substances chimiques
DNA
9007-49-2
RNA
63231-63-0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0301791Informations de copyright
Copyright: © 2024 Yu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Déclaration de conflit d'intérêts
The authors declare no competing interests.