PATO: genome-wide prediction of lncRNA-DNA triple helices.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
01 03 2023
01 03 2023
Historique:
received:
16
11
2022
revised:
06
03
2023
accepted:
12
03
2023
medline:
30
3
2023
pubmed:
17
3
2023
entrez:
16
3
2023
Statut:
ppublish
Résumé
Long non-coding RNA (lncRNA) plays a key role in many biological processes. For instance, lncRNA regulates chromatin using different molecular mechanisms, including direct RNA-DNA hybridization via triplexes, cotranscriptional RNA-RNA interactions, and RNA-DNA binding mediated by protein complexes. While the functional annotation of lncRNA transcripts has been widely studied over the last 20 years, barely a handful of tools have been developed with the specific purpose of detecting and evaluating lncRNA-DNA triple helices. What is worse, some of these tools have nearly grown a decade old, making new triplex-centric pipelines depend on legacy software that cannot thoroughly process all the data made available by next-generation sequencing (NGS) technologies. We present PATO, a modern, fast, and efficient tool for the detection of lncRNA-DNA triplexes that matches NGS processing capabilities. PATO enables the prediction of triple helices at the genome scale and can process in as little as 1 h more than 60 GB of sequence data using a two-socket server. Moreover, PATO's efficiency allows a more exhaustive search of the triplex-forming solution space, and so PATO achieves higher levels of prediction accuracy in far less time than other tools in the state of the art. Source code, user manual, and tests are freely available to download under the MIT License at https://github.com/UDC-GAC/pato.
Identifiants
pubmed: 36924420
pii: 7078802
doi: 10.1093/bioinformatics/btad134
pmc: PMC10049783
pii:
doi:
Substances chimiques
RNA, Long Noncoding
0
DNA
9007-49-2
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
© The Author(s) 2023. Published by Oxford University Press.
Références
Genome Res. 2012 Jul;22(7):1372-81
pubmed: 22550012
Nat Rev Genet. 2014 Jan;15(1):7-21
pubmed: 24296535
J Biotechnol. 2017 Nov 10;261:157-168
pubmed: 28888961
Brief Bioinform. 2019 Mar 22;20(2):551-564
pubmed: 29697742
Angew Chem Int Ed Engl. 2017 Nov 27;56(48):15210-15233
pubmed: 28444822
Int J Mol Sci. 2020 Jan 28;21(3):
pubmed: 32012884