PATO: genome-wide prediction of lncRNA-DNA triple helices.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 03 2023
Historique:
received: 16 11 2022
revised: 06 03 2023
accepted: 12 03 2023
medline: 30 3 2023
pubmed: 17 3 2023
entrez: 16 3 2023
Statut: ppublish

Résumé

Long non-coding RNA (lncRNA) plays a key role in many biological processes. For instance, lncRNA regulates chromatin using different molecular mechanisms, including direct RNA-DNA hybridization via triplexes, cotranscriptional RNA-RNA interactions, and RNA-DNA binding mediated by protein complexes. While the functional annotation of lncRNA transcripts has been widely studied over the last 20 years, barely a handful of tools have been developed with the specific purpose of detecting and evaluating lncRNA-DNA triple helices. What is worse, some of these tools have nearly grown a decade old, making new triplex-centric pipelines depend on legacy software that cannot thoroughly process all the data made available by next-generation sequencing (NGS) technologies. We present PATO, a modern, fast, and efficient tool for the detection of lncRNA-DNA triplexes that matches NGS processing capabilities. PATO enables the prediction of triple helices at the genome scale and can process in as little as 1 h more than 60 GB of sequence data using a two-socket server. Moreover, PATO's efficiency allows a more exhaustive search of the triplex-forming solution space, and so PATO achieves higher levels of prediction accuracy in far less time than other tools in the state of the art. Source code, user manual, and tests are freely available to download under the MIT License at https://github.com/UDC-GAC/pato.

Identifiants

pubmed: 36924420
pii: 7078802
doi: 10.1093/bioinformatics/btad134
pmc: PMC10049783
pii:
doi:

Substances chimiques

RNA, Long Noncoding 0
DNA 9007-49-2

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2023. Published by Oxford University Press.

Références

Genome Res. 2012 Jul;22(7):1372-81
pubmed: 22550012
Nat Rev Genet. 2014 Jan;15(1):7-21
pubmed: 24296535
J Biotechnol. 2017 Nov 10;261:157-168
pubmed: 28888961
Brief Bioinform. 2019 Mar 22;20(2):551-564
pubmed: 29697742
Angew Chem Int Ed Engl. 2017 Nov 27;56(48):15210-15233
pubmed: 28444822
Int J Mol Sci. 2020 Jan 28;21(3):
pubmed: 32012884

Auteurs

Iñaki Amatria-Barral (I)

Computer Architecture Group, Department of Computer Engineering, CITIC, Universidade da Coruña, Campus de Elviña, A Coruña 15071, Spain.

Jorge González-Domínguez (J)

Computer Architecture Group, Department of Computer Engineering, CITIC, Universidade da Coruña, Campus de Elviña, A Coruña 15071, Spain.

Juan Touriño (J)

Computer Architecture Group, Department of Computer Engineering, CITIC, Universidade da Coruña, Campus de Elviña, A Coruña 15071, Spain.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH