RNAsolo: a repository of cleaned PDB-derived RNA 3D structures.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
11 07 2022
Historique:
received: 12 01 2022
revised: 22 04 2022
accepted: 02 06 2022
pubmed: 9 6 2022
medline: 15 11 2022
entrez: 8 6 2022
Statut: ppublish

Résumé

The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational prediction is the RNA-Puzzles repository. In contrast, the largest resource with experimentally determined structures is the Protein Data Bank. However, files in this archive often contain other molecular data in addition to the RNA structure itself, which-to be used by RNA processing algorithms-should be removed. RNAsolo is a self-updating database dedicated to RNA bioinformatics. It systematically collects experimentally determined RNA 3D structures stored in the PDB, cleans them from non-RNA chains, and groups them into equivalence classes. It allows users to download various subsets of data-clustered by resolution, source, data format, etc.-for further processing and analysis with a single click. The repository is publicly available at https://rnasolo.cs.put.poznan.pl.

Identifiants

pubmed: 35674373
pii: 6604270
doi: 10.1093/bioinformatics/btac386
pmc: PMC9272803
doi:

Substances chimiques

RNA 63231-63-0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

3668-3670

Subventions

Organisme : Poznan University of Technology
Organisme : Institute of Bioorganic Chemistry PAS
Organisme : National Science Centre
ID : 2019/35/B/ST6/03074

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press.

Auteurs

Bartosz Adamczyk (B)

Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland.

Maciej Antczak (M)

Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland.
Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland.

Marta Szachniuk (M)

Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland.
Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH