EZcount: An all-in-one software for microRNA expression quantification from NGS sequencing data.


Journal

Computers in biology and medicine
ISSN: 1879-0534
Titre abrégé: Comput Biol Med
Pays: United States
ID NLM: 1250250

Informations de publication

Date de publication:
06 2021
Historique:
received: 03 02 2021
revised: 17 03 2021
accepted: 17 03 2021
pubmed: 15 4 2021
medline: 29 6 2021
entrez: 14 4 2021
Statut: ppublish

Résumé

MicroRNAs (miRNAs) are short endogenous molecules of RNA that influence cell regulation by suppressing genes. Their ubiquity throughout all branches of the tree of life has suggested their central role in many cellular functions. Nowadays, several personalized medicine applications rely on miRNAs as biomarkers for diagnoses, prognoses, and prediction of drug response. The increasing ease of sequencing miRNAs contrasts with the difficulty of accurately quantifying their concentration. The use of general purpose aligners is only a partial solution as they have limited possibilities to accurately solve ambiguous mapping due to the short length of these sequences. We developed EZcount, an all-in-one software that, with a single command, performs the entire quantification process: from raw fastq files to read counts. Experiments show that EZcount is more sensitive and accurate than methods based on sequence alignment, independently of the library preparation protocol and sequencing machine. The parallel architecture of EZcount makes it fast enough to process a sample in minutes using a standard workstation. EZcount runs on all of the most common operating systems (Linux, Windows and MacOS) and is freely available for download at https://gitlab.com/BioAlgo/miR-pipe. A detailed description of the datasets, the raw experimental results, and all the scripts used for testing are available as supplementary material.

Identifiants

pubmed: 33852974
pii: S0010-4825(21)00146-3
doi: 10.1016/j.compbiomed.2021.104352
pii:
doi:

Substances chimiques

MicroRNAs 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

104352

Informations de copyright

Copyright © 2021 Elsevier Ltd. All rights reserved.

Auteurs

Filippo Geraci (F)

Institute for Informatics and Telematics, CNR, Pisa, 56124, Italy. Electronic address: filippo.geraci@iit.cnr.it.

Giovanni Manzini (G)

Institute for Informatics and Telematics, CNR, Pisa, 56124, Italy; Department of Computer Science, University of Pisa, Pisa, 56127, Italy. Electronic address: giovanni.manzini@unipi.it.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Classifications MeSH