SAMStat 2: quality control for next generation sequencing data.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 01 2023
Historique:
received: 14 07 2022
revised: 04 11 2022
accepted: 12 01 2023
pubmed: 14 1 2023
medline: 21 1 2023
entrez: 13 1 2023
Statut: ppublish

Résumé

SAMStat is an efficient program to extract quality control metrics from fastq and SAM/BAM files. A distinguishing feature is that it displays sequence composition, base quality composition and mapping error profiles split by mapping quality. This allows users to rapidly identify reasons for poor mapping including the presence of untrimmed adapters or poor sequencing quality at individual read positions. Here, we present a major update to SAMStat. The new version now supports paired-end and long-read data. Quality control plots are drawn using the ploty javascript library. The source code of SAMStat and code to reproduce the results are found here: https://github.com/timolassmann/samstat.

Identifiants

pubmed: 36637208
pii: 6986964
doi: 10.1093/bioinformatics/btad019
pmc: PMC9850270
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Feilman and Stan Perron Foundation

Informations de copyright

© The Author(s) 2023. Published by Oxford University Press.

Références

Gigascience. 2021 Feb 16;10(2):
pubmed: 33594436
Bioinformatics. 2021 Mar 02;:
pubmed: 33677499
Nature. 2012 Sep 6;489(7414):57-74
pubmed: 22955616
Bioinformatics. 2016 Oct 1;32(19):3047-8
pubmed: 27312411
Genome Res. 2008 Nov;18(11):1851-8
pubmed: 18714091
Bioinformatics. 2011 Jan 1;27(1):130-1
pubmed: 21088025
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943

Auteurs

Timo Lassmann (T)

Precision Health, Telethon Kids Institute, University of Western Australia, Perth, WA 6009, Australia.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Classifications MeSH