ModDotPlot-rapid and interactive visualization of tandem repeats.
dotplot
heatmap
modimizer
sketching
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
07 Aug 2024
07 Aug 2024
Historique:
received:
15
04
2024
revised:
02
07
2024
accepted:
05
08
2024
medline:
7
8
2024
pubmed:
7
8
2024
entrez:
7
8
2024
Statut:
aheadofprint
Résumé
A common method for analyzing genomic repeats is to produce a sequence similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have improved upon this classic visualization by rendering dot plots as a heatmap of sequence identity, enabling researchers to better visualize multi-megabase tandem repeat arrays within centromeres and other heterochromatic regions of the genome. However, computing the similarity estimates for heatmaps requires high computational overhead and can suffer from decreasing accuracy. In this work we introduce ModDotPlot, an interactive and alignment-free dot plot viewer. By approximating average nucleotide identity via a k-mer-based containment index, ModDotPlot produces accurate plots orders of magnitude faster than StainedGlass. We accomplish this through the use of a hierarchical modimizer scheme that can visualize the full 128 Mbp genome of Arabidopsis thaliana in under 5 minutes on a laptop. ModDotPlot is bundled with a graphical user interface supporting real-time interactive navigation of entire chromosomes. ModDotPlot is available at https://github.com/marbl/ModDotPlot.
Identifiants
pubmed: 39110522
pii: 7729118
doi: 10.1093/bioinformatics/btae493
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
Published by Oxford University Press 2024.