ModDotPlot-rapid and interactive visualization of tandem repeats.

dotplot heatmap modimizer sketching

Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
07 Aug 2024
Historique:
received: 15 04 2024
revised: 02 07 2024
accepted: 05 08 2024
medline: 7 8 2024
pubmed: 7 8 2024
entrez: 7 8 2024
Statut: aheadofprint

Résumé

A common method for analyzing genomic repeats is to produce a sequence similarity matrix visualized via a dot plot. Innovative approaches such as StainedGlass have improved upon this classic visualization by rendering dot plots as a heatmap of sequence identity, enabling researchers to better visualize multi-megabase tandem repeat arrays within centromeres and other heterochromatic regions of the genome. However, computing the similarity estimates for heatmaps requires high computational overhead and can suffer from decreasing accuracy. In this work we introduce ModDotPlot, an interactive and alignment-free dot plot viewer. By approximating average nucleotide identity via a k-mer-based containment index, ModDotPlot produces accurate plots orders of magnitude faster than StainedGlass. We accomplish this through the use of a hierarchical modimizer scheme that can visualize the full 128 Mbp genome of Arabidopsis thaliana in under 5 minutes on a laptop. ModDotPlot is bundled with a graphical user interface supporting real-time interactive navigation of entire chromosomes. ModDotPlot is available at https://github.com/marbl/ModDotPlot.

Identifiants

pubmed: 39110522
pii: 7729118
doi: 10.1093/bioinformatics/btae493
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Published by Oxford University Press 2024.

Auteurs

Alexander P Sweeten (AP)

Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, MD 20892, USA.
Department of Computer Science, Johns Hopkins University, MD 21211, USA.

Michael C Schatz (MC)

Department of Computer Science, Johns Hopkins University, MD 21211, USA.

Adam M Phillippy (AM)

Genome Informatics Section, Center for Genomics and Data Science Research, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, MD 20892, USA.

Classifications MeSH