DNAModAnnot: a R toolbox for DNA modification filtering and annotation.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
09 Sep 2021
Historique:
received: 24 09 2020
revised: 17 12 2020
accepted: 13 01 2021
medline: 21 1 2021
pubmed: 21 1 2021
entrez: 20 1 2021
Statut: ppublish

Résumé

Long-read sequencing technologies can be employed to detect and map DNA modifications at the nucleotide resolution on a genome-wide scale. However, published software packages neglect the integration of genomic annotation and comprehensive filtering when analyzing patterns of modified bases detected using Pacific Biosciences (PacBio) or Oxford Nanopore Technologies (ONT) data. Here, we present DNA Modification Annotation (DNAModAnnot), a R package designed for the global analysis of DNA modification patterns using adapted filtering and visualization tools. We tested our package using PacBio sequencing data to analyze patterns of the 6-methyladenine (6mA) in the ciliate Paramecium tetraurelia, in which high 6mA amounts were previously reported. We found P. tetraurelia 6mA genome-wide distribution to be similar to other ciliates. We also performed 5-methylcytosine (5mC) analysis in human lymphoblastoid cells using ONT data and confirmed previously known patterns of 5mC. DNAModAnnot provides a toolbox for the genome-wide analysis of different DNA modifications using PacBio and ONT long-read sequencing data. DNAModAnnot is distributed as a R package available via GitHub (https://github.com/AlexisHardy/DNAModAnnot). Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 33471071
pii: 6104827
doi: 10.1093/bioinformatics/btab032
pmc: PMC8428616
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

2738-2740

Subventions

Organisme : CNRS
Organisme : Fondation de la Recherche Médicale (Equipe FRM
ID : DEQ20160334868
Organisme : Agence Nationale de la Recherche
ID : ANR-18-CE12-0005-03
Organisme : GeT core facility
Organisme : France Génomique National infrastructure
Organisme : Agence Nationale pour la Recherche
ID : ANR-10-INBS-09
Organisme : GET-PACBIO
Organisme : FEDER-FSE MIDI-PYRENEES ET GARONNE 2014-2020

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press.

Références

Biochim Biophys Acta. 1974 Nov 20;374(1):1-11
pubmed: 4429738
Bioinformatics. 2020 May 1;36(10):3236-3238
pubmed: 32053166
Epigenetics Chromatin. 2015 Jan 27;8:6
pubmed: 25972926
Genes Dev. 2011 May 15;25(10):1010-22
pubmed: 21576262
Genome Biol. 2020 Feb 7;21(1):30
pubmed: 32033565
Nucleic Acids Res. 2017 Nov 16;45(20):11594-11606
pubmed: 29036602
Cell. 2019 Jun 13;177(7):1781-1796.e25
pubmed: 31104845
Nat Biotechnol. 2018 Apr;36(4):338-345
pubmed: 29431738
Methods Mol Biol. 2016;1418:335-51
pubmed: 27008022
BMC Genomics. 2019 Jun 3;20(1):445
pubmed: 31159718
Genome Res. 2018 Jul;28(7):1067-1078
pubmed: 29764913
Essays Biochem. 2019 Dec 20;63(6):639-648
pubmed: 31755932

Auteurs

Alexis Hardy (A)

Université de Paris, CNRS, Institut Jacques Monod, F-75006 Paris, France.

Mélody Matelot (M)

Université de Paris, CNRS, Institut Jacques Monod, F-75006 Paris, France.

Amandine Touzeau (A)

Université de Paris, CNRS, Institut Jacques Monod, F-75006 Paris, France.

Christophe Klopp (C)

Plateforme bioinformatique Genotoul, UR875 Mathématique et Informatique Appliquée de Toulouse, INRA, 31326 Castanet-Tolosan, France.

Céline Lopez-Roques (C)

INRAe, US 1426, GeT-PlaGe, Genotoul, 31326 Castanet-Tolosan, France.

Sandra Duharcourt (S)

Université de Paris, CNRS, Institut Jacques Monod, F-75006 Paris, France.

Matthieu Defrance (M)

Université Libre de Bruxelles, Interuniversity Institute of Bioinformatics in Brussels (IB2), Brussels 1050, Belgium.

Classifications MeSH