BML: a versatile web server for bipartite motif discovery.


Journal

Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837

Informations de publication

Date de publication:
17 01 2022
Historique:
received: 18 06 2021
revised: 18 11 2021
accepted: 19 11 2021
pubmed: 3 1 2022
medline: 12 3 2022
entrez: 2 1 2022
Statut: ppublish

Résumé

Motif discovery and characterization are important for gene regulation analysis. The lack of intuitive and integrative web servers impedes the effective use of motifs. Most motif discovery web tools are either not designed for non-expert users or lacking optimization steps when using default settings. Here we describe bipartite motifs learning (BML), a parameter-free web server that provides a user-friendly portal for online discovery and analysis of sequence motifs, using high-throughput sequencing data as the input. BML utilizes both position weight matrix and dinucleotide weight matrix, the latter of which enables the expression of the interdependencies of neighboring bases. With input parameters concerning the motifs are given, the BML achieves significantly higher accuracy than other available tools for motif finding. When no parameters are given by non-expert users, unlike other tools, BML employs a learning method to identify motifs automatically and achieve accuracy comparable to the scenario where the parameters are set. The BML web server is freely available at http://motif.t-ridership.com/ (https://github.com/Mohammad-Vahed/BML).

Identifiants

pubmed: 34974623
pii: 6490318
doi: 10.1093/bib/bbab536
pmc: PMC8769915
pii:
doi:

Substances chimiques

Transcription Factors 0

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : NIEHS NIH HHS
ID : K01 ES025434
Pays : United States
Organisme : NICHD NIH HHS
ID : R01 HD084633
Pays : United States
Organisme : NLM NIH HHS
ID : R01 LM012373
Pays : United States
Organisme : NLM NIH HHS
ID : R01 LM012907
Pays : United States

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press.

Références

Pac Symp Biocomput. 2001;:127-38
pubmed: 11262934
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D75-7
pubmed: 14681362
Nucleic Acids Res. 2020 Jan 8;48(D1):D87-D92
pubmed: 31701148
Nat Biotechnol. 2005 Jan;23(1):137-44
pubmed: 15637633
Biol Direct. 2006 Apr 06;1:11
pubmed: 16600018
Nucleic Acids Res. 2017 Mar 17;45(5):e27
pubmed: 27899659
Genetics. 2012 Jul;191(3):781-90
pubmed: 22505627
Nature. 2005 Mar 17;434(7031):338-45
pubmed: 15735639
Nucleic Acids Res. 2010 Jul;38(12):e135
pubmed: 20439311
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8
pubmed: 19458158
Bioinformatics. 2000 Jan;16(1):16-23
pubmed: 10812473
PLoS Comput Biol. 2011 Aug;7(8):e1002100
pubmed: 21829340
PLoS Comput Biol. 2008 May 09;4(4):e1000071
pubmed: 18437229
Mol Pharm. 2008 Jan-Feb;5(1):3-16
pubmed: 18076137
BMC Bioinformatics. 2010 Apr 09;11:179
pubmed: 20380693
Front Genet. 2016 Feb 23;7:24
pubmed: 26941778
PLoS One. 2011;6(9):e24576
pubmed: 21931761
Nucleic Acids Res. 2004 Sep 23;32(17):4979-91
pubmed: 15388800
PLoS One. 2010 Mar 22;5(3):e9722
pubmed: 20339533
Nat Rev Genet. 2004 Apr;5(4):276-87
pubmed: 15131651
Nucleic Acids Res. 2018 May 4;46(8):e44
pubmed: 29385521
Proc Int Conf Intell Syst Mol Biol. 1994;2:28-36
pubmed: 7584402
J Mol Biol. 2002 Apr 12;317(5):753-64
pubmed: 11955022
Proteins. 1990;7(1):41-51
pubmed: 2184437
Nucleic Acids Res. 2016 Jan 4;44(D1):D133-43
pubmed: 26527724
PLoS One. 2019 Aug 30;14(8):e0220207
pubmed: 31469855
Proc Int Conf Intell Syst Mol Biol. 1995;3:21-9
pubmed: 7584439
Bioinformatics. 2014 Jun 15;30(12):1667-73
pubmed: 24532725

Auteurs

Mohammad Vahed (M)

Department of Pathology & Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles (UCLA), California, USA.
Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48105, USA.

Majid Vahed (M)

Pharmaceutical Sciences Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran.

Lana X Garmire (LX)

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, 48105, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH