TSPTFBS 2.0: trans-species prediction of transcription factor binding sites and identification of their core motifs in plants.
DenseNet
biological interpretability
core motif
trans-species prediction
transcription factor binding sites
Journal
Frontiers in plant science
ISSN: 1664-462X
Titre abrégé: Front Plant Sci
Pays: Switzerland
ID NLM: 101568200
Informations de publication
Date de publication:
2023
2023
Historique:
received:
28
02
2023
accepted:
13
04
2023
medline:
25
5
2023
pubmed:
25
5
2023
entrez:
25
5
2023
Statut:
epublish
Résumé
An emerging approach using promoter tiling deletion via genome editing is beginning to become popular in plants. Identifying the precise positions of core motifs within plant gene promoter is of great demand but they are still largely unknown. We previously developed TSPTFBS of 265 Here, we additionally introduced 104 maize and 20 rice TFBS datasets and utilized DenseNet for model construction on a large-scale dataset of a total of 389 plant TFs. More importantly, we combined three biological interpretability methods including DeepLIFT, For the results, DenseNet not only has achieved greater predictability than baseline methods such as LS-GKM and MEME for above 389 TFs from Arabidopsis, maize and rice, but also has greater performance on trans-species prediction of a total of 15 TFs from other six plant species. A motif analysis based on TF-MoDISco and global importance analysis (GIA) further provide the biological implication of the core motif identified by three interpretability methods. Finally, we developed a pipeline of TSPTFBS 2.0, which integrates 389 DenseNet-based models of TF binding and the above three interpretability methods. TSPTFBS 2.0 was implemented as a user-friendly web-server (http://www.hzau-hulab.com/TSPTFBS/), which can support important references for editing targets of any given plant promoters and it has great potentials to provide reliable editing target of genetic screen experiments in plants.
Identifiants
pubmed: 37229121
doi: 10.3389/fpls.2023.1175837
pmc: PMC10203575
doi:
Types de publication
Journal Article
Langues
eng
Pagination
1175837Informations de copyright
Copyright © 2023 Cheng, Liu, Zhou, Deng, Ge and Hu.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Références
Plant Cell. 2013 Sep;25(9):3360-76
pubmed: 24076974
PLoS Comput Biol. 2015 Aug 20;11(8):e1004418
pubmed: 26291518
Bioinformatics. 2011 Apr 1;27(7):1017-8
pubmed: 21330290
Brief Bioinform. 2022 Nov 19;23(6):
pubmed: 36155619
Nucleic Acids Res. 2016 Jun 20;44(11):e107
pubmed: 27084946
Nucleic Acids Res. 2020 Jan 8;48(D1):D87-D92
pubmed: 31701148
Bioinformatics. 2016 Jul 15;32(14):2196-8
pubmed: 27153584
Brief Bioinform. 2021 Jul 20;22(4):
pubmed: 33005921
Nature. 2015 Feb 19;518(7539):317-30
pubmed: 25693563
Nat Biotechnol. 2022 Sep;40(9):1403-1411
pubmed: 35449414
Nat Commun. 2020 Oct 9;11(1):5089
pubmed: 33037196
Science. 2007 Jun 8;316(5830):1497-502
pubmed: 17540862
Nat Genet. 2022 May;54(5):613-624
pubmed: 35551305
Bioinformatics. 2000 Jan;16(1):16-23
pubmed: 10812473
PLoS Comput Biol. 2021 May 13;17(5):e1008925
pubmed: 33983921
Nat Protoc. 2009;4(3):393-411
pubmed: 19265799
Nat Commun. 2015 Oct 22;6:8733
pubmed: 26490019
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D108-10
pubmed: 16381825
Nat Biotechnol. 2015 Aug;33(8):831-8
pubmed: 26213851
Plant J. 2022 Aug;111(4):1203-1215
pubmed: 35713985
Nat Rev Genet. 2004 Apr;5(4):276-87
pubmed: 15131651
Cell. 2021 Mar 18;184(6):1621-1635
pubmed: 33581057
Annu Rev Genet. 2018 Nov 23;52:421-444
pubmed: 30285496
Nature. 2020 Jul;583(7818):699-710
pubmed: 32728249
Mol Plant. 2019 Jun 3;12(6):743-763
pubmed: 30447332
Genome Res. 2010 Jun;20(6):861-73
pubmed: 20378718
Nat Methods. 2015 Oct;12(10):931-4
pubmed: 26301843
Cell. 2016 May 19;165(5):1280-1292
pubmed: 27203113
Nat Commun. 2022 Jun 14;13(1):3413
pubmed: 35701419
Genome Res. 2016 Jul;26(7):990-9
pubmed: 27197224
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8
pubmed: 19458158
Bioinformatics. 2021 Apr 19;37(2):260-262
pubmed: 33416862