Scalable inference of transcriptional kinetic parameters from MS2 time series data.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
27 01 2022
27 01 2022
Historique:
received:
06
12
2020
revised:
22
07
2021
accepted:
03
11
2021
pubmed:
18
11
2021
medline:
3
2
2023
entrez:
17
11
2021
Statut:
ppublish
Résumé
The MS2-MCP (MS2 coat protein) live imaging system allows for visualization of transcription dynamics through the introduction of hairpin stem-loop sequences into a gene. A fluorescent signal at the site of nascent transcription in the nucleus quantifies mRNA production. Computational modelling can be used to infer the promoter states along with the kinetic parameters governing transcription, such as promoter switching frequency and polymerase loading rate. However, modelling of the fluorescent trace presents a challenge due its persistence; the observed fluorescence at a given time point depends on both current and previous promoter states. A compound state Hidden Markov Model (cpHMM) was recently introduced to allow inference of promoter activity from MS2-MCP data. However, the computational time for inference scales exponentially with gene length and the cpHMM is therefore not currently practical for application to many eukaryotic genes. We present a scalable implementation of the cpHMM for fast inference of promoter activity and transcriptional kinetic parameters. This new method can model genes of arbitrary length through the use of a time-adaptive truncated compound state space. The truncated state space provides a good approximation to the full state space by retaining the most likely set of states at each time during the forward pass of the algorithm. Testing on MS2-MCP fluorescent data collected from early Drosophila melanogaster embryos indicates that the method provides accurate inference of kinetic parameters within a computationally feasible timeframe. The inferred promoter traces generated by the model can also be used to infer single-cell transcriptional parameters. Python implementation is available at https://github.com/ManchesterBioinference/burstInfer, along with code to reproduce the examples presented here. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 34788793
pii: 6426074
doi: 10.1093/bioinformatics/btab765
pmc: PMC8796374
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1030-1036Subventions
Organisme : Wellcome Trust
ID : 204832/Z/16/Z
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 204832/B/16/Z
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 215187/Z/19/Z
Pays : United Kingdom
Informations de copyright
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Références
Nature. 2010 Sep 9;467(7312):167-73
pubmed: 20829787
Curr Biol. 2006 May 23;16(10):1018-25
pubmed: 16713960
Annu Rev Biophys. 2016 Jul 5;45:25-47
pubmed: 27145880
Dev Cell. 2020 Sep 28;54(6):727-741.e7
pubmed: 32758422
Nat Commun. 2016 Jul 27;7:12248
pubmed: 27461529
Cell. 2016 Jul 14;166(2):358-368
pubmed: 27293191
Science. 2015 Jun 26;348(6242):aaa5838
pubmed: 26113727
Nat Genet. 2011 Jun;43(6):554-60
pubmed: 21532574
Trends Genet. 2014 Aug;30(8):364-75
pubmed: 25005921
Nat Rev Genet. 2013 Aug;14(8):572-84
pubmed: 23835438
Mol Cell. 1998 Oct;2(4):437-45
pubmed: 9809065
Cell Syst. 2018 Apr 25;6(4):409-423.e11
pubmed: 29454937
Development. 2000 Aug;127(15):3305-12
pubmed: 10887086
Nat Commun. 2021 Jul 23;12(1):4504
pubmed: 34301936
Nature. 2002 Sep 19;419(6904):304-8
pubmed: 12239569
Dev Cell. 2010 Feb 16;18(2):260-74
pubmed: 20159596
Mol Cell. 2018 Aug 2;71(3):468-480
pubmed: 30075145
Elife. 2016 Feb 20;5:
pubmed: 26896676
Proc Natl Acad Sci U S A. 2020 Jan 14;117(2):836-847
pubmed: 31882445
PLoS Genet. 2016 Jul 05;12(7):e1006164
pubmed: 27379389
Proc Natl Acad Sci U S A. 2012 Oct 23;109(43):17454-9
pubmed: 23064634
Cell. 2008 Oct 17;135(2):216-26
pubmed: 18957198
Mol Cell. 2015 Apr 2;58(1):147-56
pubmed: 25728770
Genome Biol. 2017 Sep 4;18(1):164
pubmed: 28870226
Curr Biol. 2013 Nov 4;23(21):2140-5
pubmed: 24139738
Cell. 2005 Dec 16;123(6):1025-36
pubmed: 16360033
Nat Methods. 2013 Apr;10(4):277-8
pubmed: 23538861
Curr Biol. 2013 Nov 4;23(21):2135-9
pubmed: 24139736
Proc Natl Acad Sci U S A. 2014 Jul 22;111(29):10598-603
pubmed: 24994903
Elife. 2020 Dec 10;9:
pubmed: 33300492
PLoS Biol. 2006 Oct;4(10):e309
pubmed: 17048983
Cell. 2018 Oct 18;175(3):835-847.e25
pubmed: 30340044