Implementing and evaluating a Gaussian mixture framework for identifying gene function from TnSeq data.


Journal

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
ISSN: 2335-6936
Titre abrégé: Pac Symp Biocomput
Pays: United States
ID NLM: 9711271

Informations de publication

Date de publication:
2019
Historique:
entrez: 14 3 2019
pubmed: 14 3 2019
medline: 24 8 2019
Statut: ppublish

Résumé

The rapid acceleration of microbial genome sequencing increases opportunities to understand bacterial gene function. Unfortunately, only a small proportion of genes have been studied. Recently, TnSeq has been proposed as a cost-effective, highly reliable approach to predict gene functions as a response to changes in a cell's fitness before-after genomic changes. However, major questions remain about how to best determine whether an observed quantitative change in fitness represents a meaningful change. To address the limitation, we develop a Gaussian mixture model framework for classifying gene function from TnSeq experiments. In order to implement the mixture model, we present the Expectation-Maximization algorithm and a hierarchical Bayesian model sampled using Stan's Hamiltonian Monte-Carlo sampler. We compare these implementations against the frequentist method used in current TnSeq literature. From simulations and real data produced by E.coli TnSeq experiments, we show that the Bayesian implementation of the Gaussian mixture framework provides the most consistent classification results.

Identifiants

pubmed: 30864320
pii: 9789813279827_0016

Substances chimiques

DNA Transposable Elements 0

Types de publication

Journal Article Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Pagination

172-183

Auteurs

Kevin Li (K)

Department of Mathematics, Columbia University, New York, NY 10027, USA, kl2918@columbia.edu.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Coal Metagenome Phylogeny Bacteria Genome, Bacterial

Classifications MeSH