DCA for genome-wide epistasis analysis: the statistical genetics perspective.
Journal
Physical biology
ISSN: 1478-3975
Titre abrégé: Phys Biol
Pays: England
ID NLM: 101197454
Informations de publication
Date de publication:
29 01 2019
29 01 2019
Historique:
pubmed:
4
1
2019
medline:
26
2
2019
entrez:
4
1
2019
Statut:
epublish
Résumé
Direct coupling analysis (DCA) is a now widely used method to leverage statistical information from many similar biological systems to draw meaningful conclusions on each system separately. DCA has been applied with great success to sequences of homologous proteins, and also more recently to whole-genome population-wide sequencing data. We here argue that the use of DCA on the genome scale is contingent on fundamental issues of population genetics. DCA can be expected to yield meaningful results when a population is in the quasi-linkage equilibrium (QLE) phase studied by Kimura and others, but not, for instance, in a phase of clonal competition. We discuss how the exponential (Potts model) distributions emerge in QLE, and compare couplings to correlations obtained in a study of about 3000 genomes of the human pathogen Streptococcus pneumoniae.
Identifiants
pubmed: 30605896
doi: 10.1088/1478-3975/aafbe0
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM