DILS: Demographic inferences with linked selection by using ABC.


Journal

Molecular ecology resources
ISSN: 1755-0998
Titre abrégé: Mol Ecol Resour
Pays: England
ID NLM: 101465604

Informations de publication

Date de publication:
Nov 2021
Historique:
revised: 09 12 2020
received: 18 06 2020
accepted: 21 12 2020
pubmed: 16 1 2021
medline: 9 11 2021
entrez: 15 1 2021
Statut: ppublish

Résumé

We present DILS, a deployable statistical analysis platform for conducting demographic inferences with linked selection from population genomic data using an Approximate Bayesian Computation framework. DILS takes as input single-population or two-population data sets (multilocus fasta sequences) and performs three types of analyses in a hierarchical manner, identifying: (a) the best demographic model to study the importance of gene flow and population size change on the genetic patterns of polymorphism and divergence, (b) the best genomic model to determine whether the effective size Ne and migration rate N, m are heterogeneously distributed along the genome (implying linked selection) and (c) loci in genomic regions most associated with barriers to gene flow. Also available via a Web interface, an objective of DILS is to facilitate collaborative research in speciation genomics. Here, we show the performance and limitations of DILS by using simulations and finally apply the method to published data on a divergence continuum composed by 28 pairs of Mytilus mussel populations/species.

Identifiants

pubmed: 33448666
doi: 10.1111/1755-0998.13323
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

2629-2644

Subventions

Organisme : Austrian Science Foundation
ID : M 2463-B29

Informations de copyright

© 2021 John Wiley & Sons Ltd.

Références

Barton, N., & Bengtsson, B. O. (1986). The barrier to genetic exchange between hybridising populations. Heredity, 57(3), 357-376.
Beaumont, M. A., Zhang, W., & Balding, D. J. (2002). Approximate Bayesian computation in population genetics. Genetics, 162(4), 2025-2035.
Bierne, N., Borsa, P., Daguin, C., Jollivet, D., Viard, F., Bonhomme, F., & David, P. (2003). Introgression patterns in the mosaic hybrid zone between Mytilus edulis and M. galloprovincialis. Molecular Ecology, 12(2), 447-461.
Braverman, J. M., Hudson, R. R., Kaplan, N. L., Langley, C. H., & Stephan, W. (1995). The hitchhiking effect on the site frequency spectrum of DNA polymorphisms. Genetics, 140(2), 783-796.
Bunnefeld, L., Hearn, J., Stone, G. N., & Lohse, K. (2018). Whole-genome data reveal the complex history of a diverse ecological community. Proceedings of the National Academy of Sciences of the United States of America, 115(28), E6507-E6515.
Chang, W., Cheng, J., Allaire, J., Xie, Y., & McPherson, J. (2019). shiny: Web application framework for R. R package version 1.4.0. Retrieved from https://CRAN.R-project.org/package=shiny
Charlesworth, B., Morgan, M., & Charlesworth, D. (1993). The effect of deleterious mutations on neutral molecular variation. Genetics, 134(4), 1289-1303.
Cong, Q., Zhang, J., & Grishin, N. (2019). Genomic determinants of speciation. bioRxiv, 837666.
Cornuet, J.-M., Pudlo, P., Veyssier, J., Dehne-Garcia, A., Gautier, M., Leblois, R., Marin, J.-M., & Estoup, A. (2014). Diyabc v2. 0: A software to make approximate Bayesian computation inferences about population history using single nucleotide polymorphism, DNA sequence and microsatellite data. Bioinformatics, 30(8), 1187-1189.
Cornuet, J.-M., Santos, F., Beaumont, M. A., Robert, C. P., Marin, J.-M., Balding, D. J., Guillemaud, T., & Estoup, A. (2008). Inferring population history with DIY ABC: A user-friendly approach to approximate Bayesian computation. Bioinformatics, 24(23), 2713-2719.
Csilléry, K., François, O., & Blum, M. G. B. (2012). abc: An R package for approximate Bayesian computation (ABC). Methods in Ecology and Evolution, 3(3), 475-479.
Ebdon, S., Laetsch, D. R., Dapporto, L., Hayward, A., Ritchie, M. G., Dincă, V., Vila, R., & Lohse, K. (2020). The pleistocene species pump past its prime: Evidence from European butterfly sister species. bioRxiv. https://doi.org/10.1101/2020.09.04.282962
Edelman, N. B., Frandsen, P. B., Miyagi, M., Clavijo, B., Davey, J., Dikow, R. B., García-Accinelli, G., Van Belleghem, S. M., Patterson, N., Neafsey, D. E., Kumar, S., Gilson, M., Salazar, C., Chouteau, M., Counterman, B., Papa, R., Blaxter, M., Reed, R., Dasmahapatra, K. … Mallet, J. (2019). Genomic architecture and introgression shape a butterfly radiation. Science, 366(6465), 594-599.
Ewing, G. B., & Jensen, J. D. (2016). The consequences of not accounting for background selection in demographic inference. Molecular Ecology, 25(1), 135-141.
Excoffier, L., Dupanloup, I., Huerta-Sánchez, E., Sousa, V. C., & Foll, M. (2013). Robust demographic inference from genomic and SNP data. PLoS Genetics, 9(10), e1003905.
Fraimout, A., Debat, V., Fellous, S., Hufbauer, R. A., Foucaud, J., Pudlo, P., Marin, J.-M., Price, D. K., Cattel, J., Chen, X., Deprá, M. (2017). Deciphering the routes of invasion of Drosophila suzukii by means of ABC random forest. Molecular Biology and Evolution, 34(4), 980-996.
Galtier, N. (2019). Delineating species in the speciation continuum: A proposal. Evolutionary Applications, 12(4), 657-663.
Galtier, N., Roux, C., Rousselle, M., Romiguier, J., Figuet, E., Glemin, S., Bierne, N., & Duret, L. (2017). Codon usage bias in animals: Disentangling the effects of natural selection, effective population size and GC-biased gene conversion. Molecular Biology and Evolution, 35, 1092-1103.
Gaut, B. S., Seymour, D. K., Liu, Q., & Zhou, Y. (2018). Demography and its effects on genomic variation in crop domestication. Nature Plants, 4(8), 512-520.
Gutenkunst, R. N., Hernandez, R. D., Williamson, S. H., & Bustamante, C. D. (2009). Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genetics, 5(10), e1000695.
Hahn, M. W. (2019). Molecular population genetics. Sinauer Associates.
Haller, B. C., & Messer, P. W. (2019). Slim 3: Forward genetic simulations beyond the wright-fisher model. Molecular Biology and Evolution, 36(3), 632-637.
Harris, K., & Nielsen, R. (2013). Inferring demographic history from a spectrum of shared haplotype lengths. PLoS Genetics, 9(6), e1003521.
Hey, J., & Pinho, C. (2012). Population genetics and objectivity in species diagnosis. Evolution, 66(5), 1413-1429.
Hudson, R. R. (2002). Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics, 18(2), 337-338.
Hudson, R. R., & Kaplan, N. L. (1985). Statistical properties of the number of recombination events in the history of a sample of DNA sequences. Genetics, 111(1), 147-164.
Johri, P., Charlesworth, B., & Jensen, J. D. (2020). Toward an evolutionarily appropriate null model: Jointly inferring demography and purifying selection. Genetics, 215(1), 173-192.
Jouganous, J., Long, W., Ragsdale, A. P., & Gravel, S. (2017). Inferring the joint demographic history of multiple populations: Beyond the diffusion approximation. Genetics, 206(3), 1549-1567.
Kelleher, J., Etheridge, A. M., & McVean, G. (2016). Efficient coalescent simulation and genealogical analysis for large sample sizes. PLoS Computational Biology, 12(5), e1004842.
Köster, J., & Rahmann, S. (2012). Snakemake-a scalable bioinformatics workflow engine. Bioinformatics, 28(19), 2520-2522.
Langmead, B., & Salzberg, S. L. (2012). Fast gapped-read alignment with bowtie 2. Nature Methods, 9(4), 357.
Li, H., & Durbin, R. (2011). Inference of human population history from individual whole genome sequences. Nature, 475(7357), 493-496.
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., & Durbin, R. (2009). The sequence alignment/map format and SAMtools. Bioinformatics, 25(16), 2078-2079.
Lohse, K., Harrison, R. J., & Barton, N. H. (2011). A general method for calculating likelihoods under the coalescent process. Genetics, 189(3), 977-987.
Malinsky, M., Svardal, H., Tyers, A. M., Miska, E. A., Genner, M. J., Turner, G. F., & Durbin, R. (2018). Whole-genome sequences of Malawi cichlids reveal multiple radiations interconnected by gene flow. Nature Ecology & Evolution, 2(12), 1940-1955.
Martin, S. H., Davey, J. W., Salazar, C., & Jiggins, C. D. (2019). Recombination rate variation shapes barriers to introgression across butterfly genomes. PLoS Biology, 17(2), e2006288.
Momigliano, P., Florin, A.-B., & Merilä, J. (2020). Biases in demographic modelling affect our understanding of the process of speciation. bioRxiv. https://doi.org/10.1101/2020.06.03.128298.
Nei, M., & Li, W. H. (1979). Mathematical model for studying genetic variation in terms of restriction endonucleases. Proceedings of the National Academy of Sciences of the United States of America, 76(10), 5269-5273.
Peñalba, J. V., Joseph, L., & Moritz, C. (2019). Current geography masks dynamic history of gene flow during speciation in northern Australian birds. Molecular Ecology, 28(3), 630-643.
Popovic I., Matias A. M. A., Bierne N., Riginos C. (2020). Twin introductions by independent invader mussel lineages are both associated with recent admixture with a native congener in Australia. Evolutionary Applications, 13, (3), 515-532. http://dx.doi.org/10.1111/eva.12857
Pudlo, P., Marin, J.-M., Estoup, A., Cornuet, J.-M., Gautier, M., & Robert, C. P. (2015). Reliable ABC model choice via random forests. Bioinformatics, 32(6), 859-866.
R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/
Ravinet, M., Faria, R., Butlin, R., Galindo, J., Bierne, N., Rafajlović, M., Noor, M., Mehlig, B., & Westram, A. (2017). Interpreting the genomic landscape of speciation: A road map for finding barriers to gene flow. Journal of Evolutionary Biology, 30(8), 1450-1477.
Raynal, L., Marin, J.-M., Pudlo, P., Ribatet, M., Robert, C. P., & Estoup, A. (2019). ABC random forests for Bayesian parameter inference. Bioinformatics, 35(10), 1720-1728.
Robinson, J. D., Bunnefeld, L., Hearn, J., Stone, G. N., & Hickerson, M. J. (2014). ABC inference of multi-population divergence with admixture from unphased population genomic data. Molecular Ecology, 23(18), 4458-4471.
Ross-Ibarra, J., Wright, S. I., Foxe, J. P., Kawabe, A., DeRose-Wilson, L., Gos, G., Charlesworth, D., & Gaut, B. S. (2008). Patterns of polymorphism and demographic history in natural populations of Arabidopsis lyrata. PLoS One, 3(6), e2411.
Roux, C., Fraïsse, C., Castric, V., Vekemans, X., Pogson, G. H., & Bierne, N. (2014). Can we continue to neglect genomic variation in introgression rates when inferring the history of speciation? A case study in a Mytilus hybrid zone. Journal of Evolutionary Biology, 27(8), 1662-1675.
Roux, C., Fraisse, C., Romiguier, J., Anciaux, Y., Galtier, N., & Bierne, N. (2016). Shedding light on the grey zone of speciation along a continuum of genomic divergence. PLoS Biology, 14(12), e2000234.
Roux, C., Tsagkogeorga, G., Bierne, N., & Galtier, N. (2013). Crossing the species barrier: Genomic hotspots of introgression between two highly divergent Ciona intestinalis species. Molecular Biology and Evolution, 30(7), 1574-1587.
Sackman, A. M., Harris, R. B., & Jensen, J. D. (2019). Inferring demography and selection in organisms characterized by skewed offspring distributions. Genetics, 211(3), 1019-1028.
Schiffels, S., & Durbin, R. (2014). Inferring human population size and separation history from multiple genome sequences. Nature Genetics, 46(8), 919-925.
Schrider, D. R., & Kern, A. D. (2018). Supervised machine learning for population genetics: A new paradigm. Trends in Genetics, 34(4), 301-312.
Schrider, D. R., Shanku, A. G., & Kern, A. D. (2016). Effects of linked selective sweeps on demographic inference and model selection. Genetics, 204(3), 1207-1223.
Sethuraman, A., Sousa, V., & Hey, J. (2019). Model-based assessments of differential introgression and linked natural selection during divergence and speciation. bioRxiv, 786038.
Sievert, C. (2018). plotly for R. Retrieved from https://plotly-r.com
Smith, C. C., & Flaxman, S. M. (2020). Leveraging whole genome sequencing data for demographic inference with approximate Bayesian computation. Molecular Ecology Resources, 20(1), 125-139.
Sousa, V. C., Carneiro, M., Ferrand, N., & Hey, J. (2013). Identifying loci under selection against gene flow in isolation-with-migration models. Genetics, 194(1), 211-233.
Sousa, V., & Hey, J. (2013). Understanding the origin of species with genome-scale data: Modelling gene flow. Nature Reviews Genetics, 14(6), 404-414.
Stankowski, S., Chase, M. A., Fuiten, A. M., Rodrigues, M. F., Ralph, P. L., & Streisfeld, M. A. (2019). Widespread selection and gene flow shape the genomic landscape during a radiation of monkey flowers. PLoS Biology, 17(7), e3000391.
Steinrücken, M., Kamm, J., Spence, J. P., & Song, Y. S. (2019). Inference of complex population histories using whole-genome sequences from multiple populations. Proceedings of the National Academy of Sciences of the United States of America, 116(34), 17115-17120.
Tajima, F. (1983). Evolutionary relationship of DNA sequences in finite populations. Genetics, 105(2), 437-460.
Tajima, F. (1989). The effect of change in population size on DNA polymorphism. Genetics, 123(3), 597-601.
Terhorst, J., Kamm, J. A., & Song, Y. S. (2017). Robust and scalable inference of population history from hundreds of unphased whole genomes. Nature Genetics, 49(2), 303.
Terhorst, J., & Song, Y. S. (2015). Fundamental limits on the accuracy of demographic inference based on the sample frequency spectrum. Proceedings of the National Academy of Sciences of the United States of America, 112(25), 7677-7682.
Tsagkogeorga, G., Cahais, V., & Galtier, N. (2012). The population genomics of a fast evolver: High levels of diversity, functional constraint, and molecular adaptation in the tunicate Ciona intestinalis. Genome Biology and Evolution, 4(8), 740-749.
Wakeley, J., & Hey, J. (1997). Estimating ancestral population parameters. Genetics, 145(3), 847-855.
Wang, K., Mathieson, I., O’Connell, J., & Schiffels, S. (2020). Tracking human population structure through time from whole genome sequences. PLoS Genetics, 16(3), e1008552.
Watterson, G. A. (1975). On the number of segregating sites in genetical models without recombination. Theoretical Population Biology, 7(2), 256-276.
Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R., Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L., Miller, E., Bache, S. M., Müller, K., Ooms, J., Robinson, D., Seidel, D. P., Spinu, V., … Yutani, H. (2019). Welcome to the Tidyverse. Journal of Open Source Software, 4(43), 1686.
Wright, S. (1943). Isolation by distance. Genetics, 28(2), 114.

Auteurs

Christelle Fraïsse (C)

Institute of Science and Technology Austria, Klosterneuœburg, Austria.
Univ. Lille, CNRS, UMR 8198 - Evo-Eco-Paleo, Lille, France.

Iva Popovic (I)

School of Biological Sciences, University of Queensland, St Lucia, Qld, Australia.

Clément Mazoyer (C)

Univ. Lille, CNRS, UMR 8198 - Evo-Eco-Paleo, Lille, France.

Bruno Spataro (B)

Laboratoire de Biologie et Biométrie Évolutive CNRS UMR 5558, Université Claude Bernard, Lyon, France.

Stéphane Delmotte (S)

Laboratoire de Biologie et Biométrie Évolutive CNRS UMR 5558, Université Claude Bernard, Lyon, France.

Jonathan Romiguier (J)

ISEM, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France.

Étienne Loire (É)

Centre de Coopération Internationale en Recherche Agronomique pour le Développement (CIRAD), UMR, ASTRE, Montpellier, France.

Alexis Simon (A)

ISEM, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France.

Nicolas Galtier (N)

ISEM, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France.

Laurent Duret (L)

Laboratoire de Biologie et Biométrie Évolutive CNRS UMR 5558, Université Claude Bernard, Lyon, France.

Nicolas Bierne (N)

ISEM, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France.

Xavier Vekemans (X)

Univ. Lille, CNRS, UMR 8198 - Evo-Eco-Paleo, Lille, France.

Camille Roux (C)

Univ. Lille, CNRS, UMR 8198 - Evo-Eco-Paleo, Lille, France.

Articles similaires

A scenario for an evolutionary selection of ageing.

Tristan Roget, Claire Macmurray, Pierre Jolivet et al.
1.00
Aging Selection, Genetic Biological Evolution Animals Fertility
Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Biological Evolution History, 20th Century Selection, Genetic History, 19th Century Biology
Humans Immunization, Secondary COVID-19 Vaccines COVID-19 SARS-CoV-2

Classifications MeSH