Site-and-branch-heterogeneous analyses of an expanded dataset favour mitochondria as sister to known Alphaproteobacteria.
Journal
Nature ecology & evolution
ISSN: 2397-334X
Titre abrégé: Nat Ecol Evol
Pays: England
ID NLM: 101698577
Informations de publication
Date de publication:
03 2022
03 2022
Historique:
received:
24
05
2021
accepted:
29
11
2021
pubmed:
15
1
2022
medline:
17
3
2022
entrez:
14
1
2022
Statut:
ppublish
Résumé
Determining the phylogenetic origin of mitochondria is key to understanding the ancestral mitochondrial symbiosis and its role in eukaryogenesis. However, the precise evolutionary relationship between mitochondria and their closest bacterial relatives remains hotly debated. The reasons include pervasive phylogenetic artefacts as well as limited protein and taxon sampling. Here we developed a new model of protein evolution that accommodates both across-site and across-branch compositional heterogeneity. We applied this site-and-branch-heterogeneous model (MAM60 + GFmix) to a considerably expanded dataset that comprises 108 mitochondrial proteins of alphaproteobacterial origin, and novel metagenome-assembled genomes from microbial mats, microbialites and sediments. The MAM60 + GFmix model fits the data much better and agrees with analyses of compositionally homogenized datasets with conventional site-heterogenous models. The consilience of evidence thus suggests that mitochondria are sister to the Alphaproteobacteria to the exclusion of MarineProteo1 and Magnetococcia. We also show that the ancestral presence of the crista-developing mitochondrial contact site and cristae organizing system (a mitofilin-domain-containing Mic60 protein) in mitochondria and the Alphaproteobacteria only supports their close relationship.
Identifiants
pubmed: 35027725
doi: 10.1038/s41559-021-01638-2
pii: 10.1038/s41559-021-01638-2
doi:
Substances chimiques
Mitochondrial Proteins
0
Banques de données
figshare
['10.6084/m9.figshare.14355845']
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
253-262Informations de copyright
© 2022. The Author(s), under exclusive licence to Springer Nature Limited.
Références
Roger, A. J., Muñoz-Gómez, S. A. & Kamikawa, R. The origin and diversification of mitochondria. Curr. Biol. 27, R1177–R1192 (2017).
pubmed: 29112874
doi: 10.1016/j.cub.2017.09.015
Stairs, C. W., Leger, M. M. & Roger, A. J. Diversity and origins of anaerobic metabolism in mitochondria and related organelles. Phil. Trans. R. Soc. B 370, 20140326 (2015).
pubmed: 26323757
pmcid: 4571565
doi: 10.1098/rstb.2014.0326
Müller, M. et al. Biochemistry and evolution of anaerobic energy metabolism in eukaryotes. Microbiol. Mol. Biol. Rev. 76, 444–495 (2012).
pubmed: 22688819
pmcid: 3372258
doi: 10.1128/MMBR.05024-11
Lane, N. & Martin, W. The energetics of genome complexity. Nature 467, 929–934 (2010).
pubmed: 20962839
doi: 10.1038/nature09486
Cavalier-Smith, T. Predation and eukaryote cell origins: a coevolutionary perspective. Int. J. Biochem. Cell Biol. 41, 307–322 (2009).
pubmed: 18935970
doi: 10.1016/j.biocel.2008.10.002
Spang, A. et al. Complex archaea that bridge the gap between prokaryotes and eukaryotes. Nature 521, 173–179 (2015).
pubmed: 25945739
pmcid: 4444528
doi: 10.1038/nature14447
Zaremba-Niedzwiedzka, K. et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541, 353–358 (2017).
pubmed: 28077874
doi: 10.1038/nature21031
Eme, L., Spang, A., Lombard, J., Stairs, C. W. & Ettema, T. J. G. Archaea and the origin of eukaryotes. Nat. Rev. Microbiol. 15, 711–723 (2017).
pubmed: 29123225
doi: 10.1038/nrmicro.2017.133
Gray, M. W. Mitochondrial evolution. Cold Spring Harb. Perspect. Biol. 4, a011403 (2012).
pubmed: 22952398
pmcid: 3428767
doi: 10.1101/cshperspect.a011403
Gray, M. W. Mosaic nature of the mitochondrial proteome: implications for the origin and evolution of mitochondria. Proc. Natl Acad. Sci. USA 112, 10133–10138 (2015).
pubmed: 25848019
pmcid: 4547279
doi: 10.1073/pnas.1421379112
Martijn, J., Vosseberg, J., Guy, L., Offre, P. & Ettema, T. J. G. Deep mitochondrial origin outside the sampled alphaproteobacteria. Nature 557, 101–105 (2018).
pubmed: 29695865
doi: 10.1038/s41586-018-0059-5
Fan, L. et al. Phylogenetic analyses with systematic taxon sampling show that mitochondria branch within Alphaproteobacteria. Nat. Ecol. Evol. 4, 1213–1219 (2020).
pubmed: 32661403
doi: 10.1038/s41559-020-1239-x
Viale, A. M. & Arakaki, A. K. The chaperone connection to the origins of the eukaryotic organelles. FEBS Lett. 341, 146–151 (1994).
pubmed: 7907991
doi: 10.1016/0014-5793(94)80446-X
Andersson, S. G. E. et al. The genome sequence of Rickettsia prowazekii and the origin of mitochondria. Nature 396, 133–140 (1998).
pubmed: 9823893
doi: 10.1038/24094
Wu, M. et al. Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements. PLoS Biol. 2, E69 (2004).
pubmed: 15024419
pmcid: 368164
doi: 10.1371/journal.pbio.0020069
Fitzpatrick, D. A., Creevey, C. J. & McInerney, J. O. Genome phylogenies indicate a meaningful Α-proteobacterial phylogeny and support a grouping of the mitochondria with the Rickettsiales. Mol. Biol. Evol. 23, 74–85 (2006).
pubmed: 16151187
doi: 10.1093/molbev/msj009
Williams, K. P., Sobral, B. W. & Dickerman, A. W. A robust species tree for the alphaproteobacteria. J. Bacteriol. 189, 4578–4586 (2007).
pubmed: 17483224
pmcid: 1913456
doi: 10.1128/JB.00269-07
Sassera, D. et al. Phylogenomic evidence for the presence of a flagellum and cbb3 oxidase in the free-living mitochondrial ancestor. Mol. Biol. Evol. 28, 3285–3296 (2011).
pubmed: 21690562
doi: 10.1093/molbev/msr159
Wang, Z. & Wu, M. Phylogenomic reconstruction indicates mitochondrial ancestor was an energy parasite. PLoS ONE 9, e110685 (2014).
pubmed: 25333787
pmcid: 4198247
doi: 10.1371/journal.pone.0110685
Wang, Z. & Wu, M. An integrated phylogenomic approach toward pinpointing the origin of mitochondria. Sci. Rep. 5, 7949 (2015).
pubmed: 25609566
pmcid: 4302308
doi: 10.1038/srep07949
Ball, S. G., Bhattacharya, D. & Weber, A. P. M. Pathogen to powerhouse. Science 351, 659–660 (2016).
pubmed: 26912842
doi: 10.1126/science.aad8864
Thrash, J. C. et al. Phylogenomic evidence for a common ancestor of mitochondria and the SAR11 clade. Sci. Rep. 1, 13 (2011).
pubmed: 22355532
pmcid: 3216501
doi: 10.1038/srep00013
Georgiades, K., Madoui, M.-A., Le, P., Robert, C. & Raoult, D. Phylogenomic analysis of Odyssella thessalonicensis fortifies the common origin of Rickettsiales, Pelagibacter ubique and Reclimonas americana mitochondrion. PLoS ONE 6, e24857 (2011).
pubmed: 21957463
pmcid: 3177885
doi: 10.1371/journal.pone.0024857
Abhishek, A., Bavishi, A., Bavishi, A. & Choudhary, M. Bacterial genome chimaerism and the origin of mitochondria. Can. J. Microbiol. 57, 49–61 (2011).
pubmed: 21217797
doi: 10.1139/W10-099
Thiergart, T., Landan, G., Schenk, M., Dagan, T. & Martin, W. F. An evolutionary network of genes present in the eukaryote common ancestor polls genomes on eukaryotic and mitochondrial origin. Genome Biol. Evol. 4, 466–485 (2012).
pubmed: 22355196
pmcid: 3342870
doi: 10.1093/gbe/evs018
Gawryluk, R. M. R. Evolutionary biology: a new home for the powerhouse? Curr. Biol. 28, R798–R800 (2018).
pubmed: 30040943
doi: 10.1016/j.cub.2018.05.073
Eme, L., Sharpe, S. C., Brown, M. W. & Roger, A. J. On the age of eukaryotes: evaluating evidence from fossils and molecular clocks. Cold Spring Harb. Perspect. Biol. 6, a016139 (2014).
pubmed: 25085908
pmcid: 4107988
doi: 10.1101/cshperspect.a016139
Betts, H. C. et al. Integrated genomic and fossil evidence illuminates life’s early evolution and eukaryote origin. Nat. Ecol. Evol. 2, 1556–1562 (2018).
pubmed: 30127539
pmcid: 6152910
doi: 10.1038/s41559-018-0644-x
Muñoz-Gómez, S. A. et al. An updated phylogeny of the Alphaproteobacteria reveals that the parasitic Rickettsiales and Holosporales have independent origins. eLife 8, e42535 (2019).
pubmed: 30789345
pmcid: 6447387
doi: 10.7554/eLife.42535
Luo, H. Evolutionary origin of a streamlined marine bacterioplankton lineage. ISME J. 9, 1423–1433 (2015).
pubmed: 25431989
doi: 10.1038/ismej.2014.227
Foster, P. G. Modeling compositional heterogeneity. Syst. Biol. 53, 485–495 (2004).
pubmed: 15503675
doi: 10.1080/10635150490445779
Rodríguez-Ezpeleta, N. & Embley, T. M. The SAR11 group of alpha-proteobacteria is not related to the origin of mitochondria. PLoS ONE 7, e30520 (2012).
pubmed: 22291975
pmcid: 3264578
doi: 10.1371/journal.pone.0030520
Anantharaman, K. et al. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system. Nat. Commun. 7, 13219 (2016).
pubmed: 27774985
pmcid: 5079060
doi: 10.1038/ncomms13219
Graham, E. D., Heidelberg, J. F. & Tully, B. J. Potential for primary productivity in a globally-distributed bacterial phototroph. ISME J. 12, 1861–1866 (2018).
pubmed: 29523891
pmcid: 6018677
doi: 10.1038/s41396-018-0091-3
Delmont, T. O. et al. Nitrogen-fixing populations of Planctomycetes and Proteobacteria are abundant in surface ocean metagenomes. Nat. Microbiol. 3, 804–813 (2018).
pubmed: 29891866
pmcid: 6792437
doi: 10.1038/s41564-018-0176-9
Mehrshad, M., Amoozegar, M. A., Ghai, R., Shahzadeh Fazeli, S. A. & Rodriguez-Valera, F. Genome reconstruction from metagenomic data sets reveals novel microbes in the brackish waters of the Caspian Sea. Appl. Environ. Microbiol. 82, 1599–1612 (2016).
pubmed: 26729711
pmcid: 4771326
doi: 10.1128/AEM.03381-15
Tully, B. J., Sachdeva, R., Graham, E. D. & Heidelberg, J. F. 290 metagenome-assembled genomes from the Mediterranean Sea: a resource for marine microbiology. PeerJ 5, e3558 (2017).
pubmed: 28713657
pmcid: 5507172
doi: 10.7717/peerj.3558
Tully, B. J., Graham, E. D. & Heidelberg, J. F. The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans. Sci. Data 5, 170203 (2018).
Parks, D. H. et al. Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life. Nat. Microbiol. 2, 1533–1542 (2017).
pubmed: 28894102
doi: 10.1038/s41564-017-0012-7
Parks, D. H. et al. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat. Biotechnol. 36, 996–1004 (2018).
pubmed: 30148503
doi: 10.1038/nbt.4229
Gaston, D., Susko, E. & Roger, A. J. A phylogenetic mixture model for the identification of functionally divergent protein residues. Bioinformatics 27, 2655–2663 (2011).
pubmed: 21840876
doi: 10.1093/bioinformatics/btr470
Susko, E., Lincker, L. & Roger, A. J. Accelerated estimation of frequency classes in site-heterogeneous profile mixture models. Mol. Biol. Evol. 35, 1266–1283 (2018).
pubmed: 29688541
doi: 10.1093/molbev/msy026
Muñoz-Gómez, S. A. et al. Additional Supplementary Data for ‘Site-and-branch-heterogeneous analyses of an expanded dataset favor mitochondria as sister to known Alphaproteobacteria. Mendeley Data https://doi.org/10.17632/dnbdzmjjkp.1 (2021).
Viklund, J., Ettema, T. J. G. & Andersson, S. G. E. Independent genome reduction and phylogenetic reclassification of the oceanic SAR11 clade. Mol. Biol. Evol. 29, 599–615 (2012).
pubmed: 21900598
doi: 10.1093/molbev/msr203
Blanquart, S. & Lartillot, N. A Bayesian compound stochastic process for modeling nonstationary and nonhomogeneous sequence evolution. Mol. Biol. Evol. 23, 2058–2071 (2006).
pubmed: 16931538
doi: 10.1093/molbev/msl091
Blanquart, S. & Lartillot, N. A site- and time-heterogeneous model of amino acid replacement. Mol. Biol. Evol. 25, 842–858 (2008).
pubmed: 18234708
doi: 10.1093/molbev/msn018
Ferla, M. P., Thrash, J. C., Giovannoni, S. J. & Patrick, W. M. New rRNA gene-based phylogenies of the Alphaproteobacteria provide perspective on major groups, mitochondrial ancestry and phylogenetic instability. PLoS ONE 8, e83383 (2013).
pubmed: 24349502
pmcid: 3859672
doi: 10.1371/journal.pone.0083383
Smith, D. R. Updating our view of organelle genome nucleotide landscape. Front. Genet. 3, 175 (2012).
pubmed: 22973299
pmcid: 3438683
doi: 10.3389/fgene.2012.00175
Muñoz-Gómez, S. A. et al. Ancient homology of the mitochondrial contact site and cristae organizing system points to an endosymbiotic origin of mitochondrial cristae. Curr. Biol. 25, 1489–1495 (2015).
pubmed: 26004762
doi: 10.1016/j.cub.2015.04.006
Muñoz-Gómez, S. A., Wideman, J. G., Roger, A. J. & Slamovits, C. H. The origin of mitochondrial cristae from Alphaproteobacteria. Mol. Biol. Evol. 34, 943–956 (2017).
pubmed: 28087774
Gutiérrez-Preciado, A. et al. Functional shifts in microbial mats recapitulate early Earth metabolic transitions. Nat. Ecol. Evol. 2, 1700–1708 (2018).
pubmed: 30297749
pmcid: 6217971
doi: 10.1038/s41559-018-0683-3
Saghaï, A. et al. Comparative metagenomics unveils functions and genome features of microbialite-associated communities along a depth gradient. Environ. Microbiol. 18, 4990–5004 (2016).
pubmed: 27422734
pmcid: 5477898
doi: 10.1111/1462-2920.13456
Saghaï, A. et al. Metagenome-based diversity analyses suggest a significant contribution of non-cyanobacterial lineages to carbonate precipitation in modern microbialites. Front. Microbiol. 6, 797 (2015).
pubmed: 26300865
pmcid: 4525015
doi: 10.3389/fmicb.2015.00797
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
pubmed: 24695404
pmcid: 4103590
doi: 10.1093/bioinformatics/btu170
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
pubmed: 22506599
pmcid: 3342519
doi: 10.1089/cmb.2012.0021
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286
pmcid: 3322381
doi: 10.1038/nmeth.1923
Wu, Y.-W., Simmons, B. A. & Singer, S. W. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics 32, 605–607 (2016).
pubmed: 26515820
doi: 10.1093/bioinformatics/btv638
Parks, D. H., Imelfort, M., Skennerton, C. T., Hugenholtz, P. & Tyson, G. W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 25, 1043–1055 (2015).
pubmed: 25977477
pmcid: 4484387
doi: 10.1101/gr.186072.114
Eren, A. M. et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ 3, e1319 (2015).
pubmed: 26500826
pmcid: 4614810
doi: 10.7717/peerj.1319
Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2015).
pubmed: 25402007
doi: 10.1038/nmeth.3176
Alneberg, J. et al. Binning metagenomic contigs by coverage and composition. Nat. Methods 11, 1144–1146 (2014).
pubmed: 25218180
doi: 10.1038/nmeth.3103
Li, D., Liu, C.-M., Luo, R., Sadakane, K. & Lam, T.-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31, 1674–1676 (2015).
pubmed: 25609793
doi: 10.1093/bioinformatics/btv033
Kang, D. D. et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ 7, e7359 (2019).
pubmed: 31388474
pmcid: 6662567
doi: 10.7717/peerj.7359
Sieber, C. M. K. et al. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat. Microbiol. 3, 836–843 (2018).
pubmed: 29807988
pmcid: 6786971
doi: 10.1038/s41564-018-0171-1
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl. Acids Res 25, 3389–3402 (1997).
pubmed: 9254694
pmcid: 146917
doi: 10.1093/nar/25.17.3389
Katoh, K., Kuma, K., Toh, H. & Miyata, T. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33, 511–518 (2005).
pubmed: 15661851
pmcid: 548345
doi: 10.1093/nar/gki198
Criscuolo, A. & Gribaldo, S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 10, 210 (2010).
pubmed: 20626897
pmcid: 3017758
doi: 10.1186/1471-2148-10-210
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
pubmed: 24451623
pmcid: 3998144
doi: 10.1093/bioinformatics/btu033
Kannan, S., Rogozin, I. B. & Koonin, E. V. MitoCOGs: clusters of orthologous genes from mitochondria and implications for the evolution of eukaryotes. BMC Evol. Biol. 14, 237 (2014).
pubmed: 25421434
pmcid: 4256733
doi: 10.1186/s12862-014-0237-5
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
pubmed: 19505945
pmcid: 2712344
doi: 10.1093/bioinformatics/btp348
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
pubmed: 25371430
doi: 10.1093/molbev/msu300
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
pubmed: 28481363
pmcid: 5453245
doi: 10.1038/nmeth.4285
Menardo, F. et al. Treemmer: a tool to reduce large phylogenetic datasets with minimal loss of diversity. BMC Bioinformatics 19, 164 (2018).
pubmed: 29716518
pmcid: 5930393
doi: 10.1186/s12859-018-2164-8
Ali, R. H., Bogusz, M. & Whelan, S. Identifying clusters of high confidence homologies in multiple sequence alignments. Mol. Biol. Evol. 36, 2340–2351 (2019).
pubmed: 31209473
pmcid: 6933875
doi: 10.1093/molbev/msz142
de Vienne, D. M., Ollier, S. & Aguileta, G. Phylo-MCOA: a fast and efficient method to detect outlier genes and species in phylogenomics using multiple co-inertia analysis. Mol. Biol. Evol. 29, 1587–1598 (2012).
pubmed: 22319162
doi: 10.1093/molbev/msr317
Vaidya, G., Lohman, D. J. & Meier, R. SequenceMatrix: concatenation software for the fast assembly of multi‐gene datasets with character set and codon information. Cladistics 27, 171–180 (2011).
pubmed: 34875773
doi: 10.1111/j.1096-0031.2010.00329.x
Muñoz-Gómez, S. A. et al. Alignments for 108 mitochondrial proteins of alphaproteobacterial origin, and alphaproteobacterial MAGs from microbial mats, microbialites, and sediments. figshare https://doi.org/10.6084/m9.figshare.14355845.v2 (2021).
Wang, H.-C., Minh, B. Q., Susko, E. & Roger, A. J. Modeling site heterogeneity with posterior mean site frequency profiles accelerates accurate phylogenomic estimation. Syst. Biol. 67, 216–235 (2018).
pubmed: 28950365
doi: 10.1093/sysbio/syx068
Schrempf, D., Lartillot, N. & Szöllősi, G. Scalable empirical mixture models that account for across-site compositional heterogeneity. Mol. Biol. Evol. 37, 3616–3631 (2020).
pubmed: 32877529
pmcid: 7743758
doi: 10.1093/molbev/msaa145
Lartillot, N. A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21, 1095–1109 (2004).
pubmed: 15014145
doi: 10.1093/molbev/msh112
Lartillot, N., Rodrigue, N., Stubbs, D. & Richer, J. PhyloBayes MPI: phylogenetic reconstruction with infinite mixtures of profiles in a parallel environment. Syst. Biol. 62, 611–615 (2013).
pubmed: 23564032
doi: 10.1093/sysbio/syt022
Kumar, S., Stecher, G., Li, M., Knyaz, C. & Tamura, K. MEGA X: molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 35, 1547–1549 (2018).
pubmed: 29722887
pmcid: 5967553
doi: 10.1093/molbev/msy096
Susko, E. Tests for two trees using likelihood methods. Mol. Biol. Evol. 31, 1029–1039 (2014).
pubmed: 24401182
doi: 10.1093/molbev/msu039
Shimodaira, H. & Hasegawa, M. Multiple comparisons of log-likelihoods with applications to phylogenetic inference. Mol. Biol. Evol. 16, 1114 (1999).
doi: 10.1093/oxfordjournals.molbev.a026201
Markowski, E. A Comparison of Methods for Constructing Confidence Sets of Phylogenetic Trees Using Maximum Likelihood. MSc thesis, Dalhousie Univ. (2021).
Lee, M. D. GToTree: a user-friendly workflow for phylogenomics. Bioinformatics 35, 4162–4164 (2019).
pubmed: 30865266
pmcid: 6792077
doi: 10.1093/bioinformatics/btz188