Universal prediction of vertebrate species age at maturity.
Journal
Communications biology
ISSN: 2399-3642
Titre abrégé: Commun Biol
Pays: England
ID NLM: 101719179
Informations de publication
Date de publication:
30 Oct 2024
30 Oct 2024
Historique:
received:
20
05
2024
accepted:
10
10
2024
medline:
31
10
2024
pubmed:
31
10
2024
entrez:
31
10
2024
Statut:
epublish
Résumé
Animal age at maturity can be used as a universal and simple predictor of species extinction risk. At present, methods to estimate age at maturity are typically species-specific, limiting comparisons among species, or are infeasible due to practical constraints. To overcome this, here we develop a universal predictor of species-level age at maturity for vertebrates. We show that modelling the frequency of 'CG' sequences (CpG sites) in gene promoter regions yields rapid predictions of vertebrate age at maturity. Our models predict age at maturity with remarkable accuracy and generalisability, with median error rates of 30% (less than 1 year) and are robust to genome assemblies of varying quality. We generate predictions for 1912 vertebrate species for which age at maturity estimates were previously absent from public databases. The predictions can be used to help to inform management decisions for the many species for which more detailed population information is currently unavailable.
Identifiants
pubmed: 39478142
doi: 10.1038/s42003-024-07046-z
pii: 10.1038/s42003-024-07046-z
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
1414Informations de copyright
© 2024. The Author(s).
Références
Hoffmann, M. et al. The impact of conservation on the status of the world’s vertebrates. Science (1979) 330, 1503–1509 (2010).
Wilder, A. P. et al. The contribution of historical processes to contemporary extinction risk in placental mammals. Science (1979) 380, eabn5856 (2023).
Hutchings, J. A., Myers, R. A., García, V. B., Lucifora, L. O. & Kuparinen, A. Life-history correlates of extinction risk and recovery potential. Ecol. Appl. 22, 1061–1067 (2012).
pubmed: 22827118
doi: 10.1890/11-1313.1
Reynolds, J. D., Dulvy, N. K., Goodwin, N. B. & Hutchings, J. A. Biology of extinction risk in marine fishes. Proc. R. Soc. B: Biol. Sci. 272, 2337–2344 (2005).
doi: 10.1098/rspb.2005.3281
Oli, M. K. & Dobson, F. S. The relative importance of life‐history variables to population growth rate in Mammals: Cole’s prediction revisited. Am. Nat. 161, 422–440 (2003).
pubmed: 12699222
doi: 10.1086/367591
Healy, K., Ezard, T. H. G., Jones, O. R., Salguero-Gómez, R. & Buckley, Y. M. Animal life history is shaped by the pace of life and the distribution of age-specific mortality and reproduction. Nat. Ecol. Evol. 3, 1217–1224 (2019).
pubmed: 31285573
doi: 10.1038/s41559-019-0938-7
Rogers, T. D., Cambiè, G. & Kaiser, M. J. Determination of size, sex and maturity stage of free swimming catsharks using laser photogrammetry. Mar. Biol. 164, 213 (2017).
pubmed: 29104316
pmcid: 5651705
doi: 10.1007/s00227-017-3241-7
Levasseur, K. E., Stapleton, S. P. & Quattro, J. M. Precise natal homing and an estimate of age at sexual maturity in hawksbill turtles. Anim. Conserv. 24, 523–535 (2021).
doi: 10.1111/acv.12657
Valenzuela-Molina, M., Atkinson, S., Mashburn, K., Gendron, D. & Brownell, R. L. Fecal steroid hormones reveal reproductive state in female blue whales sampled in the Gulf of California, Mexico. Gen. Comp. Endocrinol. 261, 127–135 (2018).
pubmed: 29476760
doi: 10.1016/j.ygcen.2018.02.015
Heydenrych, M. J., Budd, A. M., Mayne, B. & Jarman, S. A genomic predictor for age at sexual maturity for mammalian species. Evol. Appl. 17, e13635 (2024).
pubmed: 38343778
pmcid: 10853647
doi: 10.1111/eva.13635
Budd, A. M., Mayne, B., Berry, O. & Jarman, S. Fish species lifespan prediction from promoter cytosine-phosphate-guanine density. Mol. Ecol. Resour. https://doi.org/10.1111/1755-0998.13774 , 1–14 (2023).
Mayne, B., Berry, O. & Jarman, S. Redefining life expectancy and maximum lifespan for wildlife management. Austral Ecol. 45, 855–857 (2020).
doi: 10.1111/aec.12931
McLain, A. T. & Faulk, C. The evolution of CpG density and lifespan in conserved primate and mammalian promoters. Aging (Albany, NY) 10, 561 (2018).
pubmed: 29661983
doi: 10.18632/aging.101413
Weber, M. et al. Distribution, silencing potential and evolutionary impact of promoter DNA methylation in the human genome. Nat. Genet. 39, 457–466 (2007).
pubmed: 17334365
doi: 10.1038/ng1990
Saxonov, S., Berg, P. & Brutlag, D. L. A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters. Proc. Natl Acad. Sci. USA 103, 1412–1417 (2006).
pubmed: 16432200
pmcid: 1345710
doi: 10.1073/pnas.0510310103
Tian, H., He, Y., Xue, Y. & Gao, Y. Q. Expression regulation of genes is linked to their CpG density distributions around transcription start sites. Life Sci. Alliance 5, e202101302 (2022).
pubmed: 35580989
pmcid: 9113945
doi: 10.26508/lsa.202101302
Gardiner-Garden, M. & Frommer, M. CpG Islands in vertebrate genomes. J. Mol. Biol. 196, 261–282 (1987).
pubmed: 3656447
doi: 10.1016/0022-2836(87)90689-9
Li, C. Z. et al. Epigenetic predictors of species maximum life span and other life-history traits in mammals. Sci. Adv. 10, eadm7273 (2024).
pubmed: 38848365
pmcid: 11160467
doi: 10.1126/sciadv.adm7273
Bergeron, L. A. et al. Evolution of the germline mutation rate across vertebrates. Nature 615, 285–291 (2023).
pubmed: 36859541
pmcid: 9995274
doi: 10.1038/s41586-023-05752-y
Herculano-Houzel, S. Longevity and sexual maturity vary across species with number of cortical neurons, and humans are no exception. J. Comp. Neurol. 527, 1689–1705 (2019).
pubmed: 30350858
doi: 10.1002/cne.24564
de Magalhães, J. P., Costa, J. & Church, G. M. An analysis of the relationship between metabolism, developmental schedules, and longevity using phylogenetic independent contrasts. J. Gerontology: Ser. A 62, 149–160 (2007).
Aylwin, C. F., Toro, C. A., Shirtcliff, E. & Lomniczi, A. Emerging genetic and epigenetic mechanisms underlying pubertal maturation in adolescence. J. Res. Adolesc. 29, 54–79 (2019).
pubmed: 30869843
doi: 10.1111/jora.12385
Li, C. et al. MKRN3 regulates the epigenetic switch of mammalian puberty via ubiquitination of MBD3. Natl Sci. Rev. 7, 671–685 (2020).
pubmed: 34692086
pmcid: 8288866
doi: 10.1093/nsr/nwaa023
Ball, G. F. & Wade, J. The value of comparative approaches to our understanding of puberty as illustrated by investigations in birds and reptiles. Horm. Behav. 64, 211–214 (2013).
pubmed: 23998665
doi: 10.1016/j.yhbeh.2013.05.002
MacLean, J. A. et al. Rhox: a new homeobox gene cluster. Cell 120, 369–382 (2005).
pubmed: 15707895
doi: 10.1016/j.cell.2004.12.022
Yang, N. et al. A hyper-quiescent chromatin state formed during aging is reversed by regeneration. Mol. Cell 83, 1659–1676.e11 (2023).
pubmed: 37116496
pmcid: 10228348
doi: 10.1016/j.molcel.2023.04.005
Choi, Y., Ballow, D. J., Xin, Y. & Rajkovic, A. Lim homeobox gene, Lhx8, is essential for mouse oocyte differentiation and survival1. Biol. Reprod. 79, 442–449 (2008).
pubmed: 18509161
pmcid: 2710541
doi: 10.1095/biolreprod.108.069393
Fan, G. et al. Initial data release and announcement of the 10,000 Fish Genomes Project (Fish10K). Gigascience 9, giaa080 (2020).
pubmed: 32810278
pmcid: 7433795
doi: 10.1093/gigascience/giaa080
Lewin, H. A. et al. The Earth BioGenome Project 2020: starting the clock. Proc. Natl Acad. Sci. USA 119, e2115635118 (2022).
pubmed: 35042800
pmcid: 8795548
doi: 10.1073/pnas.2115635118
Dennis, B., Munholland, P. L. & Scott, J. M. Estimation of growth and extinction parameters for endangered species. Ecol. Monogr. 61, 115–143 (1991).
doi: 10.2307/1943004
Reid, J. M., Bignal, E. M., Bignal, S., McCracken, D. I. & Monaghan, P. Identifying the demographic determinants of population growth rate: a case study of red-billed choughs Pyrrhocorax pyrrhocorax. J. Anim. Ecol. 73, 777–788 (2004).
doi: 10.1111/j.0021-8790.2004.00854.x
Jones, O. R. et al. Diversity of ageing across the tree of life. Nature 505, 169–173 (2014).
pubmed: 24317695
doi: 10.1038/nature12789
Staerk, J. et al. Performance of generation time approximations for extinction risk assessments. J. Appl. Ecol. 56, 1436–1446 (2019).
doi: 10.1111/1365-2664.13368
Supple, M. A. & Shapiro, B. Conservation of biodiversity in the genomics era. Genome Biol. 19, 131 (2018).
pubmed: 30205843
pmcid: 6131752
doi: 10.1186/s13059-018-1520-3
Theissinger, K. et al. How genomics can help biodiversity conservation. Trends Genet. 39, 545–559 (2023).
pubmed: 36801111
doi: 10.1016/j.tig.2023.01.005
Mayne, B., Berry, O., Davies, C., Farley, J. & Jarman, S. A genomic predictor of lifespan in vertebrates. Sci. Rep. 9, 17866 (2019).
pubmed: 31831772
pmcid: 6908713
doi: 10.1038/s41598-019-54447-w
De Magalhães, J. P. & Costa, J. A database of vertebrate longevity records and their relation to other life-history traits. J. Evol. Biol. 22, 1770–1774 (2009).
pubmed: 19522730
doi: 10.1111/j.1420-9101.2009.01783.x
Myhrvold, N. P. et al. An amniote life-history database to perform comparative analyses with birds, mammals, and reptiles. Ecology 96, 3109 (2015).
doi: 10.1890/15-0846R.1
Jones, K. E. et al. PanTHERIA: a species-level database of life history, ecology, and geography of extant and recently extinct mammals. Ecology 90, 2648 (2009).
doi: 10.1890/08-1494.1
Périer, R. C., Praz, V., Junier, T., Bonnard, C. & Bucher, P. The Eukaryotic Promoter Database (EPD). Nucleic Acids Res. 28, 302–303 (2000).
pubmed: 10592254
pmcid: 102462
doi: 10.1093/nar/28.1.302
Dreos, R., Ambrosini, G., Périer, R. C. & Bucher, P. The Eukaryotic Promoter Database: expansion of EPDnew and new promoter analysis tools. Nucleic Acids Res. 43, D92–D96 (2015).
pubmed: 25378343
doi: 10.1093/nar/gku1111
Lenhard, B., Sandelin, A. & Carninci, P. Metazoan promoters: emerging characteristics and insights into transcriptional regulation. Nat. Rev. Genet. 13, 233–245 (2012).
pubmed: 22392219
doi: 10.1038/nrg3163
R Core Team. R: A Language and Environment for Statistical Computing (R Core Team, 2013).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1 (2010).
pubmed: 20808728
pmcid: 2929880
doi: 10.18637/jss.v033.i01
Ooi, H. glmnetUtils: utilities for’Glmnet’. R package version 1.1. 5. https://cran.r-project.org/web/packages/glmnetUtils/glmnetUtils.pdf (2020).
Breiman, L. Bagging predictors. Mach. Learn. 24, 123–140 (1996).
doi: 10.1007/BF00058655
Stock, A., Gregr, E. J. & Chan, K. M. A. Data leakage jeopardizes ecological applications of machine learning. Nat. Ecol. Evol. 7, 1743–1745 (2023).
pubmed: 37528205
doi: 10.1038/s41559-023-02162-1
Kapoor, S. & Narayanan, A. Leakage and the reproducibility crisis in machine-learning-based science. Patterns 4, 100804 (2023).
pubmed: 37720327
pmcid: 10499856
doi: 10.1016/j.patter.2023.100804
Taquet, V., Blot, V., Morzadec, T., Lacombe, L. & Brunel, N. MAPIE: an open-source library for distribution-free uncertainty quantification. arXiv preprint arXiv:2207.12274 (2022).
Kim, B., Xu, C. & Barber, R. Predictive inference is free with the jackknife+-after-bootstrap. Adv. Neural Inf. Process. Syst. 33, 4138–4149 (2020).
Yong, S. Y. & Ong, C. S. Uncertainty quantification of the virial black hole mass with conformal prediction. Mon. Not. R. Astron Soc. 524, 3116–3129 (2023).
doi: 10.1093/mnras/stad2080
Quenouille, M. H. Problems in plane sampling. Ann. Math. Stat. 20, 355–375 (1949).
doi: 10.1214/aoms/1177729989
Khosravi, A., Nahavandi, S. & Creighton, D. Construction of optimal prediction intervals for load forecasting problems. IEEE Trans. Power Syst. 25, 1496–1503 (2010).
doi: 10.1109/TPWRS.2010.2042309
Kolberg, L., Raudvere, U., Kuzmin, I., Vilo, J. & Peterson, H. gprofiler2—an R package for gene list functional enrichment analysis and namespace conversion toolset g: Profiler. F1000Res 9, ELIXIR–709 (2020).
pubmed: 33564394
pmcid: 7859841
doi: 10.12688/f1000research.24956.2
Wickham, H. ggplot2. Wiley Interdiscip. Rev. Comput. Stat. 3, 180–185 (2011).
doi: 10.1002/wics.147
Yu, G., Smith, D. K., Zhu, H., Guan, Y. & Lam, T. T.-Y. ggtree: an r package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol. Evol. 8, 28–36 (2017).
doi: 10.1111/2041-210X.12628
Xu, S. et al. ggtreeExtra: compact visualization of richly annotated phylogenetic data. Mol. Biol. Evol. 38, 4039–4042 (2021).
pubmed: 34097064
pmcid: 8382893
doi: 10.1093/molbev/msab166
Wickham, H. et al. Welcome to the Tidyverse. J. Open Source Softw. 4, 1686 (2019).
doi: 10.21105/joss.01686
Kuhn, M. et al. Package ‘caret’. R. J. 223, 7 (2020).
Harris et al. Array programming with NumPy. Nature 585, 357–362 (2020).
pubmed: 32939066
pmcid: 7759461
doi: 10.1038/s41586-020-2649-2
McKinney, W. Data structures for statistical computing in Python. SciPy 445, 51–56 (2010).
Virtanen, P. et al. Fundamental algorithms for scientific computing in Python and SciPy 1.0 contributors. SciPy 1.0. Nat. Methods 17, 261–272 (2020).
pubmed: 32015543
pmcid: 7056644
doi: 10.1038/s41592-019-0686-2
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Budd, A. M. & Yong, S. Y. Code and Data for ‘Universal Prediction of Vertebrate Species Age at Maturity’ https://doi.org/10.5281/zenodo.13637779 (2024).