Establishing the reliability of metrics extracted from long-form recordings using LENA and the ACLEW pipeline.

Accuracy Big data Daylong recordings Speech technology

Journal

Behavior research methods

ISSN: 1554-3528

Titre abrégé: Behav Res Methods

Pays: United States

ID NLM: 101244316

Informations de publication

Date de publication:
20 Sep 2024

Historique:

accepted: 31 07 2024

medline: 21 9 2024

pubmed: 21 9 2024

entrez: 20 9 2024

Statut: aheadofprint

Résumé

Long-form audio recordings are increasingly used to study individual variation, group differences, and many other topics in theoretical and applied fields of developmental science, particularly for the description of children's language input (typically speech from adults) and children's language output (ranging from babble to sentences). The proprietary LENA software has been available for over a decade, and with it, users have come to rely on derived metrics like adult word count (AWC) and child vocalization counts (CVC), which have also more recently been derived using an open-source alternative, the ACLEW pipeline. Yet, there is relatively little work assessing the reliability of long-form metrics in terms of the stability of individual differences across time. Filling this gap, we analyzed eight spoken-language datasets: four from North American English-learning infants, and one each from British English-, French-, American English-/Spanish-, and Quechua-/Spanish-learning infants. The audio data were analyzed using two types of processing software: LENA and the ACLEW open-source pipeline. When all corpora were included, we found relatively low to moderate reliability (across multiple recordings, intraclass correlation coefficient attributed to the child identity [Child ICC], was < 50% for most metrics). There were few differences between the two pipelines. Exploratory analyses suggested some differences as a function of child age and corpora. These findings suggest that, while reliability is likely sufficient for various group-level analyses, caution is needed when using either LENA or ACLEW tools to study individual variation. We also encourage improvement of extant tools, specifically targeting accurate measurement of individual variation.

Identifiants

DOI: 10.3758/s13428-024-02493-2 PMID: 39304601

pubmed: 39304601

doi: 10.3758/s13428-024-02493-2

pii: 10.3758/s13428-024-02493-2

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Subventions

Organisme : GENCI-IDRIS

ID : Grant-A0071011046

Organisme : Research Council of Finland

ID : 314602

Organisme : Agence Nationale de la Recherche

ID : ANR-14-CE30-0003 MechELex

Organisme : Agence Nationale de la Recherche

ID : ANR-16-DATA-0004 ACLEW

Organisme : James S. McDonnell Foundation

ID : Understanding Human Cognition Scholar Award

Organisme : Social Sciences and Humanities Research Council of Canada

ID : 435-2015-0628

Organisme : Social Sciences and Humanities Research Council of Canada

ID : 869-2016-0003 (ACLEW)

Organisme : Directorate for Engineering

ID : ACI-1445606

Organisme : Directorate for Engineering

ID : OCI-1053575

Organisme : Directorate for Engineering

ID : Pittsburgh Supercomputing Center (PSC)

Organisme : Horizon 2020 Framework Programme

ID : ExELang

Organisme : Horizon 2020 Framework Programme

ID : Grant agreement No. 101001095

Informations de copyright

Références

Al Futaisi, N., Zhang, Z., Cristia, A., Warlaumont, A., & Schuller, B. (2019). VCMNet: Weakly supervised learning for automatic infant vocalisation maturity analysis. International Conference on Multimodal Interaction, 2019, 205–209. https://doi.org/10.1145/3340555.3353751

doi: 10.1145/3340555.3353751

Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1. https://doi.org/10.18637/jss.v067.i01

doi: 10.18637/jss.v067.i01

Bergelson, E. (2017). Bergelson seedlings homebank. Corpus. https://doi.org/10.21415/T5PK6D

doi: 10.21415/T5PK6D

Bergelson, E., Amatuni, A., Dailey, S., Koorathota, S., & Tor, S. (2019). Day by day, hour by hour: Naturalistic language input to infants. Developmental Science, 22(1), e12715. https://doi.org/10.1111/desc.12715

doi: 10.1111/desc.12715 pubmed: 30094888

Bergelson, E., Soderstrom, M., Schwarz, I.-C., Rowland, C. F., Ramírez-Esparza, N., Hamrick, L. R., Marklund, E., Kalashnikova, M., Guez, A., Casillas, M., Benetti, L., van Alphen, P., & Cristia, A. (2023). Everyday language input and production in 1,001 children from six continents. Proceedings of the National Academy of Sciences, 120(52), e2300671120. https://doi.org/10.1073/pnas.2300671120

doi: 10.1073/pnas.2300671120

Canault, M., Le Normand, M.-T., Foudil, S., Loundon, N., & Thai-Van, H. (2016). Reliability of the Language ENvironment Analysis system (LENA

doi: 10.3758/s13428-015-0634-8 pubmed: 26174716

Casillas, M., & Cristia, A. (2019). A step-by-step guide to collecting and analyzing long-format speech environment (LFSE) recordings. Collabra: Psychology, 5(1), 24. 10/gnzf8w.

doi: 10.1525/collabra.209

Cristia, A., Seidl, A., Singh, L., & Houston, D. (2016). Test-Retest reliability in infant speech perception tasks. Infancy, 21(5), 648–667. https://doi.org/10.1111/infa.12127

doi: 10.1111/infa.12127

Cristia, A., Bulgarelli, F., & Bergelson, E. (2020). Accuracy of the language environment analysis system segmentation and metrics: A systematic review. Journal of Speech, Language, and Hearing Research, 63(4), 1093–1105. https://doi.org/10.1044/2020_JSLHR-19-00017

doi: 10.1044/2020_JSLHR-19-00017 pubmed: 32302262 pmcid: 7242991

Cychosz, M. (2022). Language exposure predicts children’s phonetic patterning: Evidence from language shift. Language, 98(3), 461–498. https://doi.org/10.1353/lan.0.0269

doi: 10.1353/lan.0.0269 pubmed: 37034148 pmcid: 10079255

Cychosz, M., Edwards, J., Munson, B., Romeo, R. R., Kosie, J., & Newman, R. (2024). The everyday speech environments of preschoolers with and without cochlear implants. Journal of Child Language, 1–22. https://doi.org/10.1017/S0305000924000023

d’Apice, K., & Stumm, S. von. (2019). Does Age Moderate the Influence of Early Life Language Experiences? A Naturalistic Home Observation Study. PsyArXiv. https://doi.org/10.31234/osf.io/jr4by

Denman, D., Speyer, R., Munro, N., Pearce, W. M., Chen, Y.-W., & Cordier, R. (2017). Psychometric Properties of Language Assessments for Children Aged 4–12 Years: A Systematic Review. Frontiers in Psychology, 8, 1515. https://doi.org/10.3389/fpsyg.2017.01515

doi: 10.3389/fpsyg.2017.01515 pubmed: 28936189 pmcid: 5594094

Drude, S., Broeder, D., Trilsbeek, P., & Wittenburg, P. (2012). The Language Archive – a new hub for language resources. LREC (pp. 3264–3267). European Language Resources Association.

Fausey, C. M., & Mendoza, J. K. (2018). FauseyTrio HomeBank Corpus [dataset]. Homebank. https://doi.org/10.21415/T5JM4R

Fenson, L., Dale, P. S., Reznick, J. S., Bates, E., Thal, D. J., & Pethick, S. J. (1994). Variability in early communicative development. Monographs of the Society for Research in Child Development, 59(5), 1–173. discussion 174–185.

doi: 10.2307/1166093 pubmed: 7845413

Fibla Reixachs, L. (2021). Relating language input to language processes early in development: Using the early language processing task in UK and India [Doctoral dissertation, University of East Anglia]. https://ueaeprints.uea.ac.uk/id/eprint/83017/

Ganek, H., & Eriks-Brophy, A. (2018). Language ENvironment analysis (LENA) system investigation of day long recordings in children: A literature review. Journal of Communication Disorders, 72, 77–85. 10/gmtkpd.

doi: 10.1016/j.jcomdis.2017.12.005 pubmed: 29402382

Gilkerson, J., Richards, J. A., Warren, S. F., Montgomery, J. K., Greenwood, C. R., Kimbrough Oller, D., Hansen, J. H. L., & Paul, T. D. (2017). Mapping the Early Language Environment Using All-Day Recordings and Automated Analysis. American Journal of Speech-Language Pathology, 26(2), 248–265. 10/gfzjg3.

doi: 10.1044/2016_AJSLP-15-0169 pubmed: 28418456 pmcid: 6195063

Grahek, I., Schaller, M., & Tackett, J. L. (2021). Anatomy of a psychological theory: Integrating construct-validation and computational-modeling methods to advance theorizing. Perspectives on Psychological Science, 16(4), 803–815. 10/ghtczb.

doi: 10.1177/1745691620966794 pubmed: 33404380

Greenwood, C. R., Thiemann-Bourque, K., Walker, D., Buzhardt, J., & Gilkerson, J. (2011). Assessing children’s home language environments using automatic speech recognition technology. Communication Disorders Quarterly, 32(2), 83–92. 10/fgpwcm.

doi: 10.1177/1525740110367826

Kobsar, D., Charlton, J. M., Tse, C. T. F., Esculier, J.-F., Graffos, A., Krowchuk, N. M., Thatcher, D., & Hunt, M. A. (2020). Validity and reliability of wearable inertial sensors in healthy adult walking: A systematic review and meta-analysis. Journal of NeuroEngineering and Rehabilitation, 17(1), 62. https://doi.org/10.1186/s12984-020-00685-3

doi: 10.1186/s12984-020-00685-3 pubmed: 32393301 pmcid: 7216606

Koo, T. K., & Li, M. Y. (2016). A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine, 15(2), 155–163. https://doi.org/10.1016/j.jcm.2016.02.012

doi: 10.1016/j.jcm.2016.02.012 pubmed: 27330520 pmcid: 4913118

Lavechin, M., Bousbib, R., Bredin, H., Dupoux, E., & Cristia, A. (2020). An open-source voice type classifier for child-centered daylong recordings. Proceedings of Interspeech. http://arxiv.org/abs/2005.12656

Levin-Asher, B., Segal, O., & Kishon-Rabin, L. (2023). The validity of LENA technology for assessing the linguistic environment and interactions of infants learning Hebrew and Arabic. Behavior Research Methods, 55(3), 1480–1495. https://doi.org/10.3758/s13428-022-01874-9

doi: 10.3758/s13428-022-01874-9 pubmed: 35668342

Lüdecke, D., Ben-Shachar, M. S., Patil, I., Waggoner, P., & Makowski, D. (2021). performance: An R Package for assessment. Comparison and Testing of Statistical Models, 6, 3139. https://doi.org/10.21105/joss.03139

doi: 10.21105/joss.03139

McDivitt, K., & Soderstrom, M. (2016). HomeBank English McDivitt/Winnipeg Corpora [dataset]. Homebank. https://doi.org/10.21415/T5KK6G

Pisani, S., Gautheron, L., & Cristia, A. (2021). Long-form recordings: From A to Z [Unpublished manuscript]. https://doi.org/10.5281/zenodo.6685828

R Core Team. (2023). R: A language and environment for statistical computing (4.3.0) [Computer software]. R Foundation for Statistical Computing https://www.R-project.org/

Räsänen, O., Seshadri, S., Lavechin, M., Cristia, A., & Casillas, M. (2021). ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings. Behavior Research Methods, 53(2), 818–835. 10/ghb2g8.

doi: 10.3758/s13428-020-01460-x pubmed: 32875399

Rowland, C. F., Bidgood, A., Durrant, S., Peter, M., & Pine, J. M. (2018). The Language 0–5 Project Corpus [dataset]. University of Liverpool. https://doi.org/10.17605/OSF.IO/KAU5F

doi: 10.17605/OSF.IO/KAU5F

Schuller, B., Räsänen, O., Metze, F., Dupoux, E., & Cristia, A. (2024). ACLEW tools report [Unpublished manuscript]. https://doi.org/10.17605/OSF.IO/47RZD

Soderstrom, M., Casillas, M., Bergelson, E., Rosemberg, C., Alam, F., Warlaumont, A. S., & Bunce, J. (2021). Developing a cross-cultural annotation system and MetaCorpus for studying infants’ real world language experience. Collabra Psychology, 7(1), 23445. 10/gnzf8x

doi: 10.1525/collabra.23445

VanDam, M. (2018). HomeBank English Cougar Corpus [dataset]. Homebank. https://doi.org/10.21415/T5WT25

VanDam, M., Warlaumont, A. S., Bergelson, E., Cristia, A., Soderstrom, M., De Palma, P., & Macwhinney, B. (2016). HomeBank: An online repository of daylong child-centered audio recordings. Seminars in Speech and Language, 37(2), 128–142. https://doi.org/10.1055/s-0036-1580745

doi: 10.1055/s-0036-1580745 pubmed: 27111272 pmcid: 5570530

Velikonja, T., Edbrooke-Childs, J., Calderon, A., Sleed, M., Brown, A., & Deighton, J. (2017). The psychometric properties of the Ages & Stages Questionnaires for ages 2–25: A systematic review. Child Care, Health and Development, 43(1), 1–17. https://doi.org/10.1111/cch.12397

doi: 10.1111/cch.12397 pubmed: 27554865

Wang, Y., Williams, R., Dilley, L., & Houston, D. M. (2020). A meta-analysis of the predictability of LENA

doi: 10.1016/j.dr.2020.100921 pubmed: 32632339 pmcid: 7337141

Warlaumont, A. S., Pretzer, G. M., Mendoza, S., & Walle, E. A. (2016). Warlaumont HomeBank Corpus. https://doi.org/10.21415/T54S3C

Establishing the reliability of metrics extracted from long-form recordings using LENA and the ACLEW pipeline.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Subventions

Informations de copyright

Références

Auteurs

Alejandrina Cristia (A)

Lucas Gautheron (L)

Zixing Zhang (Z)

Björn Schuller (B)

Camila Scaff (C)

Caroline Rowland (C)

Okko Räsänen (O)

Loann Peurey (L)

Marvin Lavechin (M)

William Havard (W)

Caitlin M Fausey (CM)

Margaret Cychosz (M)

Elika Bergelson (E)

Heather Anderson (H)

Najla Al Futaisi (N)

Melanie Soderstrom (M)

Classifications MeSH