Establishing the reliability of metrics extracted from long-form recordings using LENA and the ACLEW pipeline.
Accuracy
Big data
Daylong recordings
Speech technology
Journal
Behavior research methods
ISSN: 1554-3528
Titre abrégé: Behav Res Methods
Pays: United States
ID NLM: 101244316
Informations de publication
Date de publication:
20 Sep 2024
20 Sep 2024
Historique:
accepted:
31
07
2024
medline:
21
9
2024
pubmed:
21
9
2024
entrez:
20
9
2024
Statut:
aheadofprint
Résumé
Long-form audio recordings are increasingly used to study individual variation, group differences, and many other topics in theoretical and applied fields of developmental science, particularly for the description of children's language input (typically speech from adults) and children's language output (ranging from babble to sentences). The proprietary LENA software has been available for over a decade, and with it, users have come to rely on derived metrics like adult word count (AWC) and child vocalization counts (CVC), which have also more recently been derived using an open-source alternative, the ACLEW pipeline. Yet, there is relatively little work assessing the reliability of long-form metrics in terms of the stability of individual differences across time. Filling this gap, we analyzed eight spoken-language datasets: four from North American English-learning infants, and one each from British English-, French-, American English-/Spanish-, and Quechua-/Spanish-learning infants. The audio data were analyzed using two types of processing software: LENA and the ACLEW open-source pipeline. When all corpora were included, we found relatively low to moderate reliability (across multiple recordings, intraclass correlation coefficient attributed to the child identity [Child ICC], was < 50% for most metrics). There were few differences between the two pipelines. Exploratory analyses suggested some differences as a function of child age and corpora. These findings suggest that, while reliability is likely sufficient for various group-level analyses, caution is needed when using either LENA or ACLEW tools to study individual variation. We also encourage improvement of extant tools, specifically targeting accurate measurement of individual variation.
Identifiants
pubmed: 39304601
doi: 10.3758/s13428-024-02493-2
pii: 10.3758/s13428-024-02493-2
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : GENCI-IDRIS
ID : Grant-A0071011046
Organisme : Research Council of Finland
ID : 314602
Organisme : Agence Nationale de la Recherche
ID : ANR-14-CE30-0003 MechELex
Organisme : Agence Nationale de la Recherche
ID : ANR-16-DATA-0004 ACLEW
Organisme : James S. McDonnell Foundation
ID : Understanding Human Cognition Scholar Award
Organisme : Social Sciences and Humanities Research Council of Canada
ID : 435-2015-0628
Organisme : Social Sciences and Humanities Research Council of Canada
ID : 869-2016-0003 (ACLEW)
Organisme : Directorate for Engineering
ID : ACI-1445606
Organisme : Directorate for Engineering
ID : OCI-1053575
Organisme : Directorate for Engineering
ID : Pittsburgh Supercomputing Center (PSC)
Organisme : Horizon 2020 Framework Programme
ID : ExELang
Organisme : Horizon 2020 Framework Programme
ID : Grant agreement No. 101001095
Informations de copyright
© 2024. The Psychonomic Society, Inc.
Références
Al Futaisi, N., Zhang, Z., Cristia, A., Warlaumont, A., & Schuller, B. (2019). VCMNet: Weakly supervised learning for automatic infant vocalisation maturity analysis. International Conference on Multimodal Interaction, 2019, 205–209. https://doi.org/10.1145/3340555.3353751
doi: 10.1145/3340555.3353751
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1. https://doi.org/10.18637/jss.v067.i01
doi: 10.18637/jss.v067.i01
Bergelson, E. (2017). Bergelson seedlings homebank. Corpus. https://doi.org/10.21415/T5PK6D
doi: 10.21415/T5PK6D
Bergelson, E., Amatuni, A., Dailey, S., Koorathota, S., & Tor, S. (2019). Day by day, hour by hour: Naturalistic language input to infants. Developmental Science, 22(1), e12715. https://doi.org/10.1111/desc.12715
doi: 10.1111/desc.12715
pubmed: 30094888
Bergelson, E., Soderstrom, M., Schwarz, I.-C., Rowland, C. F., Ramírez-Esparza, N., Hamrick, L. R., Marklund, E., Kalashnikova, M., Guez, A., Casillas, M., Benetti, L., van Alphen, P., & Cristia, A. (2023). Everyday language input and production in 1,001 children from six continents. Proceedings of the National Academy of Sciences, 120(52), e2300671120. https://doi.org/10.1073/pnas.2300671120
doi: 10.1073/pnas.2300671120
Canault, M., Le Normand, M.-T., Foudil, S., Loundon, N., & Thai-Van, H. (2016). Reliability of the Language ENvironment Analysis system (LENA
doi: 10.3758/s13428-015-0634-8
pubmed: 26174716
Casillas, M., & Cristia, A. (2019). A step-by-step guide to collecting and analyzing long-format speech environment (LFSE) recordings. Collabra: Psychology, 5(1), 24. 10/gnzf8w.
doi: 10.1525/collabra.209
Cristia, A., Seidl, A., Singh, L., & Houston, D. (2016). Test-Retest reliability in infant speech perception tasks. Infancy, 21(5), 648–667. https://doi.org/10.1111/infa.12127
doi: 10.1111/infa.12127
Cristia, A., Bulgarelli, F., & Bergelson, E. (2020). Accuracy of the language environment analysis system segmentation and metrics: A systematic review. Journal of Speech, Language, and Hearing Research, 63(4), 1093–1105. https://doi.org/10.1044/2020_JSLHR-19-00017
doi: 10.1044/2020_JSLHR-19-00017
pubmed: 32302262
pmcid: 7242991
Cychosz, M. (2022). Language exposure predicts children’s phonetic patterning: Evidence from language shift. Language, 98(3), 461–498. https://doi.org/10.1353/lan.0.0269
doi: 10.1353/lan.0.0269
pubmed: 37034148
pmcid: 10079255
Cychosz, M., Edwards, J., Munson, B., Romeo, R. R., Kosie, J., & Newman, R. (2024). The everyday speech environments of preschoolers with and without cochlear implants. Journal of Child Language, 1–22. https://doi.org/10.1017/S0305000924000023
d’Apice, K., & Stumm, S. von. (2019). Does Age Moderate the Influence of Early Life Language Experiences? A Naturalistic Home Observation Study. PsyArXiv. https://doi.org/10.31234/osf.io/jr4by
Denman, D., Speyer, R., Munro, N., Pearce, W. M., Chen, Y.-W., & Cordier, R. (2017). Psychometric Properties of Language Assessments for Children Aged 4–12 Years: A Systematic Review. Frontiers in Psychology, 8, 1515. https://doi.org/10.3389/fpsyg.2017.01515
doi: 10.3389/fpsyg.2017.01515
pubmed: 28936189
pmcid: 5594094
Drude, S., Broeder, D., Trilsbeek, P., & Wittenburg, P. (2012). The Language Archive – a new hub for language resources. LREC (pp. 3264–3267). European Language Resources Association.
Fausey, C. M., & Mendoza, J. K. (2018). FauseyTrio HomeBank Corpus [dataset]. Homebank. https://doi.org/10.21415/T5JM4R
Fenson, L., Dale, P. S., Reznick, J. S., Bates, E., Thal, D. J., & Pethick, S. J. (1994). Variability in early communicative development. Monographs of the Society for Research in Child Development, 59(5), 1–173. discussion 174–185.
doi: 10.2307/1166093
pubmed: 7845413
Fibla Reixachs, L. (2021). Relating language input to language processes early in development: Using the early language processing task in UK and India [Doctoral dissertation, University of East Anglia]. https://ueaeprints.uea.ac.uk/id/eprint/83017/
Ganek, H., & Eriks-Brophy, A. (2018). Language ENvironment analysis (LENA) system investigation of day long recordings in children: A literature review. Journal of Communication Disorders, 72, 77–85. 10/gmtkpd.
doi: 10.1016/j.jcomdis.2017.12.005
pubmed: 29402382
Gilkerson, J., Richards, J. A., Warren, S. F., Montgomery, J. K., Greenwood, C. R., Kimbrough Oller, D., Hansen, J. H. L., & Paul, T. D. (2017). Mapping the Early Language Environment Using All-Day Recordings and Automated Analysis. American Journal of Speech-Language Pathology, 26(2), 248–265. 10/gfzjg3.
doi: 10.1044/2016_AJSLP-15-0169
pubmed: 28418456
pmcid: 6195063
Grahek, I., Schaller, M., & Tackett, J. L. (2021). Anatomy of a psychological theory: Integrating construct-validation and computational-modeling methods to advance theorizing. Perspectives on Psychological Science, 16(4), 803–815. 10/ghtczb.
doi: 10.1177/1745691620966794
pubmed: 33404380
Greenwood, C. R., Thiemann-Bourque, K., Walker, D., Buzhardt, J., & Gilkerson, J. (2011). Assessing children’s home language environments using automatic speech recognition technology. Communication Disorders Quarterly, 32(2), 83–92. 10/fgpwcm.
doi: 10.1177/1525740110367826
Kobsar, D., Charlton, J. M., Tse, C. T. F., Esculier, J.-F., Graffos, A., Krowchuk, N. M., Thatcher, D., & Hunt, M. A. (2020). Validity and reliability of wearable inertial sensors in healthy adult walking: A systematic review and meta-analysis. Journal of NeuroEngineering and Rehabilitation, 17(1), 62. https://doi.org/10.1186/s12984-020-00685-3
doi: 10.1186/s12984-020-00685-3
pubmed: 32393301
pmcid: 7216606
Koo, T. K., & Li, M. Y. (2016). A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine, 15(2), 155–163. https://doi.org/10.1016/j.jcm.2016.02.012
doi: 10.1016/j.jcm.2016.02.012
pubmed: 27330520
pmcid: 4913118
Lavechin, M., Bousbib, R., Bredin, H., Dupoux, E., & Cristia, A. (2020). An open-source voice type classifier for child-centered daylong recordings. Proceedings of Interspeech. http://arxiv.org/abs/2005.12656
Levin-Asher, B., Segal, O., & Kishon-Rabin, L. (2023). The validity of LENA technology for assessing the linguistic environment and interactions of infants learning Hebrew and Arabic. Behavior Research Methods, 55(3), 1480–1495. https://doi.org/10.3758/s13428-022-01874-9
doi: 10.3758/s13428-022-01874-9
pubmed: 35668342
Lüdecke, D., Ben-Shachar, M. S., Patil, I., Waggoner, P., & Makowski, D. (2021). performance: An R Package for assessment. Comparison and Testing of Statistical Models, 6, 3139. https://doi.org/10.21105/joss.03139
doi: 10.21105/joss.03139
McDivitt, K., & Soderstrom, M. (2016). HomeBank English McDivitt/Winnipeg Corpora [dataset]. Homebank. https://doi.org/10.21415/T5KK6G
Pisani, S., Gautheron, L., & Cristia, A. (2021). Long-form recordings: From A to Z [Unpublished manuscript]. https://doi.org/10.5281/zenodo.6685828
R Core Team. (2023). R: A language and environment for statistical computing (4.3.0) [Computer software]. R Foundation for Statistical Computing https://www.R-project.org/
Räsänen, O., Seshadri, S., Lavechin, M., Cristia, A., & Casillas, M. (2021). ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings. Behavior Research Methods, 53(2), 818–835. 10/ghb2g8.
doi: 10.3758/s13428-020-01460-x
pubmed: 32875399
Rowland, C. F., Bidgood, A., Durrant, S., Peter, M., & Pine, J. M. (2018). The Language 0–5 Project Corpus [dataset]. University of Liverpool. https://doi.org/10.17605/OSF.IO/KAU5F
doi: 10.17605/OSF.IO/KAU5F
Schuller, B., Räsänen, O., Metze, F., Dupoux, E., & Cristia, A. (2024). ACLEW tools report [Unpublished manuscript]. https://doi.org/10.17605/OSF.IO/47RZD
Soderstrom, M., Casillas, M., Bergelson, E., Rosemberg, C., Alam, F., Warlaumont, A. S., & Bunce, J. (2021). Developing a cross-cultural annotation system and MetaCorpus for studying infants’ real world language experience. Collabra Psychology, 7(1), 23445. 10/gnzf8x
doi: 10.1525/collabra.23445
VanDam, M. (2018). HomeBank English Cougar Corpus [dataset]. Homebank. https://doi.org/10.21415/T5WT25
VanDam, M., Warlaumont, A. S., Bergelson, E., Cristia, A., Soderstrom, M., De Palma, P., & Macwhinney, B. (2016). HomeBank: An online repository of daylong child-centered audio recordings. Seminars in Speech and Language, 37(2), 128–142. https://doi.org/10.1055/s-0036-1580745
doi: 10.1055/s-0036-1580745
pubmed: 27111272
pmcid: 5570530
Velikonja, T., Edbrooke-Childs, J., Calderon, A., Sleed, M., Brown, A., & Deighton, J. (2017). The psychometric properties of the Ages & Stages Questionnaires for ages 2–25: A systematic review. Child Care, Health and Development, 43(1), 1–17. https://doi.org/10.1111/cch.12397
doi: 10.1111/cch.12397
pubmed: 27554865
Wang, Y., Williams, R., Dilley, L., & Houston, D. M. (2020). A meta-analysis of the predictability of LENA
doi: 10.1016/j.dr.2020.100921
pubmed: 32632339
pmcid: 7337141
Warlaumont, A. S., Pretzer, G. M., Mendoza, S., & Walle, E. A. (2016). Warlaumont HomeBank Corpus. https://doi.org/10.21415/T54S3C