Establishing the reliability of metrics extracted from long-form recordings using LENA and the ACLEW pipeline.

Accuracy Big data Daylong recordings Speech technology

Journal

Behavior research methods
ISSN: 1554-3528
Titre abrégé: Behav Res Methods
Pays: United States
ID NLM: 101244316

Informations de publication

Date de publication:
20 Sep 2024
Historique:
accepted: 31 07 2024
medline: 21 9 2024
pubmed: 21 9 2024
entrez: 20 9 2024
Statut: aheadofprint

Résumé

Long-form audio recordings are increasingly used to study individual variation, group differences, and many other topics in theoretical and applied fields of developmental science, particularly for the description of children's language input (typically speech from adults) and children's language output (ranging from babble to sentences). The proprietary LENA software has been available for over a decade, and with it, users have come to rely on derived metrics like adult word count (AWC) and child vocalization counts (CVC), which have also more recently been derived using an open-source alternative, the ACLEW pipeline. Yet, there is relatively little work assessing the reliability of long-form metrics in terms of the stability of individual differences across time. Filling this gap, we analyzed eight spoken-language datasets: four from North American English-learning infants, and one each from British English-, French-, American English-/Spanish-, and Quechua-/Spanish-learning infants. The audio data were analyzed using two types of processing software: LENA and the ACLEW open-source pipeline. When all corpora were included, we found relatively low to moderate reliability (across multiple recordings, intraclass correlation coefficient attributed to the child identity [Child ICC], was < 50% for most metrics). There were few differences between the two pipelines. Exploratory analyses suggested some differences as a function of child age and corpora. These findings suggest that, while reliability is likely sufficient for various group-level analyses, caution is needed when using either LENA or ACLEW tools to study individual variation. We also encourage improvement of extant tools, specifically targeting accurate measurement of individual variation.

Identifiants

pubmed: 39304601
doi: 10.3758/s13428-024-02493-2
pii: 10.3758/s13428-024-02493-2
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : GENCI-IDRIS
ID : Grant-A0071011046
Organisme : Research Council of Finland
ID : 314602
Organisme : Agence Nationale de la Recherche
ID : ANR-14-CE30-0003 MechELex
Organisme : Agence Nationale de la Recherche
ID : ANR-16-DATA-0004 ACLEW
Organisme : James S. McDonnell Foundation
ID : Understanding Human Cognition Scholar Award
Organisme : Social Sciences and Humanities Research Council of Canada
ID : 435-2015-0628
Organisme : Social Sciences and Humanities Research Council of Canada
ID : 869-2016-0003 (ACLEW)
Organisme : Directorate for Engineering
ID : ACI-1445606
Organisme : Directorate for Engineering
ID : OCI-1053575
Organisme : Directorate for Engineering
ID : Pittsburgh Supercomputing Center (PSC)
Organisme : Horizon 2020 Framework Programme
ID : ExELang
Organisme : Horizon 2020 Framework Programme
ID : Grant agreement No. 101001095

Informations de copyright

© 2024. The Psychonomic Society, Inc.

Références

Al Futaisi, N., Zhang, Z., Cristia, A., Warlaumont, A., & Schuller, B. (2019). VCMNet: Weakly supervised learning for automatic infant vocalisation maturity analysis. International Conference on Multimodal Interaction, 2019, 205–209. https://doi.org/10.1145/3340555.3353751
doi: 10.1145/3340555.3353751
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1. https://doi.org/10.18637/jss.v067.i01
doi: 10.18637/jss.v067.i01
Bergelson, E. (2017). Bergelson seedlings homebank. Corpus. https://doi.org/10.21415/T5PK6D
doi: 10.21415/T5PK6D
Bergelson, E., Amatuni, A., Dailey, S., Koorathota, S., & Tor, S. (2019). Day by day, hour by hour: Naturalistic language input to infants. Developmental Science, 22(1), e12715. https://doi.org/10.1111/desc.12715
doi: 10.1111/desc.12715 pubmed: 30094888
Bergelson, E., Soderstrom, M., Schwarz, I.-C., Rowland, C. F., Ramírez-Esparza, N., Hamrick, L. R., Marklund, E., Kalashnikova, M., Guez, A., Casillas, M., Benetti, L., van Alphen, P., & Cristia, A. (2023). Everyday language input and production in 1,001 children from six continents. Proceedings of the National Academy of Sciences, 120(52), e2300671120. https://doi.org/10.1073/pnas.2300671120
doi: 10.1073/pnas.2300671120
Canault, M., Le Normand, M.-T., Foudil, S., Loundon, N., & Thai-Van, H. (2016). Reliability of the Language ENvironment Analysis system (LENA
doi: 10.3758/s13428-015-0634-8 pubmed: 26174716
Casillas, M., & Cristia, A. (2019). A step-by-step guide to collecting and analyzing long-format speech environment (LFSE) recordings. Collabra: Psychology, 5(1), 24. 10/gnzf8w.
doi: 10.1525/collabra.209
Cristia, A., Seidl, A., Singh, L., & Houston, D. (2016). Test-Retest reliability in infant speech perception tasks. Infancy, 21(5), 648–667. https://doi.org/10.1111/infa.12127
doi: 10.1111/infa.12127
Cristia, A., Bulgarelli, F., & Bergelson, E. (2020). Accuracy of the language environment analysis system segmentation and metrics: A systematic review. Journal of Speech, Language, and Hearing Research, 63(4), 1093–1105. https://doi.org/10.1044/2020_JSLHR-19-00017
doi: 10.1044/2020_JSLHR-19-00017 pubmed: 32302262 pmcid: 7242991
Cychosz, M. (2022). Language exposure predicts children’s phonetic patterning: Evidence from language shift. Language, 98(3), 461–498. https://doi.org/10.1353/lan.0.0269
doi: 10.1353/lan.0.0269 pubmed: 37034148 pmcid: 10079255
Cychosz, M., Edwards, J., Munson, B., Romeo, R. R., Kosie, J., & Newman, R. (2024). The everyday speech environments of preschoolers with and without cochlear implants. Journal of Child Language, 1–22. https://doi.org/10.1017/S0305000924000023
d’Apice, K., & Stumm, S. von. (2019). Does Age Moderate the Influence of Early Life Language Experiences? A Naturalistic Home Observation Study. PsyArXiv. https://doi.org/10.31234/osf.io/jr4by
Denman, D., Speyer, R., Munro, N., Pearce, W. M., Chen, Y.-W., & Cordier, R. (2017). Psychometric Properties of Language Assessments for Children Aged 4–12 Years: A Systematic Review. Frontiers in Psychology, 8, 1515. https://doi.org/10.3389/fpsyg.2017.01515
doi: 10.3389/fpsyg.2017.01515 pubmed: 28936189 pmcid: 5594094
Drude, S., Broeder, D., Trilsbeek, P., & Wittenburg, P. (2012). The Language Archive – a new hub for language resources. LREC (pp. 3264–3267). European Language Resources Association.
Fausey, C. M., & Mendoza, J. K. (2018). FauseyTrio HomeBank Corpus [dataset]. Homebank. https://doi.org/10.21415/T5JM4R
Fenson, L., Dale, P. S., Reznick, J. S., Bates, E., Thal, D. J., & Pethick, S. J. (1994). Variability in early communicative development. Monographs of the Society for Research in Child Development, 59(5), 1–173. discussion 174–185.
doi: 10.2307/1166093 pubmed: 7845413
Fibla Reixachs, L. (2021). Relating language input to language processes early in development: Using the early language processing task in UK and India [Doctoral dissertation, University of East Anglia]. https://ueaeprints.uea.ac.uk/id/eprint/83017/
Ganek, H., & Eriks-Brophy, A. (2018). Language ENvironment analysis (LENA) system investigation of day long recordings in children: A literature review. Journal of Communication Disorders, 72, 77–85. 10/gmtkpd.
doi: 10.1016/j.jcomdis.2017.12.005 pubmed: 29402382
Gilkerson, J., Richards, J. A., Warren, S. F., Montgomery, J. K., Greenwood, C. R., Kimbrough Oller, D., Hansen, J. H. L., & Paul, T. D. (2017). Mapping the Early Language Environment Using All-Day Recordings and Automated Analysis. American Journal of Speech-Language Pathology, 26(2), 248–265. 10/gfzjg3.
doi: 10.1044/2016_AJSLP-15-0169 pubmed: 28418456 pmcid: 6195063
Grahek, I., Schaller, M., & Tackett, J. L. (2021). Anatomy of a psychological theory: Integrating construct-validation and computational-modeling methods to advance theorizing. Perspectives on Psychological Science, 16(4), 803–815. 10/ghtczb.
doi: 10.1177/1745691620966794 pubmed: 33404380
Greenwood, C. R., Thiemann-Bourque, K., Walker, D., Buzhardt, J., & Gilkerson, J. (2011). Assessing children’s home language environments using automatic speech recognition technology. Communication Disorders Quarterly, 32(2), 83–92. 10/fgpwcm.
doi: 10.1177/1525740110367826
Kobsar, D., Charlton, J. M., Tse, C. T. F., Esculier, J.-F., Graffos, A., Krowchuk, N. M., Thatcher, D., & Hunt, M. A. (2020). Validity and reliability of wearable inertial sensors in healthy adult walking: A systematic review and meta-analysis. Journal of NeuroEngineering and Rehabilitation, 17(1), 62. https://doi.org/10.1186/s12984-020-00685-3
doi: 10.1186/s12984-020-00685-3 pubmed: 32393301 pmcid: 7216606
Koo, T. K., & Li, M. Y. (2016). A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine, 15(2), 155–163. https://doi.org/10.1016/j.jcm.2016.02.012
doi: 10.1016/j.jcm.2016.02.012 pubmed: 27330520 pmcid: 4913118
Lavechin, M., Bousbib, R., Bredin, H., Dupoux, E., & Cristia, A. (2020). An open-source voice type classifier for child-centered daylong recordings. Proceedings of Interspeech. http://arxiv.org/abs/2005.12656
Levin-Asher, B., Segal, O., & Kishon-Rabin, L. (2023). The validity of LENA technology for assessing the linguistic environment and interactions of infants learning Hebrew and Arabic. Behavior Research Methods, 55(3), 1480–1495. https://doi.org/10.3758/s13428-022-01874-9
doi: 10.3758/s13428-022-01874-9 pubmed: 35668342
Lüdecke, D., Ben-Shachar, M. S., Patil, I., Waggoner, P., & Makowski, D. (2021). performance: An R Package for assessment. Comparison and Testing of Statistical Models, 6, 3139. https://doi.org/10.21105/joss.03139
doi: 10.21105/joss.03139
McDivitt, K., & Soderstrom, M. (2016). HomeBank English McDivitt/Winnipeg Corpora [dataset]. Homebank. https://doi.org/10.21415/T5KK6G
Pisani, S., Gautheron, L., & Cristia, A. (2021). Long-form recordings: From A to Z [Unpublished manuscript]. https://doi.org/10.5281/zenodo.6685828
R Core Team. (2023). R: A language and environment for statistical computing (4.3.0) [Computer software]. R Foundation for Statistical Computing https://www.R-project.org/
Räsänen, O., Seshadri, S., Lavechin, M., Cristia, A., & Casillas, M. (2021). ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings. Behavior Research Methods, 53(2), 818–835. 10/ghb2g8.
doi: 10.3758/s13428-020-01460-x pubmed: 32875399
Rowland, C. F., Bidgood, A., Durrant, S., Peter, M., & Pine, J. M. (2018). The Language 0–5 Project Corpus [dataset]. University of Liverpool. https://doi.org/10.17605/OSF.IO/KAU5F
doi: 10.17605/OSF.IO/KAU5F
Schuller, B., Räsänen, O., Metze, F., Dupoux, E., & Cristia, A. (2024). ACLEW tools report [Unpublished manuscript]. https://doi.org/10.17605/OSF.IO/47RZD
Soderstrom, M., Casillas, M., Bergelson, E., Rosemberg, C., Alam, F., Warlaumont, A. S., & Bunce, J. (2021). Developing a cross-cultural annotation system and MetaCorpus for studying infants’ real world language experience. Collabra Psychology, 7(1), 23445. 10/gnzf8x
doi: 10.1525/collabra.23445
VanDam, M. (2018). HomeBank English Cougar Corpus [dataset]. Homebank. https://doi.org/10.21415/T5WT25
VanDam, M., Warlaumont, A. S., Bergelson, E., Cristia, A., Soderstrom, M., De Palma, P., & Macwhinney, B. (2016). HomeBank: An online repository of daylong child-centered audio recordings. Seminars in Speech and Language, 37(2), 128–142. https://doi.org/10.1055/s-0036-1580745
doi: 10.1055/s-0036-1580745 pubmed: 27111272 pmcid: 5570530
Velikonja, T., Edbrooke-Childs, J., Calderon, A., Sleed, M., Brown, A., & Deighton, J. (2017). The psychometric properties of the Ages & Stages Questionnaires for ages 2–25: A systematic review. Child Care, Health and Development, 43(1), 1–17. https://doi.org/10.1111/cch.12397
doi: 10.1111/cch.12397 pubmed: 27554865
Wang, Y., Williams, R., Dilley, L., & Houston, D. M. (2020). A meta-analysis of the predictability of LENA
doi: 10.1016/j.dr.2020.100921 pubmed: 32632339 pmcid: 7337141
Warlaumont, A. S., Pretzer, G. M., Mendoza, S., & Walle, E. A. (2016). Warlaumont HomeBank Corpus. https://doi.org/10.21415/T54S3C

Auteurs

Alejandrina Cristia (A)

Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'Etudes cognitives, ENS, EHESS, CNRS, PSL University, Paris, France. alecristia@gmail.com.

Lucas Gautheron (L)

Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'Etudes cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.
Interdisciplinary Centre for Science and Technology Studies (IZWT) Wuppertal, University of Wuppertal, Nordrhein-Westfalen, Germany.

Zixing Zhang (Z)

School of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, China.

Björn Schuller (B)

Technische Universität München, Institute for Human-Machine Communication, Munich, Germany.
Imperial College London, GLAM - Group on Language, Audio, & Music, London, UK.

Camila Scaff (C)

Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'Etudes cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.
Human Ecology group, Institute of Evolutionary Medicine, University of Zurich, Zurich, Switzerland.

Caroline Rowland (C)

Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands.

Okko Räsänen (O)

Unit of Computing Sciences, Tampere University, Tampere, Finland.

Loann Peurey (L)

Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'Etudes cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.

Marvin Lavechin (M)

Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'Etudes cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.

William Havard (W)

Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'Etudes cognitives, ENS, EHESS, CNRS, PSL University, Paris, France.
LLL, Université d'Orléans, CNRS, Orléans, France.

Caitlin M Fausey (CM)

Psychology, University of Oregon, Eugene, OR, USA.

Margaret Cychosz (M)

Department of Hearing and Speech Sciences, University of Maryland at College Park, College Park, MD, USA.

Elika Bergelson (E)

Harvard University, Psychology, Cambridge, MA, USA.

Heather Anderson (H)

Psychology, University of Oregon, Eugene, OR, USA.

Najla Al Futaisi (N)

Imperial College London, GLAM - Group on Language, Audio, & Music, London, UK.

Melanie Soderstrom (M)

University of Manitoba, Psychology, Winnipeg, MB, Canada.

Classifications MeSH