Single-neuronal elements of speech production in humans.
Journal
Nature
ISSN: 1476-4687
Titre abrégé: Nature
Pays: England
ID NLM: 0410462
Informations de publication
Date de publication:
31 Jan 2024
31 Jan 2024
Historique:
received:
22
06
2023
accepted:
14
12
2023
medline:
1
2
2024
pubmed:
1
2
2024
entrez:
31
1
2024
Statut:
aheadofprint
Résumé
Humans are capable of generating extraordinarily diverse articulatory movement combinations to produce meaningful speech. This ability to orchestrate specific phonetic sequences, and their syllabification and inflection over subsecond timescales allows us to produce thousands of word sounds and is a core component of language
Identifiants
pubmed: 38297120
doi: 10.1038/s41586-023-06982-w
pii: 10.1038/s41586-023-06982-w
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
© 2024. The Author(s).
Références
Levelt, W. J. M., Roelofs, A. & Meyer, A. S. A Theory of Lexical Access in Speech Production Vol. 22 (Cambridge Univ. Press, 1999).
Kazanina, N., Bowers, J. S. & Idsardi, W. Phonemes: lexical access and beyond. Psychon. Bull. Rev. 25, 560–585 (2018).
pubmed: 28875456
doi: 10.3758/s13423-017-1362-0
Bohland, J. W. & Guenther, F. H. An fMRI investigation of syllable sequence production. NeuroImage 32, 821–841 (2006).
pubmed: 16730195
doi: 10.1016/j.neuroimage.2006.04.173
Basilakos, A., Smith, K. G., Fillmore, P., Fridriksson, J. & Fedorenko, E. Functional characterization of the human speech articulation network. Cereb. Cortex 28, 1816–1830 (2017).
pmcid: 5907347
doi: 10.1093/cercor/bhx100
Tourville, J. A., Nieto-Castañón, A., Heyne, M. & Guenther, F. H. Functional parcellation of the speech production cortex. J. Speech Lang. Hear. Res. 62, 3055–3070 (2019).
pubmed: 31465713
pmcid: 6813033
doi: 10.1044/2019_JSLHR-S-CSMC7-18-0442
Lee, D. K. et al. Neural encoding and production of functional morphemes in the posterior temporal lobe. Nat. Commun. 9, 1877 (2018).
pubmed: 29760465
pmcid: 5951905
doi: 10.1038/s41467-018-04235-3
Glanz, O., Hader, M., Schulze-Bonhage, A., Auer, P. & Ball, T. A study of word complexity under conditions of non-experimental, natural overt speech production using ECoG. Front. Hum. Neurosci. 15, 711886 (2021).
pubmed: 35185491
doi: 10.3389/fnhum.2021.711886
Yellapantula, S., Forseth, K., Tandon, N. & Aazhang, B. NetDI: methodology elucidating the role of power and dynamical brain network features that underpin word production. eNeuro 8, ENEURO.0177-20.2020 (2020).
Hoffman, P. Reductions in prefrontal activation predict off-topic utterances during speech production. Nat. Commun. 10, 515 (2019).
pubmed: 30705284
pmcid: 6355898
doi: 10.1038/s41467-019-08519-0
Glasser, M. F. et al. A multi-modal parcellation of human cerebral cortex. Nature 536, 171–178 (2016).
pubmed: 27437579
pmcid: 4990127
doi: 10.1038/nature18933
Chang, E. F. et al. Pure apraxia of speech after resection based in the posterior middle frontal gyrus. Neurosurgery 87, E383–E389 (2020).
pubmed: 32097489
pmcid: 7690655
doi: 10.1093/neuros/nyaa002
Hazem, S. R. et al. Middle frontal gyrus and area 55b: perioperative mapping and language outcomes. Front. Neurol. 12, 646075 (2021).
pubmed: 33776898
pmcid: 7988187
doi: 10.3389/fneur.2021.646075
Fedorenko, E. et al. Neural correlate of the construction of sentence meaning. Proc. Natl Acad. Sci. USA 113, E6256–E6262 (2016).
pubmed: 27671642
pmcid: 5068329
doi: 10.1073/pnas.1612132113
Nelson, M. J. et al. Neurophysiological dynamics of phrase-structure building during sentence processing. Proc. Natl Acad. Sci. USA 114, E3669–E3678 (2017).
pubmed: 28416691
pmcid: 5422821
doi: 10.1073/pnas.1701590114
Walenski, M., Europa, E., Caplan, D. & Thompson, C. K. Neural networks for sentence comprehension and production: an ALE-based meta-analysis of neuroimaging studies. Hum. Brain Mapp. 40, 2275–2304 (2019).
pubmed: 30689268
pmcid: 6597252
doi: 10.1002/hbm.24523
Elin, K. et al. A new functional magnetic resonance imaging localizer for preoperative language mapping using a sentence completion task: validity, choice of baseline condition and test–retest reliability. Front. Hum. Neurosci. 16, 791577 (2022).
pubmed: 35431846
pmcid: 9006995
doi: 10.3389/fnhum.2022.791577
Duffau, H. et al. The role of dominant premotor cortex in language: a study using intraoperative functional mapping in awake patients. Neuroimage 20, 1903–1914 (2003).
pubmed: 14683696
doi: 10.1016/S1053-8119(03)00203-9
Ikeda, S. et al. Neural decoding of single vowels during covert articulation using electrocorticography. Front. Hum. Neurosci. 8, 125 (2014).
pubmed: 24639642
pmcid: 3945950
doi: 10.3389/fnhum.2014.00125
Ghosh, S. S., Tourville, J. A. & Guenther, F. H. A neuroimaging study of premotor lateralization and cerebellar involvement in the production of phonemes and syllables. J. Speech Lang. Hear. Res. 51, 1183–1202 (2008).
pubmed: 18664692
doi: 10.1044/1092-4388(2008/07-0119)
Bouchard, K. E., Mesgarani, N., Johnson, K. & Chang, E. F. Functional organization of human sensorimotor cortex for speech articulation. Nature 495, 327–332 (2013).
pubmed: 23426266
pmcid: 3606666
doi: 10.1038/nature11911
Anumanchipalli, G. K., Chartier, J. & Chang, E. F. Speech synthesis from neural decoding of spoken sentences. Nature 568, 493–498 (2019).
pubmed: 31019317
pmcid: 9714519
doi: 10.1038/s41586-019-1119-1
Moses, D. A. et al. Neuroprosthesis for decoding speech in a paralyzed person with anarthria. N. Engl. J. Med. 385, 217–227 (2021).
pubmed: 34260835
pmcid: 8972947
doi: 10.1056/NEJMoa2027540
Wang, R. et al. Distributed feedforward and feedback cortical processing supports human speech production. Proc. Natl Acad. Sci. USA 120, e2300255120 (2023).
pubmed: 37819985
pmcid: 10589651
doi: 10.1073/pnas.2300255120
Coudé, G. et al. Neurons controlling voluntary vocalization in the Macaque ventral premotor cortex. PLoS ONE 6, e26822 (2011).
pubmed: 22073201
pmcid: 3206851
doi: 10.1371/journal.pone.0026822
Hahnloser, R. H. R., Kozhevnikov, A. A. & Fee, M. S. An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature 419, 65–70 (2002).
Aronov, D., Andalman, A. S. & Fee, M. S. A specialized forebrain circuit for vocal babbling in the juvenile songbird. Science 320, 630–634 (2008).
pubmed: 18451295
doi: 10.1126/science.1155140
Stavisky, S. D. et al. Neural ensemble dynamics in dorsal motor cortex during speech in people with paralysis. eLife 8, e46015 (2019).
pubmed: 31820736
pmcid: 6954053
doi: 10.7554/eLife.46015
Tankus, A., Fried, I. & Shoham, S. Structured neuronal encoding and decoding of human speech features. Nat. Commun. 3, 1015 (2012).
pubmed: 22910361
doi: 10.1038/ncomms1995
Basilakos, A., Smith, K. G., Fillmore, P., Fridriksson, J. & Fedorenko, E. Functional characterization of the human speech articulation network. Cereb. Cortex 28, 1816–1830 (2018).
pubmed: 28453613
doi: 10.1093/cercor/bhx100
Keating, P. & Shattuck-Hufnagel, S. A prosodic view of word form encoding for speech production. UCLA Work. Pap. Phon. 101, 112–156 (1989).
Vyas, S., Golub, M. D., Sussillo, D. & Shenoy, K. V. Computation through neural population dynamics. Ann. Rev. Neurosci. 43, 249–275 (2020).
pubmed: 32640928
doi: 10.1146/annurev-neuro-092619-094115
Churchland, M. M., Cunningham, J. P., Kaufman, M. T., Ryu, S. I. & Shenoy, K. V. Cortical preparatory activity: representation of movement or first cog in a dynamical machine? Neuron 68, 387–400 (2010).
pubmed: 21040842
pmcid: 2991102
doi: 10.1016/j.neuron.2010.09.015
Shenoy, K. V., Sahani, M. & Churchland, M. M. Cortical control of arm movements: a dynamical systems perspective. Ann. Rev. Neurosci. 36, 337–359 (2013).
pubmed: 23725001
doi: 10.1146/annurev-neuro-062111-150509
Kaufman, M. T., Churchland, M. M., Ryu, S. I. & Shenoy, K. V. Cortical activity in the null space: permitting preparation without movement. Nat. Neurosci. 17, 440–448 (2014).
pubmed: 24487233
pmcid: 3955357
doi: 10.1038/nn.3643
Mante, V., Sussillo, D., Shenoy, K. V. & Newsome, W. T. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature 503, 78–84 (2013).
pubmed: 24201281
pmcid: 4121670
doi: 10.1038/nature12742
Vitevitch, M. S. & Luce, P. A. Phonological neighborhood effects in spoken word perception and production. Ann. Rev. Linguist. 2, 75–94 (2016).
Jamali, M. et al. Dorsolateral prefrontal neurons mediate subjective decisions and their variation in humans. Nat. Neurosci. 22, 1010–1020 (2019).
Mian, M. K. et al. Encoding of rules by neurons in the human dorsolateral prefrontal cortex. Cereb. Cortex 24, 807–816 (2014).
Patel, S. R. et al. Studying task-related activity of individual neurons in the human brain. Nat. Protoc. 8, 949–957 (2013).
pubmed: 23598445
doi: 10.1038/nprot.2013.050
Sheth, S. A. et al. Human dorsal anterior cingulate cortex neurons mediate ongoing behavioural adaptation. Nature 488, 218–221 (2012).
pubmed: 22722841
pmcid: 3416924
doi: 10.1038/nature11239
Williams, Z. M., Bush, G., Rauch, S. L., Cosgrove, G. R. & Eskandar, E. N. Human anterior cingulate neurons and the integration of monetary reward with motor responses. Nat. Neurosci. 7, 1370–1375 (2004).
pubmed: 15558064
doi: 10.1038/nn1354
Jang, A. I., Wittig, J. H. Jr., Inati, S. K. & Zaghloul, K. A. Human cortical neurons in the anterior temporal lobe reinstate spiking activity during verbal memory retrieval. Curr. Biol. 27, 1700–1705 (2017).
pubmed: 28552361
pmcid: 5508588
doi: 10.1016/j.cub.2017.05.014
Ponce, C. R. et al. Evolving images for visual neurons using a deep generative network reveals coding principles and neuronal preferences. Cell 177, 999–1009 (2019).
pubmed: 31051108
pmcid: 6718199
doi: 10.1016/j.cell.2019.04.005
Yoshor, D., Ghose, G. M., Bosking, W. H., Sun, P. & Maunsell, J. H. Spatial attention does not strongly modulate neuronal responses in early human visual cortex. J. Neurosci. 27, 13205–13209 (2007).
pubmed: 18045914
pmcid: 6673408
doi: 10.1523/JNEUROSCI.2944-07.2007
Jamali, M. et al. Single-neuronal predictions of others’ beliefs in humans. Nature 591, 610–614 (2021).
Patel, S. R. et al. Studying task-related activity of individual neurons in the human brain. Nat. Protoc. 8, 949–957 (2013).
Hickok, G. & Poeppel, D. Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition 92, 67–99 (2004).
pubmed: 15037127
doi: 10.1016/j.cognition.2003.10.011
Poologaindran, A., Lowe, S. R. & Sughrue, M. E. The cortical organization of language: distilling human connectome insights for supratentorial neurosurgery. J. Neurosurg. 134, 1959–1966 (2020).
pubmed: 32736348
doi: 10.3171/2020.5.JNS191281
Genon, S. et al. The heterogeneity of the left dorsal premotor cortex evidenced by multimodal connectivity-based parcellation and functional characterization. Neuroimage 170, 400–411 (2018).
pubmed: 28213119
doi: 10.1016/j.neuroimage.2017.02.034
Milton, C. K. et al. Parcellation-based anatomic model of the semantic network. Brain Behav. 11, e02065 (2021).
pubmed: 33599397
pmcid: 8035438
doi: 10.1002/brb3.2065
Basilakos, A., Smith, K. G., Fillmore, P., Fridriksson, J. & Fedorenko, E. Functional characterization of the human speech articulation network. Cereb. Cortex 28, 1816–1830 (2018).
Sun, H. et al. Functional segregation in the left premotor cortex in language processing: evidence from fMRI. J. Integr. Neurosci. 12, 221–233 (2013).
pubmed: 23869862
doi: 10.1142/S0219635213500131
Peeva, M. G. et al. Distinct representations of phonemes, syllables and supra-syllabic sequences in the speech production network. Neuroimage 50, 626–638 (2010).
pubmed: 20035884
doi: 10.1016/j.neuroimage.2009.12.065
Paulk, A. C. et al. Large-scale neural recordings with single neuron resolution using Neuropixels probes in human cortex. Nat. Neurosci. 25, 252–263 (2022).
pubmed: 35102333
doi: 10.1038/s41593-021-00997-0
Coughlin, B. et al. Modified Neuropixels probes for recording human neurophysiology in the operating room. Nat. Protoc. 18, 2927–2953 (2023).
pubmed: 37697108
doi: 10.1038/s41596-023-00871-2
Windolf, C. et al. Robust online multiband drift estimation in electrophysiology data.In Proc. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 1–5 (IEEE, Rhodes Island, 2023).
Mehri, A. & Jalaie, S. A systematic review on methods of evaluate sentence production deficits in agrammatic aphasia patients: validity and reliability issues. J. Res. Med. Sci. 19, 885–898 (2014).
pubmed: 25535505
pmcid: 4268199
Abbott, L. F. & Sejnowski, T. J. Neural Codes and Distributed Representations: Foundations of Neural Computation (MIT, 1999).
Green, D. M. & Swets, J. A. Signal Detection Theory and Psychophysics (Wiley, 1966).
Association, I. P. & Staff, I. P. A. Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet (Cambridge Univ. Press, 1999).
Indefrey, P. & Levelt, W. J. M. in The New Cognitive Neurosciences 2nd edn (ed. Gazzaniga, M. S.) 845–865 (MIT, 2000).
Slobin, D. I. Thinking for speaking. In Proc. 13th Annual Meeting of the Berkeley Linguistics Society (eds Aske, J. et al.) 435–445 (Berkeley Linguistics Society, 1987).
Pillon, A. Morpheme units in speech production: evidence from laboratory-induced verbal slips. Lang. Cogn. Proc. 13, 465–498 (1998).
doi: 10.1080/016909698386465
King, J. R. & Dehaene, S. Characterizing the dynamics of mental representations: the temporal generalization method. Trends Cogn. Sci. 18, 203–210 (2014).
pubmed: 24593982
pmcid: 5635958
doi: 10.1016/j.tics.2014.01.002
Machens, C. K., Romo, R. & Brody, C. D. Functional, but not anatomical, separation of “what” and “when” in prefrontal cortex. J. Neurosci. 30, 350–360 (2010).
pubmed: 20053916
pmcid: 2947945
doi: 10.1523/JNEUROSCI.3276-09.2010
Elsayed, G. F., Lara, A. H., Kaufman, M. T., Churchland, M. M. & Cunningham, J. P. Reorganization between preparatory and movement population responses in motor cortex. Nat. Commun. 7, 13239 (2016).
Roy, S., Zhao, L. & Wang, X. Distinct neural activities in premotor cortex during natural vocal behaviors in a New World primate, the Common Marmoset (Callithrix jacchus). J. Neurosci. 36, 12168–12179 (2016).
pubmed: 27903726
pmcid: 5148218
doi: 10.1523/JNEUROSCI.1646-16.2016
Eliades, S. J. & Miller, C. T. Marmoset vocal communication: behavior and neurobiology. Dev. Neurobiol. 77, 286–299 (2017).
pubmed: 27739195
doi: 10.1002/dneu.22464
Okobi, D. E. Jr, Banerjee, A., Matheson, A. M. M., Phelps, S. M. & Long, M. A. Motor cortical control of vocal interaction in neotropical singing mice. Science 363, 983–988 (2019).
pubmed: 30819963
doi: 10.1126/science.aau9480
Cohen, Y. et al. Hidden neural states underlie canary song syntax. Nature 582, 539–544 (2020).
pubmed: 32555461
pmcid: 7380505
doi: 10.1038/s41586-020-2397-3
Hickok, G. Computational neuroanatomy of speech production. Nat. Rev. Neurosci. 13, 135–145 (2012).
pubmed: 22218206
pmcid: 5367153
doi: 10.1038/nrn3158
Sahin, N. T., Pinker, S., Cash, S. S., Schomer, D. & Halgren, E. Sequential processing of lexical, grammatical and phonological information within Broca’s area. Science 326, 445–449 (2009).
pubmed: 19833971
pmcid: 4030760
doi: 10.1126/science.1174481
Russo, A. A. et al. Neural trajectories in the supplementary motor area and motor cortex exhibit distinct geometries, compatible with different classes of computation. Neuron 107, 745–758 (2020).
pubmed: 32516573
pmcid: 9395139
doi: 10.1016/j.neuron.2020.05.020
Willett, F. R. et al. A high-performance speech neuroprosthesis. Nature 620, 1031–1036 (2023).
pubmed: 37612500
pmcid: 10468393
doi: 10.1038/s41586-023-06377-x
Boersma, P. & Weenink, D. Praat: Doing Phonetics by Computer (2020); www.fon.hum.uva.nl/praat/ .
McAuliffe, M., Socolof, M., Mihuc, S., Wagner, M. & Sonderegger, M. Montreal forced aligner: trainable text-speech alignment using kaldi. In Proc. Annual Conference of the International Speech Communication Association 498–502 (ISCA, 2017).
Lancaster, J. L. et al. Automated regional behavioral analysis for human brain images. Front. Neuroinform. 6, 23 (2012).
pubmed: 22973224
pmcid: 3428588
doi: 10.3389/fninf.2012.00023
Lancaster, J. L. et al. Automated analysis of fundamental features of brain structures. Neuroinformatics 9, 371–380 (2011).
pubmed: 21360205
pmcid: 3738178
doi: 10.1007/s12021-011-9108-z
Fischl, B. & Dale, A. M. Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proc. Natl Acad. Sci. USA 97, 11050–11055 (2000).
pubmed: 10984517
pmcid: 27146
doi: 10.1073/pnas.200033797
Fischl, B., Liu, A. & Dale, A. M. Automated manifold surgery: constructing geometrically accurate and topologically correct models of the human cerebral cortex. IEEE Trans. Med. Imaging 20, 70–80 (2001).
pubmed: 11293693
doi: 10.1109/42.906426
Reuter, M., Schmansky, N. J., Rosas, H. D. & Fischl, B. Within-subject template estimation for unbiased longitudinal image analysis. Neuroimage 61, 1402–1418 (2012).
pubmed: 22430496
doi: 10.1016/j.neuroimage.2012.02.084
Oostenveld, R., Fries, P., Maris, E. & Schoffelen, J. M. FieldTrip: open source software for advanced analysis of MEG, EEG and invasive electrophysiological data. Comput. Intell. Neurosci. 2011, 156869 (2011).
pubmed: 21253357
doi: 10.1155/2011/156869
Noiray, A., Iskarous, K., Bolanos, L. & Whalen, D. Tongue–jaw synergy in vowel height production: evidence from American English. In 8th International Seminar on Speech Production (eds Sock, R. et al.) 81–84 (ISSP, 2008).
Flege, J. E., Fletcher, S. G., McCutcheon, M. J. & Smith, S. C. The physiological specification of American English vowels. Lang. Speech 29, 361–388 (1986).
pubmed: 3444366
doi: 10.1177/002383098602900404
Wells, J. Longman Pronunciation Dictionary (Pearson, 2008).
Seabold, S. & Perktold, J. Statsmodels: econometric and statistical modeling with Python. In Proc. 9th Python in Science Conference (eds van der Walt, S. & Millman, J.) 92–96 (SCIPY, 2010).
Cameron, A. C. & Windmeijer, F. A. G. An R-squared measure of goodness of fit for some common nonlinear regression models. J. Econometr. 77, 329–342 (1997).
doi: 10.1016/S0304-4076(96)01818-0
Hamilton, L. S. & Huth, A. G. The revolution will not be controlled: natural stimuli in speech neuroscience. Lang. Cogn. Neurosci. 35, 573–582 (2020).
pubmed: 32656294
doi: 10.1080/23273798.2018.1499946
Hamilton, L. S., Oganian, Y., Hall, J. & Chang, E. F. Parallel and distributed encoding of speech across human auditory cortex. Cell 184, 4626–4639 (2021).
pubmed: 34411517
pmcid: 8456481
doi: 10.1016/j.cell.2021.07.019
Van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Ye, K. & Lim, L.-H. Schubert varieties and distances between subspaces of different dimensions. SIAM J. Matrix Anal. Appl. 37, 1176–1197 (2016).
doi: 10.1137/15M1054201