Single-neuronal elements of speech production in humans.


Journal

Nature
ISSN: 1476-4687
Titre abrégé: Nature
Pays: England
ID NLM: 0410462

Informations de publication

Date de publication:
31 Jan 2024
Historique:
received: 22 06 2023
accepted: 14 12 2023
medline: 1 2 2024
pubmed: 1 2 2024
entrez: 31 1 2024
Statut: aheadofprint

Résumé

Humans are capable of generating extraordinarily diverse articulatory movement combinations to produce meaningful speech. This ability to orchestrate specific phonetic sequences, and their syllabification and inflection over subsecond timescales allows us to produce thousands of word sounds and is a core component of language

Identifiants

pubmed: 38297120
doi: 10.1038/s41586-023-06982-w
pii: 10.1038/s41586-023-06982-w
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© 2024. The Author(s).

Références

Levelt, W. J. M., Roelofs, A. & Meyer, A. S. A Theory of Lexical Access in Speech Production Vol. 22 (Cambridge Univ. Press, 1999).
Kazanina, N., Bowers, J. S. & Idsardi, W. Phonemes: lexical access and beyond. Psychon. Bull. Rev. 25, 560–585 (2018).
pubmed: 28875456 doi: 10.3758/s13423-017-1362-0
Bohland, J. W. & Guenther, F. H. An fMRI investigation of syllable sequence production. NeuroImage 32, 821–841 (2006).
pubmed: 16730195 doi: 10.1016/j.neuroimage.2006.04.173
Basilakos, A., Smith, K. G., Fillmore, P., Fridriksson, J. & Fedorenko, E. Functional characterization of the human speech articulation network. Cereb. Cortex 28, 1816–1830 (2017).
pmcid: 5907347 doi: 10.1093/cercor/bhx100
Tourville, J. A., Nieto-Castañón, A., Heyne, M. & Guenther, F. H. Functional parcellation of the speech production cortex. J. Speech Lang. Hear. Res. 62, 3055–3070 (2019).
pubmed: 31465713 pmcid: 6813033 doi: 10.1044/2019_JSLHR-S-CSMC7-18-0442
Lee, D. K. et al. Neural encoding and production of functional morphemes in the posterior temporal lobe. Nat. Commun. 9, 1877 (2018).
pubmed: 29760465 pmcid: 5951905 doi: 10.1038/s41467-018-04235-3
Glanz, O., Hader, M., Schulze-Bonhage, A., Auer, P. & Ball, T. A study of word complexity under conditions of non-experimental, natural overt speech production using ECoG. Front. Hum. Neurosci. 15, 711886 (2021).
pubmed: 35185491 doi: 10.3389/fnhum.2021.711886
Yellapantula, S., Forseth, K., Tandon, N. & Aazhang, B. NetDI: methodology elucidating the role of power and dynamical brain network features that underpin word production. eNeuro 8, ENEURO.0177-20.2020 (2020).
Hoffman, P. Reductions in prefrontal activation predict off-topic utterances during speech production. Nat. Commun. 10, 515 (2019).
pubmed: 30705284 pmcid: 6355898 doi: 10.1038/s41467-019-08519-0
Glasser, M. F. et al. A multi-modal parcellation of human cerebral cortex. Nature 536, 171–178 (2016).
pubmed: 27437579 pmcid: 4990127 doi: 10.1038/nature18933
Chang, E. F. et al. Pure apraxia of speech after resection based in the posterior middle frontal gyrus. Neurosurgery 87, E383–E389 (2020).
pubmed: 32097489 pmcid: 7690655 doi: 10.1093/neuros/nyaa002
Hazem, S. R. et al. Middle frontal gyrus and area 55b: perioperative mapping and language outcomes. Front. Neurol. 12, 646075 (2021).
pubmed: 33776898 pmcid: 7988187 doi: 10.3389/fneur.2021.646075
Fedorenko, E. et al. Neural correlate of the construction of sentence meaning. Proc. Natl Acad. Sci. USA 113, E6256–E6262 (2016).
pubmed: 27671642 pmcid: 5068329 doi: 10.1073/pnas.1612132113
Nelson, M. J. et al. Neurophysiological dynamics of phrase-structure building during sentence processing. Proc. Natl Acad. Sci. USA 114, E3669–E3678 (2017).
pubmed: 28416691 pmcid: 5422821 doi: 10.1073/pnas.1701590114
Walenski, M., Europa, E., Caplan, D. & Thompson, C. K. Neural networks for sentence comprehension and production: an ALE-based meta-analysis of neuroimaging studies. Hum. Brain Mapp. 40, 2275–2304 (2019).
pubmed: 30689268 pmcid: 6597252 doi: 10.1002/hbm.24523
Elin, K. et al. A new functional magnetic resonance imaging localizer for preoperative language mapping using a sentence completion task: validity, choice of baseline condition and test–retest reliability. Front. Hum. Neurosci. 16, 791577 (2022).
pubmed: 35431846 pmcid: 9006995 doi: 10.3389/fnhum.2022.791577
Duffau, H. et al. The role of dominant premotor cortex in language: a study using intraoperative functional mapping in awake patients. Neuroimage 20, 1903–1914 (2003).
pubmed: 14683696 doi: 10.1016/S1053-8119(03)00203-9
Ikeda, S. et al. Neural decoding of single vowels during covert articulation using electrocorticography. Front. Hum. Neurosci. 8, 125 (2014).
pubmed: 24639642 pmcid: 3945950 doi: 10.3389/fnhum.2014.00125
Ghosh, S. S., Tourville, J. A. & Guenther, F. H. A neuroimaging study of premotor lateralization and cerebellar involvement in the production of phonemes and syllables. J. Speech Lang. Hear. Res. 51, 1183–1202 (2008).
pubmed: 18664692 doi: 10.1044/1092-4388(2008/07-0119)
Bouchard, K. E., Mesgarani, N., Johnson, K. & Chang, E. F. Functional organization of human sensorimotor cortex for speech articulation. Nature 495, 327–332 (2013).
pubmed: 23426266 pmcid: 3606666 doi: 10.1038/nature11911
Anumanchipalli, G. K., Chartier, J. & Chang, E. F. Speech synthesis from neural decoding of spoken sentences. Nature 568, 493–498 (2019).
pubmed: 31019317 pmcid: 9714519 doi: 10.1038/s41586-019-1119-1
Moses, D. A. et al. Neuroprosthesis for decoding speech in a paralyzed person with anarthria. N. Engl. J. Med. 385, 217–227 (2021).
pubmed: 34260835 pmcid: 8972947 doi: 10.1056/NEJMoa2027540
Wang, R. et al. Distributed feedforward and feedback cortical processing supports human speech production. Proc. Natl Acad. Sci. USA 120, e2300255120 (2023).
pubmed: 37819985 pmcid: 10589651 doi: 10.1073/pnas.2300255120
Coudé, G. et al. Neurons controlling voluntary vocalization in the Macaque ventral premotor cortex. PLoS ONE 6, e26822 (2011).
pubmed: 22073201 pmcid: 3206851 doi: 10.1371/journal.pone.0026822
Hahnloser, R. H. R., Kozhevnikov, A. A. & Fee, M. S. An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature 419, 65–70 (2002).
Aronov, D., Andalman, A. S. & Fee, M. S. A specialized forebrain circuit for vocal babbling in the juvenile songbird. Science 320, 630–634 (2008).
pubmed: 18451295 doi: 10.1126/science.1155140
Stavisky, S. D. et al. Neural ensemble dynamics in dorsal motor cortex during speech in people with paralysis. eLife 8, e46015 (2019).
pubmed: 31820736 pmcid: 6954053 doi: 10.7554/eLife.46015
Tankus, A., Fried, I. & Shoham, S. Structured neuronal encoding and decoding of human speech features. Nat. Commun. 3, 1015 (2012).
pubmed: 22910361 doi: 10.1038/ncomms1995
Basilakos, A., Smith, K. G., Fillmore, P., Fridriksson, J. & Fedorenko, E. Functional characterization of the human speech articulation network. Cereb. Cortex 28, 1816–1830 (2018).
pubmed: 28453613 doi: 10.1093/cercor/bhx100
Keating, P. & Shattuck-Hufnagel, S. A prosodic view of word form encoding for speech production. UCLA Work. Pap. Phon. 101, 112–156 (1989).
Vyas, S., Golub, M. D., Sussillo, D. & Shenoy, K. V. Computation through neural population dynamics. Ann. Rev. Neurosci. 43, 249–275 (2020).
pubmed: 32640928 doi: 10.1146/annurev-neuro-092619-094115
Churchland, M. M., Cunningham, J. P., Kaufman, M. T., Ryu, S. I. & Shenoy, K. V. Cortical preparatory activity: representation of movement or first cog in a dynamical machine? Neuron 68, 387–400 (2010).
pubmed: 21040842 pmcid: 2991102 doi: 10.1016/j.neuron.2010.09.015
Shenoy, K. V., Sahani, M. & Churchland, M. M. Cortical control of arm movements: a dynamical systems perspective. Ann. Rev. Neurosci. 36, 337–359 (2013).
pubmed: 23725001 doi: 10.1146/annurev-neuro-062111-150509
Kaufman, M. T., Churchland, M. M., Ryu, S. I. & Shenoy, K. V. Cortical activity in the null space: permitting preparation without movement. Nat. Neurosci. 17, 440–448 (2014).
pubmed: 24487233 pmcid: 3955357 doi: 10.1038/nn.3643
Mante, V., Sussillo, D., Shenoy, K. V. & Newsome, W. T. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature 503, 78–84 (2013).
pubmed: 24201281 pmcid: 4121670 doi: 10.1038/nature12742
Vitevitch, M. S. & Luce, P. A. Phonological neighborhood effects in spoken word perception and production. Ann. Rev. Linguist. 2, 75–94 (2016).
Jamali, M. et al. Dorsolateral prefrontal neurons mediate subjective decisions and their variation in humans. Nat. Neurosci. 22, 1010–1020 (2019).
Mian, M. K. et al. Encoding of rules by neurons in the human dorsolateral prefrontal cortex. Cereb. Cortex 24, 807–816 (2014).
Patel, S. R. et al. Studying task-related activity of individual neurons in the human brain. Nat. Protoc. 8, 949–957 (2013).
pubmed: 23598445 doi: 10.1038/nprot.2013.050
Sheth, S. A. et al. Human dorsal anterior cingulate cortex neurons mediate ongoing behavioural adaptation. Nature 488, 218–221 (2012).
pubmed: 22722841 pmcid: 3416924 doi: 10.1038/nature11239
Williams, Z. M., Bush, G., Rauch, S. L., Cosgrove, G. R. & Eskandar, E. N. Human anterior cingulate neurons and the integration of monetary reward with motor responses. Nat. Neurosci. 7, 1370–1375 (2004).
pubmed: 15558064 doi: 10.1038/nn1354
Jang, A. I., Wittig, J. H. Jr., Inati, S. K. & Zaghloul, K. A. Human cortical neurons in the anterior temporal lobe reinstate spiking activity during verbal memory retrieval. Curr. Biol. 27, 1700–1705 (2017).
pubmed: 28552361 pmcid: 5508588 doi: 10.1016/j.cub.2017.05.014
Ponce, C. R. et al. Evolving images for visual neurons using a deep generative network reveals coding principles and neuronal preferences. Cell 177, 999–1009 (2019).
pubmed: 31051108 pmcid: 6718199 doi: 10.1016/j.cell.2019.04.005
Yoshor, D., Ghose, G. M., Bosking, W. H., Sun, P. & Maunsell, J. H. Spatial attention does not strongly modulate neuronal responses in early human visual cortex. J. Neurosci. 27, 13205–13209 (2007).
pubmed: 18045914 pmcid: 6673408 doi: 10.1523/JNEUROSCI.2944-07.2007
Jamali, M. et al. Single-neuronal predictions of others’ beliefs in humans. Nature 591, 610–614 (2021).
Patel, S. R. et al. Studying task-related activity of individual neurons in the human brain. Nat. Protoc. 8, 949–957 (2013).
Hickok, G. & Poeppel, D. Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition 92, 67–99 (2004).
pubmed: 15037127 doi: 10.1016/j.cognition.2003.10.011
Poologaindran, A., Lowe, S. R. & Sughrue, M. E. The cortical organization of language: distilling human connectome insights for supratentorial neurosurgery. J. Neurosurg. 134, 1959–1966 (2020).
pubmed: 32736348 doi: 10.3171/2020.5.JNS191281
Genon, S. et al. The heterogeneity of the left dorsal premotor cortex evidenced by multimodal connectivity-based parcellation and functional characterization. Neuroimage 170, 400–411 (2018).
pubmed: 28213119 doi: 10.1016/j.neuroimage.2017.02.034
Milton, C. K. et al. Parcellation-based anatomic model of the semantic network. Brain Behav. 11, e02065 (2021).
pubmed: 33599397 pmcid: 8035438 doi: 10.1002/brb3.2065
Basilakos, A., Smith, K. G., Fillmore, P., Fridriksson, J. & Fedorenko, E. Functional characterization of the human speech articulation network. Cereb. Cortex 28, 1816–1830 (2018).
Sun, H. et al. Functional segregation in the left premotor cortex in language processing: evidence from fMRI. J. Integr. Neurosci. 12, 221–233 (2013).
pubmed: 23869862 doi: 10.1142/S0219635213500131
Peeva, M. G. et al. Distinct representations of phonemes, syllables and supra-syllabic sequences in the speech production network. Neuroimage 50, 626–638 (2010).
pubmed: 20035884 doi: 10.1016/j.neuroimage.2009.12.065
Paulk, A. C. et al. Large-scale neural recordings with single neuron resolution using Neuropixels probes in human cortex. Nat. Neurosci. 25, 252–263 (2022).
pubmed: 35102333 doi: 10.1038/s41593-021-00997-0
Coughlin, B. et al. Modified Neuropixels probes for recording human neurophysiology in the operating room. Nat. Protoc. 18, 2927–2953 (2023).
pubmed: 37697108 doi: 10.1038/s41596-023-00871-2
Windolf, C. et al. Robust online multiband drift estimation in electrophysiology data.In Proc. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 1–5 (IEEE, Rhodes Island, 2023).
Mehri, A. & Jalaie, S. A systematic review on methods of evaluate sentence production deficits in agrammatic aphasia patients: validity and reliability issues. J. Res. Med. Sci. 19, 885–898 (2014).
pubmed: 25535505 pmcid: 4268199
Abbott, L. F. & Sejnowski, T. J. Neural Codes and Distributed Representations: Foundations of Neural Computation (MIT, 1999).
Green, D. M. & Swets, J. A. Signal Detection Theory and Psychophysics (Wiley, 1966).
Association, I. P. & Staff, I. P. A. Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet (Cambridge Univ. Press, 1999).
Indefrey, P. & Levelt, W. J. M. in The New Cognitive Neurosciences 2nd edn (ed. Gazzaniga, M. S.) 845–865 (MIT, 2000).
Slobin, D. I. Thinking for speaking. In Proc. 13th Annual Meeting of the Berkeley Linguistics Society (eds Aske, J. et al.) 435–445 (Berkeley Linguistics Society, 1987).
Pillon, A. Morpheme units in speech production: evidence from laboratory-induced verbal slips. Lang. Cogn. Proc. 13, 465–498 (1998).
doi: 10.1080/016909698386465
King, J. R. & Dehaene, S. Characterizing the dynamics of mental representations: the temporal generalization method. Trends Cogn. Sci. 18, 203–210 (2014).
pubmed: 24593982 pmcid: 5635958 doi: 10.1016/j.tics.2014.01.002
Machens, C. K., Romo, R. & Brody, C. D. Functional, but not anatomical, separation of “what” and “when” in prefrontal cortex. J. Neurosci. 30, 350–360 (2010).
pubmed: 20053916 pmcid: 2947945 doi: 10.1523/JNEUROSCI.3276-09.2010
Elsayed, G. F., Lara, A. H., Kaufman, M. T., Churchland, M. M. & Cunningham, J. P. Reorganization between preparatory and movement population responses in motor cortex. Nat. Commun. 7, 13239 (2016).
Roy, S., Zhao, L. & Wang, X. Distinct neural activities in premotor cortex during natural vocal behaviors in a New World primate, the Common Marmoset (Callithrix jacchus). J. Neurosci. 36, 12168–12179 (2016).
pubmed: 27903726 pmcid: 5148218 doi: 10.1523/JNEUROSCI.1646-16.2016
Eliades, S. J. & Miller, C. T. Marmoset vocal communication: behavior and neurobiology. Dev. Neurobiol. 77, 286–299 (2017).
pubmed: 27739195 doi: 10.1002/dneu.22464
Okobi, D. E. Jr, Banerjee, A., Matheson, A. M. M., Phelps, S. M. & Long, M. A. Motor cortical control of vocal interaction in neotropical singing mice. Science 363, 983–988 (2019).
pubmed: 30819963 doi: 10.1126/science.aau9480
Cohen, Y. et al. Hidden neural states underlie canary song syntax. Nature 582, 539–544 (2020).
pubmed: 32555461 pmcid: 7380505 doi: 10.1038/s41586-020-2397-3
Hickok, G. Computational neuroanatomy of speech production. Nat. Rev. Neurosci. 13, 135–145 (2012).
pubmed: 22218206 pmcid: 5367153 doi: 10.1038/nrn3158
Sahin, N. T., Pinker, S., Cash, S. S., Schomer, D. & Halgren, E. Sequential processing of lexical, grammatical and phonological information within Broca’s area. Science 326, 445–449 (2009).
pubmed: 19833971 pmcid: 4030760 doi: 10.1126/science.1174481
Russo, A. A. et al. Neural trajectories in the supplementary motor area and motor cortex exhibit distinct geometries, compatible with different classes of computation. Neuron 107, 745–758 (2020).
pubmed: 32516573 pmcid: 9395139 doi: 10.1016/j.neuron.2020.05.020
Willett, F. R. et al. A high-performance speech neuroprosthesis. Nature 620, 1031–1036 (2023).
pubmed: 37612500 pmcid: 10468393 doi: 10.1038/s41586-023-06377-x
Boersma, P. & Weenink, D. Praat: Doing Phonetics by Computer (2020); www.fon.hum.uva.nl/praat/ .
McAuliffe, M., Socolof, M., Mihuc, S., Wagner, M. & Sonderegger, M. Montreal forced aligner: trainable text-speech alignment using kaldi. In Proc. Annual Conference of the International Speech Communication Association 498–502 (ISCA, 2017).
Lancaster, J. L. et al. Automated regional behavioral analysis for human brain images. Front. Neuroinform. 6, 23 (2012).
pubmed: 22973224 pmcid: 3428588 doi: 10.3389/fninf.2012.00023
Lancaster, J. L. et al. Automated analysis of fundamental features of brain structures. Neuroinformatics 9, 371–380 (2011).
pubmed: 21360205 pmcid: 3738178 doi: 10.1007/s12021-011-9108-z
Fischl, B. & Dale, A. M. Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proc. Natl Acad. Sci. USA 97, 11050–11055 (2000).
pubmed: 10984517 pmcid: 27146 doi: 10.1073/pnas.200033797
Fischl, B., Liu, A. & Dale, A. M. Automated manifold surgery: constructing geometrically accurate and topologically correct models of the human cerebral cortex. IEEE Trans. Med. Imaging 20, 70–80 (2001).
pubmed: 11293693 doi: 10.1109/42.906426
Reuter, M., Schmansky, N. J., Rosas, H. D. & Fischl, B. Within-subject template estimation for unbiased longitudinal image analysis. Neuroimage 61, 1402–1418 (2012).
pubmed: 22430496 doi: 10.1016/j.neuroimage.2012.02.084
Oostenveld, R., Fries, P., Maris, E. & Schoffelen, J. M. FieldTrip: open source software for advanced analysis of MEG, EEG and invasive electrophysiological data. Comput. Intell. Neurosci. 2011, 156869 (2011).
pubmed: 21253357 doi: 10.1155/2011/156869
Noiray, A., Iskarous, K., Bolanos, L. & Whalen, D. Tongue–jaw synergy in vowel height production: evidence from American English. In 8th International Seminar on Speech Production (eds Sock, R. et al.) 81–84 (ISSP, 2008).
Flege, J. E., Fletcher, S. G., McCutcheon, M. J. & Smith, S. C. The physiological specification of American English vowels. Lang. Speech 29, 361–388 (1986).
pubmed: 3444366 doi: 10.1177/002383098602900404
Wells, J. Longman Pronunciation Dictionary (Pearson, 2008).
Seabold, S. & Perktold, J. Statsmodels: econometric and statistical modeling with Python. In Proc. 9th Python in Science Conference (eds van der Walt, S. & Millman, J.) 92–96 (SCIPY, 2010).
Cameron, A. C. & Windmeijer, F. A. G. An R-squared measure of goodness of fit for some common nonlinear regression models. J. Econometr. 77, 329–342 (1997).
doi: 10.1016/S0304-4076(96)01818-0
Hamilton, L. S. & Huth, A. G. The revolution will not be controlled: natural stimuli in speech neuroscience. Lang. Cogn. Neurosci. 35, 573–582 (2020).
pubmed: 32656294 doi: 10.1080/23273798.2018.1499946
Hamilton, L. S., Oganian, Y., Hall, J. & Chang, E. F. Parallel and distributed encoding of speech across human auditory cortex. Cell 184, 4626–4639 (2021).
pubmed: 34411517 pmcid: 8456481 doi: 10.1016/j.cell.2021.07.019
Van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Ye, K. & Lim, L.-H. Schubert varieties and distances between subspaces of different dimensions. SIAM J. Matrix Anal. Appl. 37, 1176–1197 (2016).
doi: 10.1137/15M1054201

Auteurs

Arjun R Khanna (AR)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

William Muñoz (W)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Young Joon Kim (YJ)

Harvard Medical School, Boston, MA, USA.

Yoav Kfir (Y)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Angelique C Paulk (AC)

Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Mohsen Jamali (M)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Jing Cai (J)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Martina L Mustroph (ML)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Irene Caprara (I)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Richard Hardstone (R)

Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Mackenna Mejdell (M)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Domokos Meszéna (D)

Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Abigail Zuckerman (A)

Harvard Medical School, Boston, MA, USA.

Jeffrey Schweitzer (J)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Sydney Cash (S)

Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.

Ziv M Williams (ZM)

Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA. zwilliams@mgh.harvard.edu.
Harvard-MIT Division of Health Sciences and Technology, Boston, MA, USA. zwilliams@mgh.harvard.edu.
Harvard Medical School, Program in Neuroscience, Boston, MA, USA. zwilliams@mgh.harvard.edu.

Classifications MeSH