Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
- PMID: 31824257
- PMCID: PMC6882773
- DOI: 10.3389/fnins.2019.01267
Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices
Abstract
Neural interfaces that directly produce intelligible speech from brain activity would allow people with severe impairment from neurological disorders to communicate more naturally. Here, we record neural population activity in motor, premotor and inferior frontal cortices during speech production using electrocorticography (ECoG) and show that ECoG signals alone can be used to generate intelligible speech output that can preserve conversational cues. To produce speech directly from neural data, we adapted a method from the field of speech synthesis called unit selection, in which units of speech are concatenated to form audible output. In our approach, which we call Brain-To-Speech, we chose subsequent units of speech based on the measured ECoG activity to generate audio waveforms directly from the neural recordings. Brain-To-Speech employed the user's own voice to generate speech that sounded very natural and included features such as prosody and accentuation. By investigating the brain areas involved in speech production separately, we found that speech motor cortex provided more information for the reconstruction process than the other cortical areas.
Keywords: BCI; ECoG; brain-computer interface; brain-to-speech; speech; synthesis.
Copyright © 2019 Herff, Diener, Angrick, Mugler, Tate, Goldrick, Krusienski, Slutzky and Schultz.
Figures
Similar articles
-
Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2. Sci Rep. 2024. PMID: 38671062 Free PMC article.
-
Recording human electrocorticographic (ECoG) signals for neuroscientific research and real-time functional cortical mapping.J Vis Exp. 2012 Jun 26;(64):3993. doi: 10.3791/3993. J Vis Exp. 2012. PMID: 22782131 Free PMC article.
-
Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication.Neurotherapeutics. 2022 Jan;19(1):263-273. doi: 10.1007/s13311-022-01190-2. Epub 2022 Jan 31. Neurotherapeutics. 2022. PMID: 35099768 Free PMC article. Review.
-
Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models.J Neural Eng. 2023 Sep 20;20(5):056010. doi: 10.1088/1741-2552/ace8be. J Neural Eng. 2023. PMID: 37467739 Free PMC article.
-
The Potential for a Speech Brain-Computer Interface Using Chronic Electrocorticography.Neurotherapeutics. 2019 Jan;16(1):144-165. doi: 10.1007/s13311-018-00692-2. Neurotherapeutics. 2019. PMID: 30617653 Free PMC article. Review.
Cited by
-
Iterative alignment discovery of speech-associated neural activity.J Neural Eng. 2024 Aug 28;21(4):046056. doi: 10.1088/1741-2552/ad663c. J Neural Eng. 2024. PMID: 39194182 Free PMC article.
-
A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages.Nat Biomed Eng. 2024 Aug;8(8):977-991. doi: 10.1038/s41551-024-01207-5. Epub 2024 May 20. Nat Biomed Eng. 2024. PMID: 38769157
-
Representation of internal speech by single neurons in human supramarginal gyrus.Nat Hum Behav. 2024 Jun;8(6):1136-1149. doi: 10.1038/s41562-024-01867-y. Epub 2024 May 13. Nat Hum Behav. 2024. PMID: 38740984 Free PMC article.
-
A flexible intracortical brain-computer interface for typing using finger movements.bioRxiv [Preprint]. 2024 Apr 26:2024.04.22.590630. doi: 10.1101/2024.04.22.590630. bioRxiv. 2024. PMID: 38712189 Free PMC article. Preprint.
-
Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2. Sci Rep. 2024. PMID: 38671062 Free PMC article.
References
-
- Black A. W., Taylor P. A. (1997). Automatically clustering similar units for unit selection in speech synthesis. EUROSPEECH (Rhodes: ), 601–604.
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous