Speech imagery decoding as a window to speech planning and production

Joan Orpella,Francesco Mantegna,M. Florencia Assaneo,David Poeppel,Florencia Assaneo
DOI: https://doi.org/10.1101/2022.05.30.494046
2022-06-01
bioRxiv
Abstract:Speech imagery (the ability to generate internally quasi-perceptual experiences of speech events) is a fundamental ability tightly linked to important cognitive functions such as inner speech, phonological working memory, and predictive processing. Speech imagery is also considered an ideal tool to test theories of overt speech. Despite its pervasive nature, the study and use of speech imagery for clinical or basic research has been tremendously challenging, primarily because of the lack of behavioral outputs and the difficulty in temporally aligning imagery events across trials and individuals. Here we used magnetoencephalography (MEG) paired with time-resolved decoding and a novel behavioral protocol to map out the processing stages underlying speech imagery. We monitored participants' upper lip and jaw micromovements during imagery using electromyography. Decoding of participants' imagined syllables revealed a rapid sequence of representations from visual encoding to the imagined speech event. Importantly, participants' micromovements did not discriminate between the syllables. The neural correlates of the decoded sequence maps neatly onto the predictions of current computational models of speech motor control and provide some evidence for hypothesized internal and external feedback loops for speech planning and production, respectively. Additionally, a windowed multinomial classification (WMC) analysis revealed the presence of two nested and concurrent levels of representation (syllable and consonant-vowel transition) and the compressed nature of representations during planning. It is assumed that the same sequence underlies the motor-based generation of sensory predictions that modulate speech perception and the articulatory loop of phonological working memory. The results highlight the potential of speech imagery for different research domains, based on these new experimental approaches and analytical methods, and further pave the way for successful non-invasive brain-computer interfaces.
What problem does this paper attempt to address?