Abstract:Building computational models to account for the cortical representation of language plays an important role in understanding the human linguistic system. Recent progress in distributed semantic models (DSMs), especially transformer-based methods, has driven advances in many language understanding tasks, making DSM a promising methodology to probe brain language processing. DSMs have been shown to reliably explain cortical responses to word stimuli. However, characterizing the brain activities for sentence processing is much less exhaustively explored with DSMs, especially the deep neural network-based methods. What is the relationship between cortical sentence representations against DSMs? What linguistic features that a DSM catches better explain its correlation with the brain activities aroused by sentence stimuli? Could distributed sentence representations help to reveal the semantic selectivity of different brain areas? We address these questions through the lens of neural encoding and decoding, fueled by the latest developments in natural language representation learning. We begin by evaluating the ability of a wide range of 12 DSMs to predict and decipher the functional magnetic resonance imaging (fMRI) images from humans reading sentences. Most models deliver high accuracy in the left middle temporal gyrus (LMTG) and left occipital complex (LOC). Notably, encoders trained with transformer-based DSMs consistently outperform other unsupervised structured models and all the unstructured baselines. With probing and ablation tasks, we further find that differences in the performance of the DSMs in modeling brain activities can be at least partially explained by the granularity of their semantic representations. We also illustrate the DSM's selectivity for concept categories and show that the topics are represented by spatially overlapping and distributed cortical patterns. Our results corroborate and extend previous findings in understanding t-e relation between DSMs and neural activation patterns and contribute to building solid brain–machine interfaces with deep neural network representations.

DRWS: A Model for Learning Distributed Representations for Words and Sentences.

Dependency-based Siamese Long Short-Term Memory Network for Learning Sentence Representations.

A Model of Coherence Based on Distributed Sentence Representation.

Automatic Document Summarization Via Deep Neural Networks

Incorporating Linguistic Knowledge for Learning Distributed Word Representations.

Voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data

Voxel2vec: A Natural Language Processing Approach to Learning Distributed Representations for Scientific Data.

Sparse Lifting of Dense Vectors: Unifying Word and Sentence Representations

Learning Word Representations by Jointly Modeling Syntagmatic and Paradigmatic Relations.

A Unified Framework for Jointly Learning Distributed Representations of Word and Attributes.

Topic Modeling Using Distributed Word Embeddings

Measuring Word Significance using Distributed Representations of Words

Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

Representing Sentences as Low-Rank Subspaces

Non-distributional Word Vector Representations

Neural Encoding and Decoding With Distributed Sentence Representations

Recurrent Neural Network Language Model Adaptation Derived Document Vector

DRr-Net: Dynamic Re-Read Network for Sentence Semantic Matching.

A comparative study for wordnet guided text representation

Learning Word Embedding with Better Distance Weighting and Window Size Scheduling

WRS: A Novel Word-embedding Method for Real-time Sentiment with Integrated LSTM-CNN Model