May I see what you see? Predicting visual features from neuronal activity

Vikram Ravindra,Chih-Hao Fang,Ananth Grama
DOI: https://doi.org/10.1016/j.isci.2024.108819
IF: 5.8
2024-02-01
iScience
Abstract:Understanding brain response to audiovisual stimuli is a key challenge in understanding neuronal processes. In this paper, we describe our effort aimed at reconstructing video frames from observed functional MRI images. We also demonstrate that our model can predict visual objects. Our method constructs an autoencoder model for a set of training video segments to code video streams into their corresponding latent representations. Next, we learn a mapping from the observed fMRI response to the corresponding latent video frame representation. Finally, we pass the latent vectors computed using the fMRI response through the decoder to reconstruct the predicted image. We show that the representations of video frames and those constructed from corresponding fMRI images are highly clustered, the latent representations can be used to predict objects in video frames using just the fMRI frames, and fMRI responses can be used to reconstruct the inputs to predict the presence of faces.
multidisciplinary sciences
What problem does this paper attempt to address?