Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking

Zijian Dong,Ruilin Li,Yilei Wu,Thuan Tinh Nguyen,Joanna Su Xian Chong,Fang Ji,Nathanael Ren Jie Tong,Christopher Li Hsian Chen,Juan Helen Zhou

2024-09-29

Abstract:We introduce Brain-JEPA, a brain dynamics foundation model with the Joint-Embedding Predictive Architecture (JEPA). This pioneering model achieves state-of-the-art performance in demographic prediction, disease diagnosis/prognosis, and trait prediction through fine-tuning. Furthermore, it excels in off-the-shelf evaluations (e.g., linear probing) and demonstrates superior generalizability across different ethnic groups, surpassing the previous large model for brain activity significantly. Brain-JEPA incorporates two innovative techniques: Brain Gradient Positioning and Spatiotemporal Masking. Brain Gradient Positioning introduces a functional coordinate system for brain functional parcellation, enhancing the positional encoding of different Regions of Interest (ROIs). Spatiotemporal Masking, tailored to the unique characteristics of fMRI data, addresses the challenge of heterogeneous time-series patches. These methodologies enhance model performance and advance our understanding of the neural circuits underlying cognition. Overall, Brain-JEPA is paving the way to address pivotal questions of building brain functional coordinate system and masking brain activity at the AI-neuroscience interface, and setting a potentially new paradigm in brain activity analysis through downstream adaptation.

Neurons and Cognition,Artificial Intelligence,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve The paper primarily aims to address the following issues: 1. **Improving Model Generalization**: Existing fMRI analysis models for specific tasks (such as BrainNetCNN, BrainGNN, BNT, etc.) have limited generalization capabilities across different tasks and cannot fully utilize large amounts of unlabeled fMRI data. 2. **Enhancing Pre-training Strategies**: Current foundational fMRI models (such as BrainLM) use self-supervised pre-training methods, but directly reconstructing masked time series segments may not be suitable for noisy fMRI data, making it difficult for the model to distinguish between signal and noise. 3. **Enhancing Downstream Task Performance**: Existing models perform poorly in downstream tasks (such as linear probing) and are limited to specific populations (such as Caucasians), restricting their clinical application scope. To address these issues, the paper proposes the Brain-JEPA model, which enhances model performance through the following two innovative techniques: - **Brain Gradient Positioning**: Introduces a functional coordinate system based on functional connectivity gradients to enhance the positional encoding of different regions of interest (ROIs). - **Spatiotemporal Masking**: Designs a specialized masking strategy tailored to the unique characteristics of fMRI data to handle heterogeneous time series segments. Through these techniques, Brain-JEPA demonstrates excellent performance in various downstream tasks, including demographic prediction, disease diagnosis/prognosis, and trait prediction, and shows outstanding generalization capabilities across different ethnic groups.

Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking

JDAT: Joint-Dimension-Aware Transformer with Strong Flexibility for EEG Emotion Recognition

S-JEPA: towards seamless cross-dataset transfer through dynamic spatial attention

A-JEPA: Joint-Embedding Predictive Architecture Can Listen

Joint Prediction of Longitudinal Development of Cortical Surfaces and White Matter Fibers from Neonatal MRI

Joint-Embedding Masked Autoencoder for Self-supervised Learning of Dynamic Functional Connectivity from the Human Brain

Denoising with a Joint-Embedding Predictive Architecture

Towards a "universal translator" for neural dynamics at single-cell, single-spike resolution

EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training

DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture

Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning

BrainLM: A foundation model for brain activity recordings

Multi-Task Collaborative Pre-Training and Adaptive Token Selection: A Unified Framework for Brain Representation Learning

Neuroadaptive electroencephalography: a proof-of-principle study in infants

BrainPy, a Flexible, Integrative, Efficient, and Extensible Framework for General-Purpose Brain Dynamics Programming.

BrainMass: Advancing Brain Network Analysis for Diagnosis with Large-scale Self-Supervised Learning

Benchmarking Deep Jansen-Rit Parameter Inference: An in Silico Study

An Explainable Autoencoder with Multi-Paradigm Fmri Fusion for Identifying Differences in Dynamic Functional Connectivity During Brain Development

BrainMAE: A Region-aware Self-supervised Learning Framework for Brain Signals

Bayesian Joint Modeling of Multiple Brain Functional Networks