Abstract:Functional magnetic resonance imaging (fMRI) is a powerful tool to probe the human [email protected]?s perception and cognition. Besides being extensively exploited in the clinical applications, fMRI technique is also useful to [email protected]?s ordinary life. In this paper, we investigate a novel application of leveraging fMRI techniques to video clustering and retrieval. In the proposed work, we successfully integrate semantic human-centric features derived from natural stimulus fMRI data and low-level visual-audio features to facilitate video clustering and retrieval, which is a significant innovation compared to the previous works relying on either fMRI-derived features or low-level visual-audio features. Our system consists of several algorithmic modules. First, fMRI data when the subjects are watching video shot samples are acquired. Then a newly developed brain networks localization system is employed to locate the cortical regions of interests (ROIs) for each individual subject. The functional interactions computed by wavelet transform coherence are quantified, from which the human-centric features are derived. Afterwards, the Gaussian process regression model mapping visual-audio feature space to an fMRI-derived feature space is trained, given the training samples. The trained model is then adopted to predict fMRI-derived features for videos without the fMRI data. Finally, the multi-modal spectral clustering and multi-modal ranking algorithm are adopted and proposed to integrate these two heterogeneous features for video clustering and retrieval, respectively. Our experiment on TRECVID database has demonstrated the precision of video clustering and retrieval can be substantially improved by integration of visual-audio features and fMRI-derived features.

Individual Home-Video Collecting Using a Co-clustering Method

Person-Based Video Summarization And Retrieval By Tracking And Clustering Temporal Face Sequences

Video object segmentation by motion-based sequential feature clustering.

Integrated System for Face Detection, Clustering and Recognition

A Video Face Clustering Approach Based on Sparse Subspace Representation

Human Facial Expression Recognition Based On 3d Cuboids And Improved K-Means Clustering Algorithm

Appearance-based Video Clustering in 2D Locality Preserving Projection Subspace.

Fast Person-Specific Image Retrieval Using A Simple And Efficient Clustering Method

VideoClusterNet: Self-Supervised and Adaptive Face Clustering For Videos

Clustering and retrieval of video shots based on natural stimulus fMRI

Video Face Matching using Subset Selection and Clustering of Probabilistic Multi-Region Histograms

Unsupervised Person Clustering in Videos with Cross-Modal Communication.

Multi-Clue Based Facial Feature Detection and Tracking in Video

Motion Clustering For Similar Video Segments Mining

A novel video face clustering algorithm based on divide and conquer strategy

Towards Unlocking Web Video: Automatic People Tracking and Clustering

Face Extraction from Video Sequences by K-means Clustering and Fusion

Video Face Clustering via Constrained Sparse Representation

Human Instance Segmentation and Tracking via Data Association and Single-stage Detector

Clustering-based multi-featured self-supervised learning for human activities and video retrieval

A New Method for Multi-view Face Clustering in Video Sequence