Abstract:Nowadays, multimedia big data have grown exponentially in diverse applications like social networks, transportation, health, and e-commerce, etc. Accessing preferred data in large-scale datasets needs efficient and sophisticated retrieval approaches. Multimedia big data consists of the most significant features with different types of data. Even though the multimedia supports various data formats with corresponding storage frameworks, similar semantic information is expressed by the multimedia. The overlap of semantic features is most efficient for theory and research related to semantic memory. Correspondingly, in recent years, deep multimodal hashing gets more attention owing to the efficient performance of huge-scale multimedia retrieval applications. On the other hand, the deep multimodal hashing has limited efforts for exploring the complex multilevel semantic structure. The main intention of this proposal is to develop enhanced deep multimedia big data retrieval with the Adaptive Semantic Similarity Function (A-SSF). The proposed model of this research covers several phases "(a) Data collection, (b) deep feature extraction, (c) semantic feature selection and (d) adaptive similarity function for retrieval. The two main processes of multimedia big data retrieval are training and testing. Once after collecting the dataset involved with video, text, images, and audio, the training phase starts. Here, the deep semantic feature extraction is performed by the Convolutional Neural Network (CNN), which is again subjected to the semantic feature selection process by the new hybrid algorithm termed Spider Monkey-Deer Hunting Optimization Algorithm (SM-DHOA). The final optimal semantic features are stored in the feature library. During testing, selected semantic features are added to the map-reduce framework in the Hadoop environment for handling the big data, thus ensuring the proper big data distribution. Here, the main contribution termed A-SSF is introduced to compute the correlation between the multimedia semantics of the testing data and training data, thus retrieving the data with minimum similarity. Extensive experiments on benchmark multimodal datasets demonstrate that the proposed method can outperform the state-of-the-art performance for all types of data.

Multi-modal active learning with deep reinforcement learning for target feature extraction in multi-media image processing applications

RECL: Responsive Resource-Efficient Continuous Learning for Video Analytics

The image annotation algorithm using convolutional features from intermediate layer of deep learning

Scalable Multimedia Retrieval By Deep Learning Hashing With Relative Similarity Learning

An Intelligent Multi-View Active Learning Method Based on a Double-Branch Network

Edge-Cloud Collaborative Streaming Video Analytics with Multi-agent Deep Reinforcement Learning

A multimodal deep learning framework for scalable content based visual media retrieval

Multimedia Analysis and Retrieval System

Improving Web-Based Learning: Automatic Annotation of Multimedia Semantics and Cross-Media Indexing

Online Multi-Label Active Annotation

Proactive hybrid learning framework for real-time multi-vehicle detection in unregulated traffic environments

CSL Net: Convoluted SE and LSTM Blocks Based Network for Automatic Image Annotation

Image Quality Assessment in Visual Reinforcement Learning for Fast-moving Targets

Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval

Annotation Cost Efficient Active Learning for Content Based Image Retrieval

Deep Learning Based Automatic Video Annotation Tool for Self-Driving Car

Deep Learning Techniques for Future Intelligent Cross-Media Retrieval

A new design of multimedia big data retrieval enabled by deep feature learning and Adaptive Semantic Similarity Function

M-VAAL: Multimodal Variational Adversarial Active Learning for Downstream Medical Image Analysis Tasks

Streaming Machine Learning and Online Active Learning for Automated Visual Inspection

Multi-Feature Fusion Via Hierarchical Regression for Multimedia Analysis