Abstract:Nowadays, multimedia big data have grown exponentially in diverse applications like social networks, transportation, health, and e-commerce, etc. Accessing preferred data in large-scale datasets needs efficient and sophisticated retrieval approaches. Multimedia big data consists of the most significant features with different types of data. Even though the multimedia supports various data formats with corresponding storage frameworks, similar semantic information is expressed by the multimedia. The overlap of semantic features is most efficient for theory and research related to semantic memory. Correspondingly, in recent years, deep multimodal hashing gets more attention owing to the efficient performance of huge-scale multimedia retrieval applications. On the other hand, the deep multimodal hashing has limited efforts for exploring the complex multilevel semantic structure. The main intention of this proposal is to develop enhanced deep multimedia big data retrieval with the Adaptive Semantic Similarity Function (A-SSF). The proposed model of this research covers several phases "(a) Data collection, (b) deep feature extraction, (c) semantic feature selection and (d) adaptive similarity function for retrieval. The two main processes of multimedia big data retrieval are training and testing. Once after collecting the dataset involved with video, text, images, and audio, the training phase starts. Here, the deep semantic feature extraction is performed by the Convolutional Neural Network (CNN), which is again subjected to the semantic feature selection process by the new hybrid algorithm termed Spider Monkey-Deer Hunting Optimization Algorithm (SM-DHOA). The final optimal semantic features are stored in the feature library. During testing, selected semantic features are added to the map-reduce framework in the Hadoop environment for handling the big data, thus ensuring the proper big data distribution. Here, the main contribution termed A-SSF is introduced to compute the correlation between the multimedia semantics of the testing data and training data, thus retrieving the data with minimum similarity. Extensive experiments on benchmark multimodal datasets demonstrate that the proposed method can outperform the state-of-the-art performance for all types of data.

Large Scale Cross-Media Data Retrieval Based on Hadoop.

Online latent semantic hashing for cross-media retrieval.

Supervised Coarse-to-Fine Semantic Hashing for Cross-Media Retrieval.

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval

Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval

Massive Image Data Management Using Hbase And Mapreduce

Internet Cross-Media Retrieval Based on Deep Learning.

Inter-media Hashing for Large-Scale Retrieval from Heterogeneous Data Sources.

Progressive Image Retrieval with Quality Guarantee under MapReduce Framework

Efficient Retrieval of Massive Ocean Remote Sensing Images via a Cloud-Based Mean-Shift Algorithm

Parallel Approach and Platform for Large-Scale WEB Data Extraction

Medical Cloud Computing Data Processing to Optimize the Effect of Drugs

A Content-Based Image Retrieval System Based on Hadoop and Lucene

Research on remote sensing image storage management and a fast visualization system based on cloud computing technology

Multimedia Information Retrieval System Design Based on Cloud Computing

A new design of multimedia big data retrieval enabled by deep feature learning and Adaptive Semantic Similarity Function

Parallel Image Texture Feature Extraction Under Hadoop Cloud Platform

Discrete Robust Matrix Factorization Hashing for Large-scale Cross-media Retrieval

Efficient Supervised Graph Embedding Hashing for large-scale cross-media retrieval

Massive Remote Sensing Image Data Management Based On Hbase And Geosot

An Efficient Organization Method for Large-Scale and Long Time-Series Remote Sensing Data in a Cloud Computing Environment