Abstract:With the rapid development of the Internet technology and the digital imaging device,massive pictures are uploaded and saved in the network disks every day.Traditional image coding methods (such as JPEG,JPEG2000) only exploit spatial correlations inside a picture,thus their coding efficiency cannot satisfy the demand of the huge picture data.In order to compress the images in the Internet more efficiently,we proposed a novel cloud-based image compression method by taking advantage of the ‘ big data’ challenge,rather than simply treating it as a difficulty.The proposed system adopts the processing mode of ‘encoding one image immediately after it is uploaded’,which effectively adapts to the rapid change of the big-data environment.It compresses an image by fully exploiting the correlations between the image-to-be-compressed and other images in the cloud.We had three innovative technical designs to serve for the compression framework.(a) We designed a scheme based on an efficient image retrieval technique to effectively filter the ‘big data’ and to only keep the ‘small data’ that well matches the image to be compressed.(b) By utilizing the selected ‘small data’,we did inter-image prediction coding,which is essential for the performance of the compression system.We exploited the techniques of block matching-based prediction coding and rate-distortion optimization,so that coding bits are tremendously reduced in the premise of good reconstruction quality.(c) We ensured the prediction accuracy by applying a couple of preprocessing techniques on the retrieved similar image to obtain good references.In this step,we applied projective transformation to align the retrieved image and the current image,and did illumination compensation to make the values of corresponding pixels as close as possible.This paper presents a novel framework for image compression in the big-data era,which tremendously saves storage requirement for images in the cloud disks.The proposed system has significant compression performance gains compared to traditional image coding methods and even the advanced HEVC intra coding.The experimental results demonstrate that it outperforms JPEG and HEVC intra coding by 78.5％ and 67.2％ on average,respectively.Moreover,compared to the existing state-of-the-art attempt for cloud-based image coding in the literature,both the objective and subjective reconstruction quality of our algorithm have obvious improvement.

Compression-aware I/O performance analysis for big data clustering.

Visual Analysis of Cloud Computing Performance Using Behavioral Lines

Research on Spatial Statistical Data Compression Algorithm Based on Point Cloud Clustering

IC-Data: Improving Compressed Data Processing in Hadoop.

A High Performance Compression Method For Climate

Distance-aware Virtual Cluster Performance Optimization: A Hadoop Case Study

Impression Store: Compressive Sensing-based Storage for Big Data Analytics.

Optimizing Data Migration Using Online Clustering.

Comparative Analysis of Optimization Strategies for K-means Clustering in Big Data Contexts: A Review

Compressed Subspace Clustering: A Case Study

To Compress or Not To Compress: Energy Trade-Offs and Benefits of Lossy Compressed I/O

Settling Time vs. Accuracy Tradeoffs for Clustering Big Data

Compressing Big Graph Data: A Relative Node Importance Approach

A Study of Performance Optimization Method for Massive Spaito-temporal Data Based on Spatio-temporal Partition Clustering

Content-Aware Partial Compression for Textual Big Data Analysis in Hadoop

Distributed Outlier Detection Using Compressive Sensing

Spatiotemporally adaptive compression for scientific dataset with feature preservation -- a case study on simulation data with extreme climate events analysis

An Efficient Image Compression Method Based on Cloud Data

Data Compression for Analytics over Large-scale In-memory Column Databases

Attention Based Machine Learning Methods for Data Reduction with Guaranteed Error Bounds