Abstract:The emergent ecosystems of intelligent edge devices in diverse Internet of Things (IoT) applications, from automatic surveillance to precision agriculture, increasingly rely on recording and processing variety of image data. Due to resource constraints, e.g., energy and communication bandwidth requirements, these applications require compressing the recorded images before transmission. For these applications, image compression commonly requires: (1) maintaining features for coarse-grain pattern recognition instead of the high-level details for human perception due to machine-to-machine communications; (2) high compression ratio that leads to improved energy and transmission efficiency; (3) large dynamic range of compression and an easy trade-off between compression factor and quality of reconstruction to accommodate a wide diversity of IoT applications as well as their time-varying energy/performance needs. To address these requirements, we propose, MAGIC, a novel machine learning (ML) guided image compression framework that judiciously sacrifices visual quality to achieve much higher compression when compared to traditional techniques, while maintaining accuracy for coarse-grained vision tasks. The central idea is to capture application-specific domain knowledge and efficiently utilize it in achieving high compression. We demonstrate that the MAGIC framework is configurable across a wide range of compression/quality and is capable of compressing beyond the standard quality factor limits of both JPEG 2000 and WebP. We perform experiments on representative IoT applications using two vision datasets and show up to 42.65x compression at similar accuracy with respect to the source. We highlight low variance in compression rate across images using our technique as compared to JPEG 2000 and WebP.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the need for edge devices to compress image data in a resource - constrained environment (such as limited energy and communication bandwidth) in Internet of Things (IoT) applications. Specifically, the paper proposes solutions for the following three key requirements: 1. **Maintain coarse - grained pattern recognition features**: Due to the need for machine - to - machine communication, these applications need to maintain the coarse - grained pattern recognition features in the image, rather than the high - detail required for human perception. 2. **High compression ratio**: In order to improve energy and transmission efficiency, these applications need to achieve an extremely high image compression ratio. 3. **Flexible trade - off between compression and reconstruction quality**: In order to adapt to a wide variety of IoT application scenarios and their time - varying energy/performance requirements, it is necessary to be able to make a flexible trade - off between the compression factor and the reconstruction quality. To solve the above problems, the paper proposes a new machine - learning - based image compression framework - MAGIC (Machine Learning Guided Image Compression). MAGIC achieves a higher compression ratio than traditional techniques by sacrificing visual quality while maintaining the accuracy of coarse - grained visual tasks. Its core idea is to capture domain knowledge in specific application areas and use this knowledge efficiently to achieve efficient image compression. ### Main contributions - **High configurability**: MAGIC can be configured within a wide range of compression/quality. - **Beyond standard compression limits**: Experiments show that MAGIC can achieve a higher compression ratio than JPEG 2000 and WebP, reaching a compression rate of 42.65 times respectively. - **Low compression rate fluctuation**: Compared with JPEG 2000 and WebP, MAGIC has less compression rate fluctuation on different images. ### Method overview The MAGIC framework is mainly divided into three steps: 1. **Knowledge acquisition**: Extract application - and domain - specific information, such as color distribution, common pattern deviation, and region - of - interest deviation. 2. **Encoding**: Use the Delaunay triangulation method to divide the image into non - overlapping triangles and assign colors according to the color quantization dictionary. 3. **Decoding**: Restore the image by reconstructing the triangles and assigning colors from the color quantization dictionary. ### Experimental results The authors conducted experiments on two public datasets, namely fire detection and building crack detection. The experimental results show that MAGIC can significantly improve the compression ratio while maintaining a certain accuracy. For example, in the building crack detection dataset, MAGIC achieved a 22.09 - fold compression with a 1.06% precision loss; in the fire detection dataset, MAGIC achieved a 42.65 - fold compression with a 2.99% precision loss. Through these improvements, MAGIC effectively solves the key challenges of image compression in IoT applications, especially on resource - constrained edge devices.

Leveraging Domain Knowledge using Machine Learning for Image Compression in Internet-of-Things

Optimal deep learning based image compression technique for data transmission on industrial Internet of things applications

Learned Image Compression for Machine Perception

Slimmable Multi-Task Image Compression for Human and Machine Vision

Deep Residual Learning-Based Enhanced JPEG Compression in the Internet of Things

Collaborative image compression and classification with multi-task learning for visual Internet of Things

Deep Learning-based Compressed Domain Multimedia for Man and Machine: A Taxonomy and Application to Point Cloud Classification

Deep Image Compression Towards Machine Vision: A Unified Optimization Framework

Deep Image Compression Toward Machine Vision: A Unified Optimization Framework

Probing Image Compression For Class-Incremental Learning

Machine Perception-Driven Image Compression: A Layered Generative Approach

Unified and Scalable Deep Image Compression Framework for Human and Machine

Learning-Based Image Compression for Machines

A New Approach to Image Compression in Industrial Internet of Things

Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees

Pragmatic Image Compression for Human-in-the-Loop Decision-Making

A Unified End-to-End Framework for Efficient Deep Image Compression

Learned Image Compression for Both Humans and Machines Via Dynamic Adaptation

Context-aware Optimization for Bandwidth-Efficient Image Analytics Offloading

Adaptive Compression for Online Computer Vision: an Edge Reinforcement Learning Approach

Unveiling the Future of Human and Machine Coding: A Survey of End-to-End Learned Image Compression