Leveraging Domain Knowledge using Machine Learning for Image Compression in Internet-of-Things

Prabuddha Chakraborty,Jonathan Cruz,Swarup Bhunia
DOI: https://doi.org/10.48550/arXiv.2009.06742
2020-09-15
Abstract:The emergent ecosystems of intelligent edge devices in diverse Internet of Things (IoT) applications, from automatic surveillance to precision agriculture, increasingly rely on recording and processing variety of image data. Due to resource constraints, e.g., energy and communication bandwidth requirements, these applications require compressing the recorded images before transmission. For these applications, image compression commonly requires: (1) maintaining features for coarse-grain pattern recognition instead of the high-level details for human perception due to machine-to-machine communications; (2) high compression ratio that leads to improved energy and transmission efficiency; (3) large dynamic range of compression and an easy trade-off between compression factor and quality of reconstruction to accommodate a wide diversity of IoT applications as well as their time-varying energy/performance needs. To address these requirements, we propose, MAGIC, a novel machine learning (ML) guided image compression framework that judiciously sacrifices visual quality to achieve much higher compression when compared to traditional techniques, while maintaining accuracy for coarse-grained vision tasks. The central idea is to capture application-specific domain knowledge and efficiently utilize it in achieving high compression. We demonstrate that the MAGIC framework is configurable across a wide range of compression/quality and is capable of compressing beyond the standard quality factor limits of both JPEG 2000 and WebP. We perform experiments on representative IoT applications using two vision datasets and show up to 42.65x compression at similar accuracy with respect to the source. We highlight low variance in compression rate across images using our technique as compared to JPEG 2000 and WebP.
Computer Vision and Pattern Recognition,Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the need for edge devices to compress image data in a resource - constrained environment (such as limited energy and communication bandwidth) in Internet of Things (IoT) applications. Specifically, the paper proposes solutions for the following three key requirements: 1. **Maintain coarse - grained pattern recognition features**: Due to the need for machine - to - machine communication, these applications need to maintain the coarse - grained pattern recognition features in the image, rather than the high - detail required for human perception. 2. **High compression ratio**: In order to improve energy and transmission efficiency, these applications need to achieve an extremely high image compression ratio. 3. **Flexible trade - off between compression and reconstruction quality**: In order to adapt to a wide variety of IoT application scenarios and their time - varying energy/performance requirements, it is necessary to be able to make a flexible trade - off between the compression factor and the reconstruction quality. To solve the above problems, the paper proposes a new machine - learning - based image compression framework - MAGIC (Machine Learning Guided Image Compression). MAGIC achieves a higher compression ratio than traditional techniques by sacrificing visual quality while maintaining the accuracy of coarse - grained visual tasks. Its core idea is to capture domain knowledge in specific application areas and use this knowledge efficiently to achieve efficient image compression. ### Main contributions - **High configurability**: MAGIC can be configured within a wide range of compression/quality. - **Beyond standard compression limits**: Experiments show that MAGIC can achieve a higher compression ratio than JPEG 2000 and WebP, reaching a compression rate of 42.65 times respectively. - **Low compression rate fluctuation**: Compared with JPEG 2000 and WebP, MAGIC has less compression rate fluctuation on different images. ### Method overview The MAGIC framework is mainly divided into three steps: 1. **Knowledge acquisition**: Extract application - and domain - specific information, such as color distribution, common pattern deviation, and region - of - interest deviation. 2. **Encoding**: Use the Delaunay triangulation method to divide the image into non - overlapping triangles and assign colors according to the color quantization dictionary. 3. **Decoding**: Restore the image by reconstructing the triangles and assigning colors from the color quantization dictionary. ### Experimental results The authors conducted experiments on two public datasets, namely fire detection and building crack detection. The experimental results show that MAGIC can significantly improve the compression ratio while maintaining a certain accuracy. For example, in the building crack detection dataset, MAGIC achieved a 22.09 - fold compression with a 1.06% precision loss; in the fire detection dataset, MAGIC achieved a 42.65 - fold compression with a 2.99% precision loss. Through these improvements, MAGIC effectively solves the key challenges of image compression in IoT applications, especially on resource - constrained edge devices.