Abstract:Designing a videocodec involves a four-way tradeoff among computational complexity, data rate, picture quality, and latency. Rapid advancement in very large-scale integration technology has provided CPUs with enough power to accommodate a software-only videocodec. Accordingly, computational complexity has resurfaced as a major element in this tradeoff. With a view toward significantly reducing computational complexity relative to standards-based videocodecs, we introduce a pixelwise conditional differential replenishment scheme to compress video via perception-sensitive decomposition of difference frames into a facsimile map and an intensity vector. Our schemes, which apply techniques from facsimile, are transform free. Some of them also involve no motion compensation and hence are completely free of block-based artifacts and particularly computationally economical. The fusion of our facsimile-based video-coding schemes and spatio-temporal perceptual-coding techniques facilitates powerful software-only video conferencing on today's medium- and highend personal computers. Indeed, assuming that a frame-capture driver has been provided, our motion-compensation-free approach has yielded a software-only, full-duplex, full-color videoconferencing system that conveys high-quality, CIF/Q-NTSC-sized video at 30 frames per second on 200-MHz Pentium PCs sending less than 300 Kbps in each direction. We also present new spatio-temporal compression techniques for perceptual coding of video. These techniques, motivated by the classical psychological experiments that led to formulation of the Weber-Fechner law, allow videocodec systems to capitalize on properties of the human visual system. Some of our spatiotemporal perceptual techniques not only apply to our proprietary pixelwise conditional differential replenishment schemes that we describe for video conferencing but also can readily be incorporated into today's popular video standards.

Hybrid model-and-object-based real-time conversational video coding

An Improved Method for Scalable Video Coding at Low Bit Rates

2D/3D Model-Based Facial Video Coding/Decoding at Ultra-Low Bit-Rate.

A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing

&Lt;title>layered Coding System for Very Low Bitrate Videophone</title>

A Scalable Video Conferencing System Using Cached Facial Expressions.

Model-based Low Bit-Rate Video Coding for Resource-Deficient Wireless Visual Communication

The Study On The Layered Coding System For Very Low Bit Rate Videophone

Improving Low Bitrate Video Coding Via Computation Incorporating a Priori Information

Semantic Neural Rendering-based Video Coding: Towards Ultra-Low Bitrate Video Conferencing

Low Bit-Rate Video Coding Via Mode-Dependent Adaptive Regression for Wireless Visual Communications.

Extended application of scalable video coding methods

A software-only videocodec using pixelwise conditional differential replenishment and perceptual enhancements

Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression

Transmission of Scalable Video over Networks

Software-Based video codec for mobile devices

Model-based portrait video compression with spatial constraint and adaptive pose processing

A Layered Video Coding Scheme for Very Low Bit Rate Videophone

Predictive Coding For Animation-Based Video Compression

Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields