Abstract:Designing a videocodec involves a four-way tradeoff among computational complexity, data rate, picture quality, and latency. Rapid advancement in very large-scale integration technology has provided CPUs with enough power to accommodate a software-only videocodec. Accordingly, computational complexity has resurfaced as a major element in this tradeoff. With a view toward significantly reducing computational complexity relative to standards-based videocodecs, we introduce a pixelwise conditional differential replenishment scheme to compress video via perception-sensitive decomposition of difference frames into a facsimile map and an intensity vector. Our schemes, which apply techniques from facsimile, are transform free. Some of them also involve no motion compensation and hence are completely free of block-based artifacts and particularly computationally economical. The fusion of our facsimile-based video-coding schemes and spatio-temporal perceptual-coding techniques facilitates powerful software-only video conferencing on today's medium- and highend personal computers. Indeed, assuming that a frame-capture driver has been provided, our motion-compensation-free approach has yielded a software-only, full-duplex, full-color videoconferencing system that conveys high-quality, CIF/Q-NTSC-sized video at 30 frames per second on 200-MHz Pentium PCs sending less than 300 Kbps in each direction. We also present new spatio-temporal compression techniques for perceptual coding of video. These techniques, motivated by the classical psychological experiments that led to formulation of the Weber-Fechner law, allow videocodec systems to capitalize on properties of the human visual system. Some of our spatiotemporal perceptual techniques not only apply to our proprietary pixelwise conditional differential replenishment schemes that we describe for video conferencing but also can readily be incorporated into today's popular video standards.

A Scalable Video Conferencing System Using Cached Facial Expressions.

Video Conference System for Enhancing Quality of Target Region under Low Bit Rate

Towards Ultra-Low-Bitrate Video Conferencing Using Facial Landmarks

An Improved Method for Scalable Video Coding at Low Bit Rates

Extended application of scalable video coding methods

Resolution-Agnostic Neural Compression for High-Fidelity Portrait Video Conferencing via Implicit Radiance Fields

A software-only videocodec using pixelwise conditional differential replenishment and perceptual enhancements

Low-Complexity 3D-Vision Conferencing System Based on Accelerated RIFE Model

A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing

Hybrid model-and-object-based real-time conversational video coding

A Streaming-Based Approach for Remote Interaction of the Multi-Channel Display System for Group Users

Real-Time expression mapping with ratio image

Collaborative Processing System for Networked Video Applications

A 3D-DCT and Convolutional FEC Approach to Agile Video Streaming

Scalable Multipoint Videoconferencing Scheme without MCU.

Scalable Distributed Standard Definition Video Conferencing System: Architecture and Forwarding Model

Ultra-low bitrate video conferencing using deep image animation

Demo: A Talking-head Semantic Communication System for 6G.

Encoder-Decoder Joint Enhancement for Video Chat

Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras

Robust Ultralow Bitrate Video Conferencing with Second Order Motion Coherency