Abstract:We present the content deformation field CoDeF as a new type of video representation, which consists of a canonical content field aggregating the static contents in the entire video and a temporal deformation field recording the transformations from the canonical image (i.e., rendered from the canonical content field) to each individual frame along the time <a class="link-external link-http" href="http://axis.Given" rel="external noopener nofollow">this http URL</a> a target video, these two fields are jointly optimized to reconstruct it through a carefully tailored rendering <a class="link-external link-http" href="http://pipeline.We" rel="external noopener nofollow">this http URL</a> advisedly introduce some regularizations into the optimization process, urging the canonical content field to inherit semantics (e.g., the object shape) from the <a class="link-external link-http" href="http://video.With" rel="external noopener nofollow">this http URL</a> such a design, CoDeF naturally supports lifting image algorithms for video processing, in the sense that one can apply an image algorithm to the canonical image and effortlessly propagate the outcomes to the entire video with the aid of the temporal deformation <a class="link-external link-http" href="http://field.We" rel="external noopener nofollow">this http URL</a> experimentally show that CoDeF is able to lift image-to-image translation to video-to-video translation and lift keypoint detection to keypoint tracking without any <a class="link-external link-http" href="http://training.More" rel="external noopener nofollow">this http URL</a> importantly, thanks to our lifting strategy that deploys the algorithms on only one image, we achieve superior cross-frame consistency in processed videos compared to existing video-to-video translation approaches, and even manage to track non-rigid objects like water and <a class="link-external link-http" href="http://smog.Project" rel="external noopener nofollow">this http URL</a> page can be found at <a class="link-external link-https" href="https://qiuyu96.github.io/CoDeF/" rel="external noopener nofollow">this https URL</a>.

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Spatio-Temporal Deformable Convolution for Compressed Video Quality Enhancement

GenDeF: Learning Generative Deformation Field for Video Generation

DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency

Coarse-to-Fine Spatio-Temporal Information Fusion for Compressed Video Quality Enhancement

A New Framework Based on Spatio-Temporal Information for Enhancing Compressed Video

Visual preserving video retargeting with deformable shape consistency

STDF: Spatio-Temporal Deformable Fusion for Video Quality Enhancement on Embedded Platforms

DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment

Video Deflickering Using Multi - Frame Optimization

MoDA: Modeling Deformable 3D Objects from Casual Videos

A Multiscale Gradient-Backpropagation Optimization Framework for Deformable Convolution Based Compressed Video Enhancement

DeformStream: Deformation-based Adaptive Volumetric Video Streaming

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Deformable Shape Preserving Video Retargeting With Salient Curve Matching

Contour Counts: Restricting Deformation for Accurate Animation Interpolation

FusionDeformer: text-guided mesh deformation using diffusion models

CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation

DeCoF: Generated Video Detection via Frame Consistency: The First Benchmark Dataset

DeFLOCNet: Deep Image Editing via Flexible Low-level Controls

Enlarged Motion-Aware and Frequency-Aware Network for Compressed Video Artifact Reduction