CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback
1HKUST(GZ) 2HKUST 3Kling Team, Kuaishou Technology
* Equal Contribution ‡ Corresponding Author
† This work was conducted during the author’s internship at Kling.
Abstract
We propose CamPilot, a novel framework that achieves precise camera control in video generation.
By leveraging a camera-aware 3D decoder based on 3D Gaussian Splatting (3DGS), CamPilot efficiently evaluates geometric consistency
and provides robust reward signals for feedback learning. This approach overcomes the computational bottlenecks
of traditional methods and enables strict adherence to camera trajectories.
Results on RealEstate10K.
From Left to Right: [ GT Video, Camera Trajectory, Generated Video, Rendered Video ]
Out-of-Distribution (OOD) Results
From Left to Right: [ Input Image, Camera Trajectory, Generated Video, Rendered Video ]
Ablation Studies
From Left to Right: [ Input Image, Generated Video, Rendered Video ]
Results with Different Scales
From Left to Right: [ Input Image, Generated Video, Rendered Video ]