We introduce PoseTraj, the first open-domain, Pose-Aware video dragging model for reliable 3D-aligned animations from 2D trajectories. Our method incorporates a novel Two-Stage Pose-Aware Pretraining framework, improving 3D understanding across diverse trajectories.
Specifically, we 1. construct a large-scale synthetic dataset containing 10k videos of objects following rotational trajectories and 2. enhance the model perception of object pose changes by generating 3D bounding boxes as intermediate supervision signals. Following this, we fine-tune the trajectory-controlling module on open-domain videos, applying an additional camera disentanglement module to further refine motion accuracy. Experiments on various benchmark scenarios demonstrate that PoseTraj not only excels in 3D Pose-Aligned dragging for rotational scenarios but also outperforms existing baselines in trajectory accuracy and video quality.
@article{ji@2025posetraj,
author = {Longbin, Ji and Zhong, Lei and Pengfei, Wei and Changjian, Li},
title = {PoseTraj: Pose-Aware Trajectory Control in Video Diffusion},
journal = {CVPR},
year = {2025},
}