Task Definition
Reference-guided video editing takes a source video, a text instruction, and an edited reference image (the first frame modified according to the instruction) as input. The goal is to propagate the visual edits from the reference throughout the video while preserving original motion and unedited content.