Sync Mode
sync_mode controls what happens when your video and audio inputs are different lengths. It applies to video + audio generations. For image inputs, sync_mode is ignored because images have no intrinsic duration.
Use sync_mode in the options object when creating a generation:
Quick recommendation
For the most predictable result, make your video and audio durations close before submitting. Large duration mismatches can make bounce, loop, or remap visually obvious.
Behavior matrix
This matrix shows the effective behavior when the durations do not match.
Example output durations
For a 4.00s video with 8.64s audio:
For a 10.00s video with 1.74s audio:
Mode details
cut_off
cut_off uses the shorter input length. If audio is longer than video, the extra audio is removed. If video is longer than audio, the video is trimmed.
Choose this when you want a simple, predictable output and do not need to preserve every second of both inputs.
bounce
bounce is useful when the video is shorter than the audio. The video plays forward, then reverses direction, and repeats until the audio ends.
Choose this for short looping clips where a forward/backward motion looks acceptable.
loop
loop restarts the video from the beginning until the audio ends.
Choose this when your source video is designed to loop cleanly.
silence
silence preserves the full video when the audio is shorter by padding the remaining duration with silence. During the silent portion, there is no spoken audio to drive new lip movement.
Choose this when the full visual clip matters more than ending exactly when speech ends.
remap
remap changes video playback speed so the video duration matches the audio duration.
Choose this when you need to preserve all video and audio content, but avoid extreme mismatches. Large speed changes can look unnatural.
Studio defaults
When Studio detects a duration mismatch, it only shows modes that are valid for that direction:
Segments
Segment-level sync_mode works the same way, but the mismatch is evaluated per segment. If a segment window and its audio are different lengths, the mode determines whether that segment is trimmed, padded, looped, bounced, or time-remapped.
See the Segments guide for segment request examples.

