Vidu Q3 is a next-generation AI video generation model featuring native synchronized audio-video output and up to 16-second 1080p video generation in a single pass. It supports frame-accurate camera control, multi-character dialogue, intelligent scene cutting, multilingual output, and high-quality image-to-video and text-to-video generation, making it ideal for anime, cinematic storytelling, short dramas, and commercial content creation.
Generate video
All API requests must be authenticated using a Bearer token in the Authorization header. Please ensure your API key is active.Authorization: Bearer sk-xxxxxx
These parameters come from the selected model form_schema. Switching models updates this list and the request example.
Choose the call mode. Simple: upload 1-7 reference images + prompt (prompt ≤2000 chars), easiest. Subject: name reference subjects and reference them with @name in the prompt, supports multi-subject consistency (prompt ≤5000 chars). Only one mode can be used.
images*array[Simple mode] Upload 1-7 reference images; the model generates a consistent video based on the subjects in them. Supports PNG, JPEG, JPG, WebP; pixels ≥128×128, aspect ratio between 1:4 and 4:1, each image ≤50MB.
prompt*stringText description. Max 2000 characters in simple mode; max 5000 characters in subject mode, where you can use @name to reference subjects for consistency.
duration?numberVideo duration in seconds; viduq3 range 3-16 seconds, default 5.
aspect_ratio?stringVideo aspect ratio; viduq3 supports 16:9, 9:16, 1:1 (4:3 and 3:4 are q2-only). Default 16:9.
resolution?stringVideo output resolution; viduq3 supports 540p, 720p, 1080p, default 720p.
audio?booleanEnable audio-video direct output, producing video with dialogue and sound effects. Enabled by default for viduq3. Off-peak mode is unavailable when this is off.
audio_type?stringAudio type; required when audio is true, default all.
seed?numberRandom seed. Enter a fixed integer for reproducible results; leave empty or set to 0 for a random seed.
off_peak?booleanOff-peak generation mode consumes fewer credits; tasks complete within 48 hours. Supported for viduq3 only when audio is enabled. Disabled by default.
watermark?booleanWhether to add a watermark (fixed AI-generated content marker) to the video. Disabled by default.
wm_position?stringWatermark position: 1=top-left, 2=top-right, 3=bottom-right, 4=bottom-left. Effective only when watermark is enabled.
wm_url?stringURL of a custom watermark image. Leave empty to use the default watermark. Effective only when watermark is enabled.
task_id?stringParameter description for Task Id
status?stringParameter description for Status
error?objectParameter description for Error
message?stringError Message
type?stringError Type
param?stringRelated Parameters
code?stringError Code