The Qwen3 series VL models effectively integrates thinking and non-thinking modes, achieving world-leading performance in visual agent capabilities on public benchmark datasets such as OS World. This version features comprehensive upgrades in areas like visual coding, spatial perception, and multimodal reasoning, significantly enhancing visual perception and recognition abilities, and supporting the understanding of ultra-long videos. This version is functionally equivalent to snapshot qwen3-vl-plus-2025-12-19. https://bailian.console.alibabacloud.com/cn-beijing?tab=model#/model-market/detail/qwen3-vl-plus?serviceSite=asia-pacific-china
Generate image
All API requests must be authenticated using a Bearer token in the Authorization header. Please ensure your API key is active.Authorization: Bearer sk-xxxxxx
These parameters come from the selected model form_schema. Switching models updates this list and the request example.
system_prompt?stringGlobal instructions or persona for the model.
prompt*stringimage_urls?arraySupports multi-image understanding (optional).
image_detail?stringreasoning_effort?stringControls reasoning effort; `none` disables thinking.
temperature?numberHigher values make output more random, lower more deterministic.
top_p?numberNucleus sampling threshold; an alternative to temperature.
presence_penalty?numberIncreases the tendency to talk about new topics.
max_tokens?numberresponse_format?stringstream?booleancreated?integerParameter description for Created
data?arrayParameter description for Data