gemini-2.5-flash is a highly efficient, lightweight multimodal model designed for speed and cost-effectiveness. It delivers rapid, high-quality responses across text, code, images, and audio, making it the perfect choice for high-frequency, everyday tasks and scalable applications. http://vwdv30vppp.feishu.cn/docx/FC8ZdnCiRo7WKRx6fMvcl0rGnlg
Generate image
All API requests must be authenticated using a Bearer token in the Authorization header. Please ensure your API key is active.Authorization: Bearer sk-xxxxxx
These parameters come from the selected model form_schema. Switching models updates this list and the request example.
system_prompt?stringGlobal instructions or persona for the model.
prompt*stringimage_urls?arraySupports multi-image understanding (optional).
image_detail?stringtemperature?numberHigher values make output more random, lower more deterministic.
top_p?numberNucleus sampling threshold; an alternative to temperature.
presence_penalty?numberIncreases the tendency to talk about new topics.
frequency_penalty?numberReduces the likelihood of repeating the same text verbatim.
max_tokens?numberresponse_format?stringstream?booleancreated?integerParameter description for Created
data?arrayParameter description for Data