This is a highly efficient and lightweight MoE model with 284 billion parameters in total and 13 billion activated parameters per inference. It natively supports context windows of up to one million tokens, offering fast inference speed, low latency, and cost-effective invocation while maintaining well-balanced overall performance. Designed for high-concurrency and lightweight workloads, it is ideally suited for common essential use cases such as everyday dialogue, content creation, basic RAG applications, and batch text processing.
Create chat completion
All API requests must be authenticated using a Bearer token in the Authorization header. Please ensure your API key is active.Authorization: Bearer sk-xxxxxx
These parameters come from the selected model form_schema. Switching models updates this list and the request example.
system_prompt?stringGlobal instructions or persona for the model.
prompt*stringtemperature?numberHigher values make output more random, lower more deterministic.
top_p?numberNucleus sampling threshold; an alternative to temperature.
presence_penalty?numberIncreases the tendency to talk about new topics.
frequency_penalty?numberReduces the likelihood of repeating the same text verbatim.
max_tokens?numberresponse_format?stringstream?booleanid?stringParameter description for Id
object?stringParameter description for Object
created?integerParameter description for Created
model?stringModel ID used
choices?arrayParameter description for Choices
usage?objectParameter description for Usage
prompt_tokens?integerParameter description for Prompt Tokens
completion_tokens?integerParameter description for Completion Tokens
total_tokens?integerParameter description for Total Tokens
prompt_tokens_details?objectParameter description for Prompt Tokens Details
completion_tokens_details?objectParameter description for Completion Tokens Details
system_fingerprint?stringParameter description for System Fingerprint
error?objectParameter description for Error
message?stringError Message
type?stringError Type
param?stringRelated Parameters
code?stringError Code
error?objectParameter description for Error
message?stringError Message
type?stringError Type
param?stringRelated Parameters
code?stringError Code