← All modelsKey Specifications
Context Window131K
Max Output16K
TokenizerOther
Input Modalitiesimage, text
Output Modalitiestext
Pricing
Input (per M tokens)$0.4200per million tokens
Output (per M tokens)$1.2500per million tokens
Capabilities
✗Function calling
✗Structured outputs
✓Unmoderated
About
ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baiduâs ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...
Supported parameters
frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningrepetition_penaltyseedstoptemperaturetop_ktop_p