Capabilities
Minimum system memory
Tags
Last updated
Updated on March 18byREADME
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses
Delivers strong vision-language performance across diverse tasks including document analysis, visual question answering, video understanding, and agentic interactions.
Forked from qwen/qwen3-vl-4b
Based on