glm-4.6v-flash

Public

GLM 4.6V Flash is a 9B vision-language model optimized for local deployment and low-latency applications. It supports a context length of 128k tokens and achieves strong performance in visual understanding among models of similar scale.

1K Downloads

1 star

Capabilities

Vision Input
Reasoning

Minimum system memory

8GB

Tags

9B
glm4v

README

GLM 4.6V by Z.ai

GLM 4.6V Flash is a 9B vision-language model optimized for local deployment and low-latency applications. It supports a context length of 128k tokens and achieves strong performance in visual understanding among models of similar scale.

The model introduces native multimodal function calling, enabling vision-driven tool use where images, screenshots, and document pages can be passed directly as tool inputs without text conversion.

Parameters

Custom configuration options included with this model

Repeat Penalty
1.1
Temperature
0.8
Top K Sampling
2
Top P Sampling
0.6

Sources

The underlying model files this model uses