glm-4.6v-flash

Public

Description

GLM 4.6V Flash is a 9B vision-language model optimized for local deployment and low-latency applications. It supports a context length of 128k tokens and achieves strong performance in visual understanding among models of similar scale.

Stats

280.3K Downloads

71 stars

3 forks

Capabilities

Vision Input

Trained for tool use

ReasoningSupports reasoning

Minimum system memory

8GB

GLM 4.6V by Z.ai

The model introduces native multimodal function calling, enabling vision-driven tool use where images, screenshots, and document pages can be passed directly as tool inputs without text conversion.

Parameters

Custom configuration options included with this model

Repeat Penalty

1.1

Temperature

0.8

Top K Sampling

Top P Sampling

0.6

Sources

The underlying model files this model uses

Based on