41 Downloads
1 star
Capabilities
Minimum system memory
Tags
Last updated
Updated on April 18byREADME
Custom Fields
Special features defined by the model author
Enable Thinking
: boolean
(default=true)
Controls whether the model will think before replying
Preserve Thinking
: boolean
(default=false)
Preserve reasoning content in all prior assistant turns instead of only the most recent one
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses
Based on
Source: https://huggingface.co/mlx-community/Qwen3.6-35B-A3B-4bit
This model was converted to MLX format from Qwen/Qwen3.6-35B-A3B
using mlx-vlm version 0.4.4.
Refer to the original model card for more details on the model.
pip install -U mlx-vlm
python -m mlx_vlm.generate --model unn/Qwen3.6-35B-A3B-4bit --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>