Capabilities
Minimum system memory
Tags
Last updated
Updated on December 1byREADME
Hermes 4 70B is a hybrid-mode reasoning model based on Llama-3.1-70B by Nous Research. Compared to Hermes 3, this model delivers enhanced mathematical and scientific reasoning, superior instruction following, and precise schema-adherent outputs with nuanced roleplay and creative writing capabilities.
The model supports a context length of 131k tokens.
<think>β¦</think> segments when the model decides to deliberate, and options to make your responses faster when you want.Custom Fields
Special features defined by the model author
Enable Thinking
: boolean
(default=false)
Controls whether the model will think before replying
Keep CoT
: boolean
(default=false)
Include Chain of Thought in subsequent requests
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses
Based on
GGUF
MLX
MLX
MLX
MLX
Forked from nousresearch/hermes-4-70b