nemotron-3-nano-omni

Public

Description

Nemotron Nano V3 Omni is a multi-modal large language model designed to integrate image, and text understanding, enabling various workflows such as Q&A, summarization, and document intelligence

Stats

314K Downloads

49 stars

Capabilities

Vision Input

Trained for tool use

ReasoningSupports reasoning

Minimum system memory

25GB

Nemotron 3 Nano Omni by NVIDIA

Nemotron Nano V3 Omni is a multi-modal large language model designed to integrate image, and text understanding, enabling various workflows such as Q&A, summarization, and document intelligence. New features include understanding of Graphical User Interface (GUI) and Optical Character Recognition (OCR) capabilities, providing seamless end-to-end processing for multi-modal use-cases.

Features a reasoning toggle to enable or disable intermediate reasoning traces, with improved accuracy on complex queries when reasoning is enabled.

Supports a context length of 256K tokens.

Custom Fields

Special features defined by the model author

Enable Thinking

: boolean

(default=true)

Controls whether the model will think before replying

Truncate Thinking History

: boolean

(default=false)

Controls whether thinking history will be truncated to save context space

Parameters

Custom configuration options included with this model

Temperature

0.6

Top P Sampling

0.95

Sources

The underlying model files this model uses

Based on

🤗lmstudio-community/nemotron-3-nano-omni-30b-a3b-reasoning-gguf→

GGUF