← All Models

nemotron-3-nano-omni

Public

Nemotron Nano V3 Omni is a multi-modal large language model designed to integrate image, and text understanding, enabling various workflows such as Q&A, summarization, and document intelligence

2.2K Downloads

1 star

Capabilities

Vision Input
Reasoning

Minimum system memory

25GB

Tags

30B
nemotron_h_moe

README

Nemotron 3 Nano Omni by NVIDIA

Nemotron Nano V3 Omni is a multi-modal large language model designed to integrate image, and text understanding, enabling various workflows such as Q&A, summarization, and document intelligence. New features include understanding of Graphical User Interface (GUI) and Optical Character Recognition (OCR) capabilities, providing seamless end-to-end processing for multi-modal use-cases.

Features a reasoning toggle to enable or disable intermediate reasoning traces, with improved accuracy on complex queries when reasoning is enabled.

Supports a context length of 256K tokens.

Custom Fields

Special features defined by the model author

Enable Thinking

: boolean

(default=true)

Controls whether the model will think before replying

Truncate Thinking History

: boolean

(default=false)

Controls whether thinking history will be truncated to save context space

Parameters

Custom configuration options included with this model

Temperature
0.6
Top P Sampling
0.95

Sources

The underlying model files this model uses