Description
Nemotron Nano V3 Omni is a multi-modal large language model designed to integrate image, and text understanding, enabling various workflows such as Q&A, summarization, and document intelligence
Stats
2.2K Downloads
1 star
Capabilities
Minimum system memory
Tags
Last updated
Updated 3 hours agobyREADME
Nemotron Nano V3 Omni is a multi-modal large language model designed to integrate image, and text understanding, enabling various workflows such as Q&A, summarization, and document intelligence. New features include understanding of Graphical User Interface (GUI) and Optical Character Recognition (OCR) capabilities, providing seamless end-to-end processing for multi-modal use-cases.
Features a reasoning toggle to enable or disable intermediate reasoning traces, with improved accuracy on complex queries when reasoning is enabled.
Supports a context length of 256K tokens.
Custom Fields
Special features defined by the model author
Enable Thinking
: boolean
(default=true)
Controls whether the model will think before replying
Truncate Thinking History
: boolean
(default=false)
Controls whether thinking history will be truncated to save context space
Parameters
Custom configuration options included with this model
Sources
The underlying model files this model uses