AIThe Decoder2h ago

With Nemotron 3 Nano Omni, Nvidia reveals what really goes into a modern multimodal model

With Nemotron 3 Nano Omni, Nvidia reveals what really goes into a modern multimodal model

Nvidia releases Nemotron 3 Nano Omni, an open multimodal model for text, image, video and audio. Not only the performance is exciting, but also a look at the training data: it comes from Qwen, GPT-OSS, Kimi and DeepSeek OCR, among others. The article With Nemotron 3 Nano Omni,…

Read full article

Source: The Decoder · Opens in new tab