LIVE
OPUS 4.7$15 / $75per Mtok
SONNET 4.6$3 / $15per Mtok
GPT-5.5$10 / $30per Mtok
GEMINI 3.1$3.50 / $10.50per Mtok
SWE-BENCHleader Claude Opus 4.772.1%
MMLU-PROleader Opus 4.788.4
VALS FINANCEleader Opus 4.764.4%
AFTAv1.0 whitepaper live at /whitepaper
OPUS 4.7$15 / $75per Mtok
SONNET 4.6$3 / $15per Mtok
GPT-5.5$10 / $30per Mtok
GEMINI 3.1$3.50 / $10.50per Mtok
SWE-BENCHleader Claude Opus 4.772.1%
MMLU-PROleader Opus 4.788.4
VALS FINANCEleader Opus 4.764.4%
AFTAv1.0 whitepaper live at /whitepaper
All systems operational0 AI providers monitored, polled every 2 minutes
Live status

NVIDIA is best known as the GPU company that powers the AI industry, but their model lineup matters too. The Nemotron family is purpose-built to showcase what their hardware can do, and the April 2026 release of Nemotron 3 Nano Omni 30B-A3B-Reasoning landed as one of the strongest open multimodal models of the year. It processes text, image, video, and audio in a single unified sequence via a hybrid Mamba-Transformer-MoE backbone (30B total parameters, 3B active per token), with a 256K token context window and native audio handling up to 20 minutes per clip. It tops six public leaderboards for document intelligence, video understanding, and voice interaction, and is available on Hugging Face under an open weight license in BF16, FP8, and NVFP4 quantizations including consumer-GPU formats.

Founded

1993

Headquarters

Santa Clara, CA

CEO

Jensen Huang

Models

1 active

Key Products

Nemotron 3 Nano OmniNIM inference microservicesbuild.nvidia.comNeMo frameworkParakeet ASR

Strengths

  • Open weights with consumer GPU support
  • 256K context multimodal
  • Top document and video benchmarks
  • Native audio as first-class modality
  • Self-hosted deployment focus

NVIDIA Models

ModelInput / 1MOutput / 1MContextCapabilities
Nemotron 3 Nano OmniFreeFree256Ktext, vision, audio, video, code, reasoning, tool-use

Prices per 1M tokens in USD. See the full pricing guide for detailed analysis.

Comparisons

Other Providers

Visit NVIDIACompare ModelsAll Models