NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Name: NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
Price: 0.10 USD
Author: NVIDIA

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and multi-turn chat, followed by multiple RL stages; Reward-aware Preference Optimization (RPO) for alignment, RL with Verifiable Rewards (RLVR) for step-wise reasoning, and iterative DPO to refine tool-use behavior. A distillation-driven Neural Architecture Search (“Puzzle”) replaces some attention blocks and varies FFN widths to shrink memory footprint and improve throughput, enabling single-GPU (H100/H200) deployment while preserving instruction following and CoT quality. In internal evaluations (NeMo-Skills, up to 16 runs, temp = 0.6, top_p = 0.95), the model reports strong reasoning/coding results, e.g., MATH500 pass@1 = 97.4, AIME-2024 = 87.5, AIME-2025 = 82.71, GPQA = 71.97, LiveCodeBench (24.10–25.02) = 73.58, and MMLU-Pro (CoT) = 79.53. The model targets practical inference efficiency (high tokens/s, reduced VRAM) with Transformers/vLLM support and explicit “reasoning on/off” modes (chat-first defaults, greedy recommended when disabled). Suitable for building agents, assistants, and long-context retrieval systems where balanced accuracy-to-cost and reliable tool use matter.

Context Window

131K tokens

Parameters

49B

Input Price

$0.10/1M

Output Price

$0.40/1M

Price Tier

budget

Provider

NVIDIA

How to Use NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

With CoreAI, you can start chatting with NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 instantly — no separate subscription needed. CoreAI bundles access to NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 along with 300+ other AI models from NVIDIA and other providers like OpenAI, Anthropic, Google, Meta, and more.

Download the CoreAI app for iOS, Android, or use the Web App
Select NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 from the model selector
Start chatting, comparing, or creating with AI

More NVIDIA Models

NVIDIA

Try NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 Now

Chat with NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 and 300+ other AI models — all in one app.

Download App → Try on Web App

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

How to Use NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

More NVIDIA Models

NVIDIA: Nemotron 3 Super (free)

NVIDIA: Nemotron 3 Super

NVIDIA: Nemotron 3 Nano 30B A3B (free)

NVIDIA: Nemotron 3 Nano 30B A3B

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA: Nemotron Nano 12B 2 VL

Try NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 Now