NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

Name: NVIDIA: Llama 3.1 Nemotron Ultra 253B v1
Price: 0.60 USD
Author: NVIDIA

Llama-3.1-Nemotron-Ultra-253B-v1 is a large language model (LLM) optimized for advanced reasoning, human-interactive chat, retrieval-augmented generation (RAG), and tool-calling tasks. Derived from Meta’s Llama-3.1-405B-Instruct, it has been significantly customized using Neural Architecture Search (NAS), resulting in enhanced efficiency, reduced memory usage, and improved inference latency. The model supports a context length of up to 128K tokens and can operate efficiently on an 8x NVIDIA H100 node. Note: you must include `detailed thinking on` in the system prompt to enable reasoning. Please see [Usage Recommendations](https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-v1#quick-start-and-usage-recommendations) for more.

Context Window

131K tokens

Parameters

253B

Input Price

$0.60/1M

Output Price

$1.80/1M

Price Tier

standard

Provider

NVIDIA

How to Use NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

With CoreAI, you can start chatting with NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 instantly — no separate subscription needed. CoreAI bundles access to NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 along with 300+ other AI models from NVIDIA and other providers like OpenAI, Anthropic, Google, Meta, and more.

Download the CoreAI app for iOS, Android, or use the Web App
Select NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 from the model selector
Start chatting, comparing, or creating with AI

More NVIDIA Models

NVIDIA

Try NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 Now

Chat with NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 and 300+ other AI models — all in one app.

Download App → Try on Web App

NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

How to Use NVIDIA: Llama 3.1 Nemotron Ultra 253B v1

More NVIDIA Models

NVIDIA: Nemotron 3 Super (free)

NVIDIA: Nemotron 3 Super

NVIDIA: Nemotron 3 Nano 30B A3B (free)

NVIDIA: Nemotron 3 Nano 30B A3B

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA: Nemotron Nano 12B 2 VL

Try NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 Now