Galaxy.ai Logo

Llama 3.3 Nemotron Super 49B V1.5 Model Specs, Costs & Benchmarks (November 2025)

Llama 3.3 Nemotron Super 49B V1.5, developed by NVIDIA, features a context window of 131.1K tokens. The model costs $0.10 per million tokens for input and $0.40 per million tokens for output. It was released on October 10, 2025, and has achieved impressive scores in various benchmarks.
Access Llama 3.3 Nemotron Super 49B V1.5 & 210+ other AI models all in one platformTry Galaxy.ai for free

Overview

Llama 3.3 Nemotron Super 49B V1.5Llama 3.3 Nemotron Super 49B V1.5
Model Provider
The organization behind this AI's development
NVIDIA logoNVIDIA
Input Context Window
Maximum input tokens this model can process at once
131.1K
tokens
Output Token Limit
Maximum output tokens this model can generate at once
Not specified
tokens
Release Date
When this model first became publicly available
October 10th, 2025

Pricing

Llama 3.3 Nemotron Super 49B V1.5Llama 3.3 Nemotron Super 49B V1.5
Input Token Cost
Cost per million input tokens
$0.10
per million tokens
Output Token Cost
Cost per million output tokens
$0.40
per million tokens

Capabilities & Features

Llama 3.3 Nemotron Super 49B V1.5Llama 3.3 Nemotron Super 49B V1.5
Input Types
Supported input formats
text
Output Types
Supported output formats
text
Tokenizer
Text encoding system
Llama3
Key Features
Advanced capabilities
✓ Function Calling✓ Structured Output✓ Reasoning Mode
Open Source
Model availability

Benchmarks

Llama 3.3 Nemotron Super 49B V1.5Llama 3.3 Nemotron Super 49B V1.5
MMLU
Measures knowledge across 57 subjects like law, math, history, and science
Not available
MMMU
Measures understanding of combined text and images across various domains
Not available
HellaSwag
Measures common sense reasoning by having models complete sentences about everyday situations
Not available

Compare This Model

See how Llama 3.3 Nemotron Super 49B V1.5 compares with other top models