DeepSeek R1 0528 Qwen3 8B | |
|---|---|
Model Provider The organization behind this AI's development | DeepSeek |
Input Context Window Maximum input tokens this model can process at once | 32.8K tokens |
Output Token Limit Maximum output tokens this model can generate at once | 32.8K tokens |
Release Date When this model first became publicly available | May 29, 2025 5 months ago May 29th, 2025 |
DeepSeek R1 0528 Qwen3 8B | |
|---|---|
Input Token Cost Cost per million input tokens | $0.03 per million tokens |
Output Token Cost Cost per million output tokens | $0.11 per million tokens |
DeepSeek R1 0528 Qwen3 8B | |
|---|---|
Input Types Supported input formats | text |
Output Types Supported output formats | text |
Tokenizer Text encoding system | Qwen |
Key Features Advanced capabilities | ✓ Structured Output✓ Reasoning Mode |
Open Source Model availability |
DeepSeek R1 0528 Qwen3 8B | |
|---|---|
MMLU Measures knowledge across 57 subjects like law, math, history, and science | Not available |
MMMU Measures understanding of combined text and images across various domains | Not available |
HellaSwag Measures common sense reasoning by having models complete sentences about everyday situations | Not available |
See how DeepSeek R1 0528 Qwen3 8B compares with other top models
Best models for coding and development
Models optimized for creative writing
Content creation and marketing tasks
Technical analysis and explanations
Scientific research and analysis
Multilingual translation tasks