R1 Distill Qwen 14B | Gemma 3 4B | |
|---|---|---|
Model Provider The organization behind this AI's development | DeepSeek | Google |
Input Context Window Maximum input tokens this model can process at once | 32.8K tokens | 96K tokens |
Output Token Limit Maximum output tokens this model can generate at once | 16.4K tokens | Not specified tokens |
Release Date When this model first became publicly available | January 29, 2025 9 months ago January 29th, 2025 | March 13, 2025 8 months ago March 13th, 2025 |
R1 Distill Qwen 14B | Gemma 3 4B | |
|---|---|---|
Input Types Supported input formats | 📝Text | 📝Text🖼️Image |
Output Types Supported output formats | 📝Text | 📝Text |
Tokenizer Text encoding system | Qwen | Gemini |
Key Features Advanced capabilities | Function Calling✓Structured Output✓Reasoning ModeContent Moderation | Function Calling✓Structured OutputReasoning ModeContent Moderation |
Open Source Model availability | Available on HuggingFace → | Available on HuggingFace → |
R1 Distill Qwen 14B | Gemma 3 4B | |
|---|---|---|
Input Token Cost Cost per million input tokens | $0.15 per million tokens | $0.02 per million tokens |
Output Token Cost Cost per million outut tokens | $0.15 per million tokens | $0.07 per million tokens |
R1 Distill Qwen 14B | Gemma 3 4B | |
|---|---|---|
MMLU Measures knowledge across 57 subjects like law, math, history, and science | Benchmark not available. | Benchmark not available. |
MMMU Measures understanding of combined text and images across various domains | Benchmark not available. | Benchmark not available. |
HellaSwag Measures common sense reasoning by having models complete sentences about everyday situations | Benchmark not available. | Benchmark not available. |
R1 Distill Qwen 14B by DeepSeek offers advanced reasoning, generates structured data. It can handle standard conversations with its 32.8K token context window. Very affordable at $0.15/M input and $0.15/M output tokens. Released January 29th, 2025.
Gemma 3 4B by Google understands both text and images, generates structured data. It can handle standard conversations with its 96K token context window. Very affordable at $0.02/M input and $0.07/M output tokens. Released March 13th, 2025.