← Back to all comparisons
Side-by-side
Gemini 2.5 Pro vs Llama 3.3 70B (Groq)
Google Gemini vs Groq — head-to-head specs and pricing.
Google Gemini
Gemini 2.5 Pro
Google's top multimodal model.
Gemini 2.5 Pro — strong reasoning, 2M context, native multimodal (text, image, video, audio).
Groq
Llama 3.3 70B (Groq)
Llama at Groq speed.
Meta's Llama 3.3 70B served on Groq's LPUs — hundreds of tokens/sec.
| Spec | Gemini 2.5 Pro | Llama 3.3 70B (Groq) |
|---|---|---|
| Provider | Google Gemini | Groq |
| Input cost / 1M tokens | $1.50 | $0.71 |
| Output cost / 1M tokens | $6.00 | $0.95 |
| Context window | 2,000,000 tokens | 128,000 tokens |
| Max output tokens | 8,000 | — |
| Streaming support | ✓ | ✓ |
| Tool calling | ✓ | ✓ |
| Vision input | ✓ | — |
| JSON mode | — | — |
| Status | ACTIVE | ACTIVE |
Use both via OneAPIKey — one key, one bill
OneAPIKey aggregates Google Gemini, Groq, and 10 more providers behind a single API. Smart Routing automatically picks the best model per request — or you choose explicitly.