The Gemini API "free tier" is offered through the API service with lower rate limits for testing purposes. Google AI Studio usage is completely free in all available countries. The Gemini API "paid tier" comes with higher rate limits, additional features, and different data handling.
Gemini 2.0 Flash
Our most capable multi-modal model with great performance across all tasks, with a 1 million token context window, and built for the era of Agents.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.10 (text / image / video) $0.70 (audio) |
Output price | Free of charge | $0.40 |
Context caching price | Free of charge | $0.025 / 1,000,000 tokens (text/image/video) $0.175 / 1,000,000 tokens (audio) Available February 24, 2025 |
Context caching (storage) | Free of charge, up to 1,000,000 tokens of storage per hour Available February 24, 2025 |
$1.00 / 1,000,000 tokens per hour Available February 24, 2025 |
Tuning price | Not available | Not available |
Grounding with Google Search | Free of charge, up to 500 RPD | 1,500 RPD (free), then $35 / 1,000 requests |
Used to improve our products | Yes | No |
Gemini 2.0 Flash-Lite
Our smallest and most cost effective model, built for at scale usage.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.075 |
Output price | Free of charge | $0.30 |
Context caching price | Free of charge | $0.01875 |
Context caching (storage) | Free of charge, up to 1,000,000 tokens of storage per hour | $1.00 / 1,000,000 tokens per hour |
Tuning price | Not available | Not available |
Grounding with Google Search | Not available | Not available |
Used to improve our products | Yes | No |
Imagen 3
Our state-of-the-art image generation model, available to developers on the paid tier of the Gemini API.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Image price | Not available | $0.03 |
Used to improve our products | Yes | No |
Gemini 1.5 Flash
Our fastest multi-modal model with great performance for diverse, repetitive tasks and a 1 million token context window.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.075, prompts <= 128k tokens $0.15, prompts > 128k tokens |
Output price | Free of charge | $0.30, prompts <= 128k tokens $0.60, prompts > 128k tokens |
Context caching price | Free of charge, up to 1 million tokens of storage per hour | $0.01875, prompts <= 128k tokens $0.0375, prompts > 128k tokens |
Context caching (storage) | Free of charge | $1.00 per hour |
Tuning price | Token prices are the same for tuned models Tuning service is free of charge. |
Token prices are the same for tuned models Tuning service is free of charge. |
Grounding with Google Search | Not available | $35 / 1K grounding requests (for up to 5K requests per day). |
Used to improve our products | Yes | No |
Gemini 1.5 Flash-8B
Our smallest model for lower intelligence use cases, with a 1 million token context window.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $0.0375, prompts <= 128k tokens $0.075, prompts > 128k tokens |
Output price | Free of charge | $0.15, prompts <= 128k tokens $0.30, prompts > 128k tokens |
Context caching price | Free of charge, up to 1 million tokens of storage per hour | $0.01, prompts <= 128k tokens $0.02, prompts > 128k tokens |
Context caching (storage) | Free of charge | $0.25 per hour |
Tuning price | Token prices are the same for tuned models Tuning service is free of charge. |
Token prices are the same for tuned models Tuning service is free of charge. |
Grounding with Google Search | Not available | $35 / 1K grounding requests (for up to 5K requests per day). |
Used to improve our products | Yes | No |
Gemini 1.5 Pro
Our highest intelligence Gemini 1.5 series model, with a breakthrough 2 million token context window.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | $1.25, prompts <= 128k tokens $2.50, prompts > 128k tokens |
Output price | Free of charge | $5.00, prompts <= 128k tokens $10.00, prompts > 128k tokens |
Context caching price | Not available | $0.3125, prompts <= 128k tokens $0.625, prompts > 128k tokens |
Context caching (storage) | Not available | $4.50 per hour |
Tuning price | Not available | Not available |
Grounding with Google Search | Not available | $35 / 1K grounding requests (for up to 5K requests per day). |
Used to improve our products | Yes | No |
Text Embedding 004
Our state-of-the-art text embedding model.
Free Tier | Paid Tier, per 1M tokens in USD | |
---|---|---|
Input price | Free of charge | Not available |
Output price | Free of charge | Not available |
Tuning price | Not available | Not available |
Used to improve our products | Yes | No |
[*] Google AI Studio usage is free of charge in all available regions. See Billing FAQs for details.
[**] Prices may differ from the prices listed here and the prices offered on Vertex AI. For Vertex prices, see the Vertex AI pricing page.
[***] If you are using dynamic retrieval to optimize costs, only requests that contain at least one grounding support URL from the web in their response are charged for Grounding with Google Search. Costs for Gemini always apply. Rate limits are subject to change.