Google Gemini Embedding Pricing: gemini-embedding-001, 2-preview & text-embedding-005 (June 2026)

Q: How much does Google Gemini embedding cost?

Google gemini-embedding-001 costs $0.15 per million tokens (GA, stable). The newer gemini-embedding-2-preview costs $0.20 per million tokens during preview. The legacy text-embedding-005 is available through Vertex AI at similar rates. Both MRL dimension reduction options to 3072, 1536, or 768 dimensions.

Q: What is the difference between Gemini API and Vertex AI for embeddings?

Gemini API (Google AI Studio) is the simpler consumer-facing API with a free tier and straightforward token pricing. Vertex AI is the enterprise platform with advanced quotas, VPC controls, and enterprise billing. Pricing is similar between both, but Vertex AI adds first-party compute charges and has a 60-day free trial with $300 in credits.

Q: What is the Google Gemini embedding free tier?

Google AI Studio offers free access to embedding models with rate-limited quotas (approximately 1,500 requests per day). Vertex AI provides $300 in free credits valid for 90 days for new accounts. Neither offers an unlimited free tier comparable to Voyage's 200M lifetime tokens.

Google's embedding pricing is spread across Gemini API and Vertex AI docs. This page consolidates the rates, explains Matryoshka dimension options, and clarifies the Gemini API vs Vertex AI billing differences.

Verified June 2026

Current Pricing

Model	$/M tokens	Dims (MRL)	Context	Status
gemini-embedding-2-preview	$0.20	768/1536/3072	8,192 tokens	Preview
gemini-embedding-001	$0.15	768/1536/3072	2,048 tokens	GA (stable)
text-embedding-005	$0.15	768/1536/3072	2,048 tokens	Legacy

All models accessed via Gemini API or Vertex AI. Preview pricing subject to change at GA launch.

Matryoshka Dimensions: Storage Cost Impact

Both gemini-embedding-001 and gemini-embedding-2-preview support Matryoshka Representation Learning. You can request 768, 1536, or 3072-dimension vectors. The API token price is identical regardless of dimension count - savings are in downstream storage. For 100 million vectors:

Dimensions	Bytes/vector	GB per 100M vecs	Storage ratio
3,072	12,288	11.4 GB	4x (baseline)
1,536	6,144	5.7 GB	2x
768	3,072	2.9 GB	1x (cheapest)

Using 768 dimensions instead of 3072 reduces storage cost by 4x. Quality loss on MTEB Retrieval is typically 2-5 points. Good trade-off for high-scale applications where storage costs dominate.

Vertex AI vs Gemini API: Which to Use

Gemini API (Google AI Studio)

- Free tier: ~1,500 requests/day
- Simple token-based billing
- Quick setup, great for prototypes
- Rate limits lower than Vertex
- Data processed in Google's infra

Vertex AI

- $300 free trial credits (90 days)
- Enterprise quotas and SLAs
- VPC controls, data residency
- Google Cloud billing integration
- Best for production GCP workloads

gemini-embedding-2: The Multimodal Upgrade

The gemini-embedding-2-preview model is natively multimodal - it can embed text, images, and video in a shared vector space. At $0.20/M tokens (text), it is Google's premium offering - 10x OpenAI text-embedding-3-small at $0.02/M for similar accuracy on English-only retrieval. During preview, pricing may change at general availability. The 8,192-token context window (vs 2,048 for gemini-embedding-001) is a significant upgrade for long-document RAG applications.

Frequently Asked Questions

How much does Google Gemini embedding cost?

gemini-embedding-001 costs $0.15 per million tokens (GA, stable). gemini-embedding-2-preview costs $0.20 per million tokens. Both support MRL dimensions of 768/1536/3072 with no additional cost.

What is the difference between Gemini API and Vertex AI for embeddings?

Gemini API is simpler with a free rate-limited tier. Vertex AI is the enterprise platform with VPC controls, data residency, and Google Cloud billing. Pricing is similar but Vertex adds enterprise SLAs and quota flexibility.

Does Google support Matryoshka dimensions for embeddings?

Yes. Both gemini-embedding-001 and gemini-embedding-2-preview support MRL with output dimensions of 3072, 1536, or 768. Using 768 dimensions instead of 3072 reduces storage cost by 4x.

What is the Google Gemini embedding free tier?

Google AI Studio offers approximately 1,500 free embedding requests per day. Vertex AI provides $300 in free credits valid for 90 days for new GCP accounts.

Compare all models

Google vs OpenAI vs Voyage vs Cohere

Cost optimization

MRL dimension reduction strategy

Full calculator

Include storage in your Google estimate

Disclaimer: Independent resource. Not affiliated with Google. Pricing from Google AI and Vertex AI public pricing pages, verified June 2026. Always verify at ai.google.dev/pricing.