Independent resource. Not affiliated with any provider. Always verify pricing on provider sites.
$embeddingcost

Embedding Model Comparison: OpenAI vs Cohere vs Voyage vs Google vs Self-Hosted (May 2026)

Side-by-side comparison of every major embedding model. Price, dimensions, MTEB score, context window, batch discount, free tier, and best-fit use case.

Verified May 2026

Full Model Comparison Table

ModelProvider$/M std$/M batchDimsContextMTEBFree tierBest for
text-embedding-3-smallOpenAI$0.020$0.0101,5368,19162.3-General-purpose RAG, low cost
text-embedding-3-largeOpenAI$0.130$0.0653,0728,19164.6-High-accuracy retrieval
text-embedding-ada-002OpenAI$0.100No batch1,5368,19160.5-Legacy - migrate to 3-small
embed-v4Cohere$0.100No batch1,02451255100 calls/minMultilingual content (100+ languages)
voyage-3.5Voyage AI$0.060$0.0401,02432,00067.1200M tokensBest price-to-accuracy ratio
voyage-3-largeVoyage AI$0.180$0.1202,04832,00068.9200M tokensMaximum accuracy
voyage-3-liteVoyage AI$0.020$0.01351232,00061.7200M tokensHigh-volume, cost-sensitive
gemini-embedding-2-previewGoogle$0.200No batch3,0728,19268-Multimodal, Google ecosystem
gemini-embedding-001Google$0.150No batch3,0722,04865.4-Stable GA, Google ecosystem
Titan Text Embeddings V2Amazon Bedrock$0.200No batch1,0248,19262.8-AWS-native apps, compliance
BGE-M3 (self-hosted)Self-Hosted$0.001$0.0011,0248,19266.5-High-volume, privacy-sensitive

Green = best in category, amber = watch out, purple = top MTEB. MTEB Retrieval average where publicly available. Context window in tokens. Verified May 2026.

By Scenario: Which Model Should You Pick?

Small RAG bot, under 100k documents, budget-first
OpenAI text-embedding-3-small + pgvector

Under $5/month total. One-time embedding cost for 50M tokens is $1.00. pgvector is near-zero marginal cost on existing Postgres.

Provider details
Best accuracy for production RAG
Voyage voyage-3.5

MTEB 67.1 vs OpenAI small at 62.3. At $0.06/M standard or $0.04/M batch, the accuracy premium is affordable for most production apps.

Provider details
Multilingual content, 100+ languages
Cohere embed-v4

15-20% quality improvement for non-Latin scripts over OpenAI. The 512-token context limit requires tighter chunking.

Provider details
Maximum retrieval accuracy, cost secondary
Voyage voyage-3-large

MTEB 68.9 - highest of any commercial API. At $0.18/M, index with large and query with voyage-3-lite ($0.02/M) using the same-vector-space feature.

Provider details
AWS-native application, compliance required
Amazon Titan V2 via Bedrock

Keeps data within AWS VPC. Binary embedding option for 4x storage reduction. Integrates with OpenSearch Serverless and SageMaker.

Provider details
Over 15 million tokens per month
Self-hosted BGE-M3

At A100 spot rates, $0.001/M tokens vs $0.02/M for OpenAI small. Breaks even at roughly 15M tokens/month. Requires DevOps investment.

Provider details
Code search or documentation RAG
Voyage voyage-code-3

Purpose-trained on code repositories. Meaningfully better than general-purpose models for code retrieval at the same price as voyage-3.5 ($0.06/M).

Provider details
Google ecosystem, GCP billing
Google gemini-embedding-001

Stable GA model, $0.15/M, strong MTEB at 65.4. MRL support for dimension reduction. Integrates naturally with Vertex AI, BigQuery, and Cloud Storage.

Provider details

Head-to-Head Comparisons

OpenAI small vs Cohere embed-v4

text-embedding-3-small
Price: $0.02/M
Dims: 1536
MTEB: 62.3
Context: 8,191
Free: Trial only
Cheaper; better for English
embed-v4
Price: $0.10/M
Dims: 1024
MTEB: 55.0
Context: 512
Free: 100 calls/min
Better multilingual

OpenAI large vs Voyage large

text-embedding-3-large
Price: $0.13/M
Dims: 3072
MTEB: 64.6
Context: 8,191
Free: Trial only
Mature ecosystem
voyage-3-large
Price: $0.18/M
Dims: 2048
MTEB: 68.9
Context: 32,000
Free: 200M lifetime
Higher MTEB; longer context

Voyage 3.5 vs Gemini embedding-001

voyage-3.5
Price: $0.06/M
Dims: 1024
MTEB: 67.1
Context: 32,000
Free: 200M lifetime
Best accuracy-to-cost ratio
gemini-embedding-001
Price: $0.15/M
Dims: 3072
MTEB: 65.4
Context: 2,048
Free: AI Studio free tier
GCP ecosystem; MRL dims

ada-002 vs text-embedding-3-small (migration)

ada-002 (legacy)
Price: $0.10/M
Dims: 1536
MTEB: 60.5
Context: 8,191
Free: N/A
No - expensive, worse quality
text-embedding-3-small
Price: $0.02/M
Dims: 1536
MTEB: 62.3
Context: 8,191
Free: Trial only
Yes - 5x cheaper, better quality

Frequently Asked Questions

Which embedding model is cheapest?
Self-hosted BGE-M3 at ~$0.001/M tokens is cheapest overall. Among commercial APIs, OpenAI text-embedding-3-small batch at $0.01/M is cheapest, followed by OpenAI small standard and Voyage-3-lite both at $0.02/M.
Which embedding model has the best accuracy?
As of April 2026, voyage-3-large leads at MTEB 68.9, followed by gemini-embedding-2-preview at ~68.0 and voyage-3.5 at 67.1. OpenAI text-embedding-3-large scores 64.6.
Do Cohere and Voyage have free tiers?
Cohere has a rate-limited free tier (100 API calls/minute). Voyage AI offers 200 million tokens free for life across all Voyage models. OpenAI has no free embedding tier after trial credits expire.
Which model is best for multilingual content?
Cohere embed-v4 is strongest for multilingual content, especially non-Latin scripts. It shows 15-20% quality improvement over OpenAI for Arabic, Hindi, Japanese, and Chinese.
Which model should I use for code search?
Voyage-code-3 is purpose-trained for code search at $0.06/M tokens. For code with OpenAI, text-embedding-3-small works but is not specialized.
Should I migrate from ada-002 to text-embedding-3-small?
Yes. ada-002 costs $0.10/M vs $0.02/M for text-embedding-3-small - 5x cheaper with better quality. Re-embedding 1B tokens costs $20 and pays back in 2-3 months at equivalent volume.
What embedding model is best for a small RAG application?
OpenAI text-embedding-3-small with pgvector on an existing Postgres database costs under $5/month for a 100k-document knowledge base with low query volume.
Is ada-002 vs text-embedding-3-small really that different?
text-embedding-3-small outperforms ada-002 (MTEB 62.3 vs 60.5) while costing 5x less. The only reason to stay on ada-002 is to defer a re-embedding pass on an existing index.
Full cost calculator
See Year 1 total including storage for every provider
Optimization techniques
Batch API, MRL dimensions, chunking, caching
Disclaimer: Independent resource. Not affiliated with any provider listed. Prices and MTEB scores verified May 2026. MTEB scores from the public MTEB leaderboard (Retrieval task average). Always verify pricing on each provider's own site before making decisions.

Updated May 2026