Embedding Model Comparison: OpenAI vs Cohere vs Voyage vs Google vs Self-Hosted (May 2026)
Side-by-side comparison of every major embedding model. Price, dimensions, MTEB score, context window, batch discount, free tier, and best-fit use case.
Full Model Comparison Table
| Model | Provider | $/M std | $/M batch | Dims | Context | MTEB | Free tier | Best for |
|---|---|---|---|---|---|---|---|---|
| text-embedding-3-small | OpenAI | $0.020 | $0.010 | 1,536 | 8,191 | 62.3 | - | General-purpose RAG, low cost |
| text-embedding-3-large | OpenAI | $0.130 | $0.065 | 3,072 | 8,191 | 64.6 | - | High-accuracy retrieval |
| text-embedding-ada-002 | OpenAI | $0.100 | No batch | 1,536 | 8,191 | 60.5 | - | Legacy - migrate to 3-small |
| embed-v4 | Cohere | $0.100 | No batch | 1,024 | 512 | 55 | 100 calls/min | Multilingual content (100+ languages) |
| voyage-3.5 | Voyage AI | $0.060 | $0.040 | 1,024 | 32,000 | 67.1 | 200M tokens | Best price-to-accuracy ratio |
| voyage-3-large | Voyage AI | $0.180 | $0.120 | 2,048 | 32,000 | 68.9 | 200M tokens | Maximum accuracy |
| voyage-3-lite | Voyage AI | $0.020 | $0.013 | 512 | 32,000 | 61.7 | 200M tokens | High-volume, cost-sensitive |
| gemini-embedding-2-preview | $0.200 | No batch | 3,072 | 8,192 | 68 | - | Multimodal, Google ecosystem | |
| gemini-embedding-001 | $0.150 | No batch | 3,072 | 2,048 | 65.4 | - | Stable GA, Google ecosystem | |
| Titan Text Embeddings V2 | Amazon Bedrock | $0.200 | No batch | 1,024 | 8,192 | 62.8 | - | AWS-native apps, compliance |
| BGE-M3 (self-hosted) | Self-Hosted | $0.001 | $0.001 | 1,024 | 8,192 | 66.5 | - | High-volume, privacy-sensitive |
Green = best in category, amber = watch out, purple = top MTEB. MTEB Retrieval average where publicly available. Context window in tokens. Verified May 2026.
By Scenario: Which Model Should You Pick?
Under $5/month total. One-time embedding cost for 50M tokens is $1.00. pgvector is near-zero marginal cost on existing Postgres.
Provider detailsMTEB 67.1 vs OpenAI small at 62.3. At $0.06/M standard or $0.04/M batch, the accuracy premium is affordable for most production apps.
Provider details15-20% quality improvement for non-Latin scripts over OpenAI. The 512-token context limit requires tighter chunking.
Provider detailsMTEB 68.9 - highest of any commercial API. At $0.18/M, index with large and query with voyage-3-lite ($0.02/M) using the same-vector-space feature.
Provider detailsKeeps data within AWS VPC. Binary embedding option for 4x storage reduction. Integrates with OpenSearch Serverless and SageMaker.
Provider detailsAt A100 spot rates, $0.001/M tokens vs $0.02/M for OpenAI small. Breaks even at roughly 15M tokens/month. Requires DevOps investment.
Provider detailsPurpose-trained on code repositories. Meaningfully better than general-purpose models for code retrieval at the same price as voyage-3.5 ($0.06/M).
Provider detailsStable GA model, $0.15/M, strong MTEB at 65.4. MRL support for dimension reduction. Integrates naturally with Vertex AI, BigQuery, and Cloud Storage.
Provider details