Weaviate is an open-source vector database with a managed cloud service. It is built for AI retrieval patterns: semantic search, hybrid search, RAG, metadata filtering, embeddings, multi-tenancy, and agent-facing query workflows.
It competes with Pinecone, Qdrant, Milvus, Chroma, and Postgres plus pgvector.
Recent developments
- June 10, 2026: Pricing and positioning reverified. Weaviate Cloud now shows Free, Flex from $45/mo, Plus from $280/mo, and Premium from $400/mo; Weaviate Embeddings remain $0.025-$0.065 per 1M tokens; Query Agent has a 1,000-request/month free trial path and a $30/org/month plan with 4,000 included requests plus usage-based overage.
- June 3, 2026: Weaviate made Engram generally available as a managed memory and context service for agents in Weaviate Cloud, with a free tier. This widens Weaviate from vector database/retrieval infrastructure into the agent-memory lane for teams that need durable, scoped memories.
System Verdict
Pick Weaviate if you want an open vector database with a mature cloud path. It is one of the strongest choices when self-hosting optionality and managed operations both matter.
Skip it when vector search is a small side feature. A general database with vector support may be enough.
Weaviate’s appeal is balance: open-source core, managed cloud, hybrid retrieval, and enterprise features without being cloud-only.
Key Facts
| Core product | Open-source vector database |
| Cloud | Weaviate Cloud managed service |
| Use cases | RAG, semantic search, hybrid retrieval, recommendations |
| Search modes | Vector, keyword, hybrid, filters |
| AI services | Embeddings and query agent features in cloud plans |
| Agent memory | Engram managed memory/context service, GA June 3, 2026, with free tier |
| Cloud plans | Free cloud entry, Flex from $45/month, Plus from $280/month, Premium from $400/month |
| Cloud uptime SLAs | Flex 99.5% · Plus 99.9% · Premium up to 99.95%, 1-hour Severity 1 response |
| Add-ons | Weaviate Embeddings $0.025 to $0.065/1M tokens · Query Agent free to try at 1,000 requests/month or $30/org/mo with 4,000 requests included |
| Pricing | Free self-host/free cloud entry plus paid cloud plans and usage-based AI services |
| Best fit | Teams wanting control plus managed upgrade path |
When to pick Weaviate
- Open-source matters. You can self-host and inspect the core system.
- Hybrid search matters. Combining keyword and semantic retrieval is first-class.
- You need multi-tenancy. Weaviate is often chosen for SaaS retrieval systems.
- You want a managed path. Start local or self-hosted, then move production to Weaviate Cloud.
- You care about retrieval features, not just storage. Schema, modules, filters, query patterns, Query Agent, hosted embeddings, and Engram memory can be part of the product.
When to pick something else
- Fully managed default: Pinecone is easier for teams that do not want to operate anything.
- Rust-native open-source vector DB: Qdrant.
- Postgres-first apps: pgvector through your existing database.
- Workplace search: Glean adds connectors, permissions, and employee-facing UX.
How to evaluate it
Evaluate Weaviate as a retrieval platform, not just a nearest-neighbor index. The deciding factors are hybrid search quality, schema design, multi-tenancy needs, embedding stack. If you already know your workload is simple vector lookup, a narrower database or Postgres with pgvector may be easier to run.
Weaviate’s strongest buyer fit is a team that wants open-source optionality without giving up a managed cloud path. That matters when procurement, data residency, or architecture reviews make cloud lock-in a concern. It also matters for teams that expect retrieval to become a core product surface rather than a small feature hidden behind a chatbot.
Compare it directly with Qdrant and Pinecone. Qdrant is appealing for focused open-source vector search and self-hosting. Pinecone is appealing for managed cloud simplicity. Weaviate sits in the middle: broader retrieval features, open deployment choices, and enough product surface for teams building search or RAG as a durable capability.
Pricing
Weaviate can be self-hosted for free. As verified on 2026-06-12, Weaviate Cloud lists a free entry plan, Flex from $45/month, Plus from $280/month, and Premium from $400/month. Flex is the low-commit shared-cloud pay-as-you-go path with 99.5% availability; Plus raises the shared-cloud floor and SLA; Premium adds predictable spend, enhanced reliability, support, and shared or dedicated deployment options with up to 99.95% availability and 1-hour Severity 1 response.
Pricing dimensions include vector dimensions, storage, backup, index type, compression, region, and cloud plan. The public calculator defaults to $45/month for a small Flex-shaped estimate, but production buyers should model their own vector dimensions, object count, object size, backup retention, compression profile, cloud provider, and region before migrating.
The practical advice: use the calculator before migration, and compare against pgvector if your index is small.
Weaviate lists database AI services that run inside Weaviate Cloud. Weaviate Embeddings is hosted embedding generation priced $0.025 to $0.065 per 1M tokens depending on model. Query Agent is free to try at 1,000 requests/month or available as a $30/organization monthly plan with 4,000 included requests plus usage-based pricing for additional requests. These are useful, but they mean the full bill can include database, backup, AI service, support, and agent usage rather than only vector storage.
Engram is a separate strategic signal. Weaviate describes it as a managed memory and context service for agents, generally available in Weaviate Cloud with a free tier. Treat it as an agent-architecture decision, not a generic vector-table feature.
Evaluation checklist
- Estimate vector dimensions, object count, object size, and backup volume before choosing a plan.
- Test hybrid retrieval quality against plain vector search and keyword search.
- Decide whether multi-tenancy belongs inside Weaviate or in your application layer.
- Check whether compression changes recall enough to matter.
- Model Premium or dedicated cloud if HIPAA, SSO/SAML, PrivateLink, customer-managed keys, or stronger SLAs are required.
- Treat Query Agent, hosted embeddings, and Engram as separate cost and architecture decisions.
Failure Modes
- Operational learning curve. Self-hosting search infrastructure is real work.
- Pricing dimensions need modeling. Vector dimensions, storage, backups, support, and region choices all matter.
- Schema decisions linger. Poor chunking and metadata design can make retrieval worse than the database deserves.
- Not an enterprise-search app. Weaviate is infrastructure. It does not solve workplace permissions and UX by itself.
- Migration can be expensive. Re-indexing embeddings and changing retrieval semantics require testing.
- Add-ons change scope. Embeddings, Query Agent, and Engram can make Weaviate more than a vector DB, but they also add procurement and evaluation work.
Methodology
Last verified 2026-06-12 against weaviate.io/pricing, the Weaviate homepage, the Engram GA announcement, Weaviate documentation, and the Weaviate GitHub. Scoring reflects open-source leverage, feature depth, cloud maturity, agent-memory optionality, and operational complexity.
FAQ
Is Weaviate open source? Yes. The core database is open source, with managed cloud services available.
Weaviate vs Pinecone? Weaviate gives more open-source control. Pinecone is more managed-first.
Can Weaviate do hybrid search? Yes. Hybrid semantic and keyword retrieval is a core use case.
Sources
- Weaviate pricing (verified 2026-06-12)
- Weaviate homepage (verified 2026-06-12)
- Weaviate Engram GA announcement (verified 2026-06-12)
- Weaviate documentation (verified 2026-06-12)
- Weaviate GitHub (verified 2026-06-12)
Related
- Category: AI Infrastructure · AI Search
- See also: Pinecone · Qdrant · Glean · Hugging Face