Letta (formerly MemGPT): Features, Pricing & Review (April 2026)

Letta is the open-source stateful agent platform originally released as MemGPT at UC Berkeley. The core is Apache-2.0 licensed. Letta Cloud hosts the same platform with tiered plans; Letta Code ships the platform as a memory-first coding CLI (docs).

The differentiator is typed memory blocks (persona, human context, archival) that persist across sessions and port between LLM providers.

System Verdict

Pick Letta if your agent must remember users, carry state across sessions, or survive a model swap without losing context. The memory architecture is the real product. Core memory, archival memory, and background memory subagents give agents an editable world model rather than a fresh context window per turn.

Skip it if your workload is stateless. Simple RAG, one-shot chat, or deterministic pipelines ship faster with LangChain or direct API calls. Letta’s memory layer is overhead if you never retrieve from it.

Who pays which tier: Free self-hosted for research and prototypes. Letta Cloud Professional for individual developers shipping stateful agents. Scale for multi-agent production workloads. Enterprise for private deployments with SSO and dedicated quotas.

Key Facts


Former name	MemGPT (UC Berkeley, 2023 paper)
License	Apache-2.0 (open-source core)
GitHub	letta-ai/letta · 22K+ stars, 100+ contributors
Memory model	Typed blocks: core (persona, human) · archival (long-form)
Memory portability	Cross-provider: OpenAI, Anthropic, Google, Ollama
SDKs	Python · TypeScript
REST API	Yes, full agent lifecycle
Letta Code	`npm install -g @letta-ai/letta-code` · memory-first coding CLI
Hosted tiers	Free · Professional · Scale · Max · Enterprise
Free tier quota	50 premium + 500 standard requests/month

Every data point above was verified against vendor sources on 2026-04-17. See Sources.

What it actually is

A stateful agent runtime. Every agent owns typed memory blocks that it reads and writes. The agent loop runs before each response, retrieves relevant archival context, and can hand off to background subagents that compress and improve its own prompts.

Memory is the portable layer. Swap the underlying LLM from OpenAI to Anthropic and the agent keeps its history, persona, and learned facts. That portability is the product-level claim that sets Letta apart from LangGraph or CrewAI.

Letta Code is the most interesting recent ship. A Terminal-Bench top-ranked OSS harness, it puts a persisted agent behind a coding CLI so sessions accumulate context over days instead of starting cold.

When to pick Letta

Personalized AI assistants. Agents that must remember user preferences, history, and ongoing projects across many sessions.
Avoiding vendor lock-in. Memory that survives a migration from OpenAI to Anthropic or a local Ollama model.
Memory-first coding. Letta Code CLI keeps a single persisted agent across multi-day coding sessions.
Research on agent architectures. Open-source, transparent memory blocks, extensible subagent patterns.
Regulated enterprises. Self-host on your own infrastructure with full control over data residency.

When to pick something else

Multi-agent task pipelines: CrewAI. Role-based crews are faster for hierarchical delegation.
Visual agent builder on LangChain: Langflow.
No-code business agents: Relevance AI.
Voice-first agent UX: Voiceflow.
General workflow automation: n8n or Zapier.
Deterministic production graphs: LangGraph. More control, less memory tooling.

Pricing

Plan	Price	Key limits
Open-source (self-host)	Free	Apache-2.0, BYO compute and API keys
Free (Letta Cloud)	$0/mo	50 premium + 500 standard requests/mo
Professional	Paid tier	500 premium + 5,000 standard requests/mo
Scale	Paid tier	5,000 premium + 50,000 standard requests/mo
Max	Power-user tier	High-throughput agentic coding workloads
Enterprise	Custom	SAML/OIDC SSO, private models, dedicated quotas
API Plan	Usage-based	Unlimited agents; billed per active agent + tool-execution seconds

Prices verified 2026-04-17 via Letta pricing tokens through your provider.

Against the alternatives

	Letta	LangGraph	CrewAI
Primary abstraction	Stateful agents with typed memory	State graphs	Role-based crews
Cross-session memory	Native, portable, editable	Manual wiring	Basic context sharing
Model-switch resilience	High, memory ports cleanly	Low, bound to graph impl	Mid
Production state control	Mid	Highest	Mid
Language support	Python + TS	Python + JS	Python only
Coding CLI	Letta Code	None native	None native
Best viewed as	Memory-first agent platform	Deterministic runtime	Fast multi-agent prototyping

Failure modes

Memory overhead for stateless jobs. If you never retrieve archival memory, Letta’s architecture is weight without benefit. Use plain LangChain.
Self-host setup has moving parts. Production self-hosting wants Postgres, a persistent volume, and careful upgrade discipline.
Category convergence. LangGraph, CrewAI, and ChatGPT Projects are all adding memory features. Letta’s advantage narrows as frontier models expand context.
Python-first ecosystem. TypeScript SDK exists but the examples, templates, and community content lean Python.
Moat at 6/10. Research pedigree and memory architecture are real, but the patterns are documented and copyable.
Hosted free tier is tight. 50 premium requests a month is evaluation-scale. Real usage requires Professional or self-hosting.

Methodology

This page was produced by the aipedia.wiki editorial pipeline, an automated system that ingests vendor documentation, verifies pricing and model details against primary sources, and generates the editorial analysis you are reading. No individual human wrote this review. Scoring follows the four-dimension rubric at /about/scoring/ (Utility × Value × Moat × Longevity, unweighted average). Last verified 2026-04-17 against Letta pricing, Letta Code docs, the Letta Code blog, and the letta-ai/letta GitHub repo.

FAQ

Is Letta the same project as MemGPT? Yes. MemGPT was the original 2023 UC Berkeley research prototype. The team renamed it Letta as it matured into a production-ready platform with hosted tiers, SDKs, and Letta Code.

Is Letta free? Yes. The core is Apache-2.0 open-source and free to self-host. You pay only LLM API costs through your chosen provider. Letta Cloud also offers a free hosted tier with 50 premium plus 500 standard requests per month.

What is Letta Code? A memory-first coding CLI built on the Letta API. Install with npm install -g @letta-ai/letta-code. Unlike session-based coding assistants, Letta Code keeps a persisted agent that learns across days and is portable across models (docs).

How does Letta memory compare to RAG? RAG retrieves documents at query time and discards them. Letta memory is typed, editable, and agent-owned. The agent reads and writes its own memory blocks, compresses old conversations to archival storage, and can introspect what it knows.

Can Letta swap between LLM providers mid-agent? Yes. Memory lives outside the model. Switch from OpenAI to Anthropic or a local Ollama endpoint without losing persona, user facts, or conversation history.

Sources

Letta pricing page: Current Cloud tiers and quotas
Letta Code docs: CLI install and memory-first coding architecture
Letta Code announcement: Terminal-Bench results and positioning
letta-ai/letta GitHub: Apache-2.0 source and release history
Letta documentation: Memory block architecture reference

Category: AI Automation

Share LinkedIn

Was this review helpful?

Embed this score on your site Free. Links back.

HTML

<a href="https://aipedia.wiki/tools/letta/" target="_blank" rel="noopener"><img src="https://aipedia.wiki/badges/letta.svg" alt="Letta on aipedia.wiki" width="260" height="72" /></a>

Markdown

[![Letta on aipedia.wiki](https://aipedia.wiki/badges/letta.svg)](https://aipedia.wiki/tools/letta/)

Badge value auto-updates if the editorial score changes. Attribution via the link is required.

Cite this page For journalists, researchers, and bloggers

News writers

According to aipedia.wiki Editorial at aipedia.wiki (https://aipedia.wiki/tools/letta/)

APA

aipedia.wiki Editorial. (2026). Letta — Editorial Review. aipedia.wiki. Retrieved May 8, 2026, from https://aipedia.wiki/tools/letta/

MLA 9

aipedia.wiki Editorial. "Letta — Editorial Review." aipedia.wiki, 2026, https://aipedia.wiki/tools/letta/. Accessed May 8, 2026.

Chicago

aipedia.wiki Editorial. 2026. "Letta — Editorial Review." aipedia.wiki. https://aipedia.wiki/tools/letta/.

BibTeX

@misc{letta-editorial-review-2026,
  author = {{aipedia.wiki Editorial}},
  title = {Letta — Editorial Review},
  year = {2026},
  publisher = {aipedia.wiki},
  url = {https://aipedia.wiki/tools/letta/},
  note = {Accessed: 2026-05-08}
}

Spotted an error or want to share your experience with Letta?

Every tool page is re-verified on a recurring cycle, and corrections land faster when readers flag them directly. If you spot a stale fact, a missing capability, or have used Letta and want to share what worked or didn't, the editorial desk reviews every message sent through this form.

Email editorial@aipedia.wiki

Report outdated info Help us keep this page accurate