Your agents, now with object permanence

Shared memory
across every agent.

Every new session, your agent meets you for the first time. Again. claude-mem takes notes while it works — decisions, bugfixes, dead ends — and cmem hands that memory to every agent you run through one private MCP link. Claude Code, Cursor, Codex: one memory, zero re-introductions.

No credit card. One confirmation email — the last thing you’ll have to remember yourself.

Or start local — the open-source engine is one line away:

$ /plugin marketplace add thedotmack/claude-mem
zsh — ~/work
# connect cmem as an MCP server
claude mcp add cmem https://mcp.cmem.ai/u/9f3a2c7e
✓ cmem connected · 0 observations synced
# same memories, every machine
Works with the tools you already use
Claude CodeCursorWindsurfOpenCodeCodex CLIGemini CLI
012×
cheaper codebase exploration
0×
cheaper file understanding vs full reads
~0%
lower agent cost via model tier routing
0
bundled skills
0
chat integrations
How memory forms

One AI works. Another takes notes.

While your agent ships, a second model sits in the back of the room writing everything down — categorized, causal, searchable. Nothing from a session is lost. Including the parts you forgot.

decision bugfix feature discovery security

Each note lands in a taxonomy, so your agent can recall the right kind of context on demand. A progressive-disclosure session index ~800 tokens for 50 observations keeps recall cheap — up to 95% fewer tokens in our docs' worked example (18,000 → 970 tokens).

Causality thread
  • BeforePicked Postgres over SQLite for write concurrency.
  • CurrentAuth middleware now rejects expired tokens (bugfix).
  • NextWire the rate limiter into the same middleware.
How memory travels

Three steps to memory that travels

No infrastructure, no YAML, nothing to babysit. Connect once and forget about it — it won’t.

01

Install claude-mem

Already running it? You’re set. It quietly builds an observations DB as you work.

02

Join the waitlist

Drop your email. We’ll confirm you’re in line and email your invitation.

03

Connect CMEM Cloud

Paste one URL into any agent. Your memory follows you to every machine you’ll ever own.

Open source

Memory that compounds

The claude-mem engine runs locally and gets smarter with every session you ship.

Automatic capture

A second model observes every session and writes notes out-of-band — no prompting, no slowdown.

Auto-categorization

Every observation lands in a taxonomy: decision, bugfix, feature, discovery, security.

Search three ways

Semantic, keyword, and timeline search over everything your agents have ever done.

Causality threading

Before/after links connect why a change happened to what came next — not just isolated facts.

Multi-project isolation

Each project keeps its own memory. Privacy stripping runs as notes are written, on local-first SQLite.

Smart-explore

Structural codebase exploration that is 6–12× cheaper than reading files in full.

Cloud

Boring in all the right ways

Quiet, durable, and deeply uninterested in your attention. Exciting memory infrastructure is how you end up with neither.

Offline by default

A local copy that always works. We sync in the background the moment you’re online.

Private & revocable

One link, scoped to you. Rotate or revoke it anytime — instantly.

Reach it anywhere

Laptop, server, CI, a friend’s machine. Same memory, one URL.

Early access

Be first in line — invitations roll out daily.

Synced in under a second

Incremental sync keeps the cloud copy fresh without slowing you down.

Your DB, untouched

We mirror the claude-mem schema exactly. Nothing proprietary, nothing locked in.

Skills

11 skills, bundled in

Every install ships a kit of slash commands. Search memory, build brains, explore code, and orchestrate work — all from the agent you already use.

/mem-search

Search past sessions and observations

/knowledge-agent

Build a queryable brain from corpora

/smart-explore

Structural codebase exploration, 6–12× cheaper

/learn-codebase

Deep-learn a repo into memory

/make-plan

Research and write implementation plans

/do

Orchestrate plan execution with subagents

/pathfinder

Trace code paths and flows

/timeline-report

Report what happened across sessions

/babysit

Supervise a long-running agent task

/how-it-works

Explain how claude-mem works

/version-bump

Bump versions and changelog

Knowledge agents

Build a brain from
your own work

Compile a focused corpus from your own sessions, docs, and observations — then ask it questions in plain language. Answers come back sourced, across multi-turn conversations, with a 1M-token context window behind them.

  • Build a corpus from your sessions, docs, and observations
  • Prime it once, then ask multi-turn questions
  • Query across a 1M-token context window
  • Rebuild as your work grows — your brain stays current
Ask your corpus
# multi-turn Q&A over your own work
you › How did we fix the auth race condition?
corpus › You debounced the token refresh and gated it behind a mutex.
# sourced from 3 sessions → session #4182, #4190, #4203
Real-time dreaming

Your agent dreams while you work

While one AI builds, a second one watches — turning what just happened into durable memory, in real time.

Live observation feedDreaming · now
  • 14:02Located the video source file in src/lib/
  • 14:05Refactored the sync thread into Motion.tsx
  • 14:09Tests pass — committed as a120a9b
  • 14:14Mapped the component layout pre-Phase 2
  • Observations generated live as you work
  • Before / after threading keeps cause and effect linked
  • Watch it dream in the live viewer
  • Session consolidated the moment you stop
RADSoon

RAG is how agents retrieve knowledge. RAD is how they remember experience.

Live ingestion

Every app you use becomes a memory

Memory shouldn't stop at the terminal. cmem pulls in the conversations and transcripts where your real work happens.

LiveShipped now
TelegramDiscordSlackSignalWhatsAppLINEMulti-machine syncTranscript capture (Claude Code, Cursor, Codex, Gemini CLI, OpenClaw)
SoonOn the way
Slack ingestionDiscord ingestionEmailVideo + audio via Gemini Live
Compatibility

Works where you work

If it speaks MCP, it remembers.

Agents & IDEs
Claude CodeCursorWindsurfOpenCodeOpenClawCodex CLIGemini CLIVS Code
Providers & gateways
Claude / Agent SDKGeminiOpenRouterLiteLLMany ANTHROPIC_BASE_URL gateway
Protocol
any MCP client
OS & runtimes
macOS (Intel + Apple Silicon)LinuxWindows 10/11WSLDockerBunNode 20+
Languages
JavaScriptTypeScriptTSX/JSXPythonGoRustRubyJavaCC+++ custom grammars

10 languages + custom grammars

One private link

Your memory,
one URL away

When your invitation arrives, you get a private MCP endpoint. Paste it into Claude, your editor, or any MCP-aware agent and your claude-mem history is right there — read and write, synced live. Like the agent never left.

  • Scoped to your account, never shared
  • Works with any MCP client
  • Revoke and regenerate in one click
Your private MCP link
Synced · now
https://mcp.cmem.ai/u/9f3a2c7e-d41b
# observations available to every agent
GET /memory/recent 0 rows
POST /memory/observe → ok
Pricing

Free during early access

The engine is open source and free to install today. Cloud sync is $0 during early access — join the waitlist and start syncing the moment your invitation lands.

Open source

claude-mem

Free & open source

Local-first memory for your agents. Install today — runs on your machine, your data stays yours.

Early access

CMEM Cloud

$0during early access

Unlimited observations · unlimited devices · one private MCP link

FAQ

Questions, answered

What is claude-mem?

claude-mem is an open-source memory plugin for Claude Code and other AI agents. While one AI works, a second AI takes notes about what it does — building a searchable memory that persists across every session.

Is my data private?

Yes. claude-mem is local-first: your observations live in a SQLite database on your own machine, with privacy stripping applied as memory is generated. Cloud sync is strictly opt-in.

What's an MCP link?

An MCP link is a personal MCP server URL that gives any MCP client — Claude Code, Cursor, Codex CLI, and more — secure access to your memory from anywhere.

Does it slow my agent down?

No. Observation generation runs out-of-band, so it doesn't block your agent. Thanks to progressive disclosure, sessions actually get cheaper as your memory grows.

What is RAD?

RAG is how agents retrieve knowledge. RAD is how they remember experience. RAD — Retrieval-Augmented Dreaming — is our term for live observation generation that captures what happened while you work. Dreaming ships in claude-mem today; RAD as an open standard is on the way.

What does cloud cost?

$0 during early access. Get CMEM Cloud and start syncing your memory to the cloud while we're in early access.

Your agents will
remember this.

One waitlist, one email, one private link. Every agent you run starts acting like it’s been here the whole time — because now it has.