Leave an imprint on your IDEs.
One persistent memory layer for Claude Code, Cursor, Codex, Antigravity — and every MCP-capable IDE. Automatic capture. Semantic recall. Real-time contradiction detection. Your full context, everywhere you build.
Quick start
Up and running
in under 2 minutes.
Sign in
Create your free Imprint account. No credit card, no install yet.
Connect your IDE
Paste one config block into Claude Code, Cursor, or Codex. MCP activates instantly.
AI remembers you
Switch IDEs, start a new chat — your full context is always there, automatically.
Claude Code · Cursor · Codex · Antigravity · VS Code · Windsurf
THE COGNITIVE GAP
LLMs forget conversations when
the context window resets. We fixed that.
One memory layer, every coding agent.
One store, all toolsOne store, every tool
Imprint sits beneath every coding agent and browser AI you use. Claude Code, Cursor, Codex, and Antigravity all draw from the same memory — your projects, preferences, and context travel with you, no matter which IDE you open.
Every coding agent
Install the MCP server once. Register it with Claude Code, Cursor, Codex, Antigravity — or any IDE that speaks MCP. Set IMPRINT_PLATFORM and every agent silently recalls your full context at session start.
Get started
Works in every AI coding tool.
One MCP server. Persistent memory across Claude Code, Cursor, Codex, Antigravity — or any IDE you use.
Clone & install
git clone https://github.com/YashasviThakur/imprint.git cd imprint/mcp && npm install
Register with Claude Code
claude mcp add imprint --scope user \ -- node /path/to/imprint/mcp/server.js
Or add manually to ~/.claude.json
{
"mcpServers": {
"imprint": {
"command": "node",
"args": [
"/path/to/imprint/mcp/server.js"
],
"env": {
"IMPRINT_USER_ID": "your-user-id",
"IMPRINT_PLATFORM": "claude-code"
}
}
}
}Create ~/.claude/CLAUDE.md
Call get_memories at session start. Call save_memory when you learn something. Never announce you're doing this.
Verify connection
claude mcp list # imprint ✓ Connected
Your IDE not listed?
REAL-TIME EDGE SYNCHRONIZATION
As you work with your AI assistant, Imprint's background engine streams message vectors directly into an isolated AWS serverless database, creating a permanent recall layer.
Infrastructure x Intelligence.
Decoupled Architecture
By separating state from the raw model layer, Imprint makes your AI assistant natively stateful. We deploy specialized database nodes inside your private cloud, ensuring your operational context never enters public training loops.
Serverless Hydration
When a prompt is initialized, our Vercel Edge middleware performs a semantic vector lookup across AWS, injecting highly relevant historical schemas into the prompt context prior to model generation.
Why we built this
Your AI is brilliant.
It just forgets you exist.
The Problem
AI coding assistants forget everything the moment the context window resets. Every new session starts blind — no memory of who you are, what you're building, or how you think.
The Fix
Imprint captures lasting facts from every session, across every IDE, and stores them in a serverless AWS database. Next session, your assistant already knows you.
The Edge
Real-time contradiction detection flags when you say something that conflicts with your own memory — no other memory tool does this.
The Stack
Production-grade infrastructure, zero ops.
Next.js 16
App + API layer
Vercel Edge
Global deployment
AWS DynamoDB
Memory storage
Groq API
llama-3.3-70b · fast extraction
Jina Embeddings
1024-dim semantic retrieval
AES-256 BYOK
End-to-end key encryption
Core Pipeline
AWS Native · Jina AI · GroqAWS DynamoDB + Groq Extraction
Memories are extracted by Groq's llama-3.3-70b in real-time and stored in serverless DynamoDB — pinned facts, topic filters, and contradiction detection built in.
Amazon DynamoDB Isolation
State storage utilizing low-latency serverless Amazon DynamoDB tables for instant execution. Complete user control over encryption keys and memory pruning.
Cross-IDE Memory Graph
Every memory saved — whether from Claude Code, Cursor, Codex, or Antigravity — lands in the same DynamoDB table. A live dashboard polls every 3 seconds, animating new memories as they arrive.
Jina Embeddings + Relevance Ranking
Jina AI embeds every memory at 1024 dimensions, so each session pulls the memories relevant to what you're asking — not just the most recent. Pinned facts are always injected, and duplicate saves are merged automatically.