CEMS

Continuous Evolving Memory System

Persistent memory for AI coding assistants. Store, retrieve, and evolve memories to make Claude Code and Cursor smarter over time.

Quick Start (Client Install)

You need two things from your CEMS admin: a server URL and an API key.

Option A: Interactive install (recommended)

curl -fsSL https://getcems.com/install.sh | bash

It will ask for your API URL and key interactively.

Option B: Non-interactive install

CEMS_API_KEY=your-key-here CEMS_API_URL=https://cems.example.com \
  curl -fsSL https://getcems.com/install.sh | bash

Option C: Install from source

git clone https://github.com/chocksy/cems.git && cd cems
./install.sh

What the installer does

Installs uv if missing
Installs the CEMS CLI (cems, cems-server, cems-observer) via uv tool install
Runs cems setup --claude which:
- Copies 6 hook scripts to ~/.claude/hooks/
- Copies 5 skill files to ~/.claude/skills/cems/
- Merges CEMS config into ~/.claude/settings.json (preserves existing settings)
- Saves credentials to ~/.cems/credentials (chmod 600)

After install

cems --version    # Verify CLI is installed
cems health       # Check server connection
cems update       # Pull latest version + re-deploy hooks/skills
cems setup        # Re-run setup (reconfigure credentials, re-install hooks)
cems uninstall    # Remove hooks/skills (keeps credentials by default)

Updating

CEMS auto-updates when you start a new Claude Code session — no action needed. If your install is more than 24 hours old, the SessionStart hook pulls the latest version in the background.

To update manually:

cems update          # Pull latest + re-deploy hooks/skills
cems update --hooks  # Re-deploy hooks only (skip package upgrade)

Auto-update can be disabled by setting CEMS_AUTO_UPDATE=0 in your environment or ~/.cems/credentials.

Credentials

Stored in ~/.cems/credentials (chmod 600). Checked in order:

CLI flags: --api-url, --api-key
Environment: CEMS_API_URL, CEMS_API_KEY
Credentials file: ~/.cems/credentials

How It Works

CEMS hooks into your IDE (Claude Code or Cursor) and provides persistent memory across sessions:

Memory Injection -- On every prompt, the UserPromptSubmit hook searches your memories and injects relevant context
Session Learning -- On session end, the Stop hook extracts learnings from your session and stores them
Observational Memory -- The observer daemon watches session transcripts and extracts high-level observations about your workflow
Scheduled Maintenance -- Nightly/weekly/monthly jobs deduplicate, compress, and prune memories automatically

What Gets Installed

~/.claude/
├── settings.json           # Hooks config (merged, not overwritten)
├── hooks/
│   ├── cems_session_start.py        # Profile + context injection
│   ├── cems_user_prompts_submit.py  # Memory search + observations
│   ├── cems_post_tool_use.py        # Tool learning extraction
│   ├── cems_pre_tool_use.py         # Gate rules enforcement
│   ├── cems_stop.py                 # Session analysis + observer
│   ├── cems_pre_compact.py          # Pre-compaction hook
│   └── utils/                       # Shared utilities
└── skills/
    └── cems/
        ├── remember.md     # /remember - Add personal memory
        ├── recall.md       # /recall - Search memories
        ├── share.md        # /share - Add team memory
        ├── forget.md       # /forget - Delete memory
        └── context.md      # /context - Show status

~/.cems/
└── credentials             # API URL + key (chmod 600)

Usage

Skills (Claude Code slash commands)

/remember I prefer Python for backend development
/remember The database uses snake_case column names
/recall What are my coding preferences?
/share API endpoints follow REST conventions with /api/v1/...
/forget abc123
/context

CLI

cems status                          # System status
cems health                          # Server health check
cems add "I prefer dark mode"        # Add a memory
cems search "coding preferences"     # Search memories
cems list                            # List all memories
cems rule add                        # Interactive constitution/playbook rule wizard
cems rule load --kind constitution   # Load default constitution rule bundle
cems update                          # Update to latest version
cems update --hooks                  # Re-deploy hooks only (no package upgrade)
cems maintenance --job consolidation # Run maintenance
cems uninstall                       # Remove hooks/skills
cems uninstall --all                 # Remove everything including credentials

Server Deployment

For team usage, deploy CEMS as a server. Requires Docker Compose.

Services

Service	Image	Port	Purpose
postgres	`pgvector/pgvector:pg16`	5432	PostgreSQL + pgvector (vectors + metadata + auth)
cems-server	Built from `Dockerfile`	8765	Python REST API (Starlette + uvicorn)
cems-mcp	Built from `mcp-wrapper/`	8766	MCP wrapper (Express.js, Streamable HTTP)

Quick Start

Clone and configure:

git clone https://github.com/chocksy/cems.git && cd cems
cp .env.example .env
# Edit .env with your OPENROUTER_API_KEY and CEMS_ADMIN_KEY

Start services:
```
docker compose up -d
```

Create your first user:

source .env
curl -X POST http://localhost:8765/admin/users \
  -H "Authorization: Bearer $CEMS_ADMIN_KEY" \
  -H "Content-Type: application/json" \
  -d '{"username": "yourname"}'
# Returns: {"api_key": "cems_usr_abc123..."}

Give the API key to your team member -- they run the client install above.

Environment Variables

Required (set in .env):

Variable	Description
`OPENROUTER_API_KEY`	Get from https://openrouter.ai/keys
`CEMS_ADMIN_KEY`	Generate with `openssl rand -hex 32`

Optional:

Variable	Default	Description
`POSTGRES_PASSWORD`	`cems_secure_password`	Change in production
`CEMS_EMBEDDING_BACKEND`	`openrouter`	Embedding provider
`CEMS_EMBEDDING_DIMENSION`	`1536`	Embedding dimension
`CEMS_RERANKER_BACKEND`	`disabled`	Reranker (disabled by default)

Architecture

Storage

Everything lives in PostgreSQL with pgvector:

memory_documents -- Documents with content, user/team scoping, categories, tags, soft-delete
memory_chunks -- Chunked content with 1536-dim vector embeddings (HNSW index) and full-text search (tsvector)
users / teams -- Authentication via bcrypt-hashed API keys

Embeddings

text-embedding-3-small via OpenRouter (1536 dimensions). Batch support for bulk operations.

Search Pipeline

CEMS uses a 9-stage retrieval pipeline:

Query → Understanding → Synthesis → HyDE → Retrieval → RRF Fusion → Filtering → Scoring → Assembly → Results

Stage	What it does
1. Query Understanding	LLM routes to vector or hybrid strategy
2. Query Synthesis	LLM expands query into 2-5 search terms
3. HyDE	Generates hypothetical ideal answer for better matching
4. Candidate Retrieval	pgvector HNSW (vector) + tsvector (BM25 full-text)
5. RRF Fusion	Reciprocal Rank Fusion combines result lists
6. Relevance Filtering	Removes results below threshold
7. Scoring Adjustments	Time decay, priority boost, project-scoped boost
8. Token-Budgeted Assembly	Greedy selection within token budget (default: 2000)

Search modes: vector (fast, 0 LLM calls), hybrid (thorough, 3-4 LLM calls), auto (smart routing).

Maintenance

Scheduled via APScheduler:

Job	Schedule	Purpose
Consolidation	Nightly 3 AM	Merge semantic duplicates (cosine >= 0.92)
Observation Reflection	Nightly 3:30 AM	Condense observations per project
Summarization	Weekly Sun 4 AM	Compress old memories, prune stale
Re-indexing	Monthly 1st 5 AM	Rebuild embeddings, archive dead memories

Observer Daemon

The observer (cems-observer) runs as a background process on the client machine:

Polls ~/.claude/projects/*/ JSONL transcript files every 30 seconds
When 50KB of new content accumulates, sends it to the server
Server extracts high-level observations via Gemini 2.5 Flash
Observations like "User deploys via Coolify" or "Project uses PostgreSQL" are stored as memories

MCP Integration

The MCP wrapper on port 8766 exposes CEMS as an MCP server with 6 tools:

Tool	Description
`memory_add`	Store a memory
`memory_search`	Search with the full retrieval pipeline
`memory_forget`	Delete or archive a memory
`memory_update`	Update memory content
`memory_maintenance`	Trigger maintenance jobs
`session_analyze`	Analyze session transcripts

API Endpoints

Full API reference

Public API (Bearer token auth):

Method	Endpoint	Purpose
POST	`/api/memory/add`	Add a memory
POST	`/api/memory/search`	Search memories
POST	`/api/memory/forget`	Delete memory
POST	`/api/memory/update`	Update memory
POST	`/api/memory/log-shown`	Feedback tracking
POST	`/api/memory/maintenance`	Run maintenance
GET	`/api/memory/list`	List memories
GET	`/api/memory/status`	System status
GET	`/api/memory/profile`	Session profile context
GET	`/api/memory/gate-rules`	Gate rules by project
POST	`/api/session/summarize`	Session summary (observer daemon)
POST	`/api/tool/learning`	Tool learning
POST	`/api/index/repo`	Index git repo

Admin API (CEMS_ADMIN_KEY auth):

Method	Endpoint	Purpose
GET/POST	`/admin/users`	List/create users
GET/PATCH/DELETE	`/admin/users/{id}`	Manage user
POST	`/admin/users/{id}/reset-key`	Reset API key
GET/POST	`/admin/teams`	List/create teams

Troubleshooting

Memory not being recalled

Check credentials: cat ~/.cems/credentials
Test connection: cems health
Test search: cems search "test"
Check hook output: echo '{"prompt": "test"}' | uv run ~/.claude/hooks/cems_user_prompts_submit.py

Skills not appearing

Verify: ls ~/.claude/skills/cems/
Restart Claude Code
Type / and look for remember, recall, etc.

Re-install hooks

cems setup    # Re-runs the full setup

Development

git clone https://github.com/chocksy/cems.git && cd cems
uv pip install -e ".[dev]"
pytest                    # Run tests
mypy src/cems             # Type checking

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
assets		assets
bin		bin
cursor-plugin		cursor-plugin
deploy		deploy
docs		docs
examples		examples
hooks		hooks
mcp-wrapper		mcp-wrapper
research		research
scripts		scripts
src/cems		src/cems
tests		tests
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
COOLIFY_DEPLOYMENT.md		COOLIFY_DEPLOYMENT.md
Dockerfile		Dockerfile
ENHANCED_RETRIEVAL_REPORT.md		ENHANCED_RETRIEVAL_REPORT.md
IMPLEMENTATION_STATUS.md		IMPLEMENTATION_STATUS.md
README.md		README.md
docker-compose.coolify.yml		docker-compose.coolify.yml
docker-compose.yml		docker-compose.yml
findings.md		findings.md
install.sh		install.sh
mem0-tech-spec.md		mem0-tech-spec.md
plan-rag-pageindex.md		plan-rag-pageindex.md
plan.md		plan.md
progress.md		progress.md
pyproject.toml		pyproject.toml
qmd_eval_results.json		qmd_eval_results.json
remote-install.sh		remote-install.sh
research-main.md		research-main.md
research-progress.md		research-progress.md
task_plan.md		task_plan.md
task_plan_mmr.md		task_plan_mmr.md
test_complex_memories.py		test_complex_memories.py
test_hooks.py		test_hooks.py
test_hooks_e2e.py		test_hooks_e2e.py
test_integration.py		test_integration.py
test_performance.py		test_performance.py
uv.lock		uv.lock

Chocksy/cems

Folders and files

Latest commit

History

Repository files navigation