Deep Dive

Four mechanisms. Four trust models. One word.

memory.md and 'Claude Memory': What It Actually Is

memory.md and 'Claude memory' get used loosely. Four mechanically distinct things hide behind the word across Claude Code, Claude.ai, ChatGPT, and Cursor. The taxonomy.

Michael NourielPlatform Engineer & Founder, Scaletific + Automation Switch

2 May 202615 min readDeep Dive

AIagent skillsClaude Codedeveloper tools

Editorial hero card showing memory.md as the agent-written notes file, with sub-headline indicating four mechanically distinct things get called memory across AI tools and the article reframes the taxonomy.

Key takeaways

'Memory' in AI tools is a marketing label that covers four mechanically distinct mechanisms.
Author-written contract files (CLAUDE.md), agent-written notes files (memory.md / auto memory), server-side conversation summaries (Claude.ai memory, ChatGPT chat history), and structured external stores (Memory tool API, Mem0) all get called the same word.
The trust models differ: author-written is auditable, server-side is opaque, agent-written is on disk and visible to anyone with file access.
memory.md is checked into git by default in most setups. It is a public file.
Deleting memory propagates over days, and current sessions retain whatever is already loaded.

memory.md. Claude memory. ChatGPT memory. Cursor memory. Mem0. Memory bank. Auto memory.

The word does a lot of work in the AI tooling world right now. It covers four mechanically distinct things, the four resemble each other only at the marketing layer, and the gap between the marketing and the mechanism is where teams get in trouble.

This article is for the operator who has heard their AI tool described as "having memory" and wants to know what is happening when the agent claims to remember something. It is also for the platform team making procurement decisions where "has memory" shows up on the spec sheet. For the live editorial scoring of every coding agent referenced here, see our AI Coding Assistants directory.

Here is the four-mechanism taxonomy, the trust model for each, and the three misconceptions worth naming explicitly.

The misconception

In human terms, memory has four properties: it is continuous (yesterday flows into today), it is associative (one cue retrieves a cluster of related material), it is salient (important things rise to the top), and it forgets (left alone, fidelity decays).

In AI tools, the word memory describes a different set of mechanisms entirely. Each pattern provides a partial substitute for one or two of the human properties, and each one comes with a different trust model, a different governance shape, and a different failure mode.

The gap matters because operators procure on the human framing and operate on the engineering reality. A platform team that hears "the agent remembers our conversations" and assumes continuous, associative, salient memory will be surprised by what shipped. A compliance team asked "is this auditable?" will get four different answers depending on which mechanism is in use.

The rest of this article unpacks the four mechanisms, the trust model for each, and the three confusions that come from collapsing two or more of them into one bucket.

Four-mechanism taxonomy diagram showing the engineering patterns behind the word memory in AI tools: author-written contract files like CLAUDE.md, agent-written notes files like memory.md and auto memory, server-side conversation summaries like Claude.ai memory and ChatGPT chat history, and structured external memory stores like the Anthropic Memory Tool API and Mem0. Each mechanism shows examples, location, who writes it, and trust model.

The four mechanisms

2.1 Author-written contract files

Examples: CLAUDE.md, AGENTS.md, .cursorrules, .clinerules.

What it is. A Markdown file you write and check into the repository. The agent reads it as context at session start. The file is a config artifact in the same way that .eslintrc or tsconfig.json is a config artifact.

Trust model. Same as any config file you write. You audit it because you wrote it. You version it because it sits in git. You review it in pull requests because it changes the agent's behaviour for everyone on the team.

Why it gets called memory. Vendors describe these files as "agent memory" because they persist context across sessions. The label is misleading. The file is a contract. You and your team write every word. The agent is a reader of the file.

Failure mode. The file rots. Someone added a rule six months ago that has been overridden in practice; the rule stays in the file and the agent keeps following it. The fix is the same as for any rotting config: review it, prune it, ship the change in a pull request.

The mechanics of this layer are covered in the AGENTS.md and CLAUDE.md sibling explainers in this series.

2.2 Agent-written notes files

Examples: Claude Code's auto memory (the directory the agent writes to during a session), ChatGPT's Saved Memories (notes the model writes about you when you say "remember this" or when ChatGPT decides a fact is worth saving).

What it is. A file or set of files the agent writes itself, based on what happened in your sessions and what it decided was worth keeping. You can audit it. The agent can rewrite it.

Trust model. The agent decides what to write. You have to read what is there. The file sits on disk and is visible to anyone with file access. For ChatGPT, it sits in the account and is visible to anyone with account access.

Anthropic's load behaviour for Claude Code auto memory : the first 200 lines or 25 KB of the auto-memory directory load into context at session start. Mem0's analysis of leaked Claude Code source code confirms this from the implementation side and frames it as a 200-line cap that "silently forgets at line 201." The behaviour is the same fact under both descriptions; the framing is editorial. Operators do still hit the cap unexpectedly when the agent has been writing for a while, and surfacing the limit deliberately matters.

ChatGPT's Saved Memories : stored in a notepad separate from chat history. Even if you delete a chat, any saved memories from that chat persist in the notepad. ChatGPT can manage them on its own, adding, combining, and removing them across sessions. ChatGPT Plus and Pro plans now include automatic memory management that prioritises recent and frequently referenced memories.

Failure mode. The agent writes things you would have wanted to vet, or it stops writing things it should be writing. The first failure mode is privacy and accuracy. The second is silent regression in agent behaviour over time.

TIP

What memory.md actually is

memory.md and similar agent-written files are notes the agent has decided to keep, on disk, in a format you can open and read. They differ from memory in the human sense. The agent treats its own notes as suggestions only. Your job is to audit the file periodically and prune what should sit elsewhere.

2.3 Server-side conversation summaries

Examples: Claude.ai memory (Anthropic launched a free memory feature with a ChatGPT import tool on March 2, 2026), ChatGPT's Reference chat history setting.

What it is. The vendor's servers maintain a synthesis or retrieval index of your past conversations. When you start a new conversation, the synthesis is consulted and relevant material is brought into context. You see only the effects on the model's responses; the synthesis itself is server-side.

Trust model. Opaque. The synthesis is a vendor-side artifact. You can ask the model what it remembers about you. What you get back is the synthesis re-narrated, with the synthesis itself remaining server-side.

Anthropic's Claude.ai memory is project-scoped, which limits cross-conversation reach. The launch on March 2, 2026 made the feature free across all plans and shipped a ChatGPT import tool to lower the switching cost for users moving from ChatGPT to Claude.

ChatGPT's Reference chat history reaches across all of your conversations by default. It can be turned off independently of Saved Memories, and the help docs note that data retention for some services may be affected by recent legal developments (the ongoing NYT data demands).

Failure mode. The synthesis includes something that should have stayed in one conversation, or it omits something the user expected to be remembered. Operators must rely on vendor disclosures to debug either failure.

2.4 Structured external memory stores

Examples: Anthropic Memory Tool API, Mem0, Cline Memory Bank, claude-mem, Basic Memory, Recallium.

What it is. A dedicated storage system the agent reads from and writes to through tool calls. The developer controls the storage backend (file system, database, encrypted store, cloud blob, vector database). The agent treats the store as an external resource accessed through a defined API.

Anthropic's Memory Tool uses a filesystem metaphor under /memories with six commands: view, create, str_replace, insert, delete, rename. The tool name is memory_20250818. The launch was September 29, 2025. Anthropic auto-injects this system prompt when the tool is enabled:

IMPORTANT: ALWAYS VIEW YOUR MEMORY DIRECTORY BEFORE DOING ANYTHING ELSE. ASSUME INTERRUPTION: Your context window might be reset at any moment, so you risk losing any progress that is not recorded in your memory directory.

Cline Memory Bank is a documentation methodology. It describes a structured Markdown folder the agent reads at session start, with custom instructions guiding what gets written where. It works with any agent that can read documentation; the discipline is in the structure.

Trust model. Whatever you choose for the storage backend. File on disk: standard filesystem auditing applies. Database: standard database auditing. Encrypted store: standard key management applies. The mechanism is governable in a way the previous three are because the developer owns the surface.

Failure mode. The store accumulates noise the agent reads on every turn, or the store gets corrupted by indirect prompt injection (Section 6 covers this). Both are addressable through normal data hygiene plus injection defences.

Cross-tool comparison table showing where memory sits in the taxonomy for ten specific tool/feature combinations: Claude Code CLAUDE.md and auto memory, Claude.ai memory, ChatGPT Saved Memories and Reference chat history, Cursor rules and Memory Bank, Cline Memory Bank, Anthropic Memory Tool API, and third-party stores Mem0 / Basic Memory / claude-mem. Columns show what is called memory, where it lives, who writes it, and which mechanism category it occupies.

The cross-tool comparison

Two tools that both market memory can sit in different categories with different trust models. The column that matters for procurement is the rightmost one: which mechanism category each tool occupies.

Where each tool's memory sits in the taxonomy

Tool	What's called memory	Where it lives	Who writes it	Mechanism category
Claude Code	CLAUDE.md	Repo, in git	You	1: author-written contract
Claude Code	Auto memory	~/.claude/projects/<project>/memory/	The agent	2: agent-written notes
Claude.ai	Memory (free, March 2026)	Anthropic servers, project-scoped	The agent	3: server-side summary
ChatGPT	Saved Memories	OpenAI servers, account-scoped notepad	You + the agent	2: agent-written notes
ChatGPT	Reference chat history	OpenAI servers, cross-conversation	The agent	3: server-side summary
Cursor	.cursor/rules/*.mdc	Repo, in git	You	1: author-written contract
Cursor + Memory Bank	Memory Bank	Repo, structured Markdown folder	You (agent-prompted)	4: structured external store
Cline + Memory Bank	Memory Bank	Repo, structured Markdown folder	You (agent-prompted)	4: structured external store
Anthropic Memory Tool API	/memories directory	Developer-controlled backend	The agent (via tool calls)	4: structured external store
Mem0 / Basic Memory / claude-mem	Vendor-specific stores	Vendor or self-hosted	The agent (via tool calls)	4: structured external store

Cursor's .mdc files (category 1) and Cursor with Memory Bank (category 4) live in the same IDE, are both checked into git, and behave entirely differently at runtime.

The three misconceptions worth naming

4.1 "My agent remembers our conversations"

It has a summary it wrote, a search index, or pattern matches. None of those qualify as memory in the cognitive sense.

ChatGPT's own help docs are explicit: "Memory is intended for high-level preferences and details, and should not be relied on to store exact templates or large blocks of verbatim text."

The practical implication is that operators who treat memory as durable storage of conversation content are setting themselves up for surprise. Memory is a loose, lossy summary. For exact templates, use a structured external store (mechanism 4) or a regular database.

4.2 "memory.md (or CLAUDE.md) is private"

For mechanisms 1 and 2, the file lives on disk. It is in git. It is in CI. It syncs through cloud drives. Anyone with repo access reads it.

Anthropic's Memory Tool documentation includes explicit security guidance on path traversal, sensitive information handling, and storage size. The default assumption is that the storage is auditable across the team.

The practical implication is that operators should treat agent-written notes the way they treat any other file that lives next to the code: review before committing, rotate anything that looks like a credential, and audit periodically.

WARNING

memory.md is a public file

memory.md ships in git by default. It loads on every session, by every agent that reads it, for every developer on the team. Putting a credential or sensitive identifier in memory.md is the same as putting it in the repository: assume it is public, treat it accordingly.

Alert visual on dark canvas: memory.md filename rendered as a card surrounded by visualisations of its visibility surfaces (git, CI, cloud sync, team members). Strap line reads "It is in git. It is in CI. Anyone with repo access reads it." — memory.md ships in git by default. It loads on every session, by every agent that reads it, for every developer on the team. Treat it accordingly.

4.3 "Deleting memory makes the agent forget"

The ChatGPT Memory FAQ is explicit on three points worth naming:

First, deleted Saved Memories enter a retention log for up to 30 days for safety and debugging purposes. The deletion is logged before it propagates.

Second, a memory deleted from the Saved Memories notepad can still appear in past chats unless those chats are also deleted. The memory store and the chat archive are separate; deleting from one leaves the other intact.

Third, anything already in the current session's context window is unaffected by a memory deletion. The session retains what is already loaded.

The practical implication is that "deleting memory" is a multi-step operation. It removes the future-reference of the fact while leaving the historical references and the in-session context intact. Reset is real and takes time.

30 days

ChatGPT retention log for deleted Saved Memories

Deleted Saved Memories enter a 30-day retention log for safety and debugging purposes before propagating out of OpenAI systems.

Source: OpenAI Memory FAQ [4]

How to use these mechanisms well

Five steps for using memory mechanisms well

01
Name the mechanism.
Identify which of the four you are using. Vendors will say memory. You should be able to say author-written contract or server-side summary or whichever applies.
02
Audit what is written.
Agent-authored notes (mechanism 2) and server-side summaries (mechanism 3) are decided by the agent or the vendor. Open the file or run the audit command and read what is there.
03
Treat memory files like config.
Mechanisms 1, 2, and 4 all sit on disk. Review them before committing. Pull-request workflow applies.
04
Establish private versus shared at the file system layer.
CLAUDE.local.md (gitignored) is the private overlay; CLAUDE.md is the team-shared file. Mirror the same discipline for memory.md and equivalents.
05
For multi-tenant or compliance-sensitive use, demand structured external stores with proper isolation.
Mechanism 4 (Anthropic Memory Tool, Mem0, Cline Memory Bank) is the only category where the developer owns the storage backend and can apply standard isolation, encryption, and access-control patterns.

Three-card synthesis showing the misconceptions the article corrects: my agent remembers our conversations (it has a summary, search index, or pattern matches; none qualify as memory in the cognitive sense); memory.md or CLAUDE.md is private (the file is in git, in CI, on disk, visible to anyone with repo access); deleting memory makes the agent forget (deletion propagates over up to 30 days, current sessions retain whatever is loaded, and the historical references can persist).

For the visual map of where memory.md sits alongside SKILL.md, AGENTS.md, and CLAUDE.md, see the Agent Skills Landscape infographic.

Why this matters for governance

Multi-tenant isolation requires knowing which mechanism is in use. The four mechanisms have four different governance shapes:

Mechanism 1 (author-written contract): standard secret-management and source-control auditing apply. Everything is in git.
Mechanism 2 (agent-written notes): auditable in principle (the file is on disk), opaque in practice unless someone reads it. Plan for periodic audits.
Mechanism 3 (server-side summary): opaque. Vendor disclosures matter. Data-residency complications apply. Compliance frameworks treat this category most cautiously.
Mechanism 4 (structured external store): governance follows the storage backend the developer chose. The most flexible and the most the developer's responsibility.

Indirect prompt injection on memory : Palo Alto Unit 42 published research demonstrating that injected instructions can poison long-term agent memory across sessions, persist after the original conversation ends, and be used to exfiltrate data. The risk surface scales with mechanisms 2 and 4, where the agent or a connected tool writes to the store on the operator's behalf.

Three-stage horizontal flow on dark canvas. Stage 1: a content source (web page, document, email) with a hidden injected instruction in red. Stage 2: the agent reads the source and writes a poisoned entry to its memory store, with the injected instruction now indistinguishable from legitimate notes. Stage 3: a future session, the agent reads memory at session start, the poisoned entry activates, and the payload influences the agent's behaviour. — Indirect prompt injection on memory: an attacker plants instructions in content the agent reads, the agent writes the malicious payload to its memory store, and the payload persists across sessions and influences future behaviour. Mechanisms 2 and 4 carry this risk.

The practical implication for operators running anything beyond a single-user prototype: the four mechanisms each need a governance posture, and the posture differs by category. The harness around the agent is the layer that enforces these postures consistently.

For the upstream layer — which API actually feeds content into mechanisms 2 and 4, with what output shape and what anti-bot posture — the data acquisition rubric carries the same operator framing as this taxonomy. See our comparison of the six web scraping APIs AI agents use for the path-to-MCP scoring on which it is built.

You have probably already touched one

If you have used Claude Code, ChatGPT, Claude.ai, Cursor, or Cline at all in the past year, you have already worked with at least one of these four mechanisms. You may have written a CLAUDE.md. You may have noticed ChatGPT remembering that you live in San Francisco. You may have hit the auto-memory cap on Claude Code wondering what was happening. The mechanisms are already in your workflow; the question is whether you can name which one you are using when something surprises you.

The useful discipline is small. Name the mechanism. Audit the file. Treat the storage like the config artifact it is. Pick mechanism 4 when the governance posture demands it.

The word "memory" will keep doing too much work for a while yet. The taxonomy is what survives.

Four mechanisms. Four trust models. One word doing the work for all of them.

Frequently asked questions

It has a summary it wrote, a search index, or pattern matches. None of those qualify as memory in the cognitive sense. The ChatGPT FAQ states the design goal explicitly: memory is for high-level preferences and details, and should not be relied on for verbatim recall.

For mechanisms 1 and 2, the file lives on disk. It is in git, in CI, syncing through cloud drives. Anyone with repo access reads it. Treat it as public.

Yes. For Claude Code's auto memory, the file is editable in any text editor and the agent re-reads it at session start. Editing is the standard discipline; auditing without editing is incomplete.

Deletion takes effect over time. ChatGPT retains a 30-day log of deleted Saved Memories for safety and debugging purposes. A deleted memory can still appear in past chats unless those chats are also deleted. Anything already in the current session's context is unaffected by deletion.

Four likely causes: (a) the auto-memory file hit a load cap (200 lines or 25 KB for Claude Code); (b) the server-side summary deprioritised the fact (ChatGPT's Plus/Pro automatic memory management does this); (c) the fact lived in chat history that was deleted; (d) the fact was in a session whose context window has rotated past it.

For agent-written notes that you want the team to share, yes. For personal overrides, use the .local variant (e.g. CLAUDE.local.md) and add it to .gitignore. Most setups want both.

No. Claude.ai memory is mechanism 3 (server-side summary), launched as a free feature on March 2, 2026 with a ChatGPT import tool. Claude Code memory is mechanism 1 (CLAUDE.md) plus mechanism 2 (auto memory in ~/.claude/). Same vendor, different products, different mechanisms.

They occupy the same category in the taxonomy (mechanism 3 for the server-side variants), but the implementations differ. Claude.ai memory is project-scoped. ChatGPT's Reference chat history reaches across all conversations by default. Compliance reviews need to distinguish the two.

No. Mem0, Cline Memory Bank, and similar tools are mechanism 4 (structured external store). memory.md is mechanism 2 (agent-written notes file). The categories are distinct. Mem0 markets itself as a memory layer; Cline Memory Bank markets itself as a documentation methodology; the underlying engineering pattern is the same: a developer-owned store the agent reads from and writes to through a defined interface.

The feature went from paid-tier to free across all Claude.ai plans, and shipped with a ChatGPT import tool. The strategic intent was lowering the switching cost for ChatGPT users. The mechanism is the same as ChatGPT's: server-side summary of past conversations, opaque to the user.

Article Sources10 referencesShow referencesHide references

We reviewed the sources below to support the claims, pricing, and benchmarks referenced in this article.

Memory tool
Anthropic (Claude API Docs)official
Source for the Anthropic Memory Tool API: /memories directory, six commands (view/create/str_replace/insert/delete/rename), tool name memory_20250818, the auto-injected system prompt about memory directory checks, and the security guidance on path traversal cited in Section 4.2.
How Claude remembers your project
Anthropic (Claude Code Docs)official
Source for the Claude Code auto-memory load behaviour (first 200 lines or 25 KB at session start) cited in Section 2.2 and the FAQ on the auto-memory cap.
Anthropic Adds Free Memory Feature and Import Tool to Lure ChatGPT Users to Claude
MacRumors (Tim Hardwick)community
Source for the March 2, 2026 Claude.ai memory free-tier launch and the ChatGPT import tool cited in Section 2.3 and the FAQ on what changed in March 2026.
Memory FAQ
OpenAI Help Centerofficial
Source for ChatGPT Saved Memories vs Reference chat history, the 30-day retention log for deleted memories, the verbatim quote on memory intent (high-level preferences, not verbatim text), and the deletion-propagation behaviour cited in Sections 2.2, 2.3, 4.1, and 4.3 plus the StatHighlight.
How ChatGPT Remembers You: A Deep Dive into Its Memory and Preferences
Embracethered (Johann Rehberger)community
Practitioner deep-dive on ChatGPT memory and preferences referenced for citation density on the server-side summary mechanism.
How Memory Works in Claude Code
Mem0 (Livia Ellen)community
Source for the implementation-side analysis of Claude Code auto memory framed as the 200-line cap and the silently-forgets-at-line-201 framing cited in Section 2.2.
Exploring Anthropic's Memory Tool
Leonie Monigatticommunity
Source for the Anthropic Memory Tool September 29, 2025 launch date cited in Section 2.4.
Memory Bank
Clineofficial
Source for the Cline Memory Bank methodology cited in Section 2.4 as a structured external store.
Memory Bank: How to Make Cline an AI Agent That Never Forgets
Clinecommunity
Companion source on the Cline Memory Bank pattern cited in Section 2.4.
When AI Remembers Too Much: Persistent Behaviors in Agents via Indirect Prompt Injection
Palo Alto Networks Unit 42research
Source for the indirect prompt injection on memory threat covered in Section 6 governance.

Written by

Michael Nouriel

Platform Engineer & Founder, Scaletific + Automation Switch

Michael Nouriel is a platform engineer and founder of Scaletific and Automation Switch. He builds governed AI execution infrastructure, including GoldenPath IDP and AEP, a runtime enforcement layer for AI-assisted software delivery. He writes about automation engineering, cloud infrastructure, and what it actually takes to run AI agents in production.