Human-Like Lifelong Memory: A Neuroscience-Grounded Architecture for Infinite Interaction

AuthorsDiego C. Lerma-Torres

2026

TL;DR

Human-Like Lifelong Memory uses a thalamic gateway plus valence-vector knowledge graph to turn long interaction histories into cheaper, System 1–style processing over time.

SharePost on X LinkedIn

Read our summary here, or open the publisher PDF on the next tab.

THE PROBLEM

Long context degrades reasoning by up to 85%

Du et al. (2025) show that context length alone degrades performance by up to 85%, even when retrieval is perfect.

LLMs that stuff everything into one context window lose structure, causing brittle long-term interaction and expensive, unreliable reasoning over long histories.

HOW IT WORKS

Human-Like Lifelong Memory — complementary stores, valence, and a gateway

Human-Like Lifelong Memory centers on Executive Function and Working Memory, a Memory Service Knowledge Graph, and a Thalamic Gateway that tags, gates, and routes information.

You can think of Human-Like Lifelong Memory like a brain-inspired split between RAM and disk, with a hippocampal-style graph and a thalamus-like router in front.

This architecture lets Human-Like Lifelong Memory use valence vectors and dual System 1/System 2 routing to gain identity, graded confidence, and cheaper expertise that a plain context window cannot achieve.

DIAGRAM

System 1 and System 2 retrieval flow in Human-Like Lifelong Memory

This diagram shows how Human-Like Lifelong Memory routes a single query through System 1 automatic retrieval, then escalates to System 2 deliberate search when needed.

DIAGRAM

Seven functional properties that constrain implementations

This diagram shows how Human-Like Lifelong Memory’s seven functional properties organize around working memory, the gateway, and the knowledge graph.

PROCESS

How Human-Like Lifelong Memory Handles a Lifelong Interaction Session

01
Executive Function and Working Memory
Human-Like Lifelong Memory uses Executive Function and Working Memory as a capacity limited workspace where high weight gists persist as emergent identity and current context.
02
Thalamic Gateway Tagging and Gating
The Thalamic Gateway tags each segment with multi channel salience and decides which items to promote to the Memory Service Knowledge Graph or inject back as priming.
03
System 1 and System 2 Routing
Human-Like Lifelong Memory defaults to System 1 automatic retrieval from gists and escalates to System 2 deliberate search over the full graph when novelty, stakes, or sparsity demand it.
04
Cathartic Update and Active Gist Formation
Through curiosity driven investigation, Human-Like Lifelong Memory forms new gists and uses cathartic update to modify existing gists when strong contradictions and precision thresholds are met.

KEY CONTRIBUTIONS

Key Contributions

01
Neuroscience grounded dual store architecture
Human-Like Lifelong Memory formalizes a dual store system with Executive Function and Working Memory plus a Memory Service Knowledge Graph, mirroring complementary learning systems.
02
Valence vectors and belief hierarchy
Human-Like Lifelong Memory attaches valence vectors with precision scalars to gists and maps Beck’s CBT belief hierarchy onto an emergent weight distribution in the knowledge graph.
03
Seven functional properties and predictions
Human-Like Lifelong Memory specifies seven functional properties, including graded epistemic self awareness and monotonic convergence toward System 1, plus eight concrete experimental predictions.

RESULTS

By the Numbers

Context length degradation

85% performance drop

Du et al. (2025) vs short context

Context window pricing

$0.30 to $5.00 per 1M tokens

Gemini 2.5 Flash vs Claude Opus 4.6

Salience channels

6 tagging dimensions

thematic, emotional, urgency, novelty, trust, goal

Functional properties

7 required properties

architecture constraints for implementations

Human-Like Lifelong Memory is motivated by Du et al. (2025), where long contexts cause up to 85% performance degradation even with perfect retrieval. The framework argues that explicit lifelong memory with valence vectors and gating is needed to avoid paying $0.30–$5.00 per million tokens while still maintaining reasoning quality.

BENCHMARK

By the Numbers

BENCHMARK

Economic cost of naive context expansion (Q1 2026 pricing)

Dollar cost per 1M input tokens for different LLMs, motivating Human-Like Lifelong Memory’s emphasis on cheaper System 1 processing.

KEY INSIGHT

The Counterintuitive Finding

Du et al. (2025) show that simply increasing context length can degrade reasoning performance by up to 85%, even with perfect retrieval.

This breaks the common assumption that more context is always better and reveals that undifferentiated long windows can actively harm reasoning instead of helping.

WHY IT MATTERS

What this unlocks for the field

Human-Like Lifelong Memory unlocks agents that develop stable identities, graded epistemic confidence, and cheaper expert like System 1 behavior over long lived interactions.

Builders can now design memory systems around valence vectors, belief hierarchies, and thalamic style gateways instead of just scaling context windows and naive retrieval.

~14 min read← Back to papers

Related papers

Cognitive ArchitectureAgent Memory

Aeon: High-Performance Neuro-Symbolic Memory Management for Long-Horizon LLM Agents

Mustafa Arslan

· 2026

Aeon restructures LLM memory using the Atlas, Trace, Semantic Lookaside Buffer, Write Ahead Log, and Sidecar Blob Arena inside a zero copy Core Shell kernel. Aeon achieves 4.70 ns INT8 dot products, 3.09 µs Atlas traversal at 100K nodes, 3.1× compression, and P99 read latency of 750 ns under 16 thread contention compared to FP32 and flat scan baselines.

arXiv:2601.15311 Read explainer

Cognitive ArchitectureAgent Memory

Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents

Bin Wen, Ruoxuan Zhang et al.

· 2026

Neuro-Symbolic Dual Memory Framework uses Progress Memory, Feasibility Memory, a Blueprint Planner Agent, a Progress Monitor Agent, and an Actor Agent to decouple semantic progress guidance from executable feasibility checks. On ALFWorld, Neuro-Symbolic Dual Memory Framework achieves 94.78% success rate versus 88.81% for AWM, and on WebShop reaches 0.7132 score versus 0.5998 for WALL-E 2.0.

arXiv:2604.02734 Read explainer

Cognitive ArchitectureAgent Memory

D-Mem: A Dual-Process Memory System for LLM Agents

Zhixing You, Jiachen Yuan, Jason Cai

· 2026

D-Mem combines Mem0∗, Quality Gating, and Full Deliberation into a dual-process memory system that incrementally stores vector memories and selectively scans raw history. On LoCoMo with GPT-4o-mini, D-Mem’s Quality Gating reaches 53.5 F1 versus the Mem0∗ baseline’s 51.2 F1, recovering 96.7% of the 55.3 F1 Full Deliberation performance with far fewer tokens.

arXiv:2603.18631 Read explainer

Questions about this paper?

Answers use this explainer on Memory Papers.

Checking…