Long-Term Memory

Long-term and persistent memory mechanisms for conversational AI and assistants.

41 papers

Agent MemoryLong-Term Memory

Adaptive Memory Admission Control for LLM Agents

Guilin Zhang, Wei Jiang et al.

· 2026

A-MAC scores candidate memories using Utility, Confidence, Novelty, Recency, and Type Prior combined by a learned linear admission policy with Algorithm 1 A-MAC Memory Admission. On the LoCoMo benchmark, A-MAC achieves F1 0.583 and 2644 ms latency, improving F1 by 0.042 and reducing latency by 1187 ms compared to A-mem.

arXiv:2603.04549 Read explainer

Long-Term Memory

Advancing Open-source World Models

Robbyant Team, Zelin Gao et al.

arXiv 2026 · 2026

LingBot-World combines a Data Engine, Fundamental World Model, Action-Conditioned World Model, and Post-Training causal adaptation to turn a 28B-parameter video generator into a real-time interactive world simulator. On the VBench benchmark, LingBot-World achieves a dynamic degree of 0.8857 versus 0.7612 for Yume-1.5, while also improving imaging quality to 0.6683.

arXiv:2601.20540 Read explainer

BenchmarkBenchmarkLong-Term Memory

AgenticAI-DialogGen: Topic-Guided Conversation Generation for Fine-Tuning and Evaluating Short- and Long-Term Memories of LLMs

Manoj Madushanka Perera, Adnan Mahmood et al.

· 2026

AgenticAI-DialogGen chains ChatPreprocessor, KnowledgeExtractor, TopicAnalyzer, KnowledgeGraphBuilder, PersonaGenerator, DuelingChat Agent, ConversationValidator, ConversationRefiner, QAGeneration, and PostProcessing to turn raw multi-session chats into topic-guided, persona-grounded conversations with explicit short- and long-term memories. On the TGC / KG memory QA benchmark, Mistral-7B fine-tuned within AgenticAI-DialogGen achieves 87.36 F1, compared to GPT-4’s 83.77 F1 in a zero-shot setting on the same task.

arXiv:2604.12179 Read explainer

Long-Term Memory

All-Mem: Agentic Lifelong Memory via Dynamic Topology Evolution

Can Lv, Heng Chang et al.

· 2026

All-Mem organizes long-term agent memory through Online/Offline Decoupling, Agentic Topology Consolidation, and Topology-Aware Retrieval over a topology-structured memory bank. On LoCoMo, All-Mem reaches 54.63 4o-J versus Mem0’s 48.91, and on LongMemEval-S All-Mem reaches 60.20 4o-J versus Mem0’s 55.80.

arXiv:2603.19595 Read explainer

Agent MemoryLong-Term Memory

AMA: Adaptive Memory via Multi-Agent Collaboration

Weiquan Huang, Zixuan Wang et al.

· 2026

AMA orchestrates four agents — the Constructor, Retriever, Judge, and Refresher — to build Raw Text, Fact Knowledge, and Episode Memory and route queries adaptively across these granularities. On the LoCoMo benchmark with GPT-4.1-mini, AMA achieves an overall LLM Score of 0.805 compared to Nemori’s 0.774, while reducing token consumption by approximately 80% relative to FullContext.

arXiv:2601.20352 Read explainer

BenchmarkBenchmarkLong-Term Memory

A-MBER: Affective Memory Benchmark for Emotion Recognition

Deliang Wen, Ke Sun, Yu Wang

· 2026

A-MBER builds multi-session conversational scenarios via a staged pipeline of persona specification, long-horizon planning, conversation generation, annotation, question construction, and benchmark-unit packaging. On A-MBER, a structured memory system reaches 0.69 judgment accuracy, 0.66 retrieval, and 0.65 explanation versus 0.34, 0.29, and 0.31 for a no-memory baseline.

arXiv:2604.07017 Read explainer

Agent MemoryLong-Term Memory

AMV-L: Lifecycle-Managed Agent Memory for Tail-Latency Control in Long-Running LLM Systems

Emmanuel Bamidele

· 2026

AMV-L manages agent memory using a Memory Value Model, Tiered Lifecycle, Bounded Retrieval Path, and Lifecycle Manager to decouple retention from retrieval eligibility. Under a 70k-request long-running workload, AMV-L improves throughput from 9.027 to 36.977 req/s over TTL and reduces p99 latency from 5398.167 ms to 1233.430 ms while matching LRU’s retrieval quality.

arXiv:2603.04443 Read explainer

SurveyBenchmarkAgent MemoryLong-Term MemoryMemory Architecture

A Survey on the Security of Long-Term Memory in LLM Agents: Toward Mnemonic Sovereignty

Zehao Lin, Chunyu Li, Kai Chen

· 2026

Mnemonic Sovereignty analyzes long term Write, Store, Retrieve, Execute, Share, and Forget Rollback phases against integrity, confidentiality, availability, and governance objectives for agent memory. Mnemonic Sovereignty’s lifecycle matrix shows most of the ~70 works cluster on write and retrieve integrity, leaving store, availability, and governance primitives like write gate validation and post deletion verification almost entirely unexplored.

arXiv:2604.16548 Read explainer

BenchmarkAgent MemoryLong-Term Memory

Benchmarking Agent Memory in Interdependent Multi-Session Agentic Tasks

Zexue He, Yu Wang et al.

· 2026

MEMORYARENA orchestrates Memory-Agent-Environment Loops, Multi-Session Working Flow, Bundled Web Shopping, Group Travel Planning, and Progressive Web Search to stress-test how agents store and reuse information across sessions. MEMORYARENA’s main result is that agents with near-saturated scores on long-context benchmarks like LoCoMo still obtain Task Success Rates as low as 0.00–0.12 across its four environments.

arXiv:2602.16313 Read explainer

BenchmarkBenchmarkLong-Term Memory

BenchPreS: A Benchmark for Context-Aware Personalized Preference Selectivity of Persistent-Memory LLMs

Sangyeon Yoon, Sunkyoung Kim et al.

· 2026

BenchPreS combines Contexts, User Profiles, Preference Attributes, Gold Labeling, and an LLM-as-Judge framework to test context-aware preference selectivity in persistent-memory LLMs. BenchPreS shows GPT-5.2 reaches 87.33% Appropriate Application Rate on BenchPreS while still having a 40.95% Misapplication Rate compared to Gemini 3 Pro’s 86.48% Misapplication Rate.

arXiv:2603.16557 Read explainer

Long-Term Memory

Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory

Sahil Sen, Elias Lumer et al.

· 2026

Chronos decomposes dialogue into structured events via the Event Extraction pipeline, stores them in dual event calendar and turn calendar indexes, and uses Dynamic Prompting, Initial Retrieval, and the Chronos Agent for temporal-aware tool-calling. On LongMemEvalS, Chronos Low reaches 92.60% overall accuracy and Chronos High 95.60%, beating EmergenceMem Internal by 7.67 percentage points and Mastra’s OM by 3.02 points.

arXiv:2603.16862 Read explainer

BenchmarkAgent MemoryLong-Term Memory

Drawing on Memory: Dual-Trace Encoding Improves Cross-Session Recall in LLM Agents

Benjamin Stern, Peter Nadel

· 2026

Drawing on Memory uses dual-trace memory encoding, an evidence scoring gate, and a three-state retrieval protocol to store paired fact and scene traces in Letta’s archival memory. On LongMemEval-S, Drawing on Memory reaches 73.7% accuracy versus 53.5% for the fact-only C7-control baseline, a +20.2 percentage point gain concentrated in temporal, update, and multi-session questions.

arXiv:2604.12948 Read explainer

BenchmarkBenchmarkAgent MemoryLong-Term Memory

Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability and Safety Governed Memory (SSGM) Framework

Chingkwun Lam, Jiaxin Li et al.

· 2026

SSGM interposes a Governance Middleware, Read Filtering Gate, Write Validation Gate, and a dual substrate of Mutable Active Graph plus Immutable Episodic Log between agents and memory. SSGM unifies evolving-memory systems into a four-dimensional failure taxonomy and proves that periodic reconciliation can bound semantic drift over infinite horizons.

arXiv:2603.11768 Read explainer

RAGLong-Term Memory

HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues

Yijie Zhong, Yunfan Gao, Haofen Wang

· 2026

HingeMem combines Boundary Guided Long-Term Memory, Dialogue Boundary Extraction, Memory Construction, Query Adaptive Retrieval, Hyperedge Rerank, and Adaptive Stop to segment dialogues into element-indexed hyperedges and plan query-specific retrieval. On LOCOMO, HingeMem achieves 63.9 overall F1 and 75.1 LLM-as-a-Judge score, surpassing the best baseline Zep (56.9 F1) by 7.0 F1 without using category-specific QA formats.

arXiv:2604.06845 Read explainer

Cognitive ArchitectureLong-Term Memory

Human-Like Lifelong Memory: A Neuroscience-Grounded Architecture for Infinite Interaction

Diego C. Lerma-Torres

· 2026

Human-Like Lifelong Memory combines Executive Function and Working Memory, a Memory Service Knowledge Graph, and a Thalamic Gateway to implement dual-process, valence-aware lifelong memory. Human-Like Lifelong Memory is a theoretical framework with seven functional properties and testable predictions rather than benchmark numbers against specific baselines.

arXiv:2603.29023 Read explainer

BenchmarkAgent MemoryLong-Term MemoryMemory Architecture

Lightweight LLM Agent Memory with Small Language Models

Jiaquan Zhang, Chaoning Zhang et al.

· 2026

LightMem orchestrates SLM-1 Controller, SLM-2 Selector, SLM-3 Writer, and STM MTM LTM stores to modularize retrieval, writing, and offline consolidation. On LoCoMo, LightMem reaches 34.50 F1 for GPT-4o multi hop questions, +1.64 over A-MEM, while keeping median retrieval latency at 83 ms.

arXiv:2604.07798 Read explainer

Long-Term Memory

LPC-SM: Local Predictive Coding and Sparse Memory for Long-Context Language Modeling

Keqin Xie

· 2026

LPC-SM combines local attention, dual-timescale memory, predictive correction, Orthogonal Novelty Transport, and multi-head-coupled residual routing (mHC) inside a single autoregressive block. On OpenWebMath-10k continuation, LPC-SM with adaptive sparse control reaches final LM loss 10.787 versus 12.137 for a fixed sparse controller, a 12.517% improvement.

arXiv:2604.03263 Read explainer

BenchmarkBenchmarkBenchmarkAgent MemoryLong-Term Memory

MemEvoBench: Benchmarking Memory MisEvolution in LLM Agents

Weiwei Xie, Shaoxiong Guo et al.

· 2026

MemEvoBench combines Misleading Memory Injection, Noisy Tool Returns, Biased User Feedback, and a Memory Modification Tool (+ModTool) to stress-test long-term memory safety in LLM agents across 7 domains and 36 risk types. On the QA Style benchmark, MemEvoBench shows Gemini-2.5-Pro’s ASR drops from 67.0% (Vanilla) to 19.0% with +ModTool in Round 1, while biased feedback can push GPT-5’s QA ASR from 59.0% to 78.0% by Round 3.

arXiv:2604.15774 Read explainer

RAGBenchmarkLong-Term Memory

MemMachine: A Ground-Truth-Preserving Memory System for Personalized AI Agents

Shu Wang, Edwin Yu et al.

· 2026

MemMachine combines Short-term memory, Long-term memory, Profile memory, and the Retrieval Agent to store raw conversational episodes and retrieve clustered context around nucleus matches. On LoCoMo, MemMachine scores 0.9169 with gpt-4.1-mini while using about 80% fewer input tokens than Mem0, and reaches 93.0% on LongMemEvalS with GPT-5-mini.

arXiv:2604.04853 Read explainer

RAGAgent MemoryLong-Term MemoryMemory Architecture

Memory as Metabolism: A Design for Companion Knowledge Systems

Stefan Miteski

· 2026

Memory as Metabolism defines companion knowledge systems with five retention operations (TRIAGE, DECAY, CONTEXTUALIZE, CONSOLIDATE, AUDIT) plus memory gravity and minority-hypothesis retention over a raw buffer, active wiki, and cold memory. Instead of benchmark gains, Memory as Metabolism’s main result is a governance specification that separates descriptive, taxonomic, and normative claims and predicts improved coherence stability, fragility resistance, monoculture resistance, and effective minority-hypothesis influence for companion wikis.

arXiv:2604.12034 Read explainer

Long-Term Memory

MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory

Shengtao Zhang, Jiaqian Wang et al.

· 2026

MemRL structures memory as an Intent-Experience-Utility bank and uses Two-Phase Retrieval plus Runtime Utility Update to learn a value-aware retrieval policy over a frozen LLM. On ALFWorld exploration, MemRL achieves 0.979 success rate compared to 0.921 for MemP, a +0.058 gain with the same frozen GPT-5-mini backbone.

arXiv:2601.03192 Read explainer

Long-Term Memory

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Bowen Yang, Kaiming Jin et al.

arXiv 2026 · 2026

OS-SYMPHONY coordinates an Orchestrator, Reflection-Memory Agent, and Versatile Tool Agents (Multimodal Searcher, Grounders, Coder) to stabilize long-horizon GUI workflows and fetch visual tutorials on demand. On OSWorld-Verified, OS-SYMPHONY with GPT-5 scores 65.84% at 100 steps, beating Agent S3 w/ GPT-5 (62.63%) by 3.21 percentage points.

arXiv:2601.07779 Read explainer

BenchmarkLong-Term Memory

Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs

Mohammad Tavakoli, Alireza Salemi et al.

arXiv 2025 · 2025

LIGHT augments LLMs with Retrieval from the Conversation, Scratchpad Formation and Utilization, and a Working Memory buffer plus noise filtering to answer BEAM’s long-context probing questions. On the BEAM benchmark, LIGHT raises GPT-4.1-nano’s average score at 10M-token conversations from 0.109 to 0.226, a +107.3% gain over the vanilla long-context baseline.

arXiv:2510.27246 Read explainer

RAGBenchmarkBenchmarkBenchmarkAgent MemoryLong-Term MemoryMemory Architecture

Evaluating Long-Term Memory for Long-Context Question Answering

Alessandra Terranova, Björn Ross, Alexandra Birch

· 2025

Evaluating Long-Term Memory for Long-Context Question Answering compares Full Context, RAG, A-Mem, RAG+PromptOpt, and RAG+EpMem memory components across semantic, episodic, and procedural memory for long conversational QA. On LoCoMo, RAG+EpMem reaches an average F1 ranking of 1.83 for Llama 3.2-3B Instruct and 1.80 for GPT-4o mini while using around 1,000 tokens per query versus over 23,000 for Full Context.

arXiv:2510.23730 Read explainer

Agent MemoryLong-Term MemoryMemory Architecture

LiCoMemory: Lightweight and Cognitive Agentic Memory for Efficient Long-Term Reasoning

Zhengjun Huang, Zhoujin Tian et al.

· 2025

LiCoMemory organizes long term dialogue with CogniGraph, Query Processing and Integrated Rerank, and Real Time Interactions to keep session summaries, triples, and chunks linked. On LongMemEval with GPT-4o-mini, LiCoMemory reaches 73.80% accuracy and 76.63% recall, beating Mem0g by 9.0 and 7.1 points.

arXiv:2511.01448 Read explainer

BenchmarkLong-Term MemoryMemory Architecture

LightMem: Lightweight and Efficient Memory-Augmented Generation

Jizhan Fang, Xinle Deng et al.

· 2025

LightMem pipelines a Cognitive-Inspired Sensory Memory, Topic Segmentation Submodule, Topic-Aware Short-Term Memory, and Long-Term Memory with Sleep-Time Update to filter, group, summarize, and asynchronously consolidate dialogue history. On LongMemEval-S with Qwen3-30B-A3B-Instruct-2507, LightMem reaches 70.20% ACC vs 65.20% for A-MEM (+5.00 points) while reducing total token usage by up to 21.8× and API calls by up to 17.1×.

arXiv:2510.18866 Read explainer

PickLong-Term Memory

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Prateek Chhikara, Dev Khant et al.

arXiv 2025 · 2025

Mem0 incrementally processes conversations using the extraction phase, update phase, asynchronous summary generation module, tool call mechanism, and a vector database to build scalable long-term memory. On the LOCOMO benchmark, Mem0 attains a J score of 67.13 on single-hop questions versus 63.79 for OpenAI and cuts p95 latency from 17.117s to 1.440s compared to the full-context baseline.

arXiv:2504.19413 Code Read explainer

Cognitive ArchitectureLong-Term Memory

Memory as Resonance: A Biomimetic Architecture for Infinite Context Memory on Ergodic Phonetic Manifolds

Tarik Houichime, Abdelghani Souhar, Younes El Amrani

· 2025

Phonetic Trajectory Memory (PTM) combines the Acoustic Injection, Entropy Filter, Neuro-Symbolic Relay, and Resonance Engine to encode text as a continuous trajectory on an ergodic Hyper-Torus Memory instead of a growing KV cache. PTM delivers >3,000× signal-to-KV compression while maintaining ≈92% factual accuracy and sub-50 ms retrieval latency on long narrative and scientific corpora compared to dense KV baselines.

arXiv:2512.20245 Read explainer

Long-Term Memory

M+: Extending MemoryLLM with Scalable Long-Term Memory

Yu Wang, Dmitry Krotov et al.

· 2025

M+ augments short-term memory θ, long-term memory Θ, a co-trained retriever, and a Multi-LoRA design on top of MemoryLLM’s layer-wise memory pools. On SQuAD-style knowledge retention, M+ maintains accuracy beyond 160k tokens while MemoryLLM-7B collapses before 20k and Llama-3.1-8B-SnapKV fails beyond 30k tokens.

arXiv:2502.00592 Read explainer

RAGLong-Term MemoryMemory Architecture

Mnemosyne: An Unsupervised, Human-Inspired Long-Term Memory Architecture for Edge-Based LLMs

Aneesh Jonelagadda, Christina Hahn et al.

· 2025

Mnemosyne combines a Commitment pipeline with substance and redundancy filters, a probabilistic Recall traversal over a graph-structured store, asynchronous Core Summary updates, and a Pruning module to manage long-term memory on edge devices. On the LoCoMo benchmark, Mnemosyne reaches 60.42% temporal reasoning J-score and a 54.55% overall J-score, compared to 51.55% temporal reasoning and 62.74% overall for Memory-R1, and achieves a 65.8% win rate over a 31.07% naive RAG baseline in human evaluations.

arXiv:2510.08601 Read explainer

BenchmarkBenchmarkLong-Term Memory

Pre-Storage Reasoning for Episodic Memory: Shifting Inference Burden to Memory for Personalized Dialogue

Sangyeop Kim, Yohan Lee et al.

· 2025

PREMem builds long term dialogue memory by combining Episodic Memory Extraction, Pre Storage Memory Reasoning, semantic clustering, a persistent memory pool, and an inference phase over enriched memory fragments. PREMem reaches 71.4 LLM as a judge on LongMemEval with gpt 4.1 base, a +15.5 gain over HippoRAG 2 and +9.6 over A Mem.

arXiv:2509.10852 Read explainer

RAGBenchmarkBenchmarkLong-Term Memory

RGMem: Renormalization Group-inspired Memory Evolution for Language Agents

Ao Tian, Yunfeng Lu et al.

· 2025

RGMem builds a multi-scale memory state using Microscopic Evidence Space DL0, Structured Knowledge Space G, and renormalization operators RK1, RK2, RK3 to evolve user profiles. On PersonaMem with GPT-4.1, RGMem reaches 74.01% Avg., beating Memory OS by 8.98 points.

arXiv:2510.16392 Read explainer

Long-Term Memory

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Yiming Du, Hongru Wang et al.

· 2024

PerLTQA builds a synthetic personal memory database and runs questions through Memory Classification, Memory Retrieval, and Memory Synthesis to test how LLMs use semantic and episodic memories. On the PerLTQA benchmark, BERT-base achieves 95.7 F1 for memory classification, while gpt-3.5-turbo reaches MAP 0.756 for memory synthesis with retrieval and classification.

arXiv:2402.16288 Read explainer

Long-Term Memory

Understanding the Impact of Long-Term Memory on Self-Disclosure with Large Language Model-Driven Chatbots for Public Health Intervention

Eunkyung Jo, Yuin Jeong et al.

· 2024

CareCall combines a memory management layer, LLM summarizer, and memory-augmented input over HyperCLOVA to store and reuse summaries of users’ Health, Meals, Sleep, Visited Places, and Pets across weekly calls. In deployment to 147 socially isolated adults, CareCall with long-term memory yielded higher Health-detail and Clinical-detail disclosure counts per call than CareCall without memory, and longer average call durations (87.89s vs 75.48s).

arXiv:2402.11353 Read explainer

Long-Term Memory

Augmenting Language Models with Long-Term Memory

Weizhi Wang, Li Dong et al.

· 2023

LONGMEM augments a frozen GPT-2*-style backbone with a Residual SideNet, Cached Memory Bank, Memory Retrieval and Fusion, and Cross-Network Residual Connections to read and use long-term key–value memories. On ChapterBreak AO3, LONGMEM reaches 40.5% suffix identification accuracy with infinite in-memory context, compared to 28.3% for Memorizing Transformer under the same 1k in-context window.

arXiv:2306.07174 Read explainer

Long-Term Memory

In search of dispersed memories: Generative diffusion models are associative memory networks

Luca Ambrogioni

· 2023

In search of dispersed memories connects associative memory networks, modern Hopfield networks, generative diffusion models, and the denoising loss to show that diffusion score networks encode Hopfield-like energy landscapes in their weights. In search of dispersed memories demonstrates that exact diffusion dynamics reach Pearson correlations of 0.995–0.996 with modern Hopfield iterations on denoising and completion tasks, while classical Hopfield networks reach only 0.700–0.741.

arXiv:2309.17290 Read explainer

Long-Term Memory

LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination

Kai Zhang, Yangyang Kang et al.

· 2023

MaLP combines a Dual-Process enhanced Memory (DPeM), Working Memory, Short-Term Memory (STM), Long-Term Memory (LTM), a Coordinator C, and Retriever R to coordinate short- and long-term personalization around a PEFT-tuned LLM. On MaLP’s medical dialogue benchmark with LLaMA-7B, MaLP achieves 69.95% preference classification accuracy and a 91.53% win rate in response generation, improving ROUGE-L profile QA from 29.66 to 33.91 over the LoRA baseline.

arXiv:2309.11696 Read explainer

Long-Term Memory

Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory

Lei Liu, Xiaoyan Yang et al.

· 2023

Think-in-Memory (TiM) augments an LLM agent with a hash-based Memory Cache, a Hash-based Mapping F(·), and Insert, Forget, Merge organization operations over inductive thoughts. On the Chinese part of the GVD dataset, TiM with ChatGLM raises contextual coherence from 0.428 to 0.665 compared to SiliconFriend.

arXiv:2311.08719 Read explainer

Long-Term Memory

Evaluating Long-Term Memory in 3D Mazes

Jurgis Pasukonis, Timothy Lillicrap, Danijar Hafner

· 2022

Memory Maze combines an online reinforcement learning environment, an offline dataset, and an offline probing protocol to stress-test long-term memory using Dreamer, Dreamer (TBTT), IMPALA, and supervised probe networks. On Memory 9x9, Dreamer (TBTT) achieves a return of 33.2 compared to 23.4 for IMPALA, while humans reach 26.4 and the oracle reaches 34.8.

arXiv:2210.13383 Read explainer

Long-Term Memory

Neural SLAM: Learning to Explore with External Memory

Jingwei Zhang, Lei Tai et al.

arXiv 2017 · 2017

Neural SLAM combines an LSTM, Localization and Motion Prediction, Data Association, Measurement Update, and Mapping over an external memory map to guide exploration policies. On 16×16 grid worlds, Neural SLAM achieves 13.732 average reward and 46/50 success episodes, a +6.536 reward gain over A3C-Nav2.

arXiv:1706.09520 Read explainer

Long-Term Memory

On the Long-Term Memory of Deep Recurrent Networks

Yoav Levine, Or Sharir et al.

arXiv 2017 · 2017

On the Long-Term Memory of Deep Recurrent Networks analyzes Recurrent Arithmetic Circuits, Start-End separation rank, grid tensors, and Tensor Network constructions to quantify how depth affects temporal expressivity. The main result proves depth-2 RACs achieve Start-End separation rank on the order of the multiset coefficient (min{M,R} + T/2 − 1 choose T/2), while depth-1 RACs are limited to rank min{R, M^{T/2}}.

arXiv:1710.09431 Read explainer