MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Hongli Yu, Tinghong Chen, Jiangtao Feng, Jiangjie Chen, Weinan Dai, Qiying Yu, Ya-Qin Zhang, Wei-Ying Ma, Jingjing Liu, Mingxuan Wang, Hao Zhou

July, 2025

Go to Project Site

MemAgent: A multi-conv RL-based memory agent for handling extremely long documents with linear complexity.

Abstract

The paper introduces MemAgent, a novel approach to handling extremely long documents in language models. The system reads text in segments and updates the memory using an overwrite strategy and extends the DAPO algorithm for training. Key achievements include extrapolating from an 8K context trained on 32K text to a 3.5M QA task with performance loss < 5% and achieving 95%+ in 512K RULER test. This represents significant progress in long-context language model capabilities, demonstrating substantial scalability improvements through reinforcement learning-based memory management with multi-conversation generation training.

Type

Publication

Preprint

MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent

Abstract

Jiangjie Chen

Researcher