MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

👋 Introduction

This repo is the official implementation of MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Inspired by the human brain’s ability to dynamically integrate memory and reasoning, MemGen introduces a novel framework that empowers AI agents to evolve through experience—without relying on rigid parameter updates or external databases.

Unlike traditional approaches, MemGen generates latent memory tokens directly within the model’s reasoning stream. It features:

A Memory Trigger that decides when to recall memory.
A Memory Weaver that synthesizes past experiences into compact, latent sequences—seamlessly enriching ongoing reasoning.

🌎 Setup

Create and activate the MemGen environment:
Option 1: Install via requirements.txt

conda create -n memgen python=3.10
conda activate memgen
pip install -r requirements.txt

Option 2: Install via memgen.yml

conda env create -f memgen.yml
conda activate memgen

🚀 Quick Start

🔧 Installation: Set Up Search Environment

Please follow the instructions in the Search-R1 to configure the retriever environment (optional).

▶️ How to Run

MemGen consists of two modules: Weaver and Trigger.
We follow a two-stage training approach, training each module separately.

Weaver Model

Train the Weaver model
```
bash weaver_train.sh
```
Evaluate the Weaver model
Before running, make sure to update LOAD_MODEL_PATH in eval.sh to point to the trained checkpoint: <weaver_dir>
```
bash eval.sh
```

Trigger Model

Train the Trigger model
```
bash trigger_train.sh
```
Evaluate the Trigger model
Before running, make sure to update LOAD_MODEL_PATH in eval.sh to point to the trained checkpoint: <trigger_dir>
```
bash eval.sh
```

🕒 Plans

The current repository supports the following features:

Basic MemGen model implementation
Single/Multi-turn SFT weaver training
Trigger RL training

Additional features are planned and will be introduced gradually as they are finalized for public release.

Single/Multi-turn GRPO weaver training
Integration with retrieval-based memory systems
Baseline suite

We sincerely appreciate your patience, interest, and support as we continue to enhance the project and make components more efficient.

🫡 Citation

If you find this repository helpful, a citation to our paper would be greatly appreciated:

@article{zhang2025memgen,
  title={MemGen: Weaving Generative Latent Memory for Self-Evolving Agents},
  author={Zhang, Guibin and Fu, Muxin and Yan, Shuicheng},
  journal={arXiv preprint arXiv:2509.24704},
  year={2025}
}

🙏 Acknowledgement

We sincerely thank Search-R1 for open-sourcing their search web environment.
We also extend our heartfelt thanks to LAVIS for their code framework design.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
common		common
configs		configs
data		data
interactions		interactions
memgen		memgen
scripts		scripts
README.md		README.md
main.py		main.py
memgen.yml		memgen.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

👋 Introduction

🌎 Setup

🚀 Quick Start

🔧 Installation: Set Up Search Environment

▶️ How to Run

Weaver Model

Trigger Model

🕒 Plans

🫡 Citation

🙏 Acknowledgement

About

Uh oh!

Releases

Packages

Languages

KANABOON1/MemGen

Folders and files

Latest commit

History

Repository files navigation

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

👋 Introduction

🌎 Setup

🚀 Quick Start

🔧 Installation: Set Up Search Environment

▶️ How to Run

Weaver Model

Trigger Model

🕒 Plans

🫡 Citation

🙏 Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages