Skip to content

Moore-Tian/SteeM-Memory-Control

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SteeM logo   Controllable Memory Usage: Balancing
Anchoring and Innovation in Long-Term Human-Agent Interaction

arXiv Paper PDF License ACL 2026 Main Conference

Overview | Assets | Quick Start | Citation

Models memory reliance as a user-controllable dimension—enabling regulation from fresh-start innovation to history-adherent personalization.

📖 Overview

As LLM-based agents are increasingly used in long-term interactions, cumulative memory is critical for personalization and stylistic consistency. However, most systems adopt an "all-or-nothing" approach—incorporating all past information can lead to Memory Anchoring (the agent is trapped by past interactions), while excluding memory entirely results in under-utilization.

SteeM introduces a framework that models memory reliance as an explicit, user-controllable dimension, allowing dynamic regulation from a fresh-start mode (promoting innovation) to a high-fidelity mode (closely following interaction history).

SteeM Framework Overview

SteeM introduces rubric-based memory-dependence scoring, preference-aligned data generation, training, and controllable generation modes ranging from fresh-start to history-adherent behavior.

✨ Key Contributions

  • Memory Dependence Metric — A behavioral metric to quantify the influence of past interactions on current outputs
  • Steerable Generation — Users can dynamically regulate memory reliance at inference time
  • Data Pipeline — A complete pipeline for constructing long-term interaction contexts with timelines, artifacts, concepts, and cross-session summaries
  • Consistent Improvements — Outperforms conventional prompting and rigid memory masking strategies across scenarios

📦 Released Assets

Asset Description
timeline_generation/output_gemini 194 research + 194 tutoring cases (events.json & stats.json)
context_merge/all_contexts.json.gz 11,541 merged query-context examples (gzip)
context_merge/sampled_contexts.json 400 sampled context examples
memory_control_method/ Complete method code for controllable memory

🚀 Quick Start

Installation

git clone https://github.com/Moore-Tian/SteeM-Memory-Control.git
cd SteeM-Memory-Control
pip install -r requirements.txt

Environment Variables

export OPENAI_API_KEY="your-api-key"
export OPENAI_BASE_URL="https://..."  # optional

Data Pipeline

Run the stages sequentially from the repository root:

# Step 1: Generate interaction timelines
bash data_pipeline/timeline_generation/run_generate.sh

# Step 2: Generate artifacts
bash data_pipeline/artifact_generation/run_generate.sh

# Step 3: Extract concepts
bash data_pipeline/concept_generation/run_generate.sh

# Step 4: Build cross-session summaries
bash data_pipeline/cross_session_generation/run_generate.sh

Evaluation

MODEL_PATH=/path/to/model \
TEST_DATA_PATH=/path/to/test_data.jsonl \
RUN_JUDGE=1 \
RUN_METRIC=1 \
bash memory_control_method/evaluation/run_evaluation_pipeline.sh

📝 Citation

If you find this work useful, please cite our paper:

@article{tian2026controllable,
  title={Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction},
  author={Tian, Muzhao and Huang, Zisu and Wang, Xiaohua and Xu, Jingwen and Guo, Zhengkang and Qian, Qi and Shen, Yuanzhe and Song, Kaitao and Yuan, Jiakang and Lv, Changze and Zheng, Xiaoqing},
  journal={arXiv preprint arXiv:2601.05107},
  year={2026}
}

About

Code, prompts, evaluation, and curated examples for "Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors