Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
105 commits
Select commit Hold shift + click to select a range
1220572
feat: check nodes existence
Nyakult Jul 25, 2025
1396919
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Jul 25, 2025
85b89bb
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Jul 28, 2025
c8c1488
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Jul 28, 2025
8baf5c6
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Jul 29, 2025
9982782
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Jul 30, 2025
f3dd6e7
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Jul 30, 2025
4471790
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 1, 2025
0f9ccd4
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 1, 2025
27196ef
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 1, 2025
70d0a4a
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 5, 2025
5dd9662
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 6, 2025
27203e7
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 6, 2025
b2cd7f0
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 8, 2025
6fa6af7
feat: use different template for different language input
Nyakult Aug 8, 2025
b641c51
feat: use different template for different language input
Nyakult Aug 8, 2025
9f5aca1
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 12, 2025
5eafce4
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 13, 2025
5c2e637
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 18, 2025
332bab6
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 26, 2025
d3dca58
fix: eval script
Nyakult Aug 26, 2025
45cee24
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Aug 27, 2025
b1b448e
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Sep 1, 2025
ffb034e
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Sep 3, 2025
4551297
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Sep 4, 2025
204b545
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Sep 5, 2025
298d155
Merge remote-tracking branch 'upstream/dev' into dev
Nyakult Sep 25, 2025
84d421d
feat: memos-api eval scripts
Nyakult Sep 25, 2025
6956cf0
Merge remote-tracking branch 'upstream/main' into eval/0925
Nyakult Sep 25, 2025
6a465f4
feat: mem reader
Nyakult Sep 25, 2025
bcfa7c9
feat: 实现äºprefeval memos-api evaluation scripts
2Rant Sep 25, 2025
eac7984
Merge pull request #2 from 2Rant/prefeval
Nyakult Sep 25, 2025
035d1a1
refactor:format code
Nyakult Sep 25, 2025
9c2ab81
feat: add PersonaMem eval scripts
Nyakult Sep 25, 2025
92b78c1
docs(evaluation): update PersonaMem eval readme
Nyakult Sep 25, 2025
82fecee
feat:memos-api ingest batch message
Nyakult Sep 25, 2025
1ca5ead
feat: refactor search
Nyakult Sep 28, 2025
405a162
feat: refactor search
Nyakult Sep 28, 2025
7628da8
update: add api for memory
fridayL Sep 28, 2025
b12db2f
Merge pull request #4 from fridayL/searchupdate
Nyakult Sep 28, 2025
235125a
feat: add memory api return memory and memory type
Nyakult Sep 28, 2025
81bc1f6
refactor(server):重构服务器路由模块以优化内存管理
Nyakult Sep 28, 2025
71f357a
format: ruff format code
Nyakult Sep 29, 2025
c04ed79
feat(server): 增加LLM最大令牌数
Nyakult Sep 29, 2025
aaa5d18
test
Nyakult Sep 29, 2025
880f60c
fix: user query embedding for search
Nyakult Sep 29, 2025
cce9f6c
count memory_size by user
Nyakult Sep 29, 2025
9b46589
fix(server):修复记忆读取逻辑中的列表展开问题
Nyakult Sep 29, 2025
9a45f60
feat(nebular):优化图数据库查询性能
Nyakult Oct 14, 2025
0887a6b
Merge branch 'feat/search1' into eval/0929-test
Nyakult Oct 15, 2025
e53e810
refactor(memory):
Nyakult Oct 15, 2025
4936907
feat: remove user idx_memory_user_name
Nyakult Oct 15, 2025
c74fe37
Merge branch 'feat/search1' into eval/0929-test
Nyakult Oct 15, 2025
d51885b
feat(graph):优化Nebula图数据库查询性能
Nyakult Oct 15, 2025
50163bb
Merge branch 'feat/search1' into eval/0929-test
Nyakult Oct 15, 2025
1e5021b
feat: rollback remove_oldest_memory
Nyakult Oct 15, 2025
b69a35e
Merge remote-tracking branch 'upstream/test' into eval/0929-test
Nyakult Oct 15, 2025
b300cd2
Merge remote-tracking branch 'upstream/test' into feat/search1
Nyakult Oct 15, 2025
2bfde7d
feat:nebula gql add index
Nyakult Oct 15, 2025
3499003
Merge branch 'feat/search1' into eval/0929-test
Nyakult Oct 15, 2025
07751cd
feat: align code
Nyakult Oct 15, 2025
0857bb1
Merge branch 'feat/search1' into eval/0929-test
Nyakult Oct 15, 2025
756550d
feat: update memos_api
Nyakult Oct 16, 2025
59ceadf
feat: update memos_api
Nyakult Oct 16, 2025
1567987
feat: 更新默认选项
Nyakult Oct 16, 2025
1edfebe
feat:memory client
Nyakult Oct 16, 2025
18a63e2
feat:refactor lme
Nyakult Oct 16, 2025
de11c9b
feat: memu & supermemory client
Nyakult Oct 16, 2025
279c4d9
feat: locomo memu
Nyakult Oct 17, 2025
55515c3
feat: locomo supermemory
Nyakult Oct 17, 2025
b810bd9
New 'add' and 'process' modes.
2Rant Oct 17, 2025
4980c62
Merge pull request #5 from 2Rant/eval/0929-test
Nyakult Oct 17, 2025
94c6661
feat: lme supermemory & memu
Nyakult Oct 17, 2025
f32dabd
feat: default args
Nyakult Oct 17, 2025
9ef548e
api and local
2Rant Oct 17, 2025
043d260
api and local
2Rant Oct 17, 2025
5c66068
memobase fix
Nyakult Oct 20, 2025
ccc865e
Merge remote-tracking branch 'upstream/test' into eval/0929-test
Nyakult Oct 20, 2025
7715343
memos fix
Nyakult Oct 20, 2025
f19af21
default args
Nyakult Oct 20, 2025
e9fa1ed
fix memos-api search data
Nyakult Oct 20, 2025
496387f
Merge pull request #6 from 2Rant/eval/0929-test
Nyakult Oct 20, 2025
8babcad
Merge remote-tracking branch 'upstream/dev' into eval/0929-test
Nyakult Oct 20, 2025
9b0e7ef
prefeval pipeline
2Rant Oct 20, 2025
b1f8d4d
fix lme memos-api
Nyakult Oct 21, 2025
697fc60
Merge pull request #7 from 2Rant/eval/1020
Nyakult Oct 21, 2025
627ee18
personamem pipeline
2Rant Oct 21, 2025
23404c8
personamem pipeline
2Rant Oct 21, 2025
80240dd
Merge pull request #8 from 2Rant/eval/1020
Nyakult Oct 21, 2025
750afad
lme scrips
Nyakult Oct 21, 2025
c2c9246
Merge remote-tracking branch 'upstream/dev' into eval/0929-test
Nyakult Oct 21, 2025
1b03c14
align dev
Nyakult Oct 21, 2025
78f0e99
format code
Nyakult Oct 21, 2025
5ed5d56
refactor: remove old files
Nyakult Oct 21, 2025
cdd9447
format code
Nyakult Oct 21, 2025
6109d96
pm and prefeval pipeline
2Rant Oct 21, 2025
4af98e6
format code
Nyakult Oct 21, 2025
b703fa8
pm and prefeval pipeline
2Rant Oct 21, 2025
28ecba5
format code
Nyakult Oct 21, 2025
a2e7b02
pm and prefeval pipeline
2Rant Oct 21, 2025
21d9d37
pm and prefeval pipeline
2Rant Oct 21, 2025
35963cf
pm and prefeval pipeline
2Rant Oct 21, 2025
67b6eec
Merge pull request #11 from 2Rant/eval/1020
Nyakult Oct 21, 2025
d58cd0d
format code
Nyakult Oct 21, 2025
f7a229f
format code
Nyakult Oct 21, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 10 additions & 1 deletion evaluation/.env-example
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
# memory process model
MODEL="gpt-4o-mini"
OPENAI_API_KEY="sk-***REDACTED***"
OPENAI_BASE_URL="http://***.***.***.***:3000/v1"
Expand All @@ -6,10 +7,18 @@ MEM0_API_KEY="m0-***REDACTED***"

ZEP_API_KEY="z_***REDACTED***"

# response model
CHAT_MODEL="gpt-4o-mini"
CHAT_MODEL_BASE_URL="http://***.***.***.***:3000/v1"
CHAT_MODEL_API_KEY="sk-***REDACTED***"

MEMOS_KEY="Token mpg-xxxxx"
MEMOS_URL="https://apigw-pre.memtensor.cn/api/openmem/v1"
PRE_SPLIT_CHUNK=false # pre split chunk in client end

MEMOBASE_API_KEY="xxxxx"
MEMOBASE_PROJECT_URL="http://xxx.xxx.xxx.xxx:8019"

# Configuration Only For Scheduler
# RabbitMQ Configuration
MEMSCHEDULER_RABBITMQ_HOST_NAME=rabbitmq-cn-***.cn-***.amqp-32.net.mq.amqp.aliyuncs.com
Expand All @@ -29,4 +38,4 @@ MEMSCHEDULER_GRAPHDBAUTH_URI=bolt://localhost:7687
MEMSCHEDULER_GRAPHDBAUTH_USER=neo4j
MEMSCHEDULER_GRAPHDBAUTH_PASSWORD=***
MEMSCHEDULER_GRAPHDBAUTH_DB_NAME=neo4j
MEMSCHEDULER_GRAPHDBAUTH_AUTO_CREATE=true
MEMSCHEDULER_GRAPHDBAUTH_AUTO_CREATE=true
18 changes: 18 additions & 0 deletions evaluation/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,3 +34,21 @@ This repository provides tools and scripts for evaluating the LoCoMo dataset usi
```

✍️ For evaluating OpenAI's native memory feature with the LoCoMo dataset, please refer to the detailed guide: [OpenAI Memory on LoCoMo - Evaluation Guide](./scripts/locomo/openai_memory_locomo_eval_guide.md).

### LongMemEval Evaluation
First prepare the dataset `longmemeval_s` from https://huggingface.co/datasets/xiaowu0162/longmemeval-cleaned
, and save it as `data/longmemeval/longmemeval_s.json`

```bash
# Edit the configuration in ./scripts/run_lme_eval.sh
# Specify the model and memory backend you want to use (e.g., mem0, zep, etc.)
./scripts/run_lme_eval.sh
```

### prefEval Evaluation

### personaMem Evaluation
get `questions_32k.csv` and `shared_contexts_32k.jsonl` from https://huggingface.co/datasets/bowen-upenn/PersonaMem and save them at `data/personamem/`
```bash
./scripts/run_pm_eval.sh
```
Empty file.
13,870 changes: 13,870 additions & 0 deletions evaluation/scripts/PrefEval/irrelevant_conv.py

Large diffs are not rendered by default.

Loading