Display short-term and long-term memory usage #5

Torantulino · 2023-03-29T05:23:10Z

Auto-GPT currently pins it's Long-Term memory to the start of it's context window. It is able to manage this through commands.

Auto-GPT should be aware of it's short and long term memory usage so that it knows when something is doing to be deleted from it's memory due to context limits. e.g memory usage: (2555/4000 tokens)

This may lead to some interesting behaviour where it is less inclined to read long strings of text, or is more meticulous at saving information to long-term-memory when it sees it's running low on tokens.

claysauruswrecks · 2023-03-29T06:29:44Z

From what I was reading, you can take the context window, and compress chunks at the rear into summaries.

Torantulino · 2023-03-29T07:24:17Z

Interesting idea! This would expand short-term memory.

Currently Auto-GPT manages it's own "Long-Term Memory" which is "pinned" to the start of the context.

tedspare · 2023-03-30T15:02:22Z

Another approach could be to run history through an embeddings API, save the embeddings to a Vector DB, then do a lookup for relevant memories on each step.

Torantulino · 2023-03-30T15:07:18Z

I've been meaning to look into this.
Is it practical to regularly rebuild/add to an embedding?

Forgive my ignorance, I've never used them.

tedspare · 2023-04-02T23:59:46Z

All good! Thanks for your reply. In my (limited) understanding, adding embeddings is no more than adding a row to a DB (but with vector data).

jantic · 2023-04-03T14:34:34Z

Another approach could be to run history through an embeddings API, save the embeddings to a Vector DB, then do a lookup for relevant memories on each step.

I really think this is an excellent idea. In fact it might be a huge win. This would basically give you an indefinite context window in effect, in terms of "long term" memory. Of course the discarding of "irrelevant" info in any given call to the model will be imperfect, but I'd bet it'll work pretty well.

I was thinking about this myself this morning and wondered if anybody else already mentioned it. Basically I see it as an "associative memory", much like what we have in our own minds. You could perhaps have the GPT model generate a few orthogonal short summaries of what it just output and responded to (top 5?), store these in the vector db, and then get the most relevant "memories" for subsequent calls based on this same process.

So combine these "N closest" memories with most recent and I think you'll get a very effective long term memory mechanism.

Is there anyone out there that sees problems with this idea or has way to improve upon it? It seems super awesome to me...

dschonholtz · 2023-04-03T23:06:53Z

@Torantulino I'm going to pick this up if it is ok with you.
Here is my laundry list:

Store long term memory in pinecone: https://www.pinecone.io/. There are lots of options here, this is just fairly simple and is what babyagi is using: https://github.com/yoheinakajima/babyagi
Pull in n closest memories. Default n to 5, but make it configurable. (Do some experimentation on what seems most useful.)
Make this memory object a class that is optional. Specify the delete and add operations on the current memory dict obj to pinecone operations. I'll try to keep this fairly extensible so we could easily make classes with the same interface for different vector DBs
Add in a pinecone api key in .env.template
Update the readme to tell people to use it.
If no api key is specified tell the user they are using a local memory (The current implementation). Also, support explicit local memory option.

Let me know if there is anything here you'd like me to change. I should have a working version of this by EOD tomorrow EST.

I would hope to then be able to extend this to processing files in large repos too and eventually I want to make this feed into the self improvement pipeline with respect to remembering where relevant local files are to large tasks.

alreadydone · 2023-04-04T05:31:42Z

I believe it's possible to simply use a key-value store as memory and make it available to Auto-GPT as a tool, letting the model itself decide when and what to read from and write to the memory. Auto-GPT already has code execution implemented, so it has all Python functions available as tools, and this is just one more tool. To make the model aware of the memory tool and good at utilizing it, we would have to finetune it (e.g. using the Toolformer approach; there are two open-source implementations and this is more popular than the official one), and would need to collect some usage data (there isn't any paper or implementation that uses a memory tool yet AFAIK). Finetuning is available for ChatGPT-3.5 but not GPT-4, but I think we'll need to finetune anyway if we want Auto-GPT to create new tools and self-improve; we may also use an open model (many of them have LoRA finetuning implementations), which are be less powerful, but we may expose GPT-4 API to it and train it to use the API as a tool, so the whole system would not be less powerful.

alreadydone · 2023-04-04T06:18:25Z

Actually, maybe we can make GPT models aware of the memory tool using the system message without the need of finetuning, since it's just a single simple tool. Something like

You are a language model with limited memory (or context length) so that you'll forget what's said 8,000 tokens (3,000 words?) earlier. However, you now have access to a key-value database that serve as your long-term memory. If you are about to forget something important, you may say <remember "k" "v"> to store it in the database, which you can later recall by saying <recall "k">.

I'm not experienced in prompt engineering so there's definitely room for improvement. Notice that

In general, gpt-3.5-turbo-0301 does not pay strong attention to the system message, and therefore important instructions are often better placed in a user message.

so this should work better with GPT-4 than 3.5. If you have access, please try!

dschonholtz · 2023-04-04T21:00:40Z

This works. Hard to test this kind of thing concretely. But anecdotally it seems like it is much smarter now.
I'm implementing a thing to actually track memory usage, number of memory keys taken up or number of vectors in DB to output between thoughts.
Then I'm gonna do another pass with the debugger and assuming it appears to be doing what I think it is doing I'll put it up for review

dschonholtz · 2023-04-04T22:45:18Z

See pull: #122

Pwuts · 2023-04-18T20:32:05Z

Is this resolved with the output of --debug?

Boostrix · 2023-05-04T06:22:28Z

I'm implementing a thing to actually track memory usage, number of memory keys taken up or number of vectors in DB to output between thoughts.
Auto-GPT should be aware of it's short and long term memory usage so that it knows when something is doing to be deleted from it's memory due to context limits.

This would ideally be a part of a "quota"-like system so that sub-agents could be managed by agents higher up in the chain whenever there is a quota/constraint violation (soft/hard), as per #3466

github-actions · 2023-09-17T01:54:33Z

This issue was closed automatically because it has been stale for 10 days with no activity.

Torantulino added the enhancement New feature or request label Mar 29, 2023

KadirErturk4r mentioned this issue Apr 3, 2023

Another approach could be to run history through an embeddings API, save the embeddings to a Vector DB, then do a lookup for relevant _memories_ on each step. #89

Closed

Bourn3Dynsty mentioned this issue Apr 16, 2023

Command browse_website returned: Error: Message: unknown error: Chrome failed to start: crashed. (unknown error: DevToolsActivePort file doesn't exist) #1788

Closed

1 task

ngmisl mentioned this issue Apr 16, 2023

issues with chromium #1853

Closed

1 task

amsator mentioned this issue Apr 16, 2023

autoGPT cannot recognize and use the websites URL from the google returned #1959

Closed

2 tasks

kwikcoins mentioned this issue Apr 17, 2023

Browsing broken #2043

Closed

2 tasks

contributorai mentioned this issue Apr 17, 2023

browse_website fails without --headless flag #2090

Closed

2 tasks

ngmisl mentioned this issue Apr 17, 2023

stable branch vs master branch #1963

Closed

2 tasks

m8l91 referenced this issue Apr 17, 2023

Fix selenium browsing in dev container

414bd4c

horazius mentioned this issue Apr 17, 2023

"DevToolsActivePort file doesn't exist" in browse_website #1978

Closed

2 tasks

Pwuts added this to AutoGPT development kanban Apr 17, 2023

Pwuts moved this to 📋 Backlog in AutoGPT development kanban Apr 17, 2023

Pwuts moved this from 📋 Backlog to 🆕 New in AutoGPT development kanban Apr 17, 2023

moltra mentioned this issue Apr 18, 2023

browse_website -> Chrome failed to start: exited abnormally #2304

Closed

Pwuts added the function: memory label Apr 20, 2023

bjm88 mentioned this issue Apr 27, 2023

chromedriver wont launch #3432

Closed

1 task

CharlieCappn mentioned this issue Apr 29, 2023

browse_website is broken on Apple M1 & M1 Max due to chromedriver mismatch #2600

Closed

1 task

mryarikm mentioned this issue May 4, 2023

This version of ChromeDriver only supports Chrome version 113 Current browser version is 112.0.5615.138 #3803

Closed

1 task

p-i- moved this from 🆕 Reviewed to Un-Reviewed in AutoGPT development kanban May 17, 2023

p-i- moved this from Un-Reviewed to 🆕 Reviewed in AutoGPT development kanban May 17, 2023

gravelBridge moved this from 🆕 Reviewed to Un-Reviewed in AutoGPT development kanban May 17, 2023

MauroDruwel referenced this issue in MauroDruwel/Auto-GPT May 19, 2023

Edit test #5

106aded

lc0rp moved this from 🆕 Un-Reviewed to 📋 Backlog in AutoGPT development kanban May 23, 2023

lc0rp removed this from AutoGPT development kanban Jun 13, 2023

waynehamadi pushed a commit that referenced this issue Sep 5, 2023

Tooling application form (#5)

7fccf6c

github-actions bot added the Stale label Sep 6, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Display short-term and long-term memory usage #5

Display short-term and long-term memory usage #5

Torantulino commented Mar 29, 2023 •

edited

Loading

claysauruswrecks commented Mar 29, 2023

Torantulino commented Mar 29, 2023

tedspare commented Mar 30, 2023

Torantulino commented Mar 30, 2023

tedspare commented Apr 2, 2023

jantic commented Apr 3, 2023

dschonholtz commented Apr 3, 2023

alreadydone commented Apr 4, 2023 •

edited

Loading

alreadydone commented Apr 4, 2023 •

edited

Loading

dschonholtz commented Apr 4, 2023

dschonholtz commented Apr 4, 2023

Pwuts commented Apr 18, 2023

Boostrix commented May 4, 2023 •

edited

Loading

github-actions bot commented Sep 17, 2023

Display short-term and long-term memory usage #5

Display short-term and long-term memory usage #5

Comments

Torantulino commented Mar 29, 2023 • edited Loading

claysauruswrecks commented Mar 29, 2023

Torantulino commented Mar 29, 2023

tedspare commented Mar 30, 2023

Torantulino commented Mar 30, 2023

tedspare commented Apr 2, 2023

jantic commented Apr 3, 2023

dschonholtz commented Apr 3, 2023

alreadydone commented Apr 4, 2023 • edited Loading

alreadydone commented Apr 4, 2023 • edited Loading

dschonholtz commented Apr 4, 2023

dschonholtz commented Apr 4, 2023

Pwuts commented Apr 18, 2023

Boostrix commented May 4, 2023 • edited Loading

github-actions bot commented Sep 17, 2023

Torantulino commented Mar 29, 2023 •

edited

Loading

alreadydone commented Apr 4, 2023 •

edited

Loading

alreadydone commented Apr 4, 2023 •

edited

Loading

Boostrix commented May 4, 2023 •

edited

Loading