Add PostgreSQL support for long-term memory storage #2892

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

tibbon wants to merge 1 commit into crewAIInc:main from tibbon:feature/postgres-long-term-memory

tibbon commented May 23, 2025

This PR adds PostgreSQL integration for CrewAI's long-term memory system. I added this because I'm using CrewAI in AWS Lambda and can't easily share an on-disk SQLite3 database between runs for long-term memory persistence. With this, you can use RDS/PostgreSQL. You still need mem0 for Entity storage.

This is my first contribution here, but I've tried to ensure good test coverage, tried everything out locally, added documentation, and ensured that everything passed the CI tests and linting. I'm not much of a Python programmer, and I'm happy to get feedback if my code isn't properly idiomatic. I'll amend it as needed.

Create LTMPostgresStorage class supporting PostgreSQL 16+ database backend
Implement connection pooling with psycopg for improved performance
Add storage factory pattern via LTMStorageFactory for backend selection
Support environment variable configuration for simplified deployment
Add context manager pattern for improved resource management
Implement robust validation and security measures:
- Input validation for all database parameters
- Protection against SQL injection
- Secure connection string handling
- Comprehensive error types for better debugging
Add CI-compatible tests with mocks
Create detailed documentation with examples and best practices
Add optional dependency for psycopg[pool] in pyproject.toml for connection pooling


          Add PostgreSQL support for long-term memory storage

fb099b8

This PR adds comprehensive PostgreSQL integration for CrewAI's long-term memory system:

- Create LTMPostgresStorage class supporting PostgreSQL 16+ database backend
- Implement connection pooling with psycopg for improved performance
- Add storage factory pattern via LTMStorageFactory for backend selection
- Support environment variable configuration for simplified deployment
- Add context manager pattern for improved resource management
- Implement robust validation and security measures:
  - Input validation for all database parameters
  - Protection against SQL injection
  - Secure connection string handling
  - Comprehensive error types for better debugging
- Add CI-compatible tests with mocks
- Create detailed documentation with examples and best practices
- Add optional dependency for psycopg[pool] in pyproject.toml

tibbon force-pushed the feature/postgres-long-term-memory branch from fc7fe6d to fb099b8 Compare

May 23, 2025 02:14

Author

tibbon commented May 23, 2025 •

edited

Loading

@joaomdmoura can your review crew do this one too?

Author

tibbon commented May 28, 2025

@gvieira Do you have any feedback on this one?

lucasgomide reviewed

View reviewed changes

docs/how-to/postgres-long-term-memory.mdx

Comment on lines +56 to +58

+                  memory=True,                    # Enable memory system
+                  long_term_memory=long_term_memory,  # Use PostgreSQL for long-term memory
+                  entity_memory=EntityMemory()    # Required for automatic memory saving

Contributor

lucasgomide Jun 9, 2025

For your use case, I'd recommend encouraging users to set only long_term_memory instead of configuring all the individual memory attributes.

lucasgomide reviewed

View reviewed changes

docs/how-to/postgres-long-term-memory.mdx


		### Automatic Memory Saving

		Requires both `long_term_memory` AND `entity_memory` configured

Contributor

lucasgomide Jun 9, 2025

Is there any reason to make entity_memory requierd?

lucasgomide reviewed

View reviewed changes

docs/how-to/postgres-long-term-memory.mdx

+              )
+              # Save manually
+              crew._long_term_memory.save(memory_item)

Contributor

lucasgomide Jun 9, 2025

As this is a private attribute, we advise against relying on it directly. We can think in another public access, but not a private one

lucasgomide reviewed

View reviewed changes

docs/how-to/postgres-long-term-memory.mdx

+              ```python
+              # Get the storage from your memory instance
+              postgres_storage = crew.memory.storage

Contributor

lucasgomide Jun 9, 2025

Are you inferring that memory is long-term but might be any other. If you want to get long-term's storage I'd recommend expose it in our API instead

lucasgomide reviewed

View reviewed changes

docs/how-to/postgres-long-term-memory.mdx

+                  result = crew.kickoff()
+              finally:
+                  # Always clean up resources, even if an error occurs
+                  memory.cleanup()  # You can also use memory.close() for backward compatibility

Contributor

lucasgomide Jun 9, 2025

I think you mean reset, isn't?

lucasgomide reviewed

View reviewed changes

src/crewai/memory/long_term/long_term_memory.py

Comment on lines +18 to +29

+                  def __init__(
+                      self,
+                      storage=None,
+                      storage_type: str = "sqlite",
+                      path: Optional[str] = None,
+                      postgres_connection_string: Optional[str] = None,
+                      postgres_schema: Optional[str] = None,
+                      postgres_table_name: Optional[str] = None,
+                      postgres_min_pool_size: Optional[int] = None,
+                      postgres_max_pool_size: Optional[int] = None,
+                      postgres_use_connection_pool: Optional[bool] = None,
+                  ):

Contributor

lucasgomide Jun 9, 2025

I like that storage_type can be used as a factory parameter. However, I think this class should remain storage-type agnostic. We could rely on **kwargs instead of mapping all supported attributes for each storage type here. IMO, the factory (as the PostgresStorageFactory) should be responsible for handling those mappings.. not this class

lucasgomide reviewed

View reviewed changes

src/crewai/memory/long_term/long_term_memory.py

Comment on lines +71 to +78

+                      # Ensure quality is in metadata (from item.quality if available)
+                      if "quality" not in metadata and item.quality is not None:
+                          metadata["quality"] = item.quality
+                      # Check if quality is available
+                      if "quality" not in metadata:
+                          raise ValueError("Memory quality must be provided either in item.quality or item.metadata['quality']")

Contributor

lucasgomide Jun 9, 2025

Are we sending these info as metadata? Is it exclusive for LTStorage Postgress ?

lucasgomide reviewed

View reviewed changes

src/crewai/memory/long_term/long_term_memory.py

		self.storage.reset()

		def cleanup(self) -> None:

Contributor

lucasgomide Jun 9, 2025

I misunderstood what this method was doing.. at first, I thought it was meant to reset created entries, but I realized it’s actually for closing any open connections.

That said, I do have some concerns about the current API design, it feels a bit confusing. Here are a few thoughts off the top of my head:

If users want to use long-term memory with Postgres, we should always encourage them to use context managers.
Let’s consider removing this cleanup method, as its name could be easily confused with something like reset. If we decide to keep it, we should rename it to something clearer—maybe just close

Contributor

lucasgomide Jun 9, 2025

I'm open to discuss the API usage if you want

lucasgomide reviewed

View reviewed changes

Contributor

lucasgomide left a comment

@tibbon I love your suggestion!

I just dropped some comments, I didn't finishs the review.. I'm looking forward for the next steps!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet