Update RAG metadata regeneration logic#13
Conversation
thepetk
left a comment
There was a problem hiding this comment.
lgtm in general. I feel is much better now with the non-random naming on the temp files.
I have only a tiny comment :D
There's also an issue with ty, once resolved I'll approve.
gabemontero
left a comment
There was a problem hiding this comment.
I have no additional comments. I'll let @maysunfaisal and @thepetk sort out the review thread from @thepetk and approve / merge as they see fit
another interesting permutation from running on openshift vs. local ... good catch / recovery @maysunfaisal
Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
b180485 to
524c3e1
Compare
Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
|
@gabemontero I actually had to update the ollama deployment to pull the model and also updated the run.yaml for one other config related to openai |
|
I currently have the images on quay.io/redhat-ai-dev/agentic-llama-stack:dev and quay.io/redhat-ai-dev/agentic-sample:dev from my local branch.. Once I merge this PR, I will update the latest tag. We need to have a CI that updates the quay images too. |
| provider_id: openai | ||
| model_id: text-embedding-3-small No newline at end of file | ||
| provider_id: ollama | ||
| model_id: all-minilm:l6-v2 No newline at end of file |
There was a problem hiding this comment.
my bad - thanks for catching this @maysunfaisal
* Update RAG metadata regeneration logic (#13) * Update logic for RAG Metadata caching Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Remove .git suffix from URL Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Add tests for RAG metadata regen Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Fix ruff linting err Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Address PR review and ty linter Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Llama Stack run config for openai to ollama Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Ollama Deployment to pull embedding model Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Create pull request template based on ai-lab-template (#16) * Add CI/CD for App, Llama Stack (#17) * Add CI/CD for App, Llama Stack Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Fix workflow for space and image inspect (#19) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * adjust github mcp tools calls for payload size, timeout (#20) discovered during testing of the performance agent that the github add comment tool call was failing for the k8s top outpu. investigation confirmed that the output exceeded the max length for a github comment, so adding some truncating logic to adhere to the size limit also broke out failure processing for creating and issue from failure processing for adding comments; we should at least note the github issue creation, with an indicator if any of the comment failed finally, added a timeout to the comment tool calls, as failure/retry at the openai/LLS level lead to huge delays from the UI perspective (one instance or 1200 seconds). * Consolidate GITHUB TOKENS - 1 (#18) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Prepare for RHDH 1.9 --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Maysun Faisal <31771087+maysunfaisal@users.noreply.github.com> Co-authored-by: Gabe Montero <gmontero@redhat.com>
* Update RAG metadata regeneration logic (#13) * Update logic for RAG Metadata caching Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Remove .git suffix from URL Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Add tests for RAG metadata regen Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Fix ruff linting err Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Address PR review and ty linter Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Llama Stack run config for openai to ollama Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Ollama Deployment to pull embedding model Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Create pull request template based on ai-lab-template (#16) * Add CI/CD for App, Llama Stack (#17) * Add CI/CD for App, Llama Stack Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Fix workflow for space and image inspect (#19) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * adjust github mcp tools calls for payload size, timeout (#20) discovered during testing of the performance agent that the github add comment tool call was failing for the k8s top outpu. investigation confirmed that the output exceeded the max length for a github comment, so adding some truncating logic to adhere to the size limit also broke out failure processing for creating and issue from failure processing for adding comments; we should at least note the github issue creation, with an indicator if any of the comment failed finally, added a timeout to the comment tool calls, as failure/retry at the openai/LLS level lead to huge delays from the UI perspective (one instance or 1200 seconds). * Consolidate GITHUB TOKENS - 1 (#18) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Introduce gitops sync waves (#22) * Enforce Secrets Creation Acknowledgment (#24) * Fix Secret ACK checkbox Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Default Checkbox to False, and enforce using enum Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * UX & Ingestion Optimizations (#23) * Remove polling * Avoid concurrent ingestion * Update streamlit app * Add manual refresh * Add more comments * Update comment * Prepare for RHDH 1.9 --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Maysun Faisal <31771087+maysunfaisal@users.noreply.github.com> Co-authored-by: Gabe Montero <gmontero@redhat.com>


Addresses JIRA https://issues.redhat.com/browse/RHIDP-11474 where RAG metadata would be invalid when an app pod restarted because of emptyDir volume.
After restart, RAG source link works fine:
