Update RAG metadata regeneration logic by maysunfaisal · Pull Request #13 · redhat-ai-dev/llama-stack-agentic-sample

maysunfaisal · 2026-01-20T23:21:06Z

Addresses JIRA https://issues.redhat.com/browse/RHIDP-11474 where RAG metadata would be invalid when an app pod restarted because of emptyDir volume.

After restart, RAG source link works fine:

thepetk

lgtm in general. I feel is much better now with the non-random naming on the temp files.

I have only a tiny comment :D

There's also an issue with ty, once resolved I'll approve.

src/responses.py

gabemontero

I have no additional comments. I'll let @maysunfaisal and @thepetk sort out the review thread from @thepetk and approve / merge as they see fit

another interesting permutation from running on openshift vs. local ... good catch / recovery @maysunfaisal

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

maysunfaisal · 2026-01-22T00:23:17Z

Confirmed works with the latest changes/updates:

First time Pod startup:

After Tekton PLR build and Pod restart

maysunfaisal · 2026-01-22T00:24:11Z

@gabemontero I actually had to update the ollama deployment to pull the model and also updated the run.yaml for one other config related to openai

maysunfaisal · 2026-01-22T00:26:42Z

I currently have the images on quay.io/redhat-ai-dev/agentic-llama-stack:dev and quay.io/redhat-ai-dev/agentic-sample:dev from my local branch..

Once I merge this PR, I will update the latest tag. We need to have a CI that updates the quay images too.

thepetk

lgtm

gabemontero · 2026-01-22T14:37:12Z

run.yaml

-    provider_id: openai
-    model_id: text-embedding-3-small
+    provider_id: ollama
+    model_id: all-minilm:l6-v2


my bad - thanks for catching this @maysunfaisal

* Update RAG metadata regeneration logic (#13) * Update logic for RAG Metadata caching Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Remove .git suffix from URL Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Add tests for RAG metadata regen Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Fix ruff linting err Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Address PR review and ty linter Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Llama Stack run config for openai to ollama Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Ollama Deployment to pull embedding model Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Create pull request template based on ai-lab-template (#16) * Add CI/CD for App, Llama Stack (#17) * Add CI/CD for App, Llama Stack Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Fix workflow for space and image inspect (#19) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * adjust github mcp tools calls for payload size, timeout (#20) discovered during testing of the performance agent that the github add comment tool call was failing for the k8s top outpu. investigation confirmed that the output exceeded the max length for a github comment, so adding some truncating logic to adhere to the size limit also broke out failure processing for creating and issue from failure processing for adding comments; we should at least note the github issue creation, with an indicator if any of the comment failed finally, added a timeout to the comment tool calls, as failure/retry at the openai/LLS level lead to huge delays from the UI perspective (one instance or 1200 seconds). * Consolidate GITHUB TOKENS - 1 (#18) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Prepare for RHDH 1.9 --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Maysun Faisal <31771087+maysunfaisal@users.noreply.github.com> Co-authored-by: Gabe Montero <gmontero@redhat.com>

* Update RAG metadata regeneration logic (#13) * Update logic for RAG Metadata caching Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Remove .git suffix from URL Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Add tests for RAG metadata regen Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Fix ruff linting err Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Address PR review and ty linter Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Llama Stack run config for openai to ollama Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update Ollama Deployment to pull embedding model Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Create pull request template based on ai-lab-template (#16) * Add CI/CD for App, Llama Stack (#17) * Add CI/CD for App, Llama Stack Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Update .github/workflows/image.yaml Co-authored-by: Theofanis Petkos <thepetk@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Theofanis Petkos <thepetk@gmail.com> * Fix workflow for space and image inspect (#19) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * adjust github mcp tools calls for payload size, timeout (#20) discovered during testing of the performance agent that the github add comment tool call was failing for the k8s top outpu. investigation confirmed that the output exceeded the max length for a github comment, so adding some truncating logic to adhere to the size limit also broke out failure processing for creating and issue from failure processing for adding comments; we should at least note the github issue creation, with an indicator if any of the comment failed finally, added a timeout to the comment tool calls, as failure/retry at the openai/LLS level lead to huge delays from the UI perspective (one instance or 1200 seconds). * Consolidate GITHUB TOKENS - 1 (#18) Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Introduce gitops sync waves (#22) * Enforce Secrets Creation Acknowledgment (#24) * Fix Secret ACK checkbox Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * Default Checkbox to False, and enforce using enum Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> * UX & Ingestion Optimizations (#23) * Remove polling * Avoid concurrent ingestion * Update streamlit app * Add manual refresh * Add more comments * Update comment * Prepare for RHDH 1.9 --------- Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com> Co-authored-by: Maysun Faisal <31771087+maysunfaisal@users.noreply.github.com> Co-authored-by: Gabe Montero <gmontero@redhat.com>

maysunfaisal requested review from gabemontero and thepetk as code owners January 20, 2026 23:21

thepetk reviewed Jan 21, 2026

View reviewed changes

src/responses.py Outdated Show resolved Hide resolved

gabemontero reviewed Jan 21, 2026

View reviewed changes

maysunfaisal added 5 commits January 21, 2026 15:48

Update logic for RAG Metadata caching

fa7c244

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

Remove .git suffix from URL

f4316ab

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

Add tests for RAG metadata regen

b011601

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

Fix ruff linting err

1b4647d

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

Address PR review and ty linter

524c3e1

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

maysunfaisal force-pushed the RHIDP-11474-1 branch from b180485 to 524c3e1 Compare January 21, 2026 22:26

maysunfaisal added 2 commits January 21, 2026 18:37

Update Llama Stack run config for openai to ollama

10bfd95

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

Update Ollama Deployment to pull embedding model

a20ca30

Assisted-by: Claude Opus 4.5 Generated-by: Cursor Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

thepetk approved these changes Jan 22, 2026

View reviewed changes

gabemontero approved these changes Jan 22, 2026

View reviewed changes

gabemontero merged commit 2f41af5 into redhat-ai-dev:main Jan 22, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update RAG metadata regeneration logic#13

Update RAG metadata regeneration logic#13
gabemontero merged 7 commits intoredhat-ai-dev:mainfrom
maysunfaisal:RHIDP-11474-1

maysunfaisal commented Jan 20, 2026 •

edited

Loading

Uh oh!

thepetk left a comment

Uh oh!

Uh oh!

gabemontero left a comment

Uh oh!

maysunfaisal commented Jan 22, 2026

Uh oh!

maysunfaisal commented Jan 22, 2026

Uh oh!

maysunfaisal commented Jan 22, 2026

Uh oh!

thepetk left a comment

Uh oh!

gabemontero Jan 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

maysunfaisal commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thepetk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gabemontero left a comment

Choose a reason for hiding this comment

Uh oh!

maysunfaisal commented Jan 22, 2026

First time Pod startup:

After Tekton PLR build and Pod restart

Uh oh!

maysunfaisal commented Jan 22, 2026

Uh oh!

maysunfaisal commented Jan 22, 2026

Uh oh!

thepetk left a comment

Choose a reason for hiding this comment

Uh oh!

gabemontero Jan 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

maysunfaisal commented Jan 20, 2026 •

edited

Loading