Skip to content

Update RAG metadata regeneration logic#13

Merged
gabemontero merged 7 commits intoredhat-ai-dev:mainfrom
maysunfaisal:RHIDP-11474-1
Jan 22, 2026
Merged

Update RAG metadata regeneration logic#13
gabemontero merged 7 commits intoredhat-ai-dev:mainfrom
maysunfaisal:RHIDP-11474-1

Conversation

@maysunfaisal
Copy link
Contributor

@maysunfaisal maysunfaisal commented Jan 20, 2026

Addresses JIRA https://issues.redhat.com/browse/RHIDP-11474 where RAG metadata would be invalid when an app pod restarted because of emptyDir volume.

After restart, RAG source link works fine:
Screenshot 2026-01-20 at 6 20 36 PM

Copy link
Contributor

@thepetk thepetk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm in general. I feel is much better now with the non-random naming on the temp files.

I have only a tiny comment :D

There's also an issue with ty, once resolved I'll approve.

Copy link
Contributor

@gabemontero gabemontero left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have no additional comments. I'll let @maysunfaisal and @thepetk sort out the review thread from @thepetk and approve / merge as they see fit

another interesting permutation from running on openshift vs. local ... good catch / recovery @maysunfaisal

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
@maysunfaisal
Copy link
Contributor Author

Confirmed works with the latest changes/updates:

First time Pod startup:

Screenshot 2026-01-21 at 7 11 01 PM

After Tekton PLR build and Pod restart

Screenshot 2026-01-21 at 7 21 30 PM

@maysunfaisal
Copy link
Contributor Author

@gabemontero I actually had to update the ollama deployment to pull the model and also updated the run.yaml for one other config related to openai

@maysunfaisal
Copy link
Contributor Author

I currently have the images on quay.io/redhat-ai-dev/agentic-llama-stack:dev and quay.io/redhat-ai-dev/agentic-sample:dev from my local branch..

Once I merge this PR, I will update the latest tag. We need to have a CI that updates the quay images too.

Copy link
Contributor

@thepetk thepetk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

provider_id: openai
model_id: text-embedding-3-small No newline at end of file
provider_id: ollama
model_id: all-minilm:l6-v2 No newline at end of file
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

my bad - thanks for catching this @maysunfaisal

@gabemontero gabemontero merged commit 2f41af5 into redhat-ai-dev:main Jan 22, 2026
1 check passed
thepetk added a commit that referenced this pull request Jan 29, 2026
* Update RAG metadata regeneration logic (#13)

* Update logic for RAG Metadata caching

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Remove .git suffix from URL

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Add tests for RAG metadata regen

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Fix ruff linting err

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Address PR review and ty linter

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Update Llama Stack run config for openai to ollama

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Update Ollama Deployment to pull embedding model

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

---------

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Create pull request template based on ai-lab-template (#16)

* Add CI/CD for App, Llama Stack (#17)

* Add CI/CD for App, Llama Stack

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Update .github/workflows/image.yaml

Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

* Update .github/workflows/image.yaml

Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

* Update .github/workflows/image.yaml

Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

---------

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

* Fix workflow for space and image inspect (#19)

Assisted-by: Claude Opus 4.5

Generated-by: Cursor

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* adjust github mcp tools calls for payload size, timeout (#20)

discovered during testing of the performance agent that the github add comment tool call
was failing for the k8s top outpu.

investigation confirmed that the output exceeded the max length for a github comment, so
adding some truncating logic to adhere to the size limit

also broke out failure processing for creating and issue from failure processing for adding comments;
we should at least note the github issue creation, with an indicator if any of the comment failed

finally, added a timeout to the comment tool calls, as failure/retry at the openai/LLS level lead to
huge delays from the UI perspective (one instance or 1200 seconds).

* Consolidate GITHUB TOKENS - 1 (#18)

Assisted-by: Claude Opus 4.5

Generated-by: Cursor

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Prepare for RHDH 1.9

---------

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Co-authored-by: Maysun Faisal <31771087+maysunfaisal@users.noreply.github.com>
Co-authored-by: Gabe Montero <gmontero@redhat.com>
gabemontero added a commit that referenced this pull request Jan 30, 2026
* Update RAG metadata regeneration logic (#13)

* Update logic for RAG Metadata caching

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Remove .git suffix from URL

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Add tests for RAG metadata regen

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Fix ruff linting err

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Address PR review and ty linter

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Update Llama Stack run config for openai to ollama

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Update Ollama Deployment to pull embedding model

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

---------

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Create pull request template based on ai-lab-template (#16)

* Add CI/CD for App, Llama Stack (#17)

* Add CI/CD for App, Llama Stack

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Update .github/workflows/image.yaml

Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

* Update .github/workflows/image.yaml

Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

* Update .github/workflows/image.yaml

Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

---------

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Co-authored-by: Theofanis Petkos <thepetk@gmail.com>

* Fix workflow for space and image inspect (#19)

Assisted-by: Claude Opus 4.5

Generated-by: Cursor

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* adjust github mcp tools calls for payload size, timeout (#20)

discovered during testing of the performance agent that the github add comment tool call
was failing for the k8s top outpu.

investigation confirmed that the output exceeded the max length for a github comment, so
adding some truncating logic to adhere to the size limit

also broke out failure processing for creating and issue from failure processing for adding comments;
we should at least note the github issue creation, with an indicator if any of the comment failed

finally, added a timeout to the comment tool calls, as failure/retry at the openai/LLS level lead to
huge delays from the UI perspective (one instance or 1200 seconds).

* Consolidate GITHUB TOKENS - 1 (#18)

Assisted-by: Claude Opus 4.5

Generated-by: Cursor

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Introduce gitops sync waves (#22)

* Enforce Secrets Creation Acknowledgment (#24)

* Fix Secret ACK checkbox

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* Default Checkbox to False, and enforce using enum

Assisted-by: Claude Opus 4.5

Generated-by: Cursor
Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

---------

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>

* UX & Ingestion Optimizations (#23)

* Remove polling

* Avoid concurrent ingestion

* Update streamlit app

* Add manual refresh

* Add more comments

* Update comment

* Prepare for RHDH 1.9

---------

Signed-off-by: Maysun J Faisal <maysunaneek@gmail.com>
Co-authored-by: Maysun Faisal <31771087+maysunfaisal@users.noreply.github.com>
Co-authored-by: Gabe Montero <gmontero@redhat.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants