[Obs AI Assistant] Fix re-deploy model timeout and status polling #220445

viduni94 · 2025-05-07T21:51:51Z

Closes https://github.com/elastic/obs-ai-assistant-team/issues/247
Closes #217912

Summary

Problems

The /warmup_model endpoint doesn't return immediately and waits for the KB to be ready. If there is no ML nodes or sufficient capacity in the ML node, the API can timeout.
Since the endpoint doesn't return immediately, we don't poll for status continuously.
Knowledge base tab doesn't show Inspect if no ML nodes are available.

Solutions

Show Inspect information in the knowledge base
Return /warmup_model immediately (we don't need to wait for the model to be ready since we are polling), and start polling
If the user refreshes the browser and if the kbState is in DEPLOYING_MODEL keep polling for status

Checklist

The PR description includes the appropriate Release Notes section, and the correct release_note:* label is applied per the guidelines

elasticmachine · 2025-05-07T21:51:55Z

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

github-actions · 2025-05-07T21:52:07Z

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

/oblt-deploy : Deploy a Kibana instance using the Observability test environments.
run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)

x-pack/platform/plugins/shared/observability_ai_assistant/server/service/client/index.ts

sorenlouv · 2025-05-08T13:23:34Z

x-pack/platform/plugins/shared/observability_ai_assistant/server/routes/knowledge_base/route.ts

@@ -84,7 +84,7 @@ const warmupModelKnowledgeBase = createObservabilityAIAssistantServerRoute({
      requiredPrivileges: ['ai_assistant'],
    },
  },
-  handler: async (resources): Promise<void> => {
+  handler: async (resources): Promise<{ currentInferenceId: string }> => {


currentInferenceId will always be the same as inferenceId, right? What's the use case for returning this to the client?

@sorenlouv yes, we don't need to return it back. I updated it to just return instead of returning the inferenceId
6ed092f

sorenlouv

lgtm. Just one question about a return value

elasticmachine · 2025-05-08T21:03:52Z

⏳ Build in-progress

Buildkite Build
Commit: a81ece5
Kibana Serverless Image: docker.elastic.co/kibana-ci/kibana-serverless:pr-220445-a81ece5b2295

Failed CI Steps

FTR Configs #16

History

💔 Build #299859 failed 6ed092f
💛 Build #299811 was flaky 082fbcb
💚 Build #299566 succeeded bdf985b

cc @viduni94

kibanamachine · 2025-05-08T21:49:37Z

Starting backport for target branches: 8.19

https://github.com/elastic/kibana/actions/runs/14916807930

…astic#220445) Closes elastic/obs-ai-assistant-team#247 Closes elastic#217912 ## Summary ### Problems - The `/warmup_model` endpoint doesn't return immediately and waits for the KB to be ready. If there is no ML nodes or sufficient capacity in the ML node, the API can timeout. - Since the endpoint doesn't return immediately, we don't poll for status continuously. - Knowledge base tab doesn't show `Inspect` if no ML nodes are available. ### Solutions - Show `Inspect` information in the knowledge base - Return `/warmup_model` immediately (we don't need to wait for the model to be ready since we are polling), and start polling - If the user refreshes the browser and if the `kbState` is in `DEPLOYING_MODEL` keep polling for status ### Checklist - [x] The PR description includes the appropriate Release Notes section, and the correct `release_note:*` label is applied per the [guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process) (cherry picked from commit ff3822d)

kibanamachine · 2025-05-08T22:02:48Z

💚 All backports created successfully

Status	Branch	Result
✅	8.19

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…ing (#220445) (#220591) # Backport This will backport the following commits from `main` to `8.19`: - [[Obs AI Assistant] Fix re-deploy model timeout and status polling (#220445)](#220445)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sorenlouv/backport)  Co-authored-by: Viduni Wickramarachchi <viduni.wickramarachchi@elastic.co>

…astic#220445) Closes elastic/obs-ai-assistant-team#247 Closes elastic#217912 ## Summary ### Problems - The `/warmup_model` endpoint doesn't return immediately and waits for the KB to be ready. If there is no ML nodes or sufficient capacity in the ML node, the API can timeout. - Since the endpoint doesn't return immediately, we don't poll for status continuously. - Knowledge base tab doesn't show `Inspect` if no ML nodes are available. ### Solutions - Show `Inspect` information in the knowledge base - Return `/warmup_model` immediately (we don't need to wait for the model to be ready since we are polling), and start polling - If the user refreshes the browser and if the `kbState` is in `DEPLOYING_MODEL` keep polling for status ### Checklist - [x] The PR description includes the appropriate Release Notes section, and the correct `release_note:*` label is applied per the [guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)

viduni94 self-assigned this May 7, 2025

viduni94 requested review from a team as code owners May 7, 2025 21:51

viduni94 added release_note:skip Skip the PR/issue when compiling release notes Team:Obs AI Assistant Observability AI Assistant backport:version Backport to applied version labels v9.1.0 v8.19.0 labels May 7, 2025

botelastic bot added the ci:project-deploy-observability Create an Observability project label May 7, 2025

viduni94 mentioned this pull request May 7, 2025

[AI Assistant] Assistant stuck in "setting up the knowledge base" phase if ML nodes are undersized #217912

Closed

sorenlouv reviewed May 8, 2025

View reviewed changes

x-pack/platform/plugins/shared/observability_ai_assistant/server/service/client/index.ts Outdated Show resolved Hide resolved

sorenlouv reviewed May 8, 2025

View reviewed changes

sorenlouv approved these changes May 8, 2025

View reviewed changes

Fix Redeploy model timeout and status polling

082fbcb

viduni94 force-pushed the fix-reploy-model-timeout-and-status-polling branch from bdf985b to 082fbcb Compare May 8, 2025 16:02

viduni94 added 2 commits May 8, 2025 14:21

Empty return

6ed092f

Fix type

a81ece5

viduni94 merged commit ff3822d into elastic:main May 8, 2025
9 checks passed

kibanamachine mentioned this pull request May 8, 2025

[8.19] [Obs AI Assistant] Fix re-deploy model timeout and status polling (#220445) #220591

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Obs AI Assistant] Fix re-deploy model timeout and status polling #220445

[Obs AI Assistant] Fix re-deploy model timeout and status polling #220445

Uh oh!

viduni94 commented May 7, 2025 •

edited by kibanamachine

Loading

Uh oh!

elasticmachine commented May 7, 2025

Uh oh!

github-actions bot commented May 7, 2025

Uh oh!

Uh oh!

sorenlouv May 8, 2025

Uh oh!

viduni94 May 8, 2025

Uh oh!

sorenlouv left a comment

Uh oh!

elasticmachine commented May 8, 2025

Uh oh!

Uh oh!

kibanamachine commented May 8, 2025

Uh oh!

kibanamachine commented May 8, 2025

Uh oh!

Uh oh!

[Obs AI Assistant] Fix re-deploy model timeout and status polling #220445

[Obs AI Assistant] Fix re-deploy model timeout and status polling #220445

Uh oh!

Conversation

viduni94 commented May 7, 2025 • edited by kibanamachine Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problems

Solutions

Checklist

Uh oh!

elasticmachine commented May 7, 2025

Uh oh!

github-actions bot commented May 7, 2025

🤖 GitHub comments

Uh oh!

Uh oh!

sorenlouv May 8, 2025

Choose a reason for hiding this comment

Uh oh!

viduni94 May 8, 2025

Choose a reason for hiding this comment

Uh oh!

sorenlouv left a comment

Choose a reason for hiding this comment

Uh oh!

elasticmachine commented May 8, 2025

⏳ Build in-progress

Failed CI Steps

History

Uh oh!

Uh oh!

kibanamachine commented May 8, 2025

Uh oh!

kibanamachine commented May 8, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Uh oh!

viduni94 commented May 7, 2025 •

edited by kibanamachine

Loading