Skip to content

[8.19] [Obs AI Assistant] Fix re-deploy model timeout and status polling (#220445) #220591

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 8, 2025

Conversation

kibanamachine
Copy link
Contributor

Backport

This will backport the following commits from main to 8.19:

Questions ?

Please refer to the Backport tool documentation

…astic#220445)

Closes elastic/obs-ai-assistant-team#247
Closes elastic#217912

## Summary

### Problems
- The `/warmup_model` endpoint doesn't return immediately and waits for
the KB to be ready. If there is no ML nodes or sufficient capacity in
the ML node, the API can timeout.
- Since the endpoint doesn't return immediately, we don't poll for
status continuously.
- Knowledge base tab doesn't show `Inspect` if no ML nodes are
available.

### Solutions

- Show `Inspect` information in the knowledge base
- Return `/warmup_model` immediately (we don't need to wait for the
model to be ready since we are polling), and start polling
- If the user refreshes the browser and if the `kbState` is in
`DEPLOYING_MODEL` keep polling for status

### Checklist

- [x] The PR description includes the appropriate Release Notes section,
and the correct `release_note:*` label is applied per the
[guidelines](https://www.elastic.co/guide/en/kibana/master/contributing.html#kibana-release-notes-process)

(cherry picked from commit ff3822d)
@elasticmachine
Copy link
Contributor

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

@kibanamachine kibanamachine merged commit 910bb06 into elastic:8.19 May 8, 2025
13 checks passed
@elasticmachine
Copy link
Contributor

💛 Build succeeded, but was flaky

Failed CI Steps

Test Failures

  • [job] [logs] FTR Configs #36 / alerting api integration security and spaces enabled - Group 2 Alerts alerts update space_1_all at space1 should handle updates for a long running alert type without failing the underlying tasks due to invalidated ApiKey
  • [job] [logs] FTR Configs #59 / Alerts APIs - Trial License/Complete Tier @ess @skipInServerless find alert with/without doc level security "before all" hook for "should return alerts with user who has access to security solution privileges"
  • [job] [logs] FTR Configs #68 / InfraOps App Metrics UI Metrics UI Anomaly Flyout anomalies table in flyout with anomalies present should take users to hosts list when 'Show affected Hosts' is clicked

Metrics [docs]

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id before after diff
observabilityAIAssistantApp 256.3KB 256.4KB +92.0B
observabilityAiAssistantManagement 96.5KB 96.7KB +150.0B
searchAssistant 172.4KB 172.5KB +92.0B
total +334.0B

cc @viduni94

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport Team:Obs AI Assistant Observability AI Assistant
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants