[ML] Fix issues in dynamically reading the number of allocations #115095

davidkyle · 2024-10-18T11:58:12Z

Relates to problems in the GET inference API which should dynamically update the num_allocations field with the actual number from the deployed model. This is required for adaptive allocations where the field will change dynamically.

The num_allocations field in service_settings is updated with the current value.

GET _inference/elser_on_ml

{
  "endpoints": [
    {
      "inference_id": "elser_on_ml",
      "task_type": "sparse_embedding",
      "service": "elasticsearch",
      "service_settings": {
        "num_allocations": 3,            <-- this field is dynamic
        "num_threads": 1,
        "model_id": ".elser_model_2",
        "deployment_id": ".elser_model_2_for_me"
      },
      "chunking_settings": {
        "strategy": "sentence",
        "max_chunk_size": 250,
        "sentence_overlap": 1
      }
    }
  ]
}

The first issue is that GroupedActionListener throws if called with size == 0. This is now protected against by skipping the model update if the list is empty.

The second issue is that the wrong field was being updated so the update was not seen in the API response. Tests are added to cover both cases.

Non issue as the code is not live

elasticsearchmachine · 2024-10-18T11:58:35Z

Pinging @elastic/ml-core (Team:ML)

davidkyle · 2024-10-18T11:58:59Z

...src/main/java/org/elasticsearch/xpack/inference/action/TransportGetInferenceModelAction.java

    }

    private void parseModels(List<UnparsedModel> unparsedModels, ActionListener<GetInferenceModelAction.Response> listener) {
+        if (unparsedModels.isEmpty()) {


Without this check the GroupedActionListner was called with 0 requests which throws an exception

…ing inference endpoints (#196577) When listing the inference endpoints available for the semantic text field, we should only list `sparse_embedding` and `text_embedding` types. <img width="353" alt="image" src="https://github.com/user-attachments/assets/95526f2b-e293-4e01-be79-b87e1ecb9a75"> This PR adds a check to the `data_visualizer/inference_endpoints` endpoint to ensure only `sparse_embedding` and `text_embedding` types are used and they have at least one allocation. NOTE, the allocation check is currently commented out waiting on an es change. elastic/elasticsearch#115095 Also renames the endpoint from `data_visualizer/inference_services` -> `data_visualizer/inference_endpoints` And renames variables which were incorrectly named "service" rather than "endpoint"

…ing inference endpoints (elastic#196577) When listing the inference endpoints available for the semantic text field, we should only list `sparse_embedding` and `text_embedding` types. <img width="353" alt="image" src="https://github.com/user-attachments/assets/95526f2b-e293-4e01-be79-b87e1ecb9a75"> This PR adds a check to the `data_visualizer/inference_endpoints` endpoint to ensure only `sparse_embedding` and `text_embedding` types are used and they have at least one allocation. NOTE, the allocation check is currently commented out waiting on an es change. elastic/elasticsearch#115095 Also renames the endpoint from `data_visualizer/inference_services` -> `data_visualizer/inference_endpoints` And renames variables which were incorrectly named "service" rather than "endpoint" (cherry picked from commit fb412ca)

Enables the previously commented out check for `num_allocations` when listing the inference endpoints. The adaptive allocation count can drop to 0, but it is still valid for use. Uploading a file will cause it to be re-deployed. Related to es PRs elastic/elasticsearch#115233 and elastic/elasticsearch#115095 Follow on from #196577

Enables the previously commented out check for `num_allocations` when listing the inference endpoints. The adaptive allocation count can drop to 0, but it is still valid for use. Uploading a file will cause it to be re-deployed. Related to es PRs elastic/elasticsearch#115233 and elastic/elasticsearch#115095 Follow on from elastic#196577 (cherry picked from commit 66b2447)

davidkyle added 2 commits October 18, 2024 12:48

handle zero models

ee55146

Add test

2f72637

davidkyle added >non-issue :ml Machine learning v8.16.0 labels Oct 18, 2024

elasticsearchmachine added Team:ML Meta label for the ML team v8.16.1 labels Oct 18, 2024

davidkyle commented Oct 18, 2024

View reviewed changes

jgowdyelastic mentioned this pull request Oct 18, 2024

[ML] File data visualizer: only list sparse_embedding and text_embedding inference endpoints elastic/kibana#196577

Merged

davidkyle mentioned this pull request Oct 21, 2024

[ML] Dynamically get of num allocations for ml node models #115233

Merged

prwhelan approved these changes Oct 21, 2024

View reviewed changes

davidkyle removed the v8.16.1 label Oct 22, 2024

davidkyle merged commit 8ba4b14 into elastic:8.16 Oct 22, 2024
15 checks passed

This was referenced Oct 22, 2024

[ML] Avoid use of GroupedActionListener when the group size is zero #115363

Closed

_xpack/usage fails in 8.16.0 #115362

Closed

consulthys mentioned this pull request Oct 23, 2024

Missing alerts modal when opening Stack Monitoring elastic/kibana#197103

Closed

jgowdyelastic mentioned this pull request Oct 23, 2024

[ML] File upload: enabling check for model allocations elastic/kibana#197395

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ML] Fix issues in dynamically reading the number of allocations #115095

[ML] Fix issues in dynamically reading the number of allocations #115095

Uh oh!

davidkyle commented Oct 18, 2024 •

edited

Loading

Uh oh!

elasticsearchmachine commented Oct 18, 2024

Uh oh!

davidkyle Oct 18, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ML] Fix issues in dynamically reading the number of allocations #115095

[ML] Fix issues in dynamically reading the number of allocations #115095

Uh oh!

Conversation

davidkyle commented Oct 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Oct 18, 2024

Uh oh!

davidkyle Oct 18, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

davidkyle commented Oct 18, 2024 •

edited

Loading