Model & user level throttling #1800

b4sjoo · 2023-12-21T21:15:41Z

Description

This PR should enable model and user level rate limiting on TextEmbedding model and all Remote model.

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

…ations for clarification Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

This reverts commit 5a2d455.

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

dhrubo-os · 2023-12-21T22:41:41Z

Is this PR ready to review? Spotless missing. Is that 111 files? Or other commits also added here?

b4sjoo · 2023-12-21T22:45:29Z

Is this PR ready to review? Spotless missing. Is that 111 files? Or other commits also added here?

Main codes are ready to review, just some UTs are missing. This PR including some UT names refactor, but overall they are only about the throttling feature itself.

b4sjoo · 2023-12-21T22:45:43Z

Yeah it's a super large PR...

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

common/src/main/java/org/opensearch/ml/common/controller/MLModelController.java

plugin/src/main/java/org/opensearch/ml/action/update_cache/UpdateModelCacheTransportAction.java

common/src/main/java/org/opensearch/ml/common/controller/MLModelController.java

Zhangxunmt · 2023-12-26T20:36:22Z

added a few comments from user experience and data model in the OS index. Do you have a quip document to track all the APIs usages and examples that will be added in the PR?

b4sjoo · 2023-12-26T20:56:00Z

added a few comments from user experience and data model in the OS index. Do you have a quip document to track all the APIs usages and examples that will be added in the PR?

Addressed. For the API usage will added to the same quip we used to demo later today.

Zhangxunmt

This can be merged with 2 major concerns from my view:

There will be data redundancy in the Ml-Model because both user/model throttling are bind within the Ml-Model. It's not obvious that we can decouple them easily in the future.
The APIs added in the PR should be all optional for users to run ml-commons. There shouldn't be any required setup for throttling unless someone needs to use throttling.

b4sjoo · 2023-12-26T23:00:50Z

This can be merged with 2 major concerns from my view:

There will be data redundancy in the Ml-Model because both user/model throttling are bind within the Ml-Model. It's not obvious that we can decouple them easily in the future.
The APIs added in the PR should be all optional for users to run ml-commons. There shouldn't be any required setup for throttling unless someone needs to use throttling.

We can further discuss on the first one. For the second one, at this time all throttling related apis are optional for our user. The default behavior is pass for all requests (same as the previous).

opensearch-trigger-bot · 2023-12-26T23:07:25Z

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.x 2.x
# Navigate to the new working tree
cd .worktrees/backport-2.x
# Create a new branch
git switch --create backport/backport-1800-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 50788de0bcf7a9509da6d2dabb23aab59d5a843d
# Push it to GitHub
git push --set-upstream origin backport/backport-1800-to-2.x
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-1800-to-2.x.

* Enable in-place update model --------- Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

* Model & user level throttling (#1800) * Enable in-place update model --------- Signed-off-by: Sicheng Song <sicheng.song@outlook.com> * fix backport conflict Signed-off-by: Sicheng Song <sicheng.song@outlook.com> --------- Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

* Enable in-place update model --------- Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

b4sjoo added 10 commits December 21, 2023 01:12

Enable in-place update model

f1eb8cb

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Refactor inplace update api as well as adding more doc/comments/annot…

1aa71f9

…ations for clarification Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Enabling Model Level Throttling & Quota

428046b

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

AOS sanity test

1f77167

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

fine tune model level throttling object

03f1f4f

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Enable user level throttling

d64f03f

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

revert AOS code

5a2d455

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Fine tuning

130d2fa

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Style fix

0a51632

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Add UT coverage

06243c9

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

b4sjoo requested review from dhrubo-os, jngz-es, model-collapse, rbhavna, ylwu-amzn, zane-neo, Zhangxunmt, austintlee and HenryL27 as code owners December 21, 2023 21:15

b4sjoo had a problem deploying to ml-commons-cicd-env December 21, 2023 21:15 — with GitHub Actions Error

b4sjoo had a problem deploying to ml-commons-cicd-env December 21, 2023 21:15 — with GitHub Actions Failure

b4sjoo had a problem deploying to ml-commons-cicd-env December 21, 2023 21:16 — with GitHub Actions Error

b4sjoo had a problem deploying to ml-commons-cicd-env December 21, 2023 21:16 — with GitHub Actions Failure

b4sjoo added 2 commits December 21, 2023 21:23

Revert "revert AOS code"

bfe3663

This reverts commit 5a2d455.

AOS utility

f6f6c99

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 04:08 — with GitHub Actions Error

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 04:08 — with GitHub Actions Failure

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 04:08 — with GitHub Actions Error

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 04:08 — with GitHub Actions Failure

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 04:08 — with GitHub Actions Error

Add java docs

1e3f9d7

Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 12:46 — with GitHub Actions Failure

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 12:46 — with GitHub Actions Error

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 12:47 — with GitHub Actions Error

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 12:47 — with GitHub Actions Failure

b4sjoo had a problem deploying to ml-commons-cicd-env December 26, 2023 12:47 — with GitHub Actions Error

Zhangxunmt reviewed Dec 26, 2023

View reviewed changes

common/src/main/java/org/opensearch/ml/common/controller/MLModelController.java Show resolved Hide resolved

Zhangxunmt reviewed Dec 26, 2023

View reviewed changes

plugin/src/main/java/org/opensearch/ml/action/update_cache/UpdateModelCacheTransportAction.java Show resolved Hide resolved

common/src/main/java/org/opensearch/ml/common/controller/MLModelController.java Show resolved Hide resolved

Zhangxunmt approved these changes Dec 26, 2023

View reviewed changes

rbhavna approved these changes Dec 26, 2023

View reviewed changes

b4sjoo merged commit 50788de into opensearch-project:main Dec 26, 2023
4 of 10 checks passed

b4sjoo added the backport 2.x label Dec 26, 2023

arjunkumargiri mentioned this pull request Dec 27, 2023

Add ML agent tools #1812

Closed

5 tasks

b4sjoo added a commit to b4sjoo/ml-commons that referenced this pull request Dec 28, 2023

Model & user level throttling (opensearch-project#1800)

adadb87

* Enable in-place update model --------- Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

jackiehanyang pushed a commit to jackiehanyang/ml-commons that referenced this pull request Dec 28, 2023

Model & user level throttling (opensearch-project#1800)

c4453b8

* Enable in-place update model --------- Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Zhangxunmt mentioned this pull request Dec 29, 2023

add auto expand replica settings to memories #1819

Merged

5 tasks

jackiehanyang mentioned this pull request Dec 29, 2023

Add GetTool API and ListTools API #1818

Merged

5 tasks

b4sjoo deleted the main_throttling branch January 3, 2024 01:02

austintlee pushed a commit to austintlee/ml-commons that referenced this pull request Mar 19, 2024

Model & user level throttling (opensearch-project#1800)

2dedd61

* Enable in-place update model --------- Signed-off-by: Sicheng Song <sicheng.song@outlook.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model & user level throttling #1800

Model & user level throttling #1800

b4sjoo commented Dec 21, 2023 •

edited

Loading

dhrubo-os commented Dec 21, 2023

b4sjoo commented Dec 21, 2023

b4sjoo commented Dec 21, 2023

Zhangxunmt commented Dec 26, 2023

b4sjoo commented Dec 26, 2023

Zhangxunmt left a comment

b4sjoo commented Dec 26, 2023

opensearch-trigger-bot bot commented Dec 26, 2023

Model & user level throttling #1800

Model & user level throttling #1800

Conversation

b4sjoo commented Dec 21, 2023 • edited Loading

Description

Check List

dhrubo-os commented Dec 21, 2023

b4sjoo commented Dec 21, 2023

b4sjoo commented Dec 21, 2023

Zhangxunmt commented Dec 26, 2023

b4sjoo commented Dec 26, 2023

Zhangxunmt left a comment

Choose a reason for hiding this comment

b4sjoo commented Dec 26, 2023

opensearch-trigger-bot bot commented Dec 26, 2023

b4sjoo commented Dec 21, 2023 •

edited

Loading