enable customize http client parameters #1558

zane-neo · 2023-10-27T01:45:34Z

Description

Currently remote inference using CloseableHttpClient with default configurations both for AWS protocol and http protocol, the drawback is customer can't configure the client base on their own use case. Sometimes issue can be found thrown from httpclient, this feature enables customer to configure their own parameters for httpclient.

Issues Resolved

#1470
#1537

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: zane-neo <zaniu@amazon.com>

codecov · 2023-10-27T02:42:44Z

Codecov Report

Attention: 17 lines in your changes are missing coverage. Please review.

Comparison is base (693a53c) 80.28% compared to head (347fbfc) 80.17%.
Report is 17 commits behind head on 2.x.

❗ Current head 347fbfc differs from pull request most recent head b5f2ed4. Consider uploading reports for the commit b5f2ed4 to get more accurate results

Files	Patch %	Lines
...engine/algorithms/remote/AwsConnectorExecutor.java	0.00%	13 Missing ⚠️
...ine/algorithms/remote/RemoteConnectorExecutor.java	0.00%	4 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##                2.x    #1558      +/-   ##
============================================
- Coverage     80.28%   80.17%   -0.12%     
- Complexity     2526     2531       +5     
============================================
  Files           199      200       +1     
  Lines         10001    10056      +55     
  Branches       1002     1004       +2     
============================================
+ Hits           8029     8062      +33     
- Misses         1499     1522      +23     
+ Partials        473      472       -1

Flag	Coverage Δ
ml-commons	`80.17% <70.68%> (-0.12%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: zane-neo <zaniu@amazon.com>

dhrubo-os · 2023-10-27T04:19:22Z

plugin/src/main/java/org/opensearch/ml/settings/MLCommonsSettings.java

@@ -177,4 +177,25 @@ private MLCommonsSettings() {}
    // Feature flag for enabling search processors for Retrieval Augmented Generation using OpenSearch and Remote Inference.
    public static final Setting<Boolean> ML_COMMONS_RAG_PIPELINE_FEATURE_ENABLED =
        GenerativeQAProcessorConstants.RAG_PIPELINE_FEATURE_ENABLED;
+
+    public static final Setting<Integer> ML_COMMONS_HTTP_CLIENT_CONNECTION_TIMEOUT_IN_MILLI_SECOND = Setting


What if customer wants timeout = 60 ms for remote model A, and timeout = 80 ms for remote model B?

@austintlee do you think this could be a valid use case?

May be customer can provide these parameters during setting up the connector? We'll have some default values, but customer can provide their expected values also if they want?

60ms and 80ms may not a perfect example but I think this is a valid use case, different LLMs have different latency performance, I agree that we can add these httpclient configurations to connector. @austintlee please share your thoughts as well.

Why not include these three params in the request body of the "Predict" call to each LLM? If they are absent in the request, we can use the default values like these settings.

This can be implemented but we'd better make these changes in a different PR, for this PR the target is to enable customer to set these parameters to avoid exceptions.

@zane-neo So we have three options now

Add a cluster setting like what this PR does

Add these parameters to connector configuration, from @dhrubo-os

User can set these parameters in request body, from @Zhangxunmt

For my understanding, we already have a lot of cluster settings, maybe let's not add more cluster settings? Seems adding to connector level is more flexible, and if we add to connector parameters, we can also support option3 by default.

+1 on this.

Make sense, this configuration is only working on remote inference case, we can put these in the connector configuration. The only drawback is user has to configure this for each connector which is kind of verbose.

...thms/src/main/java/org/opensearch/ml/engine/algorithms/remote/AbstractConnectorExecutor.java

Zhangxunmt · 2023-11-20T23:24:59Z

plugin/src/main/java/org/opensearch/ml/settings/MLCommonsSettings.java

@@ -177,4 +177,25 @@ private MLCommonsSettings() {}
    // Feature flag for enabling search processors for Retrieval Augmented Generation using OpenSearch and Remote Inference.
    public static final Setting<Boolean> ML_COMMONS_RAG_PIPELINE_FEATURE_ENABLED =
        GenerativeQAProcessorConstants.RAG_PIPELINE_FEATURE_ENABLED;
+
+    public static final Setting<Integer> ML_COMMONS_HTTP_CLIENT_CONNECTION_TIMEOUT_IN_MILLI_SECOND = Setting


Why not include these three params in the request body of the "Predict" call to each LLM? If they are absent in the request, we can use the default values like these settings.

Signed-off-by: zane-neo <zaniu@amazon.com>

enable customize http client parameters

f71f190

Signed-off-by: zane-neo <zaniu@amazon.com>

zane-neo had a problem deploying to ml-commons-cicd-env October 27, 2023 01:45 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env October 27, 2023 01:45 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env October 27, 2023 01:45 — with GitHub Actions Failure

Fix UT failures

347fbfc

Signed-off-by: zane-neo <zaniu@amazon.com>

zane-neo temporarily deployed to ml-commons-cicd-env October 27, 2023 02:22 — with GitHub Actions Inactive

zane-neo had a problem deploying to ml-commons-cicd-env October 27, 2023 02:22 — with GitHub Actions Error

zane-neo temporarily deployed to ml-commons-cicd-env October 27, 2023 02:22 — with GitHub Actions Inactive

zane-neo had a problem deploying to ml-commons-cicd-env October 27, 2023 02:22 — with GitHub Actions Failure

Add more UTs

beb2011

Signed-off-by: zane-neo <zaniu@amazon.com>

zane-neo had a problem deploying to ml-commons-cicd-env October 27, 2023 03:09 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env October 27, 2023 03:09 — with GitHub Actions Failure

dhrubo-os reviewed Oct 27, 2023

View reviewed changes

Zhangxunmt reviewed Nov 20, 2023

View reviewed changes

...thms/src/main/java/org/opensearch/ml/engine/algorithms/remote/AbstractConnectorExecutor.java Show resolved Hide resolved

Zhangxunmt reviewed Nov 20, 2023

View reviewed changes

zane-neo mentioned this pull request Nov 21, 2023

Enable customer httpclient parameter configurration #1550

Closed

5 tasks

move setter/getter annotation to class level

ab300c9

Signed-off-by: zane-neo <zaniu@amazon.com>

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:38 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:38 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:38 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:38 — with GitHub Actions Error

Add copyright info to class

32f2198

Signed-off-by: zane-neo <zaniu@amazon.com>

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:39 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:39 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:39 — with GitHub Actions Failure

add copyright info to class

b5f2ed4

Signed-off-by: zane-neo <zaniu@amazon.com>

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:46 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:46 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:57 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:57 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 00:57 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 01:00 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 01:00 — with GitHub Actions Failure

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 01:00 — with GitHub Actions Error

zane-neo had a problem deploying to ml-commons-cicd-env November 21, 2023 01:00 — with GitHub Actions Failure

zane-neo closed this Jan 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

enable customize http client parameters #1558

enable customize http client parameters #1558

zane-neo commented Oct 27, 2023 •

edited

Loading

codecov bot commented Oct 27, 2023 •

edited

Loading

dhrubo-os Oct 27, 2023 •

edited

Loading

zane-neo Oct 30, 2023

Zhangxunmt Nov 20, 2023

zane-neo Nov 21, 2023

ylwu-amzn Dec 7, 2023 •

edited

Loading

dhrubo-os Dec 7, 2023

zane-neo Dec 12, 2023 •

edited

Loading

Zhangxunmt Nov 20, 2023

enable customize http client parameters #1558

enable customize http client parameters #1558

Conversation

zane-neo commented Oct 27, 2023 • edited Loading

Description

Issues Resolved

Check List

codecov bot commented Oct 27, 2023 • edited Loading

Codecov Report

dhrubo-os Oct 27, 2023 • edited Loading

Choose a reason for hiding this comment

zane-neo Oct 30, 2023

Choose a reason for hiding this comment

Zhangxunmt Nov 20, 2023

Choose a reason for hiding this comment

zane-neo Nov 21, 2023

Choose a reason for hiding this comment

ylwu-amzn Dec 7, 2023 • edited Loading

Choose a reason for hiding this comment

dhrubo-os Dec 7, 2023

Choose a reason for hiding this comment

zane-neo Dec 12, 2023 • edited Loading

Choose a reason for hiding this comment

Zhangxunmt Nov 20, 2023

Choose a reason for hiding this comment

zane-neo commented Oct 27, 2023 •

edited

Loading

codecov bot commented Oct 27, 2023 •

edited

Loading

dhrubo-os Oct 27, 2023 •

edited

Loading

ylwu-amzn Dec 7, 2023 •

edited

Loading

zane-neo Dec 12, 2023 •

edited

Loading