Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Request] aws bedrock cross-region inference compatibility #4659

Open
gemnioo opened this issue Nov 10, 2024 · 3 comments
Open

[Request] aws bedrock cross-region inference compatibility #4659

gemnioo opened this issue Nov 10, 2024 · 3 comments
Labels
🌠 Feature Request New feature or request | 特性与建议

Comments

@gemnioo
Copy link

gemnioo commented Nov 10, 2024

🥰 需求描述

aws bedrock rolling out cross-region inference for Claude 3.5 and some Llama models. it will have compatibility issues with current lobe.env.

🧐 解决方案

designated the region like "us-west-2" Claude 3.5 Sonnet v2 "anthropic.claude-3-5-sonnet-20241022-v2:0"

or with cross-region inference with "us.anthropic.claude-3-5-sonnet-20241022-v2:0"

the difference in Inference profile ID aka 'model identifier' lead to

"Invocation of model ID anthropic.claude-3-5-sonnet-20241022-v2:0 with on-demand throughput isn’t supported"
error in lobechat

"The provided model identifier is invalid" error in liteLLM Proxy

📝 补充信息

aws cross-region inference page

https://docs.aws.amazon.com/bedrock/latest/userguide/inference-profiles-support.html

https://us-east-1.console.aws.amazon.com/bedrock/home?region=us-east-1#/inference-profiles

@gemnioo gemnioo added the 🌠 Feature Request New feature or request | 特性与建议 label Nov 10, 2024
@lobehubbot
Copy link
Member

👀 @gemnioo

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible.
Please make sure you have given us as much context as possible.
非常感谢您提交 issue。我们会尽快调查此事,并尽快回复您。 请确保您已经提供了尽可能多的背景信息。

@gemnioo
Copy link
Author

gemnioo commented Nov 10, 2024

reference:

#4458

BerriAI/litellm#6589

@gemnioo
Copy link
Author

gemnioo commented Nov 10, 2024

IMO lobechat or others needs find a way to compatible with cross-region inference; rather than add another models.

region us-west-2 with "anthropic.claude-3-5-sonnet-20241022-v2:0" ✅

region us-west-2/us-east-1/us-east-2 with "us.anthropic.claude-3-5-sonnet-20241022-v2:0" ✅

region us-east-1/us-east-2 with "anthropic.claude-3-5-sonnet-20241022-v2:0" ❌

same with Claude last update 3.5 Haiku model.

for the Old Claude 3 Haiku Sonnet Opus models. cross-region inference only in us-east-1 / us-west-2

Screenshot Amazon Bedrock

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🌠 Feature Request New feature or request | 特性与建议
Projects
None yet
Development

No branches or pull requests

2 participants