Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

grpc request performance issue #7187

Open
Yuyaying-winnie opened this issue May 7, 2024 · 1 comment
Open

grpc request performance issue #7187

Yuyaying-winnie opened this issue May 7, 2024 · 1 comment
Labels
grpc Related to the GRPC server performance A possible performance tune-up

Comments

@Yuyaying-winnie
Copy link

Why is the first request sent via GRPC still slow even after adding a model warm-up configuration on the server side.
Also, when a client requests the model service in succession, the first request is always close to 10x slower than the subsequent requests.
How can I improve the speed of the first request sent by GRPC?

@oandreeva-nv
Copy link
Contributor

Hi @Yuyaying-winnie , could you please share a minimal reproducer for your issue and what version of the container you are using?

@rmccorm4 rmccorm4 added performance A possible performance tune-up grpc Related to the GRPC server labels Jan 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
grpc Related to the GRPC server performance A possible performance tune-up
Development

No branches or pull requests

3 participants