About Concurrency and Stability #637

gdw439 · 2025-06-18T05:50:53Z

gdw439
Jun 18, 2025

Feature request

Hello! I recently came across this popular OpenAI-compatible inference framework and found it very interesting. I'd like to know more about its concurrency and stability—specifically, how it compares to vLLM.

Motivation

project feature

Your contribution

no

michaelfeil · 2025-08-22T23:19:53Z

michaelfeil
Aug 22, 2025
Maintainer

https://www.baseten.co/blog/how-we-built-bei-high-throughput-embedding-inference/#performance-benefits-from-baseten-infrastructure Answer: It depends. It has much more features arond embeddings than vllm.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

About Concurrency and Stability #637

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

About Concurrency and Stability #637

Uh oh!

gdw439 Jun 18, 2025

Feature request

Motivation

Your contribution

Replies: 1 comment

Uh oh!

michaelfeil Aug 22, 2025 Maintainer

gdw439
Jun 18, 2025

michaelfeil
Aug 22, 2025
Maintainer