Batch inference tool #13863

someone13574 · 2025-05-28T15:21:02Z

someone13574
May 28, 2025

A tool for efficiently processing very large datasets would be nice. You would give it a file of things to process, and it would order them to take advantage of things like common prefixes and then run as many as possible in parallel as the batch size allows for. The server sort of works for this use case, but there are still quite a few things you need to do to take advantage of parallelism (ie. multiple async calls, ordering to get prefix caching, etc.) which are non-trivial.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Batch inference tool #13863

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Batch inference tool #13863

Uh oh!

someone13574 May 28, 2025

Replies: 0 comments

someone13574
May 28, 2025