Skip to content

feat(server): auto max_batch_total_tokens for flash att models #688

feat(server): auto max_batch_total_tokens for flash att models

feat(server): auto max_batch_total_tokens for flash att models #688