Skip to content

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #4396

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models

Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models #4396

Re-run triggered February 20, 2025 13:18
Status Success
Total duration 15m 44s
Artifacts

cpu-torch-latest.yml

on: pull_request
Fit to window
Zoom out
Zoom in