pipeline-parallelism

Here is 1 public repository matching this topic...

capetron / ptg-vllm-deploy

Production vLLM deployment configs for multi-GPU setups. Docker Compose, pipeline parallelism configs for 2/4/6/8 GPU RTX 6000 Pro, H100, and H200 systems. By Petronella Technology Group.

docker-compose nvidia multi-gpu gpu-cluster pipeline-parallelism tensor-parallelism vllm llm-inference h100 ai-infrastructure rtx-6000

Updated Apr 14, 2026
Shell

Improve this page

Add a description, image, and links to the pipeline-parallelism topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pipeline-parallelism topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pipeline-parallelism

Here is 1 public repository matching this topic...

capetron / ptg-vllm-deploy

Improve this page

Add this topic to your repo