Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 459 Bytes

README.md

File metadata and controls

7 lines (4 loc) · 459 Bytes

path-kubernetes

This is the k8s configuration for deploying NVIDIA Triton on the PATh facility.

General Notes:

Because PATh runs OSPool jobs on the same hardware. Replicas need to be incremented slowly, i.e. one needs to change replica acount from 1->2->3->..., to allow for the OSPool jobs to be evicted. If you don't do that the cluster will claim it doesn't have the resources. Incrementing the replica count will give k8s to evict the OSPool jobs.