gpu-inference

Here is 1 public repository matching this topic...

FurkanAtass / Scalable-ML-Inference-Eks

End-to-end scalable ML inference on EKS: KEDA-driven pod autoscaling with Prometheus custom metrics, Cluster Autoscaler for GPU node scaling, and NVIDIA GPU time-slicing to run multiple pods per GPU.

kubernetes machine-learning terraform scalability prometheus aws-eks mlops keda gpu-inference

Updated Aug 29, 2025
HCL

Improve this page

Add a description, image, and links to the gpu-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu-inference

Here is 1 public repository matching this topic...

FurkanAtass / Scalable-ML-Inference-Eks

Improve this page

Add this topic to your repo