Event Driven Scaling Architecture with Kubernetes KEDA on AWS
- 
            Updated
            
Jul 31, 2021  - Python
 
Event Driven Scaling Architecture with Kubernetes KEDA on AWS
Sample Python Kubernetes Jobs Event-driven Autoscaling using Keda Azure Service Bus Scaler
A simple KEDA tutorial. Using Killercoda platform for more convenience. The repository is open and under GNU licence, all well documented suggestions and improvements, are very welcome. :)
Production-grade vLLM serving with an OpenAI-compatible API, per-request LoRA routing, KEDA autoscaling on Prometheus metrics, Grafana/OTel observability, and a benchmark comparing AWQ vs GPTQ vs GGUF.
Add a description, image, and links to the keda-scalers topic page so that developers can more easily learn about it.
To associate your repository with the keda-scalers topic, visit your repo's landing page and select "manage topics."