Open
Description
Description
Change to appropriate resource requests and/or limits for Neuron k8s device plugin. The AWS team has said their device plugin uses few CPU/Mem resources, but that's still not a number. To be done in manager/manifests/inferentia.yaml
.
Currently, we've associated 100m
CPU time & 100Mi
memory for the device plugin. Change that to an appropriate value once more information is released by them.
Additional context
As discussed in aws-neuron/aws-neuron-sdk#103.