Closed
Description
When ML jobs are hitting the hard_limit
an audit message is generated. This message contains details about where the limit was hit. But currently, this number is not very helpful, because it only reflects the latest model_bytes
of the job, which does not include the memory of the next stage of the analysis.
Example:
- Model memory limit setting:
6mb
- Reported
model_bytes
:3.8mb
- Job audit message:
Job memory status changed to hard_limit at 1.4mb [...]
This could be confusing for the user.
Feature request: Adjust the audit message to better explain why the hard_limit
is hit.