Skip to content

v5.0 Inference post-mortem item - Enforcing rules on late submissions #314

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

nvashutoshd
Copy link
Contributor

This PR addresses issues that arose in the v5.0 and prior inference reviews and attempts to clarify the rules surrounding late submissions.

This PR:

  1. Reinforces the mlcommon rules and asks that the 60 min grace period be strictly adhered to.
  2. Explicitly lists out the only circumstances in which late submissions will be accepted.
  3. Explicitly prohibits any new logs that will change performance/power/accuracy.

The goal is:

  1. Help streamline the review process - we spend far too much time on late submissions.
  2. Enforce fairness and ensure all submitters get equal time.
  3. Addresses the consideration that unforced errors can happen during the submission process.

@nvashutoshd nvashutoshd requested a review from a team as a code owner July 8, 2025 12:47
Copy link

github-actions bot commented Jul 8, 2025

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅


**No modifications** to hyperparameters, batch size, loadgen configurations and flags, dataset sorting, or any other performance-sensitive parameters are allowed after the submission deadline. This restriction applies even if such changes are necessary to comply with the rules.

Under no circumstances will submitters be allowed to upload new logs that alter the accuracy, performance, or power results, regardless of the extent of the change.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add a 1-2% delta here to account for noise - if the change is asked by the review committee?

@@ -129,6 +129,21 @@ MLCommons shall retain a library of past audit reports and send copies to MLComm

An audit is expected to be completed within a 90 day period. Audits failing to meet this timeline can be requested to be invalidated by the auditee. The final decision to accept such a request will be taken by the Working Group.

=== Late Submissions and Late Upload of Logs

To ensure fairness to all submitters and the integrity of the process, MLPerf Inference will strictly adhere to the https://github.com/mlcommons/policies/blob/master/submission_rules.adoc#late-submissions[MLCommons general submission rules]. The 60 minute grace period will be strictly enforced, and any late submissions will not be considered as valid submissions. Exceptions will be made only in extraordinary circumstances as listed in the https://github.com/mlcommons/policies/blob/master/submission_rules.adoc#post-submission-extension-for-extraordinary-circumstances-submission-deadline-72-hours[mlcommons guidelines].
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This PR is much appreciated considering the amount of time which is spent during the review meetings on such requests. The one concern though is even though this PR is making the rules strict it is not preventing any such late submission requests from coming to the review committee. Also we know that the rules are a bit relaxed for first time submitters - but "how relaxed" is not defined. It would be good if the rules are clear so that the review committee chair can show that to the submitters asking for allowing any late submissions instead of those requests coming to the review committee for discussions.


**Exceptions** to this rule may be granted solely at the discretion of the Review Committee, and only to ensure compliance with the established rules. For instance, if logs are corrupted during the submission process or if the submission checker fails to detect missing logs, the Review Committee may authorize a re-upload of the affected logs.

**No modifications** to hyperparameters, batch size, loadgen configurations and flags, dataset sorting, or any other performance-sensitive parameters are allowed after the submission deadline. This restriction applies even if such changes are necessary to comply with the rules.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we explicitly state that modifications are indeed allowed in case of valid hyperparameter borrowing scenarios which is actually allowed in Training during the review period?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we explicitly state that modifications are indeed allowed in case of valid hyperparameter borrowing scenarios which is actually allowed in Training during the review period?

This repo has the policy applies to Inference only, and we don’t have any HP borrowing in Inference.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh shoot, my bad.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants