tag:github.com,2008:https://github.com/mlcommons/algorithmic-efficiency/releasesRelease notes from algorithmic-efficiency2026-02-10T04:44:25Ztag:github.com,2008:Repository/339507851/v1.0.12026-02-10T04:50:12ZV 1.0.1<h2>Summary</h2>
<p>Small fixes.</p>
<h3>Fixes</h3>
<ul>
<li>Small fixes in documentation</li>
<li>Fix in dataset setup for fineweb edu 10B</li>
</ul>priyakasimbegtag:github.com,2008:Repository/339507851/v1.0.02026-02-07T05:46:26ZV 1.0.0<h2>Summary</h2>
<ul>
<li>Stable release for AlgoPerf benchmark</li>
<li>Added language model workload.</li>
<li>Migrated from 8xV100 to 4xA100 setup.</li>
</ul>
<h2>What's Changed</h2>
<p>Improved and streamlined version of the benchmark, which includes a new workload and hardware migration.</p>
<h3>Added</h3>
<ul>
<li>[Code, Rules] Migrated from 8xV100 to 4xA100 (40GB) and calibrated the workload runtime budgets for this new hardware (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/892" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/892/hovercard">PR</a>).</li>
<li>[Code, Rules] Added LM <code>finewebedu_lm</code> workload with decoder-only language model with Fineweb-Edu 10B dataset (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/902" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/902/hovercard">PR</a>).</li>
<li>[Code] Support for changing dropout with the <code>model_fn</code> (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/884" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/884/hovercard">PR</a>).</li>
</ul>
<p><strong>Full Changelog</strong>: <a class="commit-link" href="https://github.com/mlcommons/algorithmic-efficiency/compare/algoperf-benchmark-0.5.0...algoperf-benchmark-1.0.0"><tt>algoperf-benchmark-0.5.0...algoperf-benchmark-1.0.0</tt></a></p>priyakasimbegtag:github.com,2008:Repository/339507851/v0.6.02025-09-08T13:26:12Zv0.6.0<h2>Summary</h2>
<ul>
<li>A rolling leaderboard to support continuous submissions.</li>
<li>Rule updates aimed at cost efficiency, e.g., removing held-out workloads, limiting to 3 repetition studies, and adjusting workload runtime budgets based on competition results.</li>
<li>JIT-sharding for JAX workloads.</li>
<li>Important bug fixes (e.g., batch norm behavior) and a more flexible API (e.g., a new prepare_for_eval function).</li>
</ul>
<h2>What's Changed</h2>
<p>Improved and streamlined version of the benchmark, which includes important bug fixes, API improvements, and benchmark protocol changes following the lessons learned from the first competition.</p>
<h3>Added</h3>
<ul>
<li>[Code, Rules] Updated API to allow for <code>prepare_for_eval</code> function (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/806" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/806/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/789" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/789/hovercard">Issue</a>).</li>
<li>[Docs] Document default dropout values for each workload (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/806" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/806/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/786" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/786/hovercard">Issue</a>).</li>
<li>[Docs] Unified versioning policy section (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/876" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/876/hovercard">PR</a>).</li>
<li>[Code] Add the ability to change dropout values during training (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/875" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/875/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/753" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/753/hovercard">Issue</a>).</li>
</ul>
<h3>Changed/Removed</h3>
<ul>
<li>[Code, Docs] Rename package to <code>algoperf</code> (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/833" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/833/hovercard">PR</a>).</li>
<li>[Code, Docs] Switch to <code>ruff</code> for linting and formatting(<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/874" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/874/hovercard">PR</a>).</li>
<li>[Code, Rules] Pass <code>train_state</code> to <code>update_params</code> function (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/790" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/790/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/785" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/785/hovercard">Issue</a>).</li>
<li>[Code, Rules] Reduced number of studies from 5 to 3 (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/876" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/876/hovercard">PR</a>). See also Section 5.1 in our <a href="https://arxiv.org/abs/2502.15015" rel="nofollow">results paper</a>.</li>
<li>[Code, Rules] Remove held-out workloads from the benchmark (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/876" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/876/hovercard">PR</a>). See also Section 5.1 in our <a href="https://arxiv.org/abs/2502.15015" rel="nofollow">results paper</a>.</li>
<li>[Code] Remove sacrebleu dependency (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/828" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/828/hovercard">PR</a>).</li>
<li>[Code] Switch to <code>pyproject.toml</code> for package management (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/830" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/830/hovercard">PR</a>).</li>
<li>[Code] Update Python version to 3.11 and dependencies accordingly (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/811" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/811/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/805" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/805/hovercard">Issue</a>).</li>
<li>[Rules] Modify the runtime budgets and step hints for each workload (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/838" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/838/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/836" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/836/hovercard">Issue</a>). See also Section 5.1 in our <a href="https://arxiv.org/abs/2502.15015" rel="nofollow">results paper</a>.</li>
<li>[Code] Automatically determine the package version via the latest GitHub tag (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/831" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/831/hovercard">PR</a>).</li>
<li>[Code, Docs] Move all algorithms into a dedicated <code>algorithms</code> directory (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/876" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/876/hovercard">PR</a>).</li>
<li>[Code] Migrate from <code>pmap</code> to <code>jit</code> in JAX for better performance and scalability (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/848" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/848/hovercard">PR</a>).</li>
</ul>
<h3>Fixed</h3>
<ul>
<li>[Code] Batch norm bug (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/783" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/783/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/798" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/798/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/767" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/767/hovercard">Issue</a>).</li>
<li>[Code] Fix bug of potentially giving a free evaluation to a submission that goes out of <code>max_runtime</code> (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/789" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/789/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/719#issuecomment-2328797610" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/719/hovercard">Issue</a>).</li>
<li>[Code] Fix that models in the self-tuning ruleset will always be initialized with default dropout (<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/873" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/873/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/pull/875" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/875/hovercard">PR</a>/<a href="https://github.com/mlcommons/algorithmic-efficiency/issues/753" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/753/hovercard">Issue</a>).</li>
</ul>
<p><strong>Full Changelog</strong>: <a class="commit-link" href="https://github.com/mlcommons/algorithmic-efficiency/compare/algoperf-benchmark-0.5.0...algoperf-benchmark-0.6.0"><tt>algoperf-benchmark-0.5.0...algoperf-benchmark-0.6.0</tt></a></p>fsschneidertag:github.com,2008:Repository/339507851/v0.5.02025-06-24T12:40:12Zv0.5.0<h2>Summary</h2>
<ul>
<li>Finalized variant workload targets.</li>
<li>Fix in random_utils helper function.</li>
<li>For conformer PyTorch Dropout layers set <code>inplace=True</code>.</li>
<li>Clear CUDA cache at begining of each trial for PyTorch.</li>
</ul>
<h2>What's Changed</h2>
<ul>
<li>update speech variants target setting points by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/priyakasimbeg/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/priyakasimbeg">@priyakasimbeg</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2209492296" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/727" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/727/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/727">#727</a></li>
<li>set num_workers for librispeech back to 4 by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/priyakasimbeg/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/priyakasimbeg">@priyakasimbeg</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2209602808" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/736" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/736/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/736">#736</a></li>
<li>[fix] random_utils.py to <code>_signed_to_unsigned</code> by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/tfaod/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/tfaod">@tfaod</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2214006365" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/739" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/739/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/739">#739</a></li>
<li>Fix path in helper config for running experiments in bulk. by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/priyakasimbeg/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/priyakasimbeg">@priyakasimbeg</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2214397340" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/740" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/740/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/740">#740</a></li>
<li>Finalize variants targets by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/priyakasimbeg/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/priyakasimbeg">@priyakasimbeg</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2212184851" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/738" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/738/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/738">#738</a></li>
<li>Aiming to Fix Conformer OOM by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/pomonam/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/pomonam">@pomonam</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2195587207" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/710" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/710/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/710">#710</a></li>
<li>Lint fixes by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/priyakasimbeg/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/priyakasimbeg">@priyakasimbeg</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2216019123" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/742" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/742/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/742">#742</a></li>
<li>Add warning for PyTorch data loader num_workers flag. by <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/users/priyakasimbeg/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="https://github.com/priyakasimbeg">@priyakasimbeg</a> in <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2208877519" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/726" data-hovercard-type="pull_request" data-hovercard-url="/mlcommons/algorithmic-efficiency/pull/726/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/pull/726">#726</a></li>
</ul>
<p><strong>Full Changelog</strong>: <a class="commit-link" href="https://github.com/mlcommons/algorithmic-efficiency/compare/algoperf-benchmark-0.1.4...algoperf-benchmark-0.1.5"><tt>algoperf-benchmark-0.1.4...algoperf-benchmark-0.1.5</tt></a></p>fsschneidertag:github.com,2008:Repository/339507851/v0.0.42025-06-24T12:39:11Zv0.0.4<p>Upgrade CUDA version to CUDA 12.1:</p>
<ul>
<li>Upgrade CUDA version in Dockerfiles that will be used for scoring.</li>
<li>Update Jax and PyTorch package version tags to use local CUDA installation.</li>
</ul>
<p>Add flag for completely disabling checkpointing.</p>
<ul>
<li>Note that we will run with checkpointing off at scoring time.</li>
</ul>
<p>Update Deepspeech and Conformer variant target setting configurations.</p>
<ul>
<li>Note that variant targets are not final.</li>
</ul>
<p>Fixed bug in scoring code to take best trial in a study for external-tuning ruleset.</p>
<p>Added instructions for submission.</p>
<p>Changed default number of workers for PyTorch data loaders to 0. Running imagenet workloads with >0 may lead to incorrect eval results see <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2209554541" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/732" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/732/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/issues/732">#732</a>.<br>
Update: for speech workloads the <code>pytorch_eval_num_workers</code> flag to submission_runner.py has to be set to >0, to prevent data loader crash in jax code.</p>fsschneidertag:github.com,2008:Repository/339507851/v0.0.32025-06-24T12:38:35Zv0.0.3<p>Update technical documentation.</p>
<p>Bug fixes:</p>
<ul>
<li>Fix workload variant names in Dockerfile.</li>
<li>Fix VIT GLU OOM by reducing batch size.</li>
<li>Fix submission_runner stopping condition.</li>
<li>Fix dropout rng in ViT and WMT.</li>
</ul>fsschneidertag:github.com,2008:Repository/339507851/v0.0.22025-06-24T12:37:59Zv0.0.2<p>Add workload variants.</p>
<p>Add prize qualification logs for external tuning ruleset.<br>
Note: FastMRI trials with dropout are not yet added due to <a class="issue-link js-issue-link" data-error-text="Failed to load title" data-id="2162206939" data-permission-text="Title is private" data-url="https://github.com/mlcommons/algorithmic-efficiency/issues/664" data-hovercard-type="issue" data-hovercard-url="/mlcommons/algorithmic-efficiency/issues/664/hovercard" href="https://github.com/mlcommons/algorithmic-efficiency/issues/664">#664</a>.</p>
<p>Add functionality to Docker startup script for self_tuning ruleset.<br>
Add self_tuning ruleset option to script that runs all workloads for scoring.</p>
<p>Data setup fixes.</p>
<p>Fix tests that check training differences in PyTorch and JAX on GPU.</p>fsschneidertag:github.com,2008:Repository/339507851/v0.0.12025-06-24T12:37:29Zv0.0.1<p>First release of the AlgoPerf: Training algorithms benchmarking code.</p>fsschneider