[ModelSuite] Add model loading infrastructure #182

PaliC · 2025-10-02T08:21:54Z

Model Registration

This PR creates a way of adding models to the suite and automatically validates them through CI. It also loads the models as well. The way these models are added is detailed in this readme. The tl;dir is we use a format similar to kernelbench and SakanaAI/robust-kbench where we pair model code with a config. Importantly the configs contain initialization code, forward pass arguments (both in a similar format to torchbench), and a list of ops in the forward and backwards passes. These ops are fairly important as they are what we want to point out to the researcher when they are optimizing a model. There is a README.md to help folks setup proper model code / configs.

We also further verify these registrations are correct through CI. Specifically we run test/test_model_ops_configs.py to ensure the configs are formatted correctly.

Small Things

Added a --model-filter to the CLI as it will be needed to support filtering in model suite as it chooses things to test based on the model not set of ops

Testing

New tests are added so pytest resolves things here

Future work with Model Suite

#181

Stack created with Sapling. Best reviewed with ReviewStack.

Summary: Here we introduce model suite (model.py). The idea here to start and codify the ideas from jiannanWang/BackendBenchExamples. Specifically this PR adds some example models / configs which are to be loaded + a Readme. (It may be useful to look at the PR above this as well since it's the model loading logic). This PR adds two toy models to model suite SmokeTestModel - This is simple model that uses aten.ops.mm as we can implement a correct version of this op ToyCoreOpsModel - This is a model which explicitly calls the backwards passes which are both in torchbench + core. Test Plan: the test infra is in the pr above, so tests passing on the PR above should be sufficient here ### Future work with Model Suite #181

### Model Registration This PR creates a way of adding models to the suite and automatically validates them through CI. It also loads the models as well. The way these models are added is detailed in this readme. The tl;dir is we use a format similar to kernelbench and SakanaAI/robust-kbench where we pair model code with a config. Importantly the configs contain initialization code, forward pass arguments (both in a similar format to torchbench), and a list of ops in the forward and backwards passes. These ops are fairly important as they are what we want to point out to the researcher when they are optimizing a model. There is a README.md to help folks setup proper model code / configs. We also further verify these registrations are correct through CI. Specifically we run test/test_model_ops_configs.py to ensure the configs are formatted correctly. ### Small Things - Added a --model-filter to the CLI as it will be needed to support filtering in model suite as it chooses things to test based on the model not set of ops ### Testing New tests are added so pytest resolves things here ### Future work with Model Suite #181

msaroufim · 2025-10-03T17:38:22Z

This stacked based view is weird, the line count seems to increase monotonically making each PR harder to review than the last whereas ghstack tends to only show the specific diff

PaliC · 2025-10-12T14:36:30Z

@msaroufim They are both buggy in their own ways. My understanding is sapling is cleaner for landing things as I'm not really sure how well ghstack land works as I've never seen anyone use it. Though if you have bot support / using pytorch/pytorch, just using ghstack is the move. (or pay for graphite which is apprently nice :P )

meta-cla · 2025-10-17T00:26:06Z

Hi @PaliC!

Thank you for your pull request.

We require contributors to sign our Contributor License Agreement, and yours needs attention.

You currently have a record in our system, but the CLA is no longer valid, and will need to be resubmitted.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at cla@meta.com. Thanks!

This was referenced Oct 2, 2025

[ModelSuite] Add Toy Models #183

Open

[Model Suite] Add model correctness testing #185

Open

[ModelSuite] Refactor TorchBench for ModelSuite inheritance #180

Open

[ModelSuite] Add model ops coverage validation test #184

Open

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 2, 2025

PaliC added 2 commits October 2, 2025 08:29

PaliC force-pushed the pr182 branch from 5297f23 to 6e6334b Compare October 2, 2025 08:29

PaliC marked this pull request as ready for review October 2, 2025 08:33

PaliC requested review from jiannanWang and msaroufim October 2, 2025 09:54

PaliC mentioned this pull request Oct 2, 2025

[WIP] [ModelSuite] Add Performace Testing #186

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ModelSuite] Add model loading infrastructure #182

[ModelSuite] Add model loading infrastructure #182

Uh oh!

PaliC commented Oct 2, 2025 •

edited

Loading

Uh oh!

msaroufim commented Oct 3, 2025

Uh oh!

PaliC commented Oct 12, 2025

Uh oh!

meta-cla bot commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ModelSuite] Add model loading infrastructure #182

Are you sure you want to change the base?

[ModelSuite] Add model loading infrastructure #182

Uh oh!

Conversation

PaliC commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Model Registration

Small Things

Testing

Future work with Model Suite

Uh oh!

msaroufim commented Oct 3, 2025

Uh oh!

PaliC commented Oct 12, 2025

Uh oh!

meta-cla bot commented Oct 17, 2025

Process

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

PaliC commented Oct 2, 2025 •

edited

Loading