Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add high-level operator interface #708

Merged
merged 1 commit into from
Aug 20, 2024

Conversation

metascroy
Copy link
Contributor

Summary:
Here is a draft of the high-level kernel operator. This is something I quickly put together to get early feedback, and it will be iterated. It closely follows what I had in the design doc, but I think it's easier to see how things fit together in the code.

The operator uses parallel_for and get_num_threads. At build time, we need to link these to ET/PT implementations. In this diff, they are single-threaded or OMP-backed implementations that I threw together.

The operator (inside channelwise_8bit_activation_groupwise_lowbit_weight.h) depends on a ukernel_config and tiling configs. But in the wrapper, these can be hidden if desired (see examples/ for how).

Reviewed By: digantdesai

Differential Revision: D60321449

Copy link

pytorch-bot bot commented Aug 19, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/708

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 72cfb21 with merge base ac8ce4c (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 19, 2024
@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D60321449

@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D60321449

metascroy added a commit to metascroy/ao that referenced this pull request Aug 19, 2024
Summary:
Pull Request resolved: pytorch#708

Here is a draft of the high-level kernel operator.  This is something I quickly put together to get early feedback, and it will be iterated.  It closely follows what I had in the design doc, but I think it's easier to see how things fit together in the code.

The operator uses parallel_for and get_num_threads.  At build time, we need to link these to ET/PT implementations.  In this diff, they are single-threaded or OMP-backed implementations that I threw together.

The operator (inside channelwise_8bit_activation_groupwise_lowbit_weight.h) depends on a ukernel_config and tiling configs.  But in the wrapper, these can be hidden if desired (see examples/ for how).

Reviewed By: digantdesai

Differential Revision: D60321449
Summary:
Pull Request resolved: pytorch#708

Here is a draft of the high-level kernel operator.  This is something I quickly put together to get early feedback, and it will be iterated.  It closely follows what I had in the design doc, but I think it's easier to see how things fit together in the code.

The operator uses parallel_for and get_num_threads.  At build time, we need to link these to ET/PT implementations.  In this diff, they are single-threaded or OMP-backed implementations that I threw together.

The operator (inside channelwise_8bit_activation_groupwise_lowbit_weight.h) depends on a ukernel_config and tiling configs.  But in the wrapper, these can be hidden if desired (see examples/ for how).

Reviewed By: digantdesai

Differential Revision: D60321449
@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D60321449

Copy link
Contributor

@jerryzh168 jerryzh168 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

stamping for experimental folder

@facebook-github-bot facebook-github-bot merged commit 1909171 into pytorch:main Aug 20, 2024
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants