Skip to content

✨[Feature] Reducing Overhead with C++ Torchbind operation getting called up to Python #3942

@narendasan

Description

@narendasan

Is your feature request related to a problem? Please describe.

We are seeing that Torchbind operators from the C++ runtime getting called into Python in order to dispatch.

Describe the solution you'd like

We want to run in C++ without going back to python.

Potential solutions would be registering as a CUDA op or can we reexport so that we dont need to be lifted into python and we run more like what happens in AOTInductor or we can switch to an executorch style integration rather than torchbind

Describe alternatives you've considered

Additional context

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions