You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Current JIT compilation is slow because we rely on a huge header
`<torch/extension.h>` which is too heavy for our use case.
This PR refactors the codebase to only include necessary headers for
pybind, and moves most of torch runtime API calls from C++ to python.
The compilation time was reduced from 48 seconds to 18 seconds for
lightweight operators such as norm.
0 commit comments