Open
Description
When I use torch.compile
with certain models that cause the following error:
BackendCompilerFailed: backend='inductor' raised:
AssertionError: torch.compile is currently being used with tensor subclass inputs:
<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torchao.quantization.linear_activation_quantized_tensor.LinearActivationQuantizedTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torchao.quantization.linear_activation_quantized_tensor.LinearActivationQuantizedTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>,<class 'torch._subclasses.fake_tensor.FakeTensor'>. We are attempting to a compile a graph with two graph outputs
that alias one another, which is currently unsupported in the subclass use case. If you run into this,
please file a github issue
Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information
LinearActivationQuantizedTensor
and FakeTensor
which are tensor subclasses are not supported by torch.compile
which raises the errors.
The model compiles correct if the errors are suppressed with the following:
import torch._dynamo
torch._dynamo.config.suppress_errors = True