NAN issue using FP16 to load the model #93

zitgit · 2024-10-25T09:02:10Z

when I changed the torch_dtype of the loading function from torch.bfloat16 to torch.float16,
which is
model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True, device_map="sequential", torch_dtype=torch.float16,
The inference wont work. Activation will contain Nan. Is this a known issue?
env: A100*8; transformers Version: 4.44.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NAN issue using FP16 to load the model #93

NAN issue using FP16 to load the model #93

zitgit commented Oct 25, 2024

NAN issue using FP16 to load the model #93

NAN issue using FP16 to load the model #93

Comments

zitgit commented Oct 25, 2024