set all fusedrope inputs to bf16 #140

skaulintel · 2024-04-01T19:29:04Z

When passing --bf16 to train/inference cmd, ensure all input tensors are cast to bf16.

Change fused rope input type to bf16 for llama models

mandy-li · 2024-04-01T20:30:46Z

@skaulintel , what is the reason to hard cast data type to bf16? what about --bf16 is not used in the model's command?

ghost · 2024-04-02T03:35:37Z

@skaulintel , what is the reason to hard cast data type to bf16? what about --bf16 is not used in the model's command?

+1

…re bf16

skaulintel · 2024-04-02T22:57:47Z

@mandy-li In order to calculate RoPE in bf16, we have to cast all inputs to bf16. From my analysis, I see that when we pass --bf16 True to the training script when running llama-7b (examples/language-modeling/run_lora_clm.py) I see the following dtypes passed to apply_customized_rope

q: torch.float32, k: torch.bfloat16, cos: torch.float32, sin: torch.float32

Since query is passed in as float32, we won't compute that in bf16. but for the key states we will compute them in bf16. @kbinias please review.

astachowiczhabana · 2024-06-12T10:08:48Z

huggingface#1026

Update modeling_llama.py

7059c73

skaulintel requested review from mandy-li and libinta as code owners April 1, 2024 19:29

skaulintel requested review from a user and kbinias April 1, 2024 19:29

skaulintel added 3 commits April 3, 2024 01:13

make sure the cos/sin inputs are bf16 when the query and key values a…

3c8a0d1

…re bf16

fix indentation

2105aa7

only cast key to bf16

7d8169c

skaulintel requested review from asharmahabana, ssarkar2 and MohitIntel April 2, 2024 23:01

skaulintel changed the title ~~Update modeling_llama.py~~ set all fusedrope inputs to bf16 Apr 10, 2024

libinta approved these changes Apr 10, 2024

View reviewed changes

libinta merged commit 1d44433 into habana-main Apr 10, 2024

sushildubey171 pushed a commit that referenced this pull request Apr 12, 2024

set all fusedrope inputs to bf16 (#140)

386266c

schoi-habana mentioned this pull request Apr 15, 2024

Update Mixtral-8x7B Optimization huggingface/optimum-habana#836

Closed

astachowiczhabana pushed a commit that referenced this pull request Apr 19, 2024

set all fusedrope inputs to bf16 (#140)

d58a972

astachowiczhabana pushed a commit that referenced this pull request Apr 22, 2024

set all fusedrope inputs to bf16 (#140)

62bca31

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

set all fusedrope inputs to bf16 (#140)

ef785d3

astachowiczhabana pushed a commit that referenced this pull request Apr 24, 2024

set all fusedrope inputs to bf16 (#140)

ee7834e

skaulintel mentioned this pull request Apr 30, 2024

convert query, key, cos, sin to bf16 if any are bf16 #182

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

set all fusedrope inputs to bf16 #140

set all fusedrope inputs to bf16 #140

skaulintel commented Apr 1, 2024 •

edited

Loading

mandy-li commented Apr 1, 2024

ghost commented Apr 2, 2024

skaulintel commented Apr 2, 2024

astachowiczhabana commented Jun 12, 2024

set all fusedrope inputs to bf16 #140

set all fusedrope inputs to bf16 #140

Conversation

skaulintel commented Apr 1, 2024 • edited Loading

mandy-li commented Apr 1, 2024

ghost commented Apr 2, 2024

skaulintel commented Apr 2, 2024

astachowiczhabana commented Jun 12, 2024

skaulintel commented Apr 1, 2024 •

edited

Loading