Skip to content

Commit

Permalink
fix: return the out tensor rather then the functions return value (#2361
Browse files Browse the repository at this point in the history
)
  • Loading branch information
drbh authored Aug 6, 2024
1 parent dd47a3d commit 29b8d19
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions server/text_generation_server/layers/attention/cuda.py
Original file line number Diff line number Diff line change
Expand Up @@ -292,8 +292,7 @@ def attention(
)

out = torch.empty_like(q)

return flash_attn_cuda.fwd(
flash_attn_cuda.fwd(
q,
k,
v,
Expand All @@ -309,4 +308,5 @@ def attention(
False,
0,
None,
)[0]
)
return out

0 comments on commit 29b8d19

Please sign in to comment.