You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
target_positions used to be time_step, but after PR apple#995, it now represents the
actual target positions with shape [batch, step_len].
apple#995
Updating the GPU decoding code to align with this change.
CI did not cover GPU unit tests.
TEST=test_extend_step10 of axlearn/common/flash_attention/layer_test.py in GPU
0 commit comments