support mode 2 sp in gpt2 #5

linsj20 · 2024-01-26T08:03:17Z

No description provided.

* [shardformer] add megatron sp to llama * support llama7B 128k with distributed attention * [shardformer] robustness enhancement * add block attn * sp mode 1: keep input as a complete sequence * fix sp compatability * refactor ring implementation * support mode 2 sp in gpt2

linsj20 added 9 commits January 19, 2024 10:16

[shardformer] add megatron sp to llama

b213549

support llama7B 128k with distributed attention

8ae8f69

[shardformer] robustness enhancement

579df22

add block attn

333cfde

sp mode 1: keep input as a complete sequence

7aa4a2d

fix sp compatability

be6aed2

refactor ring implementation

7d219c1

support mode 2 sp in gpt2

5f806d0

merge fix

300bb61

KKZ20 merged commit a727ba9 into KKZ20:feature/sequence_parallel_optimization Jan 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support mode 2 sp in gpt2 #5

support mode 2 sp in gpt2 #5

Uh oh!

linsj20 commented Jan 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

support mode 2 sp in gpt2 #5

support mode 2 sp in gpt2 #5

Uh oh!

Conversation

linsj20 commented Jan 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants