Readme Update for FSDP #980

hlahkar · 2024-05-14T04:14:04Z

Updating Readme to indicate flash attention support for LLama2 70b with FSDP. This improves performance as well as memory footprint

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Updating Readme to indicate flash attention support for LLama2 70b with FSDP. This improves performance as well as memory footprint

HuggingFaceDocBuilderDev · 2024-05-14T04:17:36Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

vivekgoe

LGTM.

vivekgoe · 2024-05-20T13:55:04Z

@regisss please review this readme change and merge this change when we transition to Synapse 1.16.0. We have tested this change on G2 with Synapse 1.16.0, this gives a small boost in performance.

libinta · 2024-05-20T16:09:20Z

@hlahkar @vivekgoe can you also update the ci test and benchmark with the same parameters?

regisss · 2024-05-21T13:01:06Z

Does this work with 1.15?

hlahkar · 2024-05-23T03:49:48Z

@regisss This is for 1.16; it does not work properly for 1.15

Fused SDPA leads to a small drop in the perf for 7b model; so updating the test case. For 70b we see a perf improvement.

hlahkar · 2024-06-06T04:05:29Z

@regisss Can we merge this now?

Readme Update for FSDP

61cd667

Updating Readme to indicate flash attention support for LLama2 70b with FSDP. This improves performance as well as memory footprint

hlahkar added the synapse1.16 label May 14, 2024

hlahkar requested a review from vivekgoe May 14, 2024 04:14

hlahkar requested a review from regisss as a code owner May 14, 2024 04:14

Update README.md

733b6a7

vivekgoe approved these changes May 20, 2024

View reviewed changes

Update test_fsdp_examples.py

1bfd805

libinta added synapse 1.16_dependency synapse 1.16 dependency and removed synapse1.16 labels May 23, 2024

Update test_fsdp_examples.py

f7f1f8b

Fused SDPA leads to a small drop in the perf for 7b model; so updating the test case. For 70b we see a perf improvement.

hsubramony added a commit that referenced this pull request May 29, 2024

Readme Update for FSDP #980

3bd3433

regisss added the run-test Run CI for PRs from external contributors label May 30, 2024

regisss approved these changes May 30, 2024

View reviewed changes

hsubramony added a commit that referenced this pull request May 31, 2024

Readme Update for FSDP #980

719d9c8

regisss merged commit acc7093 into huggingface:main Jun 6, 2024
11 of 12 checks passed

hlahkar mentioned this pull request Jun 11, 2024

FSDP Readme Update HabanaAI/optimum-habana-fork#204

Closed

imangohari1 pushed a commit to imangohari1/optimum-habana that referenced this pull request Jun 13, 2024

Readme Update for FSDP (huggingface#980)

76ea6e6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Readme Update for FSDP #980

Readme Update for FSDP #980

hlahkar commented May 14, 2024

HuggingFaceDocBuilderDev commented May 14, 2024

vivekgoe left a comment

vivekgoe commented May 20, 2024

libinta commented May 20, 2024

regisss commented May 21, 2024

hlahkar commented May 23, 2024

hlahkar commented Jun 6, 2024

Readme Update for FSDP #980

Readme Update for FSDP #980

Conversation

hlahkar commented May 14, 2024

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented May 14, 2024

vivekgoe left a comment

Choose a reason for hiding this comment

vivekgoe commented May 20, 2024

libinta commented May 20, 2024

regisss commented May 21, 2024

hlahkar commented May 23, 2024

hlahkar commented Jun 6, 2024