llama 70b AutoTP inference #202

sywangyi · 2023-08-31T01:50:36Z

Type of Change

feature or bug fix or documentation or others
API changed or not

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

sywangyi · 2023-08-31T06:22:29Z

@jiafuzha

sywangyi · 2023-08-31T06:47:06Z

@lkk12014402

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

sywangyi marked this pull request as draft August 31, 2023 01:50

sywangyi force-pushed the llama_70b_hpu_infer branch 2 times, most recently from ff234d2 to 730efb1 Compare August 31, 2023 06:18

sywangyi added 2 commits August 31, 2023 11:50

llama 70b AutoTP inference

730efb1

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

Merge branch 'main' into llama_70b_hpu_infer

aa16f38

sywangyi marked this pull request as ready for review August 31, 2023 06:22

sywangyi requested a review from lvliang-intel August 31, 2023 06:22

lvliang-intel approved these changes Aug 31, 2023

View reviewed changes

lkk12014402 approved these changes Aug 31, 2023

View reviewed changes

hshen14 merged commit 1c6458a into main Aug 31, 2023

hshen14 deleted the llama_70b_hpu_infer branch August 31, 2023 14:19

lvliang-intel pushed a commit that referenced this pull request Aug 31, 2023

llama 70b AutoTP inference (#202)

6261493

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

lvliang-intel pushed a commit that referenced this pull request Sep 3, 2023

llama 70b AutoTP inference (#202)

9e71158

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama 70b AutoTP inference #202

llama 70b AutoTP inference #202

sywangyi commented Aug 31, 2023

sywangyi commented Aug 31, 2023

sywangyi commented Aug 31, 2023

llama 70b AutoTP inference #202

llama 70b AutoTP inference #202

Conversation

sywangyi commented Aug 31, 2023

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

sywangyi commented Aug 31, 2023

sywangyi commented Aug 31, 2023