Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

llama 70b AutoTP inference #202

Merged
merged 2 commits into from
Aug 31, 2023
Merged

llama 70b AutoTP inference #202

merged 2 commits into from
Aug 31, 2023

Conversation

sywangyi
Copy link
Contributor

Type of Change

feature or bug fix or documentation or others
API changed or not

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

@sywangyi sywangyi marked this pull request as draft August 31, 2023 01:50
@sywangyi sywangyi force-pushed the llama_70b_hpu_infer branch 2 times, most recently from ff234d2 to 730efb1 Compare August 31, 2023 06:18
@sywangyi sywangyi marked this pull request as ready for review August 31, 2023 06:22
@sywangyi
Copy link
Contributor Author

@jiafuzha

@sywangyi sywangyi requested a review from lvliang-intel August 31, 2023 06:22
@sywangyi
Copy link
Contributor Author

@lkk12014402

@hshen14 hshen14 merged commit 1c6458a into main Aug 31, 2023
@hshen14 hshen14 deleted the llama_70b_hpu_infer branch August 31, 2023 14:19
lvliang-intel pushed a commit that referenced this pull request Aug 31, 2023
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
lvliang-intel pushed a commit that referenced this pull request Sep 3, 2023
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants