[CPP Graph] Baichuan & Baichuan2 Enabling #376

Zhenzhong1 · 2023-09-22T09:30:28Z

Type of Change

Feature add and Issues Fix

Description

Add extension-test

https://github.com/intel-innersource/frameworks.ai.lpot.lpot-validation/pull/523
Baichuan-13B-Chat extension tests https://inteltf-jenk.sh.intel.com/job/ITREX-cpp-graph-extension/148/

Expected Behavior & Potential Risk

N/A.

How has this PR been tested?

Manual and CI tests.

Dependency Change?

N/A.

intel_extension_for_transformers/llm/runtime/graph/application/main_run.cpp

airMeng · 2023-09-22T10:11:53Z

please update KERNEL_DEBUG settings as zhenwei issued during the meeting, and prepare readme

airMeng

LGTM except some comments

intel_extension_for_transformers/llm/runtime/graph/core/ne_layers.h

intel_extension_for_transformers/llm/runtime/graph/models/baichuan/baichuan.h

intel_extension_for_transformers/llm/runtime/graph/models/baichuan/baichuan_utils.cpp

airMeng · 2023-10-12T08:08:02Z

please update KERNEL_DEBUG settings as zhenwei issued during the meeting, and prepare readme

better to finish this in this PR

Zhenzhong1 · 2023-10-12T08:30:16Z

please update KERNEL_DEBUG settings as zhenwei issued during the meeting, and prepare readme

better to finish this in this PR

I've removed this feature in this PR, because printing data info using a unified API is not something that can be solved with a few simple lines of code, which means it requires more efforts.

intel_extension_for_transformers/llm/runtime/graph/models/baichuan/baichuan_utils.cpp

a32543254 · 2023-10-13T01:46:28Z

Do we enable FFN fusion / MHA fusion in this pr?

Zhenzhong1 · 2023-10-13T01:51:33Z

Do we enable FFN fusion / MHA fusion in this pr?

No. The MHA & FFN fusion to the chatglm & baichuan will be applied in the next PR. I plan to add models first this week.

a32543254

LGTM

Zhenzhong1 added 5 commits September 19, 2023 16:38

baichuan initial & compile pass

dda7252

convert & quantize pass

e3edee6

fix quantize issues & finish forward part but inference fail

901e4ae

fix the mul_mat weight & quantize embedding weight issues

82fbda9

add the KERNEL_DEBUG

0a5a65c

Zhenzhong1 requested a review from airMeng as a code owner September 22, 2023 09:30

airMeng reviewed Sep 22, 2023

View reviewed changes

intel_extension_for_transformers/llm/runtime/graph/application/main_run.cpp Outdated Show resolved Hide resolved

Zhenzhong1 added 5 commits September 25, 2023 10:08

benchmark mode

fb11ee8

fix print of the benchmark mode

8586334

add the baichuan tokenizer

e496f64

update the kernel_debug print

a2b2cef

update the forward function

2adfea5

VincyZhang added ITREX.cpp extension labels Sep 28, 2023

Zhenzhong1 added 9 commits October 10, 2023 09:57

Merge branch 'main' into zhenzhong/baichuan

f57db01

extend the scratch memory size

fc11e55

remove debug info

02ad887

do not display the prompt

3a04e8e

clang-format

1fb209f

Merge branch 'main' into zhenzhong/baichuan

a9e384a

clang-format

3e78b13

remove pdb

898ec5b

add baichuan to pybind

a4cc238

Zhenzhong1 requested a review from zhenwei-intel as a code owner October 12, 2023 06:18

Zhenzhong1 requested a review from a32543254 October 12, 2023 06:47

airMeng approved these changes Oct 12, 2023

View reviewed changes

Zhenzhong1 changed the title ~~[CPP Graph] Baichuan Enabling~~ [CPP Graph] Baichuan & Baichuan2 Enabling Oct 12, 2023

DDEle reviewed Oct 12, 2023

View reviewed changes

intel_extension_for_transformers/llm/runtime/graph/models/baichuan/baichuan_utils.cpp Outdated Show resolved Hide resolved

Zhenzhong1 added 2 commits October 13, 2023 09:59

update readme

f229a21

clang-format

c33314d

a32543254 approved these changes Oct 13, 2023

View reviewed changes

zhenwei-intel approved these changes Oct 13, 2023

View reviewed changes

hshen14 merged commit 98e5f9a into main Oct 13, 2023

hshen14 deleted the zhenzhong/baichuan branch October 13, 2023 07:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPP Graph] Baichuan & Baichuan2 Enabling #376

[CPP Graph] Baichuan & Baichuan2 Enabling #376

Zhenzhong1 commented Sep 22, 2023 •

edited

Loading

airMeng commented Sep 22, 2023

airMeng left a comment

airMeng commented Oct 12, 2023

Zhenzhong1 commented Oct 12, 2023

a32543254 commented Oct 13, 2023 •

edited

Loading

Zhenzhong1 commented Oct 13, 2023

a32543254 left a comment

[CPP Graph] Baichuan & Baichuan2 Enabling #376

[CPP Graph] Baichuan & Baichuan2 Enabling #376

Conversation

Zhenzhong1 commented Sep 22, 2023 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

airMeng commented Sep 22, 2023

airMeng left a comment

Choose a reason for hiding this comment

airMeng commented Oct 12, 2023

Zhenzhong1 commented Oct 12, 2023

a32543254 commented Oct 13, 2023 • edited Loading

Zhenzhong1 commented Oct 13, 2023

a32543254 left a comment

Choose a reason for hiding this comment

Zhenzhong1 commented Sep 22, 2023 •

edited

Loading

a32543254 commented Oct 13, 2023 •

edited

Loading