Support qlora in CPU #9233

yangw1234 · 2023-10-20T04:35:44Z

Description

Support qlora in CPU

yangw1234 · 2023-10-20T04:36:26Z

yangw1234 · 2023-10-20T04:38:14Z

python/llm/example/GPU/QLoRA-FineTuning/qlora_finetuning.py

+    model = prepare_model_for_kbit_training(model, use_gradient_checkpointing=False)
+
+    model.enable_input_require_grads()


Somehow both setting use_gradient_checkpoint=False and calling model.enable_input_require_grads() are necessary. I still have not figure out why.

yangw1234 · 2023-10-20T04:39:19Z

python/llm/example/GPU/QLoRA-FineTuning/qlora_finetuning.py

@@ -71,8 +73,8 @@
            max_steps=200,
            learning_rate=2e-4,
            save_steps=100,
-            fp16=True,
-            logging_steps=20,
+            bf16=True,


must use "bf16" in for cpu

jason-dai

LGTM

yangw1234 · 2023-10-27T21:01:10Z

ARC test is unstable. Merge it for now.

* support qlora in CPU * revert example * fix style

yangw1234 commented Oct 20, 2023

View reviewed changes

yangw1234 changed the title ~~[WIP] Support qlora in CPU~~ Support qlora in CPU Oct 27, 2023

yangw1234 added 3 commits October 27, 2023 09:49

support qlora in CPU

56037fc

revert example

2503ceb

fix style

02e011f

yangw1234 force-pushed the cpu_qlora branch from c8f85d3 to 02e011f Compare October 27, 2023 01:54

yangw1234 marked this pull request as ready for review October 27, 2023 01:55

yangw1234 requested review from jason-dai and hzjane October 27, 2023 01:55

jason-dai approved these changes Oct 27, 2023

View reviewed changes

yangw1234 merged commit b0f71f1 into intel-analytics:main Oct 27, 2023
16 of 17 checks passed

liu-shaojun pushed a commit that referenced this pull request Mar 25, 2024

Support qlora in CPU (#9233)

163d033

* support qlora in CPU * revert example * fix style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support qlora in CPU #9233

Support qlora in CPU #9233

yangw1234 commented Oct 20, 2023

yangw1234 commented Oct 20, 2023 •

edited

Loading

yangw1234 Oct 20, 2023 •

edited

Loading

yangw1234 Oct 20, 2023

jason-dai left a comment

yangw1234 commented Oct 27, 2023

		model = prepare_model_for_kbit_training(model, use_gradient_checkpointing=False)

		model.enable_input_require_grads()

Support qlora in CPU #9233

Support qlora in CPU #9233

Conversation

yangw1234 commented Oct 20, 2023

Description

yangw1234 commented Oct 20, 2023 • edited Loading

yangw1234 Oct 20, 2023 • edited Loading

Choose a reason for hiding this comment

yangw1234 Oct 20, 2023

Choose a reason for hiding this comment

jason-dai left a comment

Choose a reason for hiding this comment

yangw1234 commented Oct 27, 2023

yangw1234 commented Oct 20, 2023 •

edited

Loading

yangw1234 Oct 20, 2023 •

edited

Loading