Skip to content

Conversation

@Kaihui-intel
Copy link
Contributor

@Kaihui-intel Kaihui-intel commented Oct 17, 2023

Type of Change

feature

Description

added

  • WoqTuningParams(Enum)
  • WeightOnlyQuantSampler
  • Woq tuning will be at the beginning of the basic strategy or auto strategy

Usage

  1. Currently, it's only available for the ORT backend with approach is weight_only.
  2. It will be triggered when a) quant_level is auto or b) quant_level is 1 && strategy is basic.
  3. algorithm is not set in op_type_dict
# config example
conf = PostTrainingQuantConfig(
    approach="weight_only",
    quant_level="auto",
)
  • Strategy support woq tuning
  • ORT backend ready
  • Docs, The document will be updated in another PR

Expected Behavior & Potential Risk

How has this PR been tested?

Pre-CI

Dependency Change?

None

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>
Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>
@Kaihui-intel Kaihui-intel changed the title Support the tuning of WOQ algo in strategy layer Support the tuning of WOQ algorithm in strategy layer Oct 17, 2023
@chensuyue chensuyue added this to the v2.4 milestone Oct 19, 2023
yuwenzho and others added 4 commits October 19, 2023 10:41
Signed-off-by: yuwenzho <yuwen.zhou@intel.com>
Signed-off-by: yuwenzho <yuwen.zhou@intel.com>
@yuwenzho yuwenzho requested a review from mengniwang95 October 19, 2023 02:49
@yiliu30 yiliu30 changed the title Support the tuning of WOQ algorithm in strategy layer Enable the tuning of WOQ algorithm Oct 19, 2023
yuwenzho and others added 2 commits October 20, 2023 10:06
@yiliu30 yiliu30 merged commit 6d4ea5b into master Oct 20, 2023
@yiliu30 yiliu30 deleted the kaihui/wo_tuning branch October 20, 2023 08:57
bmyrcha pushed a commit that referenced this pull request Oct 24, 2023
* support WOQ algos tuning
---------

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>
Signed-off-by: yuwenzho <yuwen.zhou@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: yuwenzho <yuwen.zhou@intel.com>
Signed-off-by: bmyrcha <bartosz.myrcha@intel.com>
mengniwang95 pushed a commit that referenced this pull request Nov 20, 2023
* support WOQ algos tuning
---------

Signed-off-by: Kaihui-intel <kaihui.tang@intel.com>
Signed-off-by: yuwenzho <yuwen.zhou@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: yuwenzho <yuwen.zhou@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants