Skip to content

Conversation

@wenhuach21
Copy link
Contributor

No description provided.

@wenhuach21 wenhuach21 changed the title [WIP]Refine inference code [High Risk]Refine inference code Sep 22, 2025
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This is a high-risk refactoring of the inference code that modernizes the backend selection system and improves code maintainability. The PR removes legacy features and refactors the quantization layer replacement logic.

  • Removes deprecated clip parameter from quantization schemes
  • Refactors backend selection to use structured packing formats and simplifies the layer replacement logic
  • Updates backend configuration with more consistent naming and format specifications

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file
File Description
auto_round/schemes.py Removes deprecated clip parameter from QuantizationScheme
auto_round/inference/convert_model.py Major refactoring of model conversion logic with simplified backend selection and updated function signatures
auto_round/inference/backend.py Updates backend configurations with new packing format structure and modernized type hints
auto_round/compressors/base.py Removes warning about unsupported auto_round format loading
auto_round/main.py Updates condition for model evaluation to check format type

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

wenhuach21 and others added 3 commits September 23, 2025 14:32
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
@wenhuach21 wenhuach21 merged commit 162424d into main Sep 23, 2025
8 checks passed
@wenhuach21 wenhuach21 deleted the refine_inference branch September 23, 2025 12:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants