support tuning target_bits #2336

xin3he · 2025-11-14T06:22:54Z

User description

Type of Change

Feature

Description

Feature: support AutoRound target_bits and autotune bits.

Expected Behavior & Potential Risk

Automatically generate mix-precision model
Autotune target_bits based on the evaluation result.

How has this PR been tested?

UT pass

Dependency Change?

AutoRound >= 0.9.0

PR Type

Enhancement

Description

Add support for tuning target_bits
Introduce new parameters for auto scheme configuration
Update preprocessing in autotune function

Diagram Walkthrough

flowchart LR
  A["Add target_bits"] -- "New parameter" --> B["Update __init__"]
  B -- "Include new parameters" --> C["Modify autotune"]
  C -- "Preprocess model and config" --> D["Enhance AutoRoundConfig"]

File Walkthrough

Relevant files

Enhancement

3 files

config.py `Add target_bits and auto scheme parameters`	+41/-4
autotune.py `Preprocess model and quant config`	+13/-1
base_config.py `Update parameter handling`	+3/-3

Additional files

4 files

tuning_param.py	+1/-1
autoround.py	+57/-1
algorithm_entry.py	+16/-1
test_autoround.py	+75/-4

Signed-off-by: He, Xin3 <[email protected]>

PRAgent4INC · 2025-11-14T06:23:34Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 4 🔵🔵🔵🔵⚪
🧪 PR contains tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Default Value The `target_bits` parameter is set to `None` by default, which might lead to unexpected behavior if not explicitly set by the user. options: Union[str, list[Union[str]], tuple[Union[str], ...]] = ("MXFP4", "MXFP8"), Import Statement The import statement for `AutoScheme` is inside the `convert` method, which can lead to increased load times and potential circular import issues. if self.target_bits is not None: Hardcoded Op Type The `dump_model_op_stats` function currently only collects statistics for the "Linear" op type. This might need to be extended to support other types of operations. """Dump quantizable ops stats of model to user.""" # TODO: collect more ops besides Linear res = {}

PRAgent4INC · 2025-11-14T06:24:04Z

PR Code Suggestions ✨

for more information, see https://pre-commit.ci

Copilot

Pull Request Overview

This PR adds support for tuning target_bits in the AutoRound quantization configuration, enabling automatic mixed-precision model generation and autotuning of target bits based on evaluation results.

Key changes:

Added new target_bits parameter and auto scheme configuration options to AutoRoundConfig
Implemented preprocessing of model and quantization config in the autotune function to handle tokenizer attributes
Updated parameter handling in base_config.py to use type annotations instead of default values

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
config.py	Added `target_bits` and related auto scheme parameters to `AutoRoundConfig` for mixed-precision support
autotune.py	Introduced preprocessing function to handle tokenizer attributes before quantization
base_config.py	Modified parameter expansion logic to use type annotations
tuning_param.py	Moved model creation inside try-catch block for better error handling
autoround.py	Implemented auto scheme creation when `target_bits` is set and added statistics dumping
algorithm_entry.py	Passed new auto scheme parameters to the quantizer
test_autoround.py	Added tests for `target_bits` functionality and moved imports to module level

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

neural_compressor/common/base_config.py

neural_compressor/torch/quantization/config.py

neural_compressor/torch/algorithms/weight_only/autoround.py

yiliu30

LGTM

support tuning target_bits

cc4291c

Signed-off-by: He, Xin3 <[email protected]>

PRAgent4INC added the Review effort 4/5 label Nov 14, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

d70e5bb

for more information, see https://pre-commit.ci

xin3he requested review from Kaihui-intel, Copilot and yiliu30 and removed request for Copilot and yiliu30 November 14, 2025 06:28

Copilot AI reviewed Nov 14, 2025

View reviewed changes

neural_compressor/common/base_config.py Show resolved Hide resolved

neural_compressor/torch/quantization/config.py Show resolved Hide resolved

neural_compressor/torch/algorithms/weight_only/autoround.py Show resolved Hide resolved

xin3he added this to the 3.7 milestone Nov 14, 2025

yiliu30 approved these changes Nov 17, 2025

View reviewed changes

Kaihui-intel approved these changes Nov 17, 2025

View reviewed changes

chensuyue merged commit a03e6d0 into master Nov 19, 2025
20 of 25 checks passed

chensuyue deleted the xinhe/target_bits branch November 19, 2025 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support tuning target_bits #2336

support tuning target_bits #2336

Uh oh!

xin3he commented Nov 14, 2025 •

edited by PRAgent4INC

Loading

Uh oh!

PRAgent4INC commented Nov 14, 2025

Uh oh!

PRAgent4INC commented Nov 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiliu30 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

support tuning target_bits #2336

support tuning target_bits #2336

Uh oh!

Conversation

xin3he commented Nov 14, 2025 • edited by PRAgent4INC Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

PR Type

Description

Diagram Walkthrough

File Walkthrough

Uh oh!

PRAgent4INC commented Nov 14, 2025

PR Reviewer Guide 🔍

Uh oh!

PRAgent4INC commented Nov 14, 2025

PR Code Suggestions ✨

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yiliu30 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

xin3he commented Nov 14, 2025 •

edited by PRAgent4INC

Loading