Aanuf/fix for asym by andreyanufr · Pull Request #4074 · openvinotoolkit/nncf

andreyanufr · 2026-05-15T14:29:58Z

Changes

Fixed compression range for asymmetric compression if all values are positive or negative.

Reason for changes

For vector [-22. -21. -20. -19. -18. -17. -16. -15.] current implementation gives decompressed values after integer_quantize_dequantize_weight(..) equal to [-7. -7. -7. -7. -7. -7. -7. -7. ] bacause zero_point before clamp equal to
-22 / scale = -22 * 255/(-15 + 22) = 804 and after clamp is 0, but min value is -22/scale = -804 and max value is -15/scale = -548, and after clamp all values equal to zero.

But if add 0 to range of values: [-22. -21. -20. -19. -18. -17. -16. -15. 0.] then scale = 22/256, zero_point = -255, min_value=-255, max_value=0 and we have correct range.

Related tickets

CVS-186919

Tests

Test examples - success

Copilot

Pull request overview

Fixes a quantization range bug in asymmetric weight compression where input weights whose [min, max] range does not include zero (all-positive or all-negative values) produced degenerate decompressed outputs. The fix forces the quantization range to always span zero by clamping min_values <= 0 and max_values >= 0 before computing the scale and zero point. The change is mirrored in both the reference NumPy/Tensor path and the optimized OpenVINO graph builder.

Changes:

In the reference asymmetric path, clamp min_values and max_values so the range always includes zero before calling calculate_scale_zero_point.
In the optimized OpenVINO model builder, perform the equivalent opset.minimum/opset.maximum against a 0.0 constant when computing min/max.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
src/nncf/quantization/algorithms/weight_compression/weight_lowering.py	Adds zero-inclusive clamping to min/max inside the asymmetric branch of `calculate_integer_quantization_params`.
src/nncf/openvino/optimized_functions/models.py	Adds equivalent zero-inclusive clamping to min/max in `_build_integer_quantization_model`, but unconditionally rather than only for asymmetric mode.

+        zero = fns.zeros_like(min_values)
+        min_values = fns.minimum(zero, min_values)
+        max_values = fns.maximum(zero, max_values)


Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

    if config.is_asym_mode:
        level_low = 0
        level_high = 2**num_bits - 1
        min_values = fns.min(weight, axis=reduction_axes, keepdims=True)  # [a1, r, a2] -> [a1, 1, a2]
        max_values = fns.max(weight, axis=reduction_axes, keepdims=True)  # [a1, r, a2] -> [a1, 1, a2]
+
+        zero = fns.zeros_like(min_values)
+        min_values = fns.minimum(zero, min_values)
+        max_values = fns.maximum(zero, max_values)


        example_inputs_numpy = example_input.detach().cpu().numpy()
        stripped_ov_output = torch.tensor(model(example_inputs_numpy)[0], device=example_input.device)

+        # TODO(aanuf): fix input_low, input_range computation for AsymmetricQuantizer
        assert torch.allclose(tuned_output, stripped_output, atol=1e-1)


alexsu52 and others added 30 commits September 2, 2024 13:22

Support scale estimation inside GPTQ

488cacc

fix for INT4_ASYM

ee64877

Merge remote-tracking branch 'upstream/develop' into develop

f22e411

Merge remote-tracking branch 'upstream/develop' into develop

51b4d7b

Merge remote-tracking branch 'upstream/develop' into develop

f66cd1e

Merge remote-tracking branch 'upstream/develop' into develop

7ce5a53

Merge remote-tracking branch 'upstream/develop' into develop

f74d156

Merge remote-tracking branch 'upstream/develop' into develop

5288c79

Merge remote-tracking branch 'upstream/develop' into develop

1becf15

Merge remote-tracking branch 'upstream/develop' into develop

047d7d9

Merge remote-tracking branch 'upstream/develop' into develop

c0c7e57

Merge remote-tracking branch 'upstream/develop' into develop

b74dea1

Merge remote-tracking branch 'upstream/develop' into develop

26a9a77

Merge remote-tracking branch 'upstream/develop' into develop

25fcc2c

Merge remote-tracking branch 'upstream/develop' into develop

26d4887

Merge remote-tracking branch 'upstream/develop' into develop

7748233

Merge remote-tracking branch 'upstream/develop' into develop

df251b3

Merge remote-tracking branch 'upstream/develop' into develop

4c134c4

Merge remote-tracking branch 'upstream/develop' into develop

6147097

Merge remote-tracking branch 'upstream/develop' into develop

2b94d28

Merge remote-tracking branch 'upstream/develop' into develop

5e312a5

Merge remote-tracking branch 'upstream/develop' into develop

2c5e983

Merge remote-tracking branch 'upstream/develop' into develop

1d8db1e

Merge remote-tracking branch 'upstream/develop' into develop

7244f18

Merge remote-tracking branch 'upstream/develop' into develop

443048c

Merge remote-tracking branch 'upstream/develop' into develop

80d2d8a

Merge remote-tracking branch 'upstream/develop' into develop

06bb19b

Merge remote-tracking branch 'upstream/develop' into develop

5d97d87

Merge remote-tracking branch 'upstream/develop' into develop

ae7cece

Merge remote-tracking branch 'upstream/develop' into develop

04ca66c

andreyanufr added 8 commits January 30, 2026 15:05

Merge remote-tracking branch 'upstream/develop' into develop

b62b7b9

Merge remote-tracking branch 'upstream/develop' into develop

c5715de

Merge remote-tracking branch 'upstream/develop' into develop

0ecd8fe

Merge remote-tracking branch 'upstream/develop' into develop

33c21e8

Merge remote-tracking branch 'upstream/develop' into develop

d7622a5

Merge remote-tracking branch 'upstream/develop' into develop

6f10fff

Merge remote-tracking branch 'upstream/develop' into develop

529b8aa

Merge remote-tracking branch 'upstream/develop' into develop

7f2c5cf

Copilot AI review requested due to automatic review settings May 15, 2026 14:29

andreyanufr requested a review from a team as a code owner May 15, 2026 14:29

Copilot started reviewing on behalf of andreyanufr May 15, 2026 14:30 View session

Copilot AI reviewed May 15, 2026

View reviewed changes

andreyanufr added 2 commits May 15, 2026 16:35

Fixed asym compression for case then all values positive or negative.

65aed7c

Fixed OV optimization.

6364003

andreyanufr marked this pull request as draft May 15, 2026 14:55

Merge remote-tracking branch 'upstream/develop' into aanuf/fix_for_asym

33ecbab

github-actions Bot added the NNCF OpenVINO Pull requests that updates NNCF OpenVINO label May 21, 2026

andreyanufr added 4 commits May 21, 2026 16:03

Updated OV test references.

5d39420

Updated OV test references.

2aa48d5

Updated references for OV test_scale_estimation

2004096

Updated refernces for torch scale estimation test.

275b9dc

github-actions Bot added NNCF PT Pull requests that updates NNCF PyTorch NNCF ONNX Pull requests that updates NNCF ONNX labels May 22, 2026

andreyanufr added 3 commits May 22, 2026 12:52

Updated reference values for test_scale_estimation ONNX backend.

18feba3

Fixed test_fq_lora_export.

02239da

Aligned weight values between OV and Torch.

daace3b

andreyanufr marked this pull request as ready for review May 22, 2026 13:55

andreyanufr requested a review from Copilot May 26, 2026 09:15

Copilot started reviewing on behalf of andreyanufr May 26, 2026 09:16 View session

Copilot AI reviewed May 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aanuf/fix for asym#4074

Aanuf/fix for asym#4074
andreyanufr wants to merge 55 commits into
openvinotoolkit:developfrom
andreyanufr:aanuf/fix_for_asym

andreyanufr commented May 15, 2026 •

edited by github-actions Bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andreyanufr commented May 15, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Reason for changes

Related tickets

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andreyanufr commented May 15, 2026 •

edited by github-actions Bot

Loading