Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ReleaseNotes update #2500

Merged
Prev Previous commit
Next Next commit
pt
  • Loading branch information
AlexanderDokuchaev authored and nikita-malininn committed Feb 29, 2024
commit 811161052e7155f4eae54d03cab6c5e7f5cdcbfc
6 changes: 6 additions & 0 deletions ReleaseNotes.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,12 +11,18 @@ Post-training Quantization:
- Features:
- (ONNX) Introduced support for the ONNX backend in the `nncf.quantize_with_accuracy_control()` method. Users can now perform quantization with accuracy control for `onnx.ModelProto`. By leveraging this feature, users can enhance the accuracy of quantized models while minimizing performance impact.
- (ONNX) Added an example based on the YOLOv8n-seg model for demonstrating the usage of quantization with accuracy control for the ONNX backend.
- (PT) Added SmoothQuant algorithm for PyTorch backend in `nncf.quantize`.
- ...
- Fixes:
- (PyTorch) Fixed incorrect set `is_shared` attribute in case of wraping model with `trace_parameters=True`.
- (PyTorch) Fixed dtype handling for traced `torch.nn.Parameter`.
- (PyTorch) Fixed zero eps for fake quantizer module.
- ...
- Improvements:
- ...
- Deprecations/Removals:
- (PyTorch) Deprecated `binarization` algorithm
- Removed Dockerfiles
- ...
- Tutorials:
- ...
Expand Down