Quantize NafNet deblurring: ORT pipeline + validation script + robust handling #299

Hariom-Nagar211 · 2025-09-22T16:20:59Z

Summary

Adds ONNX Runtime static quantization support for deblurring_nafnet, and a validation script to compare FP32 vs INT8 outputs with PSNR/SSIM and timing.
Improves quantization robustness and memory handling across the tools.
Changes

tools/quantize/quantize-ort.py
Adds deblurring_nafnet entry to the central models dict.
Uses QuantFormat.QOperator + op_types_to_quantize=['Conv','MatMul'] for better CPU EP performance, with an automatic fallback to QDQ Conv-only when needed.
Optional pre-processing (skips if onnxruntime-extensions is missing).
Lazy
DataReader
creation to avoid importing all datasets at module import.
MinMax calibration; input resize to 512x512; max_samples=1 to reduce memory.
Auto-excludes Conv nodes without bias initializers to avoid bias=None errors.
tools/quantize/transform.py
Makes
HandAlign
lazy-load its palm detector so unrelated quant runs don’t load Mediapipe ONNX at import time.
models/deblurring_nafnet/validate_quantization.py
New: Runs FP32 and INT8 models via ORT, reports timing, PSNR, SSIM, and shows side-by-side results.
Usage

Quantize NafNet:
cd tools/quantize
python quantize-ort.py deblurring_nafnet
Output: models/deblurring_nafnet/deblurring_nafnet_2025may_int8.onnx
Validate:
cd models/deblurring_nafnet
python validate_quantization.py --input example_outputs/licenseplate_motion.jpg --model_fp32 deblurring_nafnet_2025may.onnx --model_int8 deblurring_nafnet_2025may_int8.onnx
Notes on performance

On CPUExecutionProvider, INT8 speed-ups are not guaranteed unless the provider fuses/accelerates int8 kernels.
For acceleration, consider provider-specific EPs:
OpenVINOExecutionProvider (CPU) or DmlExecutionProvider (GPU) if available.
The script quantizes to QOperator Conv/MatMul by default; falls back to QDQ Conv-only for robustness.
Known limitations

Quantization depends on ORT support for the model’s ops and shapes; the script includes automatic mitigations (resize, sample limit, excludes).
We do not commit generated ONNX or output images.

Screenshot:

…t from 4-point corners); keep original quad dets. Fixes opencv#275

…robust quantization handling

Hariom-Nagar211 added 2 commits September 4, 2025 12:06

license_plate_detection_yunet: fix NMSBoxes input (use [x,y,w,h] buil…

d145b8c

…t from 4-point corners); keep original quad dets. Fixes opencv#275

Quantize NafNet (deblurring): add ORT pipeline, validation tool, and …

d43c91a

…robust quantization handling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quantize NafNet deblurring: ORT pipeline + validation script + robust handling #299

Quantize NafNet deblurring: ORT pipeline + validation script + robust handling #299

Uh oh!

Hariom-Nagar211 commented Sep 22, 2025

Uh oh!

Uh oh!

Quantize NafNet deblurring: ORT pipeline + validation script + robust handling #299

Are you sure you want to change the base?

Quantize NafNet deblurring: ORT pipeline + validation script + robust handling #299

Uh oh!

Conversation

Hariom-Nagar211 commented Sep 22, 2025

Uh oh!

Uh oh!