Abstract: Deep Neural Networks (DNNs) are pivotal in modern applications, yet their inference phase remains computationally demanding. Quantization, the process of reducing the precision of numerical ...