Quantization Operators
===========================

Quantization is a model optimization technique to reduce the size of a large
model in order to achieve better storage performance with a small loss in
accuracy.

CUDA Operators
--------------

.. doxygengroup:: quantize-ops-cuda
   :content-only:

CPU Operators
-------------

.. doxygengroup:: quantize-data-cpu
   :content-only: