Quantization Operators =========================== Quantization is a model optimization technique to reduce the size of a large model in order to achieve better storage performance with a small loss in accuracy. CUDA Operators -------------- .. doxygengroup:: quantize-ops-cuda :content-only: CPU Operators ------------- .. doxygengroup:: quantize-data-cpu :content-only: