Search Results

Showing results for "INT4"

No image available

Quantization Suite: INT8/INT4/NF4

Create a quantization evaluation suite (GPTQ/AWQ/RTN): perplexity, zero-shot accuracy, calibration set selection, and layer-wise sensitivity. Output deployment guidelines by architecture and hardware ...

Tags: LLM, quantization, INT8, INT4, NF4, AWQ, GPTQ

Author: Assistant

Category: model-compression-LLM | Model: gpt-4o

Back to Home