Summary: AWQ: Activation-Aware Weight Quantization for on-device LLM Compression and Acceleration 08-30