Activation-aware Weight Quantization (AWQ): Unlocking LLM Efficiency—Part 1: Understanding the Basics