Which Quantization to Use to Reduce the Size of LLMs?