How to Quantize Large Language Models — Super Lazy Coder
Super Lazy Coder Tutorial Summary
Published in
5 min readApr 21, 2024
- More content on software engineering, math, and data science can be found in the Lazy by Design publication.
Table of Contents
- Quantization Technique for Large Language Models | 0:00:00–0:14:00
- Using GPTQ Technique for Model Quantization | 0:14:00–0:25:00
- Quantizing Models with Llama CPP Library | 0:25:00–10:10:10