How to Quantize Large Language Models — Super Lazy Coder

Super Lazy Coder Tutorial Summary

Rohan Kotwani
Lazy by Design
Published in
5 min readApr 21, 2024

--

  • More content on software engineering, math, and data science can be found in the Lazy by Design publication.

Table of Contents

  1. Quantization Technique for Large Language Models | 0:00:00–0:14:00
  2. Using GPTQ Technique for Model Quantization | 0:14:00–0:25:00
  3. Quantizing Models with Llama CPP Library | 0:25:00–10:10:10

Quantization Technique…

--

--