692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU
Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week's episode.Additional materials: www.superdatascience.com/692Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
An episode of the Super Data Science: ML & AI Podcast with Jon Krohn podcast, hosted by Jon Krohn, titled "692: Lossless LLM Weight Compression: Run Huge Models on a Single GPU" was published on June 30, 2023 and runs 7 minutes.
June 30, 2023 ·7m · Super Data Science: ML & AI Podcast with Jon Krohn
Summary
Join Jon as he navigates listeners through the innovative SpQR approach—a cutting-edge, lossless LLM weight compression technique that harnesses the power of quantization. Tune in as Jon delves into the four steps behind this groundbreaking method in this week's episode.Additional materials: www.superdatascience.com/692Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Episode Description
Similar Episodes
Apr 9, 2026 ·37m
Mar 12, 2026 ·43m
Feb 26, 2026 ·35m