Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders episode artwork

EPISODE · Nov 3, 2024 · 4 MIN

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

from Daily Paper Cast (Test) · host Jingwen Liang, Gengyu Wang

🤗 Daily Paper Upvotes: 56 Authors: Viacheslav Surkov, Chris Wendler, Mikhail Terekhov, Justin Deschenaux, Robert West, Caglar Gulcehre Categories: cs.LG, cs.AI, cs.CV Arxiv: http://arxiv.org/abs/2410.22366v1 Title: Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Abstract: Sparse autoencoders (SAEs) have become a core ingredient in the reverse engineering of large-language models (LLMs). For LLMs, they have been shown to decompose intermediate representations that often are not interpretable directly into sparse sums of interpretable features, facilitating better control and subsequent analysis. However, similar analyses and approaches have been lacking for text-to-image models. We investigated the possibility of using SAEs to learn interpretable features for a few-step text-to-image diffusion models, such as SDXL Turbo. To this end, we train SAEs on the updates performed by transformer blocks within SDXL Turbo's denoising U-net. We find that their learned features are interpretable, causally influence the generation process, and reveal specialization among the blocks. In particular, we find one block that deals mainly with image composition, one that is mainly responsible for adding local details, and one for color, illumination, and style. Therefore, our work is an important first step towards better understanding the internals of generative text-to-image models like SDXL Turbo and showcases the potential of features learned by SAEs for the visual domain. Code is available at https://github.com/surkovv/sdxl-unbox

NOW PLAYING

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

0:00 4:18

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

French Your Way Jessica: Native French teacher founder of French Your Way Boost your French listening skills and test your comprehension with this one of a kind series of podcasts. Get the chance to listen to a real conversation between native speakers talking at normal speed AND customise your learning experience through carefully designed sets of questions (2 levels of difficulty) available for download at www.frenchvoicespodcast.com. All interviews also come with the transcript. French teacher Jessica interviews native speakers of French from around the world who share a bit of their life and passion. Where else would you meet in one same place a French yoga teacher based in Melbourne, a soap manufacturer from Provence, or a couple cycling around the world? Christadelphian Encouragements CE.captivate.fm Christadelphian Encouragements provides sermons, exhortations, bible studies, memorials, and daily readings from around the world. Please visit ChristadelphianEncouragements.Com and our content creators websites for more information and Christian audio content. Gooday Gaming Guests FFF Gaming Emporium These are my Daily Messages in a Bottle sent over the internet Ocean for anyone to find. Listen to a Quick 20-minute Journey into my Life's Passions Work a Few Times a Day. I am 57. I Grew Up on All Gaming and Computing. I am a Seller of Gaming Parts on eBay and Etsy. In the past 8 years, I have learned about every system ever made. I am also an Enthusiast, Collector and Hobbyist of all Vintage Computing from the Very Beginning. In the last Few Years, I have been sharing my knowledge with others on YouTube, TikTok and Now this Pod Cast.See where all the Magic Happens:FFF Gaming Emporium | eBay Storeshttps://www.youtube.com/channel/UCDrdCmDQ52AsCWTWAhE7JEQ/<a target="_blank" rel="noopener noreferrer nofollow" href="https://www MHI Cast MHI podcast

Frequently Asked Questions

How long is this episode of Daily Paper Cast (Test)?

This episode is 4 minutes long.

When was this Daily Paper Cast (Test) episode published?

This episode was published on November 3, 2024.

What is this episode about?

🤗 Daily Paper Upvotes: 56 Authors: Viacheslav Surkov, Chris Wendler, Mikhail Terekhov, Justin Deschenaux, Robert West, Caglar Gulcehre Categories: cs.LG, cs.AI, cs.CV Arxiv: http://arxiv.org/abs/2410.22366v1 Title: Unpacking SDXL Turbo:...

Can I download this Daily Paper Cast (Test) episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!