308 - How Image Diffusion Models Work - the 20 minute explainer

from Fragmented - AI Developer Podcast · host Kaushik Gopal, Iury Souza

You already know how LLMs work from our popular 20-minute explainer. Now we take it to images. What does Michelangelo have to do with stable diffusion? More than you'd think. Walk away knowing how image generation actually works — and what it has in common with the text models you already understand. Full shownotes at fragmentedpodcast.com. Show Notes Episode 303 - How LLMs work in 20 minutes - text generation VAE -Variational Autoencoder RGB Color model - wikipedia Word2Vec technique - wikipedia Efficient Estimation of Word Representation - original Word2Vec paper by Mikolov et al. High-Resolution Image Synthesis with Latent Diffusion Models - Rombach et al. (2022) — the paper behind Stable Diffusion Image Training data LAION-5B - 5 billion image-text pairs scraped from the web, used to train many image generation models WebLI - Google's internal image-text dataset Michelangelo Get in touch We'd love to hear from you. Email is the best way to reach us or you can check our contact page for other ways. We want to hear all the feedback: what's working, what's not, topics you'd like to hear more on. Contact us Newsletter Youtube Website Co-hosts: Kaushik Gopal Iury Souza [!fyi] We transitioned from Android development to AI starting withEp. #300. Listen to that episode for the full story behind our new direction.

NOW PLAYING

0:00 24:44

1×

Play in mini player Transcript not yet generated

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

Share this episode

Similar Episodes

No similar episodes found.

Similar Podcasts

The Tapes Archive Osiris Media A podcast that unearths never-before-heard conversations with world-class musicians and comedians. CFL Fantasy Podcast Canadian Football League We are bringing you the official fantasy football podcast of the CFL! Subscribe now to have Pat Steinberg, Jeff Krever and Hannah Nordman help you set your CFL fantasy rosters every single week of the season. RAISING THE BAR RAISING THE BAR The RAISING THE BAR Podcast is dedicated to providing a fresh and unconventional broadcast platform for the biggest names in music and entertainment.The interview insight provided by the staff of MUSICHYPEBEAST separates us from the pack. The passion of RAISING THE BAR podcast is fueled by Millennial Music culture. Everything VR & AR The VRAR Association Everything VR & AR is a weekly podcast covering technologists, enthusiasts, and companies with real world deployments of virtual reality and augmented reality experiences. Learn from interviews with the leaders in gaming, entertainment, productivity, enterprise, social, education, medicine, software, hardware, psychology and more. This podcast covers everything that is VR and AR including the hottest topics and news in virtual reality and augmented reality. Nathan Pettyjohn, Founder of the VR/AR Association is your host.

URL copied to clipboard!

Share this episode

Similar Episodes

Similar Podcasts

Age Verification