PodParley PodParley

759: Full Encoder-Decoder Transformers Fully Explained, with Kirill Eremenko

Encoders, cross attention and masking for LLMs: SuperDataScience Founder Kirill Eremenko returns to the SuperDataScience podcast, where he speaks with Jon Krohn about transformer architectures and why they are a new frontier for generative AI. If you’re interested in applying LLMs to your business portfolio, you’ll want to pay close attention to this episode!This episode is brought to you by Ready Tensor, where innovation meets reproducibility, by Oracle NetSuite business software, and by Int...

An episode of the Super Data Science: ML & AI Podcast with Jon Krohn podcast, hosted by Jon Krohn, titled "759: Full Encoder-Decoder Transformers Fully Explained, with Kirill Eremenko" was published on February 20, 2024 and runs 103 minutes.

February 20, 2024 ·103m · Super Data Science: ML & AI Podcast with Jon Krohn

0:00 / 0:00

Encoders, cross attention and masking for LLMs: SuperDataScience Founder Kirill Eremenko returns to the SuperDataScience podcast, where he speaks with Jon Krohn about transformer architectures and why they are a new frontier for generative AI. If you’re interested in applying LLMs to your business portfolio, you’ll want to pay close attention to this episode!This episode is brought to you by Ready Tensor, where innovation meets reproducibility, by Oracle NetSuite business software, and by Intel and HPE Ezmeral Software Solutions. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• How decoder-only transformers work [15:51]• How cross-attention works in transformers [41:05]• How encoders and decoders work together (an example) [52:46]• How encoder-only architectures excel at understanding natural language [1:20:34]• The importance of masking during self-attention [1:27:08]Additional materials: www.superdatascience.com/759

Encoders, cross attention and masking for LLMs: SuperDataScience Founder Kirill Eremenko returns to the SuperDataScience podcast, where he speaks with Jon Krohn about transformer architectures and why they are a new frontier for generative AI. If you’re interested in applying LLMs to your business portfolio, you’ll want to pay close attention to this episode!This episode is brought to you by Ready Tensor, where innovation meets reproducibility, by Oracle NetSuite business software, and by Intel and HPE Ezmeral Software Solutions. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.In this episode you will learn:• How decoder-only transformers work [15:51]• How cross-attention works in transformers [41:05]• How encoders and decoders work together (an example) [52:46]• How encoder-only architectures excel at understanding natural language [1:20:34]• The importance of masking during self-attention [1:27:08]Additional materials: www.superdatascience.com/759
Partially Derivative Partially Derivative The everyday data of the world around us, hosted by data science super geeks. For the nerdy and nerd curious. Plumbers of Data Science Andreas Kretz Data Engineering is the plumbing of data science. Almost invisible, but super important and a big mess when done wrong.We talk about interesting Data Engineering trends and topics. I also train Data Engineering in my Data Engineering Academy at LearnDataEngineering.com PurpleCar Christine Cavalier Woah there, Speedy! Get off that highway and pull in to PurpleCar Park, a podcast where you can settle in to author interviews, book reviews, and discussion about the act of reading and writing in our super-digital, data-driven world.Unlike most book reviewers and author interviewers in traditional media and on the internet, Christine Cavalier takes the time to read and study the book. Listen in and you’ll notice the difference. Welcome to PurpleCar Park! kill switch Kaleidoscope Were we sleeping when everything changed? Seems like the technologically driven future is already here. On killswitch, we explain the right NOW of our super charged technological lives. New host Dexter Thomas answers questions big and small – like who’s behind Shrimp Jesus, and could you get arrested by a computer?kill switch also brings the DIY back to tech – “How to Now” on everything from how to run your own LLM to tips to keep your data safe. Because the more “user-friendly” our devices get, the less we understand how they work, and the less control we have. We’re here to help you take back control. And if we can’t… Well, maybe we need to look for the kill switch.
URL copied to clipboard!