PodParley - Discover, Search, and Explore Podcasts

1

Efficient Streaming Language Models with Attention Sinks (Paper Explained)

Oct 17, 2023

32:26

2

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)

Oct 17, 2023

46:44

3

Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)

Oct 5, 2023

28:25

4

Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)

Oct 5, 2023

53:06

5

[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise

Aug 28, 2023

44:10

6

How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich)

Aug 28, 2023

29:08

7

Recipe AI suggests FATAL CHLORINE GAS Recipe

Aug 28, 2023

7:05

8

DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)

Aug 28, 2023

53:31

9

[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released

Aug 28, 2023

31:04

10

Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained)

Aug 28, 2023

35:44

11

RWKV: Reinventing RNNs for the Transformer Era (Paper Explained)

Aug 28, 2023

62:16

12

Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)

Aug 28, 2023

29:28

13

OpenAI suggests AI licenses (US Senate hearing on AI regulation w/ Sam Altman)

Aug 28, 2023

16:12

14

[ML News] Geoff Hinton leaves Google | Google has NO MOAT | OpenAI down half a billion

Aug 28, 2023

39:06

15

Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained)

Aug 28, 2023

24:33

16

OpenAssistant RELEASED! The world's best open-source Chat AI!

Aug 28, 2023

21:05

17

OpenAssistant First Models are here! (Open-Source ChatGPT)

Aug 28, 2023

16:52

18

The biggest week in AI (GPT-4, Office Copilot, Google PaLM, Anthropic Claude & more)

Aug 28, 2023

41:01

19

GPT-4 is here! What we know so far (Full Analysis)

Aug 28, 2023

34:09

20

This ChatGPT Skill will earn you $10B (also, AI reads your mind!)

Aug 28, 2023

43:27

21

LLaMA: Open and Efficient Foundation Language Models (Paper Explained)

Aug 28, 2023

41:06

22

Open Assistant Inference Backend Development (Hands-On Coding)

Aug 28, 2023

81:23

23

OpenAssistant - ChatGPT's Open Alternative (We need your help!)

Aug 28, 2023

35:47

24

ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)

Jan 2, 2023

31:54

25

[ML News] GPT-4 Rumors | AI Mind Reading | Neuron Interaction Solved | AI Theorem Proving

Nov 30, 2022

41:55

26

CICERO: An AI agent that negotiates, persuades, and cooperates with people

Nov 30, 2022

61:02

27

[ML News] Multiplayer Stable Diffusion | OpenAI needs more funding | Text-to-Video models incoming

Nov 23, 2022

22:52

28

The New AI Model Licenses have a Legal Loophole (OpenRAIL-M of BLOOM, Stable Diffusion, etc.)

Nov 23, 2022

27:50

29

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

Nov 23, 2022

64:58

30

Neural Networks are Decision Trees (w/ Alexander Mattick)

Oct 23, 2022

31:50

31

This is a game changer! (AlphaTensor by DeepMind explained)

Oct 23, 2022

55:06

32

[ML News] Stable Diffusion Takes Over! (Open Source AI Art)

Oct 23, 2022

27:27

33

How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit

Oct 23, 2022

50:19

34

More Is Different for AI - Scaling Up, Emergence, and Paperclip Maximizers (w/ Jacob Steinhardt)

Sep 15, 2022

66:36

35

The hidden dangers of loading open-source AI models (ARBITRARY CODE EXPLOIT!)

Sep 7, 2022

19:42

36

The Future of AI is Self-Organizing and Self-Assembling (w/ Prof. Sebastian Risi)

Aug 29, 2022

61:48

37

The Man behind Stable Diffusion

Aug 29, 2022

25:41

38

[ML News] BLOOM: 176B Open-Source | Chinese Brain-Scale Computer | Meta AI: No Language Left Behind

Aug 3, 2022

14:02

39

JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained)

Jul 10, 2022

59:37

40

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos (Paper Explained)

Jun 28, 2022

32:33

41

Parti - Scaling Autoregressive Models for Content-Rich Text-to-Image Generation (Paper Explained)

Jun 28, 2022

34:57

42

Did Google's LaMDA chatbot just become sentient?

Jun 20, 2022

22:22

43

[ML News] DeepMind's Flamingo Image-Text model | Locked-Image Tuning | Jurassic X & MRKL

May 16, 2022

24:18

44

[ML News] Meta's OPT 175B language model | DALL-E Mega is training | TorToiSe TTS fakes my voice

May 12, 2022

19:23

45

This A.I. creates infinite NFTs

May 12, 2022

18:47

46

Author Interview: SayCan - Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

May 12, 2022

58:31

47

Do As I Can, Not As I Say: Grounding Language in Robotic Affordances (SayCan - Paper Explained)

May 2, 2022

28:46

48

Author Interview - ACCEL: Evolving Curricula with Regret-Based Environment Design

May 2, 2022

57:45

49

ACCEL: Evolving Curricula with Regret-Based Environment Design (Paper Review)

May 2, 2022

44:05

50

LAION-5B: 5 billion image-text-pairs dataset (with the authors)

Apr 25, 2022

58:01

51

Sparse Expert Models (Switch Transformers, GLAM, and more... w/ the Authors)

Apr 25, 2022

58:22

52

Author Interview - Transformer Memory as a Differentiable Search Index

Apr 21, 2022

43:03

53

Transformer Memory as a Differentiable Search Index (Machine Learning Research Paper Explained)

Apr 21, 2022

51:51

54

[ML News] Google's 540B PaLM Language Model & OpenAI's DALL-E 2 Text-to-Image Revolution

Apr 12, 2022

14:20

55

The Weird and Wonderful World of AI Art (w/ Author Jack Morris)

Apr 6, 2022

59:29

56

Author Interview - Improving Intrinsic Exploration with Language Abstractions

Apr 6, 2022

49:25

57

Improving Intrinsic Exploration with Language Abstractions (Machine Learning Paper Explained)

Apr 6, 2022

42:25

58

[ML News] GPT-3 learns to edit | Google Pathways | Make-A-Scene | CLIP meets GamePhysics | DouBlind

Apr 6, 2022

18:02

59

Author Interview - Memory-assisted prompt editing to improve GPT-3 after deployment

Mar 30, 2022

40:37

60

Memory-assisted prompt editing to improve GPT-3 after deployment (Machine Learning Paper Explained)

Mar 30, 2022

36:41

61

Author Interview - Typical Decoding for Natural Language Generation

Mar 28, 2022

48:55

62

Typical Decoding for Natural Language Generation (Get more human-like outputs from language models!)

Mar 28, 2022

48:55

63

One Model For All The Tasks - BLIP (Author Interview)

Mar 25, 2022

48:33

64

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation

Mar 25, 2022

46:40

65

[ML News] AI Threatens Biological Arms Race

Mar 22, 2022

33:19

66

Active Dendrites avoid catastrophic forgetting - Interview with the Authors

Mar 21, 2022

56:32

67

Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments (Review)

Mar 21, 2022

65:20

68

Author Interview - VOS: Learning What You Don't Know by Virtual Outlier Synthesis

Mar 17, 2022

35:58

69

VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained)

Mar 14, 2022

35:57

70

Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents

Mar 10, 2022

96:39

71

First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning)

Mar 8, 2022

58:07

72

OpenAI tackles Math - Formal Mathematics Statement Curriculum Learning (Paper Explained)

Mar 8, 2022

50:40

73

[ML News] DeepMind controls fusion | Yann LeCun's JEPA architecture | US: AI can't copyright its art

Mar 8, 2022

28:20

74

AlphaCode - with the authors!

Mar 8, 2022

53:45

75

Competition-Level Code Generation with AlphaCode (Paper Review)

Mar 2, 2022

45:25

76

Can Wikipedia Help Offline Reinforcement Learning? (Author Interview)

Mar 2, 2022

44:46

77

Can Wikipedia Help Offline Reinforcement Learning? (Paper Explained)

Mar 2, 2022

38:34

78

[ML Olds] Meta Research Supercluster | OpenAI GPT-Instruct | Google LaMDA | Drones fight Pigeons

Mar 2, 2022

12:38

79

[ML News] Uber: Deep Learning for ETA | MuZero Video Compression | Block-NeRF | EfficientNet-X

Feb 24, 2022

26:06

80

Listening to You! - Channel Update (Author Interviews)

Feb 22, 2022

4:30

81

All about AI Accelerators: GPU, TPU, Dataflow, Near-Memory, Optical, Neuromorphic & more (w/ Author)

Feb 21, 2022

62:34

82

CM3: A Causal Masked Multimodal Model of the Internet (Paper Explained w/ Author Interview)

Feb 21, 2022

84:19

83

AI against Censorship: Genetic Algorithms, The Geneva Project, ML in Security, and more!

Feb 17, 2022

54:57

84

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning (w/ Author)

Feb 16, 2022

78:16

85

[ML News] DeepMind AlphaCode | OpenAI math prover | Meta battles harmful content with AI

Feb 16, 2022

26:38

86

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents (+Author)

Feb 16, 2022

77:04

87

OpenAI Embeddings (and Controversy?!)

Feb 16, 2022

15:56

88

Unsupervised Brain Models - How does Deep Learning inform Neuroscience? (w/ Patrick Mineault)

Feb 16, 2022

81:27

89

GPT-NeoX-20B - Open-Source huge language model by EleutherAI (Interview w/ co-founder Connor Leahy)

Feb 16, 2022

20:05

90

Predicting the rules behind - Deep Symbolic Regression for Recurrent Sequences (w/ author interview)

Feb 2, 2022

71:09

91

IT ARRIVED! YouTube sent me a package. (also: Limited Time Merch Deal)

Jan 28, 2022

12:57

92

[ML News] ConvNeXt: Convolutions return | China regulates algorithms | Saliency cropping examined

Jan 28, 2022

18:36

93

Dynamic Inference with Neural Interpreters (w/ author interview)

Jan 24, 2022

82:36

94

Noether Networks: Meta-Learning Useful Conserved Quantities (w/ the authors)

Jan 21, 2022

69:04

95

This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)

Jan 20, 2022

83:50

96

This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)

Jan 16, 2022

83:50

97

Full Self-Driving is HARD! Analyzing Elon Musk re: Tesla Autopilot on Lex Fridman's Podcast

Jan 7, 2022

41:50

98

Player of Games: All the games, one algorithm! (w/ author Martin Schmid)

Jan 5, 2022

54:10

99

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Jan 5, 2022

42:16

100

[ML News] DeepMind builds Gopher | Google builds GLaM | Suicide capsule uses AI to check access

Jan 5, 2022

25:39

101

Resolution-robust Large Mask Inpainting with Fourier Convolutions (w/ Author Interview)

Jan 5, 2022

54:01

102

[ML News] DeepMind tackles Math | Microsoft does more with less | Timnit Gebru launches DAIR

Dec 14, 2021

25:39

103

NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion (ML Research Paper Explained)

Dec 10, 2021

52:44

104

[ML News] OpenAI removes GPT-3 waitlist | GauGAN2 is amazing | NYC regulates AI hiring tools

Dec 3, 2021

29:08

105

Sparse is Enough in Scaling Transformers (aka Terraformer) | ML Research Paper Explained

Dec 2, 2021

57:06

106

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning (Paper Explained)

Dec 1, 2021

40:42

107

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions (Paper Explained)

Dec 1, 2021

59:18

108

Peer Review is still BROKEN! The NeurIPS 2021 Review Experiment (results are in)

Nov 26, 2021

11:09

109

Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)

Nov 25, 2021

48:06

110

Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Reivew)

Nov 22, 2021

39:14

111

[ML News] Cedille French Language Model | YOU Search Engine | AI Finds Profitable MEME TOKENS

Nov 22, 2021

36:05

112

Gradients are Not All You Need (Machine Learning Research Paper Explained)

Nov 22, 2021

48:29

113

[ML News] Microsoft combines Images & Text | Meta makes artificial skin | Russians replicate DALL-E

Nov 22, 2021

37:52

114

Autoregressive Diffusion Models (Machine Learning Research Paper Explained)

Nov 11, 2021

34:23

115

[ML News] Google introduces Pathways | OpenAI solves Math Problems | Meta goes First Person

Nov 11, 2021

36:45

116

EfficientZero: Mastering Atari Games with Limited Data (Machine Learning Research Paper Explained)

Nov 5, 2021

29:25

117

[YTalks] Siraj Raval - Stories about YouTube, Plagiarism, and the Dangers of Fame (Interview)

Nov 1, 2021

66:44

118

[ML News GERMAN] NVIDIA GTC'21 | DeepMind kauft MuJoCo | Google Lernt Spreadsheet Formeln

Nov 1, 2021

26:56

119

[ML News] NVIDIA GTC'21 | DeepMind buys MuJoCo | Google predicts spreadsheet formulas

Nov 1, 2021

21:23

120

I went to an AI Art Festival in Geneva (AiiA Festival Trip Report)

Oct 29, 2021

18:52

121

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained)

Oct 25, 2021

45:21

122

I took a Swiss train and it was awesome! Train Seat Review - SBB InterCity 1 - Geneva to St. Gallen

Oct 25, 2021

4:15

123

[ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitable

Oct 21, 2021

27:51

124

[ML News] DeepMind does Nowcasting | The Guardian's shady reporting | AI finishes Beethoven's 10th

Oct 11, 2021

27:40

125

Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)

Oct 11, 2021

29:46

126

How far can we scale up? Deep Learning's Diminishing Returns (Article Review)

Oct 4, 2021

20:26

127

[ML News] Plagiarism Case w/ Plot Twist | CLIP for video surveillance | OpenAI summarizes books

Sep 30, 2021

30:51

128

Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment (Paper Explained)

Sep 30, 2021

25:59

129

[ML News] New ImageNet SOTA | Uber's H3 hexagonal coordinate system | New text-image-pair dataset

Sep 28, 2021

14:13

130

Does GPT-3 lie? - Misinformation and fear-mongering around the TruthfulQA dataset

Sep 24, 2021

13:18

131

Topographic VAEs learn Equivariant Capsules (Machine Learning Research Paper Explained)

Sep 21, 2021

32:03

132

[ML News] Roomba Avoids Poop | Textless NLP | TikTok Algorithm Secrets | New Schmidhuber Blog

Sep 16, 2021

25:39

133

Celebrating 100k Subscribers! (w/ Channel Statistics)

Sep 16, 2021

9:38

134

[ML News] AI predicts race from X-Ray | Google kills HealthStreams | Boosting Search with MuZero

Sep 13, 2021

27:33

135

∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained)

Sep 6, 2021

36:36

136

[ML News] Blind Chess AI Competition | Graph NNs for traffic | AI gift suggestions

Sep 5, 2021

17:06

137

ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation

Sep 5, 2021

31:21

138

[ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered

Aug 30, 2021

32:35

139

Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained)

Aug 27, 2021

35:29

140

PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)

Aug 23, 2021

44:18

141

NeuralHash is BROKEN | How to evade Apple's detection and forge hash collisions (w/ Code)

Aug 19, 2021

8:15

142

[ML News] Nvidia renders CEO | Jurassic-1 larger than GPT-3 | Tortured Phrases reveal Plagiarism

Aug 19, 2021

26:38

143

How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained

Aug 16, 2021

50:37

144

[ML NEWS] Apple scans your phone | Master Faces beat face recognition | WALL-E is real

Aug 16, 2021

30:28

145

[ML News] AI-generated patent approved | Germany gets an analog to OpenAI | ML cheats video games

Aug 9, 2021

27:30

146

[ML News] MMO Game destroys GPUs | OpenAI quits Robotics | Today w/ guest host Sanyam Bhutani

Aug 9, 2021

13:20

147

[ML News] Facebook AI adapting robots | Baidu autonomous excavators | Happy Birthday EleutherAI

Jul 18, 2021

23:38

148

[ML News] GitHub Copilot - Copyright, GPL, Patents & more | Brickit LEGO app | Distill goes on break

Jul 13, 2021

27:00

149

Self-driving from VISION ONLY - Tesla's self-driving progress by Andrej Karpathy (Talk Analysis)

Jul 5, 2021

23:46

150

[ML News] CVPR bans social media paper promotion | AI restores Rembrandt | GPU prices down

Jul 5, 2021

18:26

151

The Dimpled Manifold Model of Adversarial Examples in Machine Learning (Research Paper Explained)

Jun 28, 2021

74:21

152

[ML News] Hugging Face course | GAN Theft Auto | AI Programming Puzzles | PyTorch 1.9 Released

Jun 25, 2021

15:51

153

XCiT: Cross-Covariance Image Transformers (Facebook AI Machine Learning Research Paper Explained)

Jun 25, 2021

35:39

154

AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control (Paper Explained)

Jun 22, 2021

34:44

155

[ML News] De-Biasing GPT-3 | RL cracks chip design | NetHack challenge | Open-Source GPT-J

Jun 22, 2021

17:01

156

Efficient and Modular Implicit Differentiation (Machine Learning Research Paper Explained)

Jun 15, 2021

32:50

157

[ML News] EU regulates AI, China trains 1.75T model, Google's oopsie, Everybody cheers for fraud.

Jun 10, 2021

16:54

158

Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)

Jun 7, 2021

56:48

159

[ML News] Anthropic raises $124M, ML execs clueless, collusion rings, ELIZA source discovered & more

Jun 7, 2021

11:56

160

Reward Is Enough (Machine Learning Research Paper Explained)

Jun 2, 2021

35:48

161

Expire-Span: Not All Memories are Created Equal: Learning to Forget by Expiring (Paper Explained)

May 26, 2021

41:44

162

FNet: Mixing Tokens with Fourier Transforms (Machine Learning Research Paper Explained)

May 24, 2021

34:22

163

AI made this music video | What happens when OpenAI's CLIP meets BigGAN?

May 21, 2021

13:52

164

DDPM - Diffusion Models Beat GANs on Image Synthesis (Machine Learning Research Paper Explained)

May 15, 2021

54:33

165

Involution: Inverting the Inherence of Convolution for Visual Recognition (Research Paper Explained)

May 10, 2021

30:53

166

MLP-Mixer: An all-MLP Architecture for Vision (Machine Learning Research Paper Explained)

May 10, 2021

28:11

167

Is Google Translate Sexist? Gender Stereotypes in Statistical Machine Translation

May 3, 2021

12:02

168

Perceiver: General Perception with Iterative Attention (Google DeepMind Research Paper Explained)

May 3, 2021

29:35

169

Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)

May 3, 2021

34:01

170

Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)

May 3, 2021

58:36

171

Multimodal Neurons in Artificial Neural Networks (w/ OpenAI Microscope, Research Paper Explained)

May 3, 2021

51:29

172

Machine Learning PhD Survival Guide 2021 | Advice on Topic Selection, Papers, Conferences & more!

May 3, 2021

16:26

173

PAIR AI Explorables | Is the problem in the data? Examples on Fairness, Diversity, and Bias.

May 3, 2021

23:32

174

DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

May 3, 2021

48:17

175

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (ML Research Paper Explained)

May 2, 2021

33:55

176

DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

May 2, 2021

39:12

177

Why AI is Harder Than We Think (Machine Learning Research Paper Explained)

May 2, 2021

36:40

All Episodes