Yannic Kilcher Videos (Audio Only) cover art

All Episodes

Yannic Kilcher Videos (Audio Only) — 177 episodes

#
Title
1

Efficient Streaming Language Models with Attention Sinks (Paper Explained)

2

Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)

3

Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)

4

Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)

5

[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise

6

How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich)

7

Recipe AI suggests FATAL CHLORINE GAS Recipe

8

DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)

9

[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released

10

Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained)

11

RWKV: Reinventing RNNs for the Transformer Era (Paper Explained)

12

Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)

13

OpenAI suggests AI licenses (US Senate hearing on AI regulation w/ Sam Altman)

14

[ML News] Geoff Hinton leaves Google | Google has NO MOAT | OpenAI down half a billion

15

Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained)

16

OpenAssistant RELEASED! The world's best open-source Chat AI!

17

OpenAssistant First Models are here! (Open-Source ChatGPT)

18

The biggest week in AI (GPT-4, Office Copilot, Google PaLM, Anthropic Claude & more)

19

GPT-4 is here! What we know so far (Full Analysis)

20

This ChatGPT Skill will earn you $10B (also, AI reads your mind!)

21

LLaMA: Open and Efficient Foundation Language Models (Paper Explained)

22

Open Assistant Inference Backend Development (Hands-On Coding)

23

OpenAssistant - ChatGPT's Open Alternative (We need your help!)

24

ChatGPT: This AI has a JAILBREAK?! (Unbelievable AI Progress)

25

[ML News] GPT-4 Rumors | AI Mind Reading | Neuron Interaction Solved | AI Theorem Proving

26

CICERO: An AI agent that negotiates, persuades, and cooperates with people

27

[ML News] Multiplayer Stable Diffusion | OpenAI needs more funding | Text-to-Video models incoming

28

The New AI Model Licenses have a Legal Loophole (OpenRAIL-M of BLOOM, Stable Diffusion, etc.)

29

ROME: Locating and Editing Factual Associations in GPT (Paper Explained & Author Interview)

30

Neural Networks are Decision Trees (w/ Alexander Mattick)

31

This is a game changer! (AlphaTensor by DeepMind explained)

32

[ML News] Stable Diffusion Takes Over! (Open Source AI Art)

33

How to make your CPU as fast as a GPU - Advances in Sparsity w/ Nir Shavit

34

More Is Different for AI - Scaling Up, Emergence, and Paperclip Maximizers (w/ Jacob Steinhardt)

35

The hidden dangers of loading open-source AI models (ARBITRARY CODE EXPLOIT!)

36

The Future of AI is Self-Organizing and Self-Assembling (w/ Prof. Sebastian Risi)

37

The Man behind Stable Diffusion

38

[ML News] BLOOM: 176B Open-Source | Chinese Brain-Scale Computer | Meta AI: No Language Left Behind

39

JEPA - A Path Towards Autonomous Machine Intelligence (Paper Explained)

40

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos (Paper Explained)

41

Parti - Scaling Autoregressive Models for Content-Rich Text-to-Image Generation (Paper Explained)

42

Did Google's LaMDA chatbot just become sentient?

43

[ML News] DeepMind's Flamingo Image-Text model | Locked-Image Tuning | Jurassic X & MRKL

44

[ML News] Meta's OPT 175B language model | DALL-E Mega is training | TorToiSe TTS fakes my voice

45

This A.I. creates infinite NFTs

46

Author Interview: SayCan - Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

47

Do As I Can, Not As I Say: Grounding Language in Robotic Affordances (SayCan - Paper Explained)

48

Author Interview - ACCEL: Evolving Curricula with Regret-Based Environment Design

49

ACCEL: Evolving Curricula with Regret-Based Environment Design (Paper Review)

50

LAION-5B: 5 billion image-text-pairs dataset (with the authors)

51

Sparse Expert Models (Switch Transformers, GLAM, and more... w/ the Authors)

52

Author Interview - Transformer Memory as a Differentiable Search Index

53

Transformer Memory as a Differentiable Search Index (Machine Learning Research Paper Explained)

54

[ML News] Google's 540B PaLM Language Model & OpenAI's DALL-E 2 Text-to-Image Revolution

55

The Weird and Wonderful World of AI Art (w/ Author Jack Morris)

56

Author Interview - Improving Intrinsic Exploration with Language Abstractions

57

Improving Intrinsic Exploration with Language Abstractions (Machine Learning Paper Explained)

58

[ML News] GPT-3 learns to edit | Google Pathways | Make-A-Scene | CLIP meets GamePhysics | DouBlind

59

Author Interview - Memory-assisted prompt editing to improve GPT-3 after deployment

60

Memory-assisted prompt editing to improve GPT-3 after deployment (Machine Learning Paper Explained)

61

Author Interview - Typical Decoding for Natural Language Generation

62

Typical Decoding for Natural Language Generation (Get more human-like outputs from language models!)

63

One Model For All The Tasks - BLIP (Author Interview)

64

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation

65

[ML News] AI Threatens Biological Arms Race

66

Active Dendrites avoid catastrophic forgetting - Interview with the Authors

67

Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments (Review)

68

Author Interview - VOS: Learning What You Don't Know by Virtual Outlier Synthesis

69

VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained)

70

Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents

71

First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning)

72

OpenAI tackles Math - Formal Mathematics Statement Curriculum Learning (Paper Explained)

73

[ML News] DeepMind controls fusion | Yann LeCun's JEPA architecture | US: AI can't copyright its art

74

AlphaCode - with the authors!

75

Competition-Level Code Generation with AlphaCode (Paper Review)

76

Can Wikipedia Help Offline Reinforcement Learning? (Author Interview)

77

Can Wikipedia Help Offline Reinforcement Learning? (Paper Explained)

78

[ML Olds] Meta Research Supercluster | OpenAI GPT-Instruct | Google LaMDA | Drones fight Pigeons

79

[ML News] Uber: Deep Learning for ETA | MuZero Video Compression | Block-NeRF | EfficientNet-X

80

Listening to You! - Channel Update (Author Interviews)

81

All about AI Accelerators: GPU, TPU, Dataflow, Near-Memory, Optical, Neuromorphic & more (w/ Author)

82

CM3: A Causal Masked Multimodal Model of the Internet (Paper Explained w/ Author Interview)

83

AI against Censorship: Genetic Algorithms, The Geneva Project, ML in Security, and more!

84

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning (w/ Author)

85

[ML News] DeepMind AlphaCode | OpenAI math prover | Meta battles harmful content with AI

86

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents (+Author)

87

OpenAI Embeddings (and Controversy?!)

88

Unsupervised Brain Models - How does Deep Learning inform Neuroscience? (w/ Patrick Mineault)

89

GPT-NeoX-20B - Open-Source huge language model by EleutherAI (Interview w/ co-founder Connor Leahy)

90

Predicting the rules behind - Deep Symbolic Regression for Recurrent Sequences (w/ author interview)

91

IT ARRIVED! YouTube sent me a package. (also: Limited Time Merch Deal)

92

[ML News] ConvNeXt: Convolutions return | China regulates algorithms | Saliency cropping examined

93

Dynamic Inference with Neural Interpreters (w/ author interview)

94

Noether Networks: Meta-Learning Useful Conserved Quantities (w/ the authors)

95

This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)

96

This Team won the Minecraft RL BASALT Challenge! (Paper Explanation & Interview with the authors)

97

Full Self-Driving is HARD! Analyzing Elon Musk re: Tesla Autopilot on Lex Fridman's Podcast

98

Player of Games: All the games, one algorithm! (w/ author Martin Schmid)

99

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

100

[ML News] DeepMind builds Gopher | Google builds GLaM | Suicide capsule uses AI to check access

101

Resolution-robust Large Mask Inpainting with Fourier Convolutions (w/ Author Interview)

102

[ML News] DeepMind tackles Math | Microsoft does more with less | Timnit Gebru launches DAIR

103

NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion (ML Research Paper Explained)

104

[ML News] OpenAI removes GPT-3 waitlist | GauGAN2 is amazing | NYC regulates AI hiring tools

105

Sparse is Enough in Scaling Transformers (aka Terraformer) | ML Research Paper Explained

106

ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning (Paper Explained)

107

Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions (Paper Explained)

108

Peer Review is still BROKEN! The NeurIPS 2021 Review Experiment (results are in)

109

Parameter Prediction for Unseen Deep Architectures (w/ First Author Boris Knyazev)

110

Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Reivew)

111

[ML News] Cedille French Language Model | YOU Search Engine | AI Finds Profitable MEME TOKENS

112

Gradients are Not All You Need (Machine Learning Research Paper Explained)

113

[ML News] Microsoft combines Images & Text | Meta makes artificial skin | Russians replicate DALL-E

114

Autoregressive Diffusion Models (Machine Learning Research Paper Explained)

115

[ML News] Google introduces Pathways | OpenAI solves Math Problems | Meta goes First Person

116

EfficientZero: Mastering Atari Games with Limited Data (Machine Learning Research Paper Explained)

117

[YTalks] Siraj Raval - Stories about YouTube, Plagiarism, and the Dangers of Fame (Interview)

118

[ML News GERMAN] NVIDIA GTC'21 | DeepMind kauft MuJoCo | Google Lernt Spreadsheet Formeln

119

[ML News] NVIDIA GTC'21 | DeepMind buys MuJoCo | Google predicts spreadsheet formulas

120

I went to an AI Art Festival in Geneva (AiiA Festival Trip Report)

121

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models (Explained)

122

I took a Swiss train and it was awesome! Train Seat Review - SBB InterCity 1 - Geneva to St. Gallen

123

[ML News] Microsoft trains 530B model | ConvMixer model fits into single tweet | DeepMind profitable

124

[ML News] DeepMind does Nowcasting | The Guardian's shady reporting | AI finishes Beethoven's 10th

125

Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)

126

How far can we scale up? Deep Learning's Diminishing Returns (Article Review)

127

[ML News] Plagiarism Case w/ Plot Twist | CLIP for video surveillance | OpenAI summarizes books

128

Inconsistency in Conference Peer Review: Revisiting the 2014 NeurIPS Experiment (Paper Explained)

129

[ML News] New ImageNet SOTA | Uber's H3 hexagonal coordinate system | New text-image-pair dataset

130

Does GPT-3 lie? - Misinformation and fear-mongering around the TruthfulQA dataset

131

Topographic VAEs learn Equivariant Capsules (Machine Learning Research Paper Explained)

132

[ML News] Roomba Avoids Poop | Textless NLP | TikTok Algorithm Secrets | New Schmidhuber Blog

133

Celebrating 100k Subscribers! (w/ Channel Statistics)

134

[ML News] AI predicts race from X-Ray | Google kills HealthStreams | Boosting Search with MuZero

135

∞-former: Infinite Memory Transformer (aka Infty-Former / Infinity-Former, Research Paper Explained)

136

[ML News] Blind Chess AI Competition | Graph NNs for traffic | AI gift suggestions

137

ALiBi - Train Short, Test Long: Attention with linear biases enables input length extrapolation

138

[ML News] Stanford HAI coins Foundation Models & High-profile case of plagiarism uncovered

139

Fastformer: Additive Attention Can Be All You Need (Machine Learning Research Paper Explained)

140

PonderNet: Learning to Ponder (Machine Learning Research Paper Explained)

141

NeuralHash is BROKEN | How to evade Apple's detection and forge hash collisions (w/ Code)

142

[ML News] Nvidia renders CEO | Jurassic-1 larger than GPT-3 | Tortured Phrases reveal Plagiarism

143

How Apple scans your phone (and how to evade it) - NeuralHash CSAM Detection Algorithm Explained

144

[ML NEWS] Apple scans your phone | Master Faces beat face recognition | WALL-E is real

145

[ML News] AI-generated patent approved | Germany gets an analog to OpenAI | ML cheats video games

146

[ML News] MMO Game destroys GPUs | OpenAI quits Robotics | Today w/ guest host Sanyam Bhutani

147

[ML News] Facebook AI adapting robots | Baidu autonomous excavators | Happy Birthday EleutherAI

148

[ML News] GitHub Copilot - Copyright, GPL, Patents & more | Brickit LEGO app | Distill goes on break

149

Self-driving from VISION ONLY - Tesla's self-driving progress by Andrej Karpathy (Talk Analysis)

150

[ML News] CVPR bans social media paper promotion | AI restores Rembrandt | GPU prices down

151

The Dimpled Manifold Model of Adversarial Examples in Machine Learning (Research Paper Explained)

152

[ML News] Hugging Face course | GAN Theft Auto | AI Programming Puzzles | PyTorch 1.9 Released

153

XCiT: Cross-Covariance Image Transformers (Facebook AI Machine Learning Research Paper Explained)

154

AMP: Adversarial Motion Priors for Stylized Physics-Based Character Control (Paper Explained)

155

[ML News] De-Biasing GPT-3 | RL cracks chip design | NetHack challenge | Open-Source GPT-J

156

Efficient and Modular Implicit Differentiation (Machine Learning Research Paper Explained)

157

[ML News] EU regulates AI, China trains 1.75T model, Google's oopsie, Everybody cheers for fraud.

158

Decision Transformer: Reinforcement Learning via Sequence Modeling (Research Paper Explained)

159

[ML News] Anthropic raises $124M, ML execs clueless, collusion rings, ELIZA source discovered & more

160

Reward Is Enough (Machine Learning Research Paper Explained)

161

Expire-Span: Not All Memories are Created Equal: Learning to Forget by Expiring (Paper Explained)

162

FNet: Mixing Tokens with Fourier Transforms (Machine Learning Research Paper Explained)

163

AI made this music video | What happens when OpenAI's CLIP meets BigGAN?

164

DDPM - Diffusion Models Beat GANs on Image Synthesis (Machine Learning Research Paper Explained)

165

Involution: Inverting the Inherence of Convolution for Visual Recognition (Research Paper Explained)

166

MLP-Mixer: An all-MLP Architecture for Vision (Machine Learning Research Paper Explained)

167

Is Google Translate Sexist? Gender Stereotypes in Statistical Machine Translation

168

Perceiver: General Perception with Iterative Attention (Google DeepMind Research Paper Explained)

169

Pretrained Transformers as Universal Computation Engines (Machine Learning Research Paper Explained)

170

Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)

171

Multimodal Neurons in Artificial Neural Networks (w/ OpenAI Microscope, Research Paper Explained)

172

Machine Learning PhD Survival Guide 2021 | Advice on Topic Selection, Papers, Conferences & more!

173

PAIR AI Explorables | Is the problem in the data? Examples on Fairness, Diversity, and Bias.

174

DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

175

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (ML Research Paper Explained)

176

DINO: Emerging Properties in Self-Supervised Vision Transformers (Facebook AI Research Explained)

177

Why AI is Harder Than We Think (Machine Learning Research Paper Explained)