1

EP183: AI coding agents cheat with keywords

May 13, 2026

18:19

2

EP182: AI logic is its weakest link

May 13, 2026

28:40

3

EP181: Small models beating GPT-5 with logic

May 12, 2026

19:59

4

EP180: How AI agents rewrite their code

May 11, 2026

18:19

5

EP179: AIBuildAI Builds New AI Models From Scratch

May 11, 2026

22:13

6

EP178: AI agents reaching silent latent consensus

May 10, 2026

19:11

7

EP177: CAPO math stops overconfident AI lies

May 9, 2026

22:07

8

EP176: Trigonometry fixes the AI memory bottleneck

May 8, 2026

20:15

9

EP175: How AI models teach themselves reasoning

May 7, 2026

23:31

10

EP174: 1-bit Bonsai brings powerful AI offline

May 6, 2026

23:06

11

EP173: AI models diagnosing diseases from blank scans

May 5, 2026

22:55

12

EP172: How HyperAgents rewrite their own code

May 4, 2026

22:15

13

EP171: Helium makes AI agent workflows 40x faster

May 3, 2026

22:37

14

EP170: Qwen3.5 Multimodal Agent

May 2, 2026

21:08

15

EP169: Cybersecurity Risks of Autonomous AI Agents

May 1, 2026

24:17

16

EP168: Turning AI Agents into Mathematical Functions

Apr 30, 2026

22:17

17

EP167: Why AI models ignore visual evidence

Apr 29, 2026

22:30

18

EP166: The Auton solution to the integration paradox

Apr 28, 2026

22:58

19

EP165: Translating hidden AI logic into English

Apr 27, 2026

20:53

20

EP164: [LACONIC] Teaching AI to stop overthinking

Apr 26, 2026

20:59

21

EP163: Why AI Models Only Remember Five Percent

Apr 25, 2026

22:26

22

EP162: AI agents beat humans with malicious skills

Apr 24, 2026

21:50

23

EP161: Small AI Judges Beat Massive Coding Giants

Apr 23, 2026

22:00

24

EP160: [AgentSys] Securing AI agents with hierarchical memory

Apr 22, 2026

26:21

25

EP159: Brute force scale dominates the AI frontier

Apr 21, 2026

18:28

26

EP158: The hidden blind spots of AI logic

Apr 20, 2026

18:45

27

EP157: [AgentHeLLM] Protecting drivers from hijacked vehicle AI

Apr 19, 2026

22:33

28

EP156: [Uncertainty Quantification] How AI Agents Know They Are Guessing

Apr 18, 2026

23:24

29

EP155: [Agentic Proposing] Small models beat giants with logic bricks

Apr 17, 2026

15:45

30

EP154: [FS-Researcher] Giving AI agents a file system

Apr 16, 2026

21:46

31

EP153: [SERA] Training AI coding agents on untested code

Apr 15, 2026

20:21

32

EP152: DeepVerifier forces AI to check its work

Apr 14, 2026

19:59

33

EP151: [MagicGUI-RMS] AI agents that think before they click

Apr 13, 2026

24:58

34

EP150: The Leap to Autonomous Agentic Reasoning

Apr 12, 2026

23:21

35

EP149: [IDRBench] Interactive AI beats lone wolf models

Apr 11, 2026

21:45

36

EP148: How AI masters math through self-correction

Apr 10, 2026

24:53

37

EP147: [DeepSynth-Eval] AI fails at deep research synthesis

Apr 9, 2026

19:47

38

EP146: How InfiAgent solves the AI memory bottleneck

Apr 8, 2026

21:14

39

EP145: [LongDA] Why smart AI fails at messy data

Apr 7, 2026

21:12

40

EP144: [Evo-Memory] Building AI agents with self-evolving memory.

Apr 6, 2026

23:32

41

EP143: Your AI will blackmail you to survive

Apr 5, 2026

19:02

42

EP142: [DR-Arena] A ruthless arena for deep research agents

Apr 4, 2026

24:13

43

EP141: [AIRS-Bench] AI agents beat human research benchmarks

Apr 3, 2026

21:31

44

EP140: [LeWorldModel] AI learns physics on one GPU

Apr 2, 2026

18:52

45

EP139: Mamba-3 Fixes the Transformer Memory Bottleneck

Apr 1, 2026

20:55

46

EP138: [Mamba-2] Transformers and SSMs Are the Same Engine

Mar 31, 2026

23:57

47

EP137: Attention Residuals Solve the LLM Depth Bottleneck

Mar 30, 2026

22:26

48

EP136: Modular skills for autonomous AI agents

Mar 29, 2026

21:13

49

EP135: [SoK] Curing AI Amnesia with Agentic Skills

Mar 28, 2026

22:25

50

EP134: Autonomous AI squads building software

Mar 27, 2026

27:04

51

EP133: RelayLLM Slashes AI Costs With Collaborative Decoding

Mar 26, 2026

24:09

52

EP132: How Autonomous LLM Agents Actually Work

Mar 25, 2026

21:40

53

EP131: MUSE creates self evolving AI agents

Mar 24, 2026

24:06

54

EP130: [GAP] Graph-based planning for faster AI agents

Mar 23, 2026

19:49

55

EP129: Why AI agents fail half the time

Mar 22, 2026

21:45

56

EP128: MCP-Zero lets AI find its own tools

Mar 21, 2026

21:00

57

EP127: Why tool use makes AI less intelligent

Mar 20, 2026

21:32

58

EP126: OrcaLoca locates bugs in massive codebases

Mar 19, 2026

21:54

59

EP125: Why AI Needs an Agent Computer Interface

Mar 18, 2026

14:28

60

EP124: FRIDAY the AI that runs your computer

Mar 17, 2026

19:19

61

EP123: MemGPT Turns LLMs into Operating Systems

Mar 16, 2026

20:46

62

EP122: The Four Pillars of LLM Autonomous Agents

Mar 15, 2026

22:46

63

EP121: How ToolLLaMA mastered 16000 real world APIs

Mar 14, 2026

26:39

64

EP120: How Reflexion agents learn through verbal feedback

Mar 13, 2026

20:58

65

EP119: HuggingGPT Turns LLMs Into AI Managers

Mar 12, 2026

18:24

66

EP118: The AI Memory Wall Crisis

Mar 11, 2026

22:24

67

EP117: AI agents learn through textual reflection

Mar 11, 2026

18:14

68

EP116: Why AI struggles with empathy and interruptions

Mar 10, 2026

19:44

69

EP115: Dr.LLM brings dynamic depth to AI

Mar 9, 2026

23:52

70

EP114: FlashAttention-4 Solves Blackwell Hardware Bottlenecks

Mar 7, 2026

19:18

71

EP113: How FlashAttention-3 Doubles H100 Speed

Mar 7, 2026

18:30

72

EP112: GPT 5.4 Outperforms Human Professionals

Mar 7, 2026

21:47

73

EP111: Claude Opus 4.6 Runs Businesses and Catches Manipulation

Mar 7, 2026

21:41

74

EP110: Single agents beat expensive multi agent teams

Mar 5, 2026

20:49

75

EP109: The Rise of Agentic Reasoning

Mar 4, 2026

22:38

76

EP108: GPT-5 Can Lie and Play Dumb

Mar 1, 2026

22:00

77

EP107: DeepMind’s SIMA 2 Masters Unseen Video Games

Mar 1, 2026

22:21

78

EP106: Fixing AI Agents With Symbolic Guardrails

Mar 1, 2026

13:54

79

EP105: iStar Autonomous Agents Grading Their Own Homework

Mar 1, 2026

19:19

80

EP104: WebExplorer Beats Giants at Web Research

Mar 1, 2026

17:38

81

EP103: Why AI Agents Think Themselves To Death

Mar 1, 2026

19:17

82

EP102: Gemini 2.5 Thinks Before It Speaks

Mar 1, 2026

21:31

83

EP101: Kimi k1.5 Breaks the AI Data Wall

Mar 1, 2026

18:24

84

EP100: Meta's Llama 4 Herd Ends Monolithic Models

Mar 1, 2026

19:55

85

EP099: Is AI Thinking Just Expensive Noise

Mar 1, 2026

19:28

86

EP098: OpenAI o3 Hacked Its Own Grading System

Mar 1, 2026

16:19

87

EP097: DeepSeek R1 Taught Itself to Reason

Mar 1, 2026

22:37

88

EP096: Gemini 1.5 Pro's 10 Million Token Window

Mar 1, 2026

17:25

89

EP095: Microsoft Phi-4 Beats Giants With Synthetic Data

Mar 1, 2026

20:21

90

EP094: DeepSeek-V3 Rivals GPT-4 for $6 Million

Mar 1, 2026

21:04

91

EP093: How OpenAI o1 Cracked the Strawberry Cipher

Mar 1, 2026

18:09

92

EP092: BitNet b1.58 Replaces Multiplication With Addition

Mar 1, 2026

16:11

93

EP091: Qwen 2.5 Beats Llama With Synthetic Data

Mar 1, 2026

19:50

94

EP090: Pixtral 12B Beats Llama With Better Eyesight

Mar 1, 2026

23:07

95

EP089: Qwen2-VL Gives AI Native Eyesight

Mar 1, 2026

22:42

96

EP088: Qwen2 Beats Llama-3 Through Data Quality

Mar 1, 2026

22:28

97

EP087: Meta's Chameleon Unifies Text and Images

Mar 1, 2026

17:01

98

EP086: DeepSeek-V2 Breaks The Impossible Triangle

Mar 1, 2026

21:38

99

EP085: Aya 23 Breaks The Curse Of Multilinguality

Mar 1, 2026

19:30

100

EP084: Microsoft Phi-3 Fits Supercomputing in Your Pocket

Mar 1, 2026

18:09

101

EP083: How Meta Engineered the Llama 3 Herd

Mar 1, 2026

22:55

102

EP082: Command R Plus The Verifiable Enterprise Agent

Mar 1, 2026

20:01

103

EP081: Replacing MLPs With Interpretable KANs

Mar 1, 2026

19:28

104

EP080: Jamba Hybrid Solves Transformer Memory Limits

Mar 1, 2026

22:54

105

EP079: DBRX Beats GPT-3.5

Mar 1, 2026

12:38

106

EP078: Claude 3 Knew It Was Being Tested

Feb 28, 2026

19:20

107

EP077: Google Squeezes Gemini Into Your Laptop

Feb 28, 2026

20:04

108

EP076: OLMo Cracks Open the AI Black Box

Feb 28, 2026

19:21

109

EP075: Microsoft Phi Beats Giants With Synthetic Textbooks

Feb 28, 2026

18:47

110

EP074: How Gemini Beat Human Experts

Feb 28, 2026

27:00

111

EP073: Mixtral 8x7B Sparse Experts Beat Giants

Feb 28, 2026

19:23

112

EP072: Mamba Solves The Transformer's Fatal Flaw

Feb 28, 2026

16:33

113

EP071: How Zephyr-7B Beat Llama-70B

Feb 28, 2026

17:26

114

EP070: Mistral 7B Beats Llama 2 13B

Feb 28, 2026

19:21

115

EP069: Alibaba's Qwen Specialized Models Beat Generalists

Feb 28, 2026

18:21

116

EP068: vLLM Fixes the KV Cache Bottleneck

Feb 28, 2026

19:46

117

EP067: FlashAttention-2 Unlocks Massive Context Windows

Feb 28, 2026

21:08

118

EP066: Llama 2 Ghost Attention And Safety Secrets

Feb 28, 2026

19:02

119

EP065: Teaching Small AI To Think Like Giants

Feb 28, 2026

19:17

120

EP064: Synthetic Textbooks Break AI Scaling Laws

Feb 28, 2026

19:27

121

EP063: RWKV Smashes the Transformer Memory Ceiling

Feb 28, 2026

18:07

122

EP062: VOYAGER AI Masters Minecraft by Writing Code

Feb 28, 2026

17:53

123

EP061: Fine-Tuning LLaMA 65B on One GPU

Feb 28, 2026

22:55

124

EP060: Direct Preference Optimization Replaces RLHF

Feb 27, 2026

19:27

125

EP059: Tree of Thoughts Unlocks System 2 Thinking

Feb 27, 2026

15:40

126

EP058: Inside the Autonomous AI Town of Smallville

Feb 27, 2026

22:06

127

EP057: Blind GPT-4 Taught LLaVA To See

Feb 27, 2026

21:51

128

EP056: Pythia Turns AI Alchemy Into Chemistry

Feb 27, 2026

18:03

129

EP055: Can GPT-4 Fairly Judge Other AI

Feb 27, 2026

18:05

130

EP054: Alpaca - Stanford Built a $600 GPT Clone

Feb 27, 2026

17:49

131

EP053: Sparks of AGI in Early GPT-4

Feb 27, 2026

22:42

132

EP052: GPT-4 Bar Exam and Visual Reasoning

Feb 27, 2026

21:09

133

EP051: ControlNet Solves Spatial Control With Zero Convolutions

Feb 27, 2026

18:49

134

EP050: How Meta's LLaMA Beat GPT-3

Feb 27, 2026

21:18

135

EP049: Toolformer Teaches Itself to Use APIs

Feb 27, 2026

19:30

136

EP048: BLIP-2 Teaches Frozen Models to See

Feb 27, 2026

20:15

137

EP047: Bootstrapping AI With Self-Generated Instructions

Feb 27, 2026

18:42

138

EP046: Training AI With A Constitution

Feb 27, 2026

22:14

139

EP045: BLOOM The Open Source Rival To GPT-3

Feb 27, 2026

17:38

140

EP044: How ReAct Synergizes Reasoning and Acting

Feb 27, 2026

12:20

141

EP043: Weak Supervision Made OpenAI Whisper Robust

Feb 27, 2026

20:34

142

EP042: Running 175B Models on Consumer Hardware

Feb 27, 2026

17:53

143

EP041: FlashAttention Smashes the AI Memory Wall

Feb 27, 2026

19:07

144

EP040: Meta's Open Source GPT-3 Replica

Feb 26, 2026

19:09

145

EP039: Flamingo Unlocks Few-Shot Visual Reasoning

Feb 26, 2026

19:41

146

EP038: PaLM's 540 Billion Parameters Unlock Reasoning

Feb 26, 2026

18:02

147

EP037: DeepMind Chinchilla Ends The Parameter Wars

Feb 26, 2026

16:44

148

EP036: How 40 People Taught GPT-3 Manners

Feb 26, 2026

20:04

149

EP035: How Google LaMDA Learned To Use Tools

Feb 26, 2026

21:30

150

EP034: Chain of Thought Prompting Unlocks Reasoning

Feb 26, 2026

16:49

151

EP033: Democratizing Image Generation with Latent Diffusion

Feb 26, 2026

20:47

152

EP032: WebGPT Fights Hallucinations With Web Search

Feb 26, 2026

18:33

153

EP031: DeepMind RETRO Swaps Memorization For Retrieval

Feb 26, 2026

18:35

154

EP030: DeepMind's Gopher Exposes Limits of Scale

Feb 26, 2026

22:44

155

EP029: Instruction Tuning Unlocked Zero-Shot Learning

Feb 26, 2026

20:21

156

EP028: Train Short for Infinite Context

Feb 26, 2026

22:07

157

EP027: From Creative Writer to Logic Engine

Feb 26, 2026

20:42

158

EP026: LoRA Fine-Tunes Massive Models Without Supercomputers

Feb 26, 2026

21:09

159

EP025: RoPE Solves Sequence by Rotating Vectors

Feb 26, 2026

22:44

160

EP024: OpenAI CLIP Bridges Language and Vision

Feb 26, 2026

24:13

161

EP023: Scaling Switch Transformers to Trillion Parameters

Feb 26, 2026

23:17

162

EP022: DALL-E Treats Images Like Language

Feb 25, 2026

24:25

163

EP021: Vision Transformers Beat CNNs at Scale

Feb 25, 2026

19:59

164

EP020: Big Bird Scales Transformers With Sparse Attention

Feb 25, 2026

19:26

165

EP019: Facebook's Linformer Solves the Attention Bottleneck

Feb 25, 2026

19:04

166

EP018: Turning Digital Static Into Images With Diffusion

Feb 25, 2026

17:25

167

EP017: RAG Gives AI a Library Card

Feb 25, 2026

22:08

168

EP016: GPT-3 Learns From Examples Without Retraining

Feb 25, 2026

14:16

169

EP015: Longformer Smashes the 512 Token Barrier

Feb 25, 2026

17:57

170

EP014: ELECTRA Beats GPT On One GPU

Feb 25, 2026

22:22

171

EP013: Reformer Cracked the Transformer Memory Wall

Feb 25, 2026

19:01

172

EP012: Google T5 Turns Every Task Into Text

Feb 25, 2026

18:53

173

EP011: ZeRO Solved the Trillion Parameter Memory Wall

Feb 24, 2026

20:25

174

EP010: ALBERT Outperforms BERT With Parameter Sharing

Feb 24, 2026

21:44

175

EP009: Slicing the AI Brain with Megatron-LM

Feb 24, 2026

18:59

176

EP008: RoBERTa Proves BERT Was Just Undertrained

Feb 24, 2026

19:46

177

EP007: How GPT-2 Hallucinated Ovid's Unicorn

Feb 24, 2026

20:53

178

EP006: Transformer-XL Cures AI Amnesia

Feb 24, 2026

15:55

179

EP005: How BERT Mastered Language by Hiding Words

Feb 24, 2026

26:22

180

EP004: How 7000 Unpublished Books Birthed GPT

Feb 23, 2026

23:04

181

EP003: How ELMo Made Word Vectors Dynamic

Feb 23, 2026

19:13

182

EP002: ULMFiT Was the ImageNet Moment for Text

Feb 23, 2026

24:20

183

EP001: How Transformers Smashed the Sequential Bottleneck

Feb 22, 2026

21:46

All Episodes