PodParley - Discover, Search, and Explore Podcasts

1

Inside Inkling’s 1T MoE Architecture and 1M Token Context

Jul 16, 2026

50:06

2

NVIDIA Nemotron Labs: Why Open Models are Dominating Enterprise AI

Jul 15, 2026

37:17

3

OpenAI GPT-Live Explained: Full-Duplex Voice Meets AI Agents

Jul 12, 2026

25:12

4

GPT-5.6 Technical Deep Dive: Multi-Agent Parallelism, "Iris-Alpha" Architecture, and the Notice-Act Gap

Jul 9, 2026

41:13

5

Grok 4.5, the $60B Cursor Acquisition, and the Fight for the AI Moat

Jul 9, 2026

28:46

6

Hotwiring Apple's Neural Engine

Jul 7, 2026

40:29

7

2026 LLM Inference Deep Dive: Solving the Memory Bandwidth & Interconnect Bottleneck | Neural Intel

Jun 26, 2026

37:19

8

Engineering Persistence: How MLX-Engine v1.8.5 Solves the KV Cache Rewind Problem

Jun 22, 2026

43:03

9

Claude Fable 5 Isn’t Just a Better Model: It’s a New AI Runtime

Jun 10, 2026

42:45

10

The EML Operator: One Primitive to Rule All Mathematics

May 13, 2026

33:17

11

OpenAI MRC, SRv6, and the Architecture of Frontier AI Supercomputers

May 8, 2026

44:45

12

Inside the Machine: Training GPT-5, the Memory Wall, and the Math of MoE

May 1, 2026

45:18

13

DeepSeek-V4: The Million-Token Efficiency Leap | Open Source SOTA

Apr 27, 2026

8:14

14

Breaking the Quadratic Bottleneck with DeepSeek-V4’s Hybrid Attention

Apr 27, 2026

56:41

15

Claude Desktop’s Silent Sandbox Bypass: The Undocumented Browser Bridge

Apr 24, 2026

7:54

16

Forensic Audit of Anthropic’s Native Messaging Backdoor

Apr 24, 2026

37:14

17

The $60 Billion Synergy: Architecting the SpaceX + Cursor AI "Colossus" | Neural Intel Podcast

Apr 24, 2026

40:48

18

The Jackrong Playbook: Mastering Claude 4.6 Opus Distillation with Unsloth and LoRA

Apr 20, 2026

23:24

19

Inside the Claude Opus 4.7 Orchestration Layer - Deferred Tools & Agentic Code

Apr 17, 2026

29:38

20

Electrons to Tokens: The Technical Architecture of Nvidia’s AI Monopoly

Apr 16, 2026

37:45

21

Hermes Agent’s Memory Architecture and the Future of Agentic RL

Apr 14, 2026

23:54

22

200 Gigawatts or Bust: Dylan Patel on the Engineering Reality of AGI Scaling

Apr 12, 2026

52:50

23

The Muse Spark Revolution: Dissecting Meta's 2026 Architectural Pivot & The Triad of Truth | Neural Intel Podcast

Apr 9, 2026

33:04

24

Synaptic Persistence and Mushroom Body Neurogenesis: The Architecture of Metamorphic Memory

Apr 9, 2026

37:24

25

Engineering Sovereign Knowledge Bases with Andrej Karpathy’s Automated Architect

Apr 7, 2026

34:45

26

The Mercor AI Breach: National Security Crisis or a Wake-Up Call for the AI Industry?

Apr 3, 2026

18:52

27

BREAKING: Massive Mercor AI Data Breach - SOTA Training Data Leaked from Meta, Apple, & Amazon

Apr 3, 2026

6:12

28

Did Anthropic Just Hand the Keys to AI Coding to Everyone? The Huge Claude Code Leak Explained

Apr 2, 2026

7:03

29

The Claude Code Leak: Decoding Anthropic’s Self-Healing Memory and Secret "KAIROS" Agent

Apr 2, 2026

33:10

30

Is AI Censorship Over? The G0DM0D3 "Liberated Chat" Breakthrough

Mar 29, 2026

7:09

31

Is Traditional Computing Dead? NVIDIA's Jensen Huang on the "iPhone of Tokens"

Mar 26, 2026

7:20

32

The Bio-Computer Architecture: Declassified CIA Mechanics for Synthetic Consciousness

Mar 25, 2026

25:44

33

The End of the Human Bottleneck: Andrej Karpathy on Auto-Research and Recursive AI

Mar 24, 2026

38:21

34

Is Open Source Dead? Inside the Cursor Composer 2 vs. Kimi License Controversy

Mar 22, 2026

18:16

35

Is Residual Scaling Obsolete? Introducing Attention Residuals

Mar 17, 2026

9:43

36

The Sequence-Depth Breakthrough: Inside Kimi Team's Attention Residuals

Mar 16, 2026

53:44

37

Beyond the Prompt: Architecture of the Qwen-Agent Ecosystem and Qwen3.5

Mar 12, 2026

42:59

38

Beyond the Chatbot: Engineering "Forever-Agents" with Hermes Agent and OpenClaw

Mar 10, 2026

44:03

39

Nanochat: How Karpathy Automated AI Evolution with NVIDIA ClimbMix

Mar 8, 2026

32:48

40

1 Million Tokens: Breakthrough or Marketing Stunt? The GPT-5.4 Technical Deep Dive

Mar 6, 2026

43:20

41

Qwen 3.5: Exodus, Restructuring, Betrayal, and the Future of Chinese AI

Mar 4, 2026

33:30

42

The Mac mini Guide to OpenClaw and Local AI

Mar 2, 2026

30:56

43

The Neural Intel Op Ed: Engineering a Post-Natural Language for the AI Era

Mar 1, 2026

34:42

44

Andrej Karpathy on the "Claw" Revolution: Are AI Agents Obsolete?

Feb 28, 2026

31:52

45

10 Million Tokens and Beyond: Why Recursive AI is the Next Scaling Frontier

Feb 21, 2026

38:16

46

The Grok 4.20 Manifesto: Multi-Agent Logic and the Quest for Unfiltered Truth

Feb 18, 2026

15:17

47

The End of Memory Bottlenecks: How Fiber Optics and Ganged Flash Power Trillion-Parameter Models

Feb 16, 2026

16:18

48

Interview with Dario Amodei from Anthropic: Inside the $100B "Big Blob of Compute" & The 2030 AGI Certainty

Feb 15, 2026

36:31

49

The OpenClaw Saga: Peter Steinberger on Self-Modifying AI and the Age of the Lobster

Feb 15, 2026

34:46

50

Inside the 180 Billion HKD Breakthrough: How MiniMax M2.5 Scaled Agentic RL

Feb 14, 2026

35:46

51

The 744B Parameter Giant: How GLM-5 and Domestic Chips Redefine the Global AI Order

Feb 12, 2026

12:53

52

The OpenClaw Security Crisis: Can We Control Autonomous AI Swarms?

Feb 4, 2026

29:59

53

Is Consciousness Only in Your Head?

Jan 29, 2026

13:10

54

Methods and Applications of Parametric Sensitivity Analysis

Jan 22, 2026

26:56

55

The Architecture of Choice: Scaling MIT’s Decision Algorithms

Jan 19, 2026

50:28

56

The Logographic Advantage: How China’s Ancient Language is Powering Next-Gen AI | Neural Intel Deep Dive

Jan 9, 2026

29:53

57

Deep Learning Deep Dive: From Neural Networks to Differentiable Programming

Jan 7, 2026

30:15

58

The Hidden Evolution: Implicit Reinforcement Learning and the Future of Iterative AI

Jan 5, 2026

34:40

59

The Math of Stability: DeepSeek-AI’s mHC and the Evolution of Macro-Architecture

Jan 1, 2026

28:44

60

MoE Giants: Decoding the 670 Billion Parameter Showdown Between DeepSeek V3 and Mistral Large

Dec 25, 2025

30:18

61

GLM-4.7 Deep Dive: 358B Parameters, Agentic Reasoning, and the Future of Open Weights

Dec 24, 2025

33:32

62

Beyond the Exam Room: Stress-Testing Clinical AI with Medmarks v0.1

Dec 23, 2025

27:12

63

ANDREJ KARPATHY 2025 LLM Review: RLVR, Jagged Intelligence, & The Vibe Coding Revolution

Dec 21, 2025

35:23

64

The Automated Karpathy Recipe: Master Neural Network Debugging with neural_net_checklist

Dec 18, 2025

13:05

65

Nemotron 3 Nano: The Hybrid Mamba-MoE Model Driving Efficient, 1M-Token Agentic AI

Dec 16, 2025

40:38

66

Olmo 3: Unpacking the Fully Open LLM Flow (Dolma 3, OlmoRL, & State-of-the-Art Reasoning)

Dec 14, 2025

13:14

67

The Code Red Gambit: GPT-5.2's Mega-Agent Architecture

Dec 13, 2025

34:51

68

Fara-7B: The 7B Agentic SLM Redefining On-Device CUA Performance

Dec 10, 2025

16:29

69

The AGI Frontier: DeepMind’s Decade of Breakthroughs-From DQN and AlphaZero to Solving Protein Folding.

Dec 7, 2025

33:03

70

INTELLECT-3: Scaling Agentic RL and MoE to SOTA Performance with prime-rl and 512 H200s

Dec 4, 2025

16:45

71

Kimi Founder Yang Zhilin on K2, Agentic LLMs, & AGI: The Beginning of Infinity | Scaling & Innovation Strategy

Nov 30, 2025

20:00

72

Ilya Sutskever on AI: Transitioning from Scaling to Research, Generalization, and the Future of Superintelligence

Nov 26, 2025

34:59

73

Neuromorphic Computing: Principles and Architecture

Nov 23, 2025

11:57

74

Gemini 3 Pro Release Review: Benchmarks, Generative UI, Deep Think Mode, and Google Antigravity

Nov 20, 2025

17:10

75

DeepSeek-OCR: Contexts Optical Compression

Nov 16, 2025

14:00

76

LLM Gambling Addiction: Behavioral and Neural Mechanisms

Nov 10, 2025

16:32

77

Glyph: Visual-Text Compression for Scaling Context Windows

Nov 2, 2025

15:58

78

Continual Learning via Sparse Memory Finetuning

Oct 26, 2025

14:07

79

Andrej Karpathy on AI, Intelligence, and Education

Oct 21, 2025

36:19

80

Untangling the xAI-OpenAI Legal War: Trade Secrets and Antitrust

Oct 4, 2025

18:09

81

IBM Granite 4.0: Hybrid Mamba/Transformer Breakthrough for Enterprise LLMs?

Oct 3, 2025

14:03

82

Anthropic's Claude Sonnet 4.5: The New Coding Standard?

Sep 30, 2025

16:08

83

GPT-5-Codex: Agentic Coding and OpenAI's Evolution

Sep 22, 2025

13:40

84

Grok 4 Fast: Speed, Efficiency, and Application Review

Sep 22, 2025

14:52

85

How to Read a Research Paper

Sep 14, 2025

7:15

86

The Science of Sampling

Sep 14, 2025

6:58

87

GPT-5 Revisited: Progress, Performance, and User Experience

Sep 12, 2025

13:49

88

Thyme Autonomous AI that Sees, Codes and Solves Problems

Sep 11, 2025

41:04

89

YaRN: Extending LLM Context Windows Efficiently

Sep 10, 2025

6:27

90

Ilya Sutskever's AI Vision: From Deep Learning Dogmas to Safe Superintelligence

Sep 9, 2025

49:45

91

Thyme: Think Beyond Images with Code-Executing MLLMs

Sep 7, 2025

7:50

92

What did Ilya see?

Sep 6, 2025

49:45

93

Meta's AI Ambitions: Turbulence in Superintelligence Labs

Sep 5, 2025

15:20

94

Hierarchical Reasoning: Bigger Isn't Always Better

Sep 4, 2025

7:35

95

Prime Collective Communications Library: A Technical Report

Sep 3, 2025

76:03

96

Prime Collective Communications Library: A Technical Report

Sep 3, 2025

7:24

97

MetaStone-S1: Reflective Generative AI for Test-Time Scaling

Sep 2, 2025

6:52

98

MetaStone-S1: Reflective Generative AI for Test-Time Scaling

Sep 2, 2025

45:03

99

ToonComposer: AI-Assisted Cartoon Production and Post-Keyframing

Sep 1, 2025

48:54

100

ToonComposer: AI-Assisted Cartoon Production and Post-Keyframing

Sep 1, 2025

7:03

101

Triton: Language, Compiler, and Optimization for AI Workloads

Aug 31, 2025

8:40

102

Triton: Language, Compiler, and Optimization for AI Workloads

Aug 30, 2025

78:43

103

Dynamic Fine-Tuning: Elevating LLM Generalization

Aug 29, 2025

48:57

104

Lessons from a Chimp: AI Scheming and Ape Language

Aug 28, 2025

7:15

105

Deciphering Reinforcement Learning for Language Models

Aug 28, 2025

38:03

106

STREAM3R: Scalable Streaming 3D Reconstruction with Causal Transformer

Aug 28, 2025

7:10

107

Yan: Interactive Video Generation Framework

Aug 27, 2025

59:48

108

Lessons from a Chimp: AI Scheming and Ape Language

Aug 26, 2025

78:00

109

NextStep-1: Unified Multi-modal Generation

Aug 26, 2025

37:12

110

STREAM3R: Scalable Streaming 3D Reconstruction with Causal Transformer

Aug 24, 2025

42:28

111

GLM-V: Advancing Multimodal Reasoning with RLCS

Aug 23, 2025

35:25

112

DINOv3: Self-Supervised Vision Foundation Models

Aug 22, 2025

48:48

113

GPT-5 and Grok 4: Altman vs Musk

Aug 21, 2025

33:44

114

Hugging Face Hub Storage: Xet vs. Git LFS

Aug 20, 2025

13:39

115

Channel-Wise MLPs Boost RCN Generalization

Aug 19, 2025

46:27

116

Fine-Tuning Custom Embedding Models for Enhanced Retrieval Performance

Aug 18, 2025

37:58

117

AdLlama: Boosting Ad CTR with Reinforcement Learning

Aug 17, 2025

45:00

118

Machine Learning: Models, Algorithms, and Reinforcement Learning

Aug 17, 2025

58:37

119

Mixture-of-Recursions: Adaptive Computation for Language Models

Aug 16, 2025

39:47

120

Operator-Based Machine Intelligence: A Hilbert Space Framework

Aug 15, 2025

75:17

121

Meta CLIP 2: A Worldwide Scaling Recipe

Aug 13, 2025

53:26

122

In-Context Learning: Implicit Weight Dynamics

Aug 12, 2025

41:53

123

GLM-4.5: Open Agentic, Reasoning, and Coding Foundation Models

Aug 11, 2025

67:17

124

RLVMR: Verifiable Meta-Reasoning for Long-Horizon Agents

Aug 10, 2025

32:58

125

CoT-Self-Instruct: High-Quality Synthetic Prompt Generation

Aug 9, 2025

39:44

126

GPT-5: Hype, Reality and the Future of AI

Aug 9, 2025

48:10

127

Seed-Prover: Advancing Automated Mathematical Reasoning with Formal Verification

Aug 8, 2025

40:48

128

Self-Evolving Agents: A Comprehensive Survey

Aug 7, 2025

63:13

129

High-Precision W and Z Boson Mass Measurement at CMS

Aug 7, 2025

63:08

130

Falcon-H1: Hybrid-Head LLMs for Efficiency and Performance

Aug 6, 2025

53:45

131

ASI-ARCH: AI-Driven Scientific Discovery for Neural Architecture

Aug 4, 2025

40:33

132

In-Context Learning: Implicit Weight Dynamics

Aug 3, 2025

41:53

133

Qwen3: Unifying Reasoning and Efficiency in LLMs

Aug 2, 2025

60:26

134

Group Sequence Policy Optimization for LLMs

Aug 1, 2025

32:58

135

Reinforcement Learning: Advancements, Applications, and Challenges

Jul 31, 2025

56:50

136

SPIRAL: Self-Play for Reasoning in Games

Jul 29, 2025

38:46

137

Qwen3-Coder: Agentic Coding and Model Capabilities

Jul 28, 2025

53:20

138

Hierarchical Reasoning Model: Brain-Inspired AI for Complex Tasks

Jul 27, 2025

39:15

139

Local LLM Solutions for Mac Silicon: Llama.cpp and LM Studio

Jul 26, 2025

33:43

140

Kimi K2: Open Agentic Intelligence and Applications

Jul 25, 2025

14:42

141

CARTRIDGES: Efficient Context for LLMs

Jul 24, 2025

15:55

142

Prompt Baking: Embedding LLM Behavior in Weights

Jul 23, 2025

16:00

143

Massistant: Chinese Mobile Forensic Tooling Revealed

Jul 22, 2025

25:37

144

Unexpected Military Roots of Digital Computing and Research

Jul 17, 2025

46:32

145

The 2025 AI Landscape: Progress and Outlook

Jul 16, 2025

58:47

146

The Dynamics of Neural Attention

Jul 15, 2025

26:17

147

Consciousness and Reality according to the CIA:Gateway

Jul 14, 2025

69:38

148

Military Roots of Digital Computing and Research

Jul 13, 2025

51:19

149

Accelerating Mobile AI with ExecuTorch and KleidiAI: Revisited

Jul 12, 2025

29:42

150

State-Adaptive Regularization for Offline Reinforcement Learning

Jul 11, 2025

39:32

151

Nash Learning from Human Feedback via Mirror Prox

Jul 10, 2025

31:21

152

MiniMax-M1: Scaling Test-Time Compute with Lightning Attention

Jul 9, 2025

37:27

153

Direct Reasoning Optimization for LLMs

Jul 8, 2025

40:36

154

AI's Impact on the US Workforce

Jul 7, 2025

36:40

155

LLaMA Factory: Easy LLM Fine-Tuning

Jul 6, 2025

55:38

156

Project Vend: Can Claude Run a Small Shop?

Jul 5, 2025

58:12

157

Self-Adapting Language Models (SEAL)

Jul 4, 2025

50:31

158

The Illusion of the Illusion of Thinking

Jul 3, 2025

34:01

159

The Illusion of Thinking in Reasoning Models

Jul 2, 2025

37:55

160

Meta-Reinforcement Learning with Minimum Attention

Jul 1, 2025

34:20

161

AI Persuasion Through Reinforcement Learning and Rhetoric

Jun 30, 2025

37:15

162

Reinforcement Learning for Assembly Code Optimization with LLMs

Jun 30, 2025

59:06

163

FileFix: Browser to PowerShell Social Engineering

Jun 29, 2025

26:07

164

Reinforcement Learning Under Unmeasured Confounding

Jun 28, 2025

64:20

165

Reinforcement Learning for Urban Air Quality Management

Jun 27, 2025

61:19

166

Reinforcement Learning in Non-Stationary Environments

Jun 26, 2025

31:26

167

Personalized Policy Learning from Heterogeneous Data

Jun 25, 2025

38:42

168

Boosting Reinforcement Learning with Human Feedback via SeRA

Jun 23, 2025

34:05

169

AXIOM: Active Inference Object-Centric World Models

Jun 22, 2025

36:09

170

Entropy and Reinforcement Learning for LLMs

Jun 21, 2025

31:10

171

FLEX Robot-Agnostic Force-Based Manipulation Learning

Jun 19, 2025

56:34

172

Agent RL Scaling for Mathematical Problem Solving

Jun 18, 2025

51:16

173

Beyond Reward: Limits of RL in LLM Reasoning

Jun 17, 2025

39:57

174

Reward Model Variance in RLHF

Jun 15, 2025

50:58

175

Power Grid Topological Control with Graph Reinforcement Learning

Jun 14, 2025

57:47

176

Decentralized RL for Multi-Resource Allocation via Dynamic Cluster Agreements

Jun 13, 2025

52:32

177

Reinforcement Learning for Humanoid Dexterous Manipulation

Jun 12, 2025

42:03

178

µCODE: Code Generation with Single-Step Rewards

Jun 11, 2025

50:32

179

Confidence-Reward Preference Optimization for Machine Translation

Jun 10, 2025

55:38

180

Personalized Preference Learning with MiCRo

Jun 9, 2025

47:37

181

ProRL Expands LLM Reasoning Boundaries

Jun 8, 2025

41:43

182

ProxyThinker: Guiding Large Models with Small Reasoners

Jun 7, 2025

44:31

183

Open CaptchaWorld: Benchmarking MLLM Agents

Jun 7, 2025

12:43

184

DexMachina: Functional Dexterous Bimanual Manipulation

Jun 6, 2025

16:28

185

3DMEM-BENCH: Long-Term Memory for Embodied AI

Jun 5, 2025

13:58

186

Fine-Tuning Large Language Models: A Comprehensive Guide

Jun 4, 2025

27:47

187

Maximizing Confidence Alone Improves Reasoning

Jun 2, 2025

11:42

188

Critical Points of Random Neural Networks

Jun 1, 2025

11:06

189

BAGEL: Vision-Language Model for Visual Generation

May 31, 2025

18:29

190

Incentivizing Knowledge Acquisition in LLMs via RL

May 31, 2025

14:35

191

RL for Image Generation: DPO vs GRPO

May 30, 2025

13:21

192

Let Androids Dream Framework

May 29, 2025

13:35

193

SmolVLM: Compact and Efficient Vision-Language Models

May 27, 2025

19:47

194

Federated Learning: Privacy-Preserving Collaborative Intelligence Survey

May 26, 2025

30:42

195

Compressed Federated Learning of Tiny Language Models

May 25, 2025

11:33

196

Mobile Intelligence Language Understanding Benchmark

May 24, 2025

16:03

197

AI-RAN: Converging Communications and Computing

May 23, 2025

24:24

198

Ollama LLM Fine-Tuning Methods

May 22, 2025

15:12

199

Customizing LLMs for High-Performance VHDL Design

May 21, 2025

15:00

200

Adaptively Weighted Nearest Neighbors for Matrix Completion

May 20, 2025

15:56

201

SAD Neural Networks, Divergent Gradient Flows, and Optimality

May 19, 2025

12:38

202

WavReward: Evaluating Spoken Dialogue Models

May 18, 2025

10:36

203

BLIP3-o Unified Multimodal Models

May 17, 2025

18:29

204

CodePDE: LLM-Driven PDE Solver Generation

May 16, 2025

14:01

205

Online Learning Neural Networks: Bounds and Characterization

May 15, 2025

13:09

206

UAV Visual Object Search in City Space

May 15, 2025

16:53

207

Benchmark for Auto-bidding Task

May 14, 2025

14:54

208

Reinforcement Learning with Human Feedback Improvements

May 6, 2025

10:37

209

T2I-R1: Reinforcing Image Generation with Bi-level CoT

May 5, 2025

14:47

210

Pretraining for Heterogeneous Treatment Effects

May 4, 2025

22:00

211

AI Jekyll-Hyde Tipping Point Formula

May 4, 2025

13:26

212

Personalizing Multimodal Models with Yo'Chameleon

May 3, 2025

15:43

213

Current Advances and Applications of AI, April 2025 Overview

May 1, 2025

17:21

214

Min-Form Credit Assignment for Process Reward Model Reasoning

May 1, 2025

15:14

215

Language Models for Automated Patient Record Linkage

Apr 30, 2025

17:06

216

Parameter-Efficient Continual Learning: A Survey

Apr 29, 2025

19:14

217

Building an Agent: LLM, Loop, and Tokens

Apr 28, 2025

8:28

218

Uncertainty-Guided Lung Tumor Segmentation via Coarse-to-Fine Refinement

Apr 27, 2025

11:33

219

Complex Instruction-Based Image Editing Benchmark

Apr 26, 2025

12:14

220

Sleep-Time Compute: Pre-computation for Efficient LLM Inference

Apr 25, 2025

11:45

221

Miras: A Framework for Designing Deep Learning Architectures

Apr 24, 2025

14:07

222

RUKA: A Compact and Affordable Humanoid Robotic Hand

Apr 23, 2025

19:32

223

GenEAva: Expressive Cartoon Avatar Generation via Diffusion

Apr 22, 2025

17:55

224

VCR-Bench: Video Chain-of-Thought Reasoning Evaluation

Apr 21, 2025

15:29

225

Automating LLM Hallucination Detection with Reasoning

Apr 20, 2025

10:14

226

Llama 4: Natively Multimodal AI Innovation

Apr 19, 2025

17:40

227

Self-Steering Language Models via Probabilistic Programs

Apr 18, 2025

16:06

228

Amazon Q Developer: AI for Data Science in SageMaker Canvas

Apr 17, 2025

14:25

229

Adaptive SVD for Continual Learning in Large Language Models

Apr 16, 2025

17:59

230

Llama 4: Natively Multimodal AI Innovation

Apr 15, 2025

17:40

231

UniOcc: Unified Occupancy Prediction and Forecasting Benchmark

Apr 13, 2025

21:55

232

Graph Counterfactual XAI via Latent Space Traversal

Apr 12, 2025

25:23

233

Continual Forgetting for Pre-trained Vision Models

Apr 11, 2025

13:20

234

Age of Updates for Adaptive OFDM in Autonomous Vehicles

Apr 10, 2025

30:43

235

Video Generation Improvement via Human Preference Alignment

Apr 9, 2025

24:42

236

AnimeGamer: Infinite Anime Life Simulation via MLLM

Apr 8, 2025

21:11

237

NoProp: Learning Neural Networks Without Backpropagation

Apr 7, 2025

17:06

238

ACPBench Hard: Generative Planning Reasoning Tasks

Apr 6, 2025

20:07

239

Efficient Training of Large Language Models

Apr 5, 2025

8:34

240

Uni4D Dynamic 4D Modeling from Casual Video

Apr 4, 2025

16:58

241

KDTalker: Audio-Driven Talking Portraits via Implicit Keypoint Diffusion

Apr 3, 2025

17:42

242

OLMo 2: Fully Open Language Model Advancements

Apr 2, 2025

15:43

243

Stable-SCore Stable 3D Shape Correspondence via Registration

Apr 1, 2025

16:59

244

ProjectEval: Benchmarking Project-Level Code Generation by LLM Agents

Mar 31, 2025

25:29

245

Embodied Agent Confidence Elicitation in Dynamic Multimodal Environments

Mar 30, 2025

17:53

246

VLMs Playing StarCraft II: A Multimodal Decision Benchmark

Mar 29, 2025

14:17

247

M-Attack: Simple Yet Effective Attacks Against Strong Vision-Language Models

Mar 28, 2025

18:18

248

Deep Learning for Inverse Design of Radio-Frequency Circuits

Mar 27, 2025

13:39

249

Coding with LLMs A Developer's Guide by Simon Willison

Mar 26, 2025

11:08

250

Vision-R1 Reasoning in Multimodal Large Language Models via RL

Mar 25, 2025

12:56

251

OWL: Optimized Multi-Agent Assistance for Task Automation

Mar 24, 2025

16:27

252

Generalized Kullback-Leibler Divergence Loss for Enhanced Learning

Mar 23, 2025

20:35

253

Unsloth: A Practical Guide to LLM Fine-Tuning

Mar 22, 2025

21:41

254

Introducing the New PyTorch Landscape

Mar 21, 2025

11:10

255

Deep Learning for Inverse Design of Radio-Frequency Circuits

Mar 20, 2025

13:39

256

Distill Any Depth: Monocular Depth Estimation via Distillation

Mar 18, 2025

11:47

257

Economical Inference: DeepSeek's Multi-Head Latent Attention in LLMs

Mar 16, 2025

11:30

258

SWE-RL: Reinforcement Learning for LLMs on Software Evolution

Mar 15, 2025

14:47

259

Optimizing Quantum Circuit Mapping with SAT Solving at Amazon

Mar 14, 2025

11:26

260

LM Studio SDK: Python and TypeScript APIs for Local AI

Mar 13, 2025

17:42

261

GameFi AI Agents, DeFi, and Decentralized Virtual Ecosystems

Mar 12, 2025

11:52

262

LLMS Play Among Us

Mar 11, 2025

11:50

263

AN/UYK-1: Stored Logic Multiple-Purpose Digital Computer

Mar 10, 2025

18:33

264

Training Code Generation Models for Self-Debugging

Mar 9, 2025

11:18

265

LLMs in The Chameleon Game: Strategic Information Dynamics

Mar 9, 2025

11:50

266

GameFi: AI Agents, DeFi, and Decentralized Virtual Ecosystems

Mar 8, 2025

11:52

267

Training Code Generation Models for Self-Debugging

Mar 6, 2025

11:18

268

Accelerating Generative AI with PyTorch: Fast Inference with SAM2

Mar 4, 2025

17:24

269

V-HOP Visuo-Haptic 6D Object Pose Tracking

Mar 3, 2025

14:16

270

FACTR Force-Attending Curriculum Training for Contact-Rich Policy Learning

Mar 2, 2025

16:35

271

Language Model Training for Social Deduction in Among Us

Mar 1, 2025

21:06

272

Depth Pro Sharp Monocular Metric Depth Estimation

Feb 28, 2025

12:22

273

MME-CoT Benchmarking Chain-of-Thought in Large Multimodal Models

Feb 27, 2025

15:53

274

Unsloth Efficient GRPO for Long-Context Reasoning Models

Feb 26, 2025

12:49

275

CoT-Valve Tunable Length Control for Chain-of-Thought Reasoning

Feb 25, 2025

16:55

276

Implementing Transformers from Scratch

Feb 25, 2025

23:05

277

Reflection and Refraction

Feb 24, 2025

5:50

278

MixGCN Scalable Graph Convolutional Network Training

Feb 23, 2025

16:47

279

Open-Source AI The Imperative for Transparency

Feb 22, 2025

21:15

280

Forge Reasoning API and Nous Chat Advancing LLM Inference

Feb 21, 2025

13:42

281

Gradient Equilibrium in Online Learning

Feb 20, 2025

15:12

282

Encoder-Free 3D Large Multimodal Models An Investigation

Feb 19, 2025

15:27

283

Intel and PyTorch Empowering Generative AI

Feb 19, 2025

16:10

284

Iterative Prompting and LLM Code Optimization

Feb 18, 2025

15:07

285

Everything You Always Wanted To Know About Mathematics

Feb 17, 2025

15:09

286

The Instruct Monomyth_ Why Base Models Matter

Feb 16, 2025

18:11

287

DSJJJJ Desideratic AI and Mischievous Instability

Feb 15, 2025

22:57

288

Simplified PyTorch MLOps Workflow with Arm and GitHub

Feb 14, 2025

13:53

289

UMed-LVLM_ Unveiling Medical Abnormalities in Vision-Language Models

Feb 13, 2025

24:51

290

Ploppie_ A LiteLLM Abstraction Layer

Feb 12, 2025

13:06

291

Heat's Demise of Quantum Entanglement

Feb 11, 2025

9:32

292

Provably Autonomous AI Agents on Twitter

Feb 10, 2025

15:52

293

Confidence-Reward Driven Preference Optimization for Machine Translation

Feb 9, 2025

20:33

294

Exotic Smooth Four-Manifolds

Feb 8, 2025

19:15

295

Neuro-Symbolic AI A 2024 Systematic Review

Feb 7, 2025

19:49

296

YuLan-Mini A Data-Efficient Language Model

Feb 6, 2025

18:26

297

Jasper and Stella: Distilling State-of-the-Art Embedding Models

Feb 5, 2025

14:14

298

Creating a unique agent with ElizaOS

Feb 4, 2025

24:43

299

DeepSeek-V3 A 671B Parameter Mixture-of-Experts Language Model

Feb 3, 2025

11:51

300

Alice's Adventures in Differentiable Wonderland

Feb 2, 2025

44:23

301

Cline Development Assistant

Feb 1, 2025

26:23

302

Hyperbolic Time Chambers and Brain Emulation

Jan 31, 2025

18:06

303

Genesis A Universal Physics Engine for Robotics

Jan 30, 2025

11:34

304

Evolutionary & Market-Based Optimization

Jan 29, 2025

17:06

305

Benchmarking LLM Creativity and Diversity

Jan 28, 2025

10:01

306

Distilling GPT-4 for Wine Grape Variety Classification

Jan 27, 2025

6:58

307

Efficient Attention Mechanisms in Transformers

Jan 26, 2025

21:52

308

Byte Latent Transformer and Other AI Research at Meta

Jan 25, 2025

11:59

309

AI Agent Workflow and Deployment

Jan 24, 2025

11:44

310

Absolute Unit Neural Networks

Jan 23, 2025

20:26

311

LLMs and the Brain_ A Converging Architecture

Jan 22, 2025

9:21

312

Neuroevolution A Review

Jan 21, 2025

21:54

313

Building a High-Frequency Trading Exchange

Jan 20, 2025

18:02

314

The Unreasonable Effectiveness of Data and Scaling in AI

Jan 19, 2025

17:08

315

Patents and Interview: Inertial Mass Reduction in Craft

Jan 18, 2025

26:43

316

ChatGPT-4o in Financial Data Analysis

Jan 17, 2025

18:40

317

Exotic Smooth Four-Manifolds

Jan 16, 2025

19:15

318

Monolith_ A Real-Time Recommendation System

Jan 15, 2025

25:30

319

Automating Artificial Life Discovery with Foundation Models

Jan 14, 2025

13:25

320

Building Effective Agents with LLMs

Jan 13, 2025

19:48

321

Latent Reasoning in Large Language Models

Jan 12, 2025

13:13

322

LLM Multi-Step Reasoning_ Think-to-Talk or Talk-to-Think_

Jan 11, 2025

13:12

323

Neural Observation Field Guided Hybrid Camera Placement Optimization

Jan 10, 2025

15:16

324

Phi-4_ A 14B Parameter Language Model

Jan 10, 2025

42:04

325

Post-Hoc MOTS_ Time-Symmetric Multi-Object Tracking

Jan 9, 2025

19:19

326

Thompson Sampling Regret Bounds for Logistic Bandits

Jan 8, 2025

13:41

327

Bi-Level Optimization for Redundant Manipulator Trajectory Optimization

Jan 7, 2025

14:07

328

An end-to-end attention-based approach for learning on graphs

Jan 6, 2025

23:08

329

DMRA_ Diffusion Model with Representation Alignment for Protein Inverse Folding

Jan 5, 2025

16:08

330

Training Jacobians of Neural Networks

Jan 4, 2025

17:55

331

xAI's Colossus_ A Million-GPU Supercomputer

Jan 3, 2025

8:05

332

The Return of Pseudoscience in AI

Jan 2, 2025

23:47

333

Situational Awareness_ The Coming Age of Superintelligence

Jan 2, 2025

33:31

334

Surpassing OpenAI's O1_ Distillation and the Bitter Lesson

Jan 1, 2025

26:19

335

Rebooting the Arsenal of Democracy

Jan 1, 2025

4:31

336

QwQ_ Exploring AI Reasoning Capabilities

Dec 31, 2024

16:16

337

Parametric PerceptNet for Image Quality Assessment

Dec 30, 2024

16:06

338

Optimizing Mixed-Input Matrix Multiplication on NVIDIA Ampere

Dec 29, 2024

9:35

339

OpenAI's o1_ Reasoning with LLMs

Dec 28, 2024

13:46

340

O1 Replication_ Distillation, Progress, and Lessons

Dec 27, 2024

11:49

341

Nonlinear Unitary Photonic Circuits for Deep Learning

Dec 26, 2024

14:17

342

Moto_ A Latent Motion Token Language Model for Robot Manipulation

Dec 26, 2024

15:00

343

MAG-V_ A Multi-Agent Framework for Synthetic Data Generation and Verification

Dec 26, 2024

10:48

344

Machines of Loving Grace_ AI's Transformative Potential

Dec 25, 2024

14:46

345

LearnLM_ A Google AI for Education

Dec 24, 2024

12:08

346

Hybrid-SQuAD_ A Scholarly Question Answering Dataset

Dec 24, 2024

17:26

347

HunyuanVideo_ A Large Open-Source Video Generation Model

Dec 23, 2024

13:47

348

Fine-Tuning Mosquito Larvae Locomotion via Reinforcement Learning

Dec 22, 2024

19:42

349

Fine-Tuning LLMs with Ollama

Dec 21, 2024

20:37

350

FedDW_ Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning

Dec 20, 2024

20:22

351

Exphormer_ Scaling Transformers for Graph-Structured Data

Dec 20, 2024

12:05

352

DHCP_ Detecting Hallucinations in Large Vision-Language Models

Dec 19, 2024

10:48

353

Benchmarking 25 State-of-the-Art LLMs

Dec 18, 2024

14:59

354

Detecting AI-Generated Responses in Multiple-Choice Assessments

Dec 17, 2024

11:05

355

Avoiding Rookie Mistakes in Machine Learning

Dec 16, 2024

23:03

356

AI-Powered Ultrasound for Global Maternal Healthcare

Dec 16, 2024

14:53

357

DeMo_ Decoupled Momentum Optimization for Large Neural Networks

Dec 15, 2024

19:33

358

CS Freshmen and ChatGPT_ A Log Analysis

Dec 15, 2024

18:13

359

AI Compiler for Autonomous Vehicles

Dec 14, 2024

6:40

360

Competitive Programmer's Handbook

Dec 13, 2024

20:32

361

AI Coding Tool Showdown_ Cursor, Bolt, Replit, and V0 Compared

Dec 12, 2024

12:15

362

Challenges in Human-Agent Communication

Dec 11, 2024

20:52

363

ASL Fingerspelling Recognition Competition

Dec 10, 2024

22:42

364

Accelerating Mobile AI with ExecuTorch and KleidiAI

Dec 10, 2024

14:58

All Episodes