Neural intel Pod cover art

All Episodes

Neural intel Pod — 355 episodes

#
Title
1

The EML Operator: One Primitive to Rule All Mathematics

2

OpenAI MRC, SRv6, and the Architecture of Frontier AI Supercomputers

3

Inside the Machine: Training GPT-5, the Memory Wall, and the Math of MoE

4

DeepSeek-V4: The Million-Token Efficiency Leap | Open Source SOTA

5

Breaking the Quadratic Bottleneck with DeepSeek-V4’s Hybrid Attention

6

Claude Desktop’s Silent Sandbox Bypass: The Undocumented Browser Bridge

7

Forensic Audit of Anthropic’s Native Messaging Backdoor

8

The $60 Billion Synergy: Architecting the SpaceX + Cursor AI "Colossus" | Neural Intel Podcast

9

The Jackrong Playbook: Mastering Claude 4.6 Opus Distillation with Unsloth and LoRA

10

Inside the Claude Opus 4.7 Orchestration Layer - Deferred Tools & Agentic Code

11

Electrons to Tokens: The Technical Architecture of Nvidia’s AI Monopoly

12

Hermes Agent’s Memory Architecture and the Future of Agentic RL

13

200 Gigawatts or Bust: Dylan Patel on the Engineering Reality of AGI Scaling

14

The Muse Spark Revolution: Dissecting Meta's 2026 Architectural Pivot & The Triad of Truth | Neural Intel Podcast

15

Synaptic Persistence and Mushroom Body Neurogenesis: The Architecture of Metamorphic Memory

16

Engineering Sovereign Knowledge Bases with Andrej Karpathy’s Automated Architect

17

The Mercor AI Breach: National Security Crisis or a Wake-Up Call for the AI Industry?

18

BREAKING: Massive Mercor AI Data Breach - SOTA Training Data Leaked from Meta, Apple, & Amazon

19

Did Anthropic Just Hand the Keys to AI Coding to Everyone? The Huge Claude Code Leak Explained

20

The Claude Code Leak: Decoding Anthropic’s Self-Healing Memory and Secret "KAIROS" Agent

21

Is AI Censorship Over? The G0DM0D3 "Liberated Chat" Breakthrough

22

Is Traditional Computing Dead? NVIDIA's Jensen Huang on the "iPhone of Tokens"

23

The Bio-Computer Architecture: Declassified CIA Mechanics for Synthetic Consciousness

24

The End of the Human Bottleneck: Andrej Karpathy on Auto-Research and Recursive AI

25

Is Open Source Dead? Inside the Cursor Composer 2 vs. Kimi License Controversy

26

Is Residual Scaling Obsolete? Introducing Attention Residuals

27

The Sequence-Depth Breakthrough: Inside Kimi Team's Attention Residuals

28

Beyond the Prompt: Architecture of the Qwen-Agent Ecosystem and Qwen3.5

29

Beyond the Chatbot: Engineering "Forever-Agents" with Hermes Agent and OpenClaw

30

Nanochat: How Karpathy Automated AI Evolution with NVIDIA ClimbMix

31

1 Million Tokens: Breakthrough or Marketing Stunt? The GPT-5.4 Technical Deep Dive

32

Qwen 3.5: Exodus, Restructuring, Betrayal, and the Future of Chinese AI

33

The Mac mini Guide to OpenClaw and Local AI

34

The Neural Intel Op Ed: Engineering a Post-Natural Language for the AI Era

35

Andrej Karpathy on the "Claw" Revolution: Are AI Agents Obsolete?

36

10 Million Tokens and Beyond: Why Recursive AI is the Next Scaling Frontier

37

The Grok 4.20 Manifesto: Multi-Agent Logic and the Quest for Unfiltered Truth

38

The End of Memory Bottlenecks: How Fiber Optics and Ganged Flash Power Trillion-Parameter Models

39

Interview with Dario Amodei from Anthropic: Inside the $100B "Big Blob of Compute" & The 2030 AGI Certainty

40

The OpenClaw Saga: Peter Steinberger on Self-Modifying AI and the Age of the Lobster

41

Inside the 180 Billion HKD Breakthrough: How MiniMax M2.5 Scaled Agentic RL

42

The 744B Parameter Giant: How GLM-5 and Domestic Chips Redefine the Global AI Order

43

The OpenClaw Security Crisis: Can We Control Autonomous AI Swarms?

44

Is Consciousness Only in Your Head?

45

Methods and Applications of Parametric Sensitivity Analysis

46

The Architecture of Choice: Scaling MIT’s Decision Algorithms

47

The Logographic Advantage: How China’s Ancient Language is Powering Next-Gen AI | Neural Intel Deep Dive

48

Deep Learning Deep Dive: From Neural Networks to Differentiable Programming

49

The Hidden Evolution: Implicit Reinforcement Learning and the Future of Iterative AI

50

The Math of Stability: DeepSeek-AI’s mHC and the Evolution of Macro-Architecture

51

MoE Giants: Decoding the 670 Billion Parameter Showdown Between DeepSeek V3 and Mistral Large

52

GLM-4.7 Deep Dive: 358B Parameters, Agentic Reasoning, and the Future of Open Weights

53

Beyond the Exam Room: Stress-Testing Clinical AI with Medmarks v0.1

54

ANDREJ KARPATHY 2025 LLM Review: RLVR, Jagged Intelligence, & The Vibe Coding Revolution

55

The Automated Karpathy Recipe: Master Neural Network Debugging with neural_net_checklist

56

Nemotron 3 Nano: The Hybrid Mamba-MoE Model Driving Efficient, 1M-Token Agentic AI

57

Olmo 3: Unpacking the Fully Open LLM Flow (Dolma 3, OlmoRL, & State-of-the-Art Reasoning)

58

The Code Red Gambit: GPT-5.2's Mega-Agent Architecture

59

Fara-7B: The 7B Agentic SLM Redefining On-Device CUA Performance

60

The AGI Frontier: DeepMind’s Decade of Breakthroughs-From DQN and AlphaZero to Solving Protein Folding.

61

INTELLECT-3: Scaling Agentic RL and MoE to SOTA Performance with prime-rl and 512 H200s

62

Kimi Founder Yang Zhilin on K2, Agentic LLMs, & AGI: The Beginning of Infinity | Scaling & Innovation Strategy

63

Ilya Sutskever on AI: Transitioning from Scaling to Research, Generalization, and the Future of Superintelligence

64

Neuromorphic Computing: Principles and Architecture

65

Gemini 3 Pro Release Review: Benchmarks, Generative UI, Deep Think Mode, and Google Antigravity

66

DeepSeek-OCR: Contexts Optical Compression

67

LLM Gambling Addiction: Behavioral and Neural Mechanisms

68

Glyph: Visual-Text Compression for Scaling Context Windows

69

Continual Learning via Sparse Memory Finetuning

70

Andrej Karpathy on AI, Intelligence, and Education

71

Untangling the xAI-OpenAI Legal War: Trade Secrets and Antitrust

72

IBM Granite 4.0: Hybrid Mamba/Transformer Breakthrough for Enterprise LLMs?

73

Anthropic's Claude Sonnet 4.5: The New Coding Standard?

74

GPT-5-Codex: Agentic Coding and OpenAI's Evolution

75

Grok 4 Fast: Speed, Efficiency, and Application Review

76

How to Read a Research Paper

77

The Science of Sampling

78

GPT-5 Revisited: Progress, Performance, and User Experience

79

Thyme Autonomous AI that Sees, Codes and Solves Problems

80

YaRN: Extending LLM Context Windows Efficiently

81

Ilya Sutskever's AI Vision: From Deep Learning Dogmas to Safe Superintelligence

82

Thyme: Think Beyond Images with Code-Executing MLLMs

83

What did Ilya see?

84

Meta's AI Ambitions: Turbulence in Superintelligence Labs

85

Hierarchical Reasoning: Bigger Isn't Always Better

86

Prime Collective Communications Library: A Technical Report

87

Prime Collective Communications Library: A Technical Report

88

MetaStone-S1: Reflective Generative AI for Test-Time Scaling

89

MetaStone-S1: Reflective Generative AI for Test-Time Scaling

90

ToonComposer: AI-Assisted Cartoon Production and Post-Keyframing

91

ToonComposer: AI-Assisted Cartoon Production and Post-Keyframing

92

Triton: Language, Compiler, and Optimization for AI Workloads

93

Triton: Language, Compiler, and Optimization for AI Workloads

94

Dynamic Fine-Tuning: Elevating LLM Generalization

95

Lessons from a Chimp: AI Scheming and Ape Language

96

Deciphering Reinforcement Learning for Language Models

97

STREAM3R: Scalable Streaming 3D Reconstruction with Causal Transformer

98

Yan: Interactive Video Generation Framework

99

Lessons from a Chimp: AI Scheming and Ape Language

100

NextStep-1: Unified Multi-modal Generation

101

STREAM3R: Scalable Streaming 3D Reconstruction with Causal Transformer

102

GLM-V: Advancing Multimodal Reasoning with RLCS

103

DINOv3: Self-Supervised Vision Foundation Models

104

GPT-5 and Grok 4: Altman vs Musk

105

Hugging Face Hub Storage: Xet vs. Git LFS

106

Channel-Wise MLPs Boost RCN Generalization

107

Fine-Tuning Custom Embedding Models for Enhanced Retrieval Performance

108

AdLlama: Boosting Ad CTR with Reinforcement Learning

109

Machine Learning: Models, Algorithms, and Reinforcement Learning

110

Mixture-of-Recursions: Adaptive Computation for Language Models

111

Operator-Based Machine Intelligence: A Hilbert Space Framework

112

Meta CLIP 2: A Worldwide Scaling Recipe

113

In-Context Learning: Implicit Weight Dynamics

114

GLM-4.5: Open Agentic, Reasoning, and Coding Foundation Models

115

RLVMR: Verifiable Meta-Reasoning for Long-Horizon Agents

116

CoT-Self-Instruct: High-Quality Synthetic Prompt Generation

117

GPT-5: Hype, Reality and the Future of AI

118

Seed-Prover: Advancing Automated Mathematical Reasoning with Formal Verification

119

Self-Evolving Agents: A Comprehensive Survey

120

High-Precision W and Z Boson Mass Measurement at CMS

121

Falcon-H1: Hybrid-Head LLMs for Efficiency and Performance

122

ASI-ARCH: AI-Driven Scientific Discovery for Neural Architecture

123

In-Context Learning: Implicit Weight Dynamics

124

Qwen3: Unifying Reasoning and Efficiency in LLMs

125

Group Sequence Policy Optimization for LLMs

126

Reinforcement Learning: Advancements, Applications, and Challenges

127

SPIRAL: Self-Play for Reasoning in Games

128

Qwen3-Coder: Agentic Coding and Model Capabilities

129

Hierarchical Reasoning Model: Brain-Inspired AI for Complex Tasks

130

Local LLM Solutions for Mac Silicon: Llama.cpp and LM Studio

131

Kimi K2: Open Agentic Intelligence and Applications

132

CARTRIDGES: Efficient Context for LLMs

133

Prompt Baking: Embedding LLM Behavior in Weights

134

Massistant: Chinese Mobile Forensic Tooling Revealed

135

Unexpected Military Roots of Digital Computing and Research

136

The 2025 AI Landscape: Progress and Outlook

137

The Dynamics of Neural Attention

138

Consciousness and Reality according to the CIA:Gateway

139

Military Roots of Digital Computing and Research

140

Accelerating Mobile AI with ExecuTorch and KleidiAI: Revisited

141

State-Adaptive Regularization for Offline Reinforcement Learning

142

Nash Learning from Human Feedback via Mirror Prox

143

MiniMax-M1: Scaling Test-Time Compute with Lightning Attention

144

Direct Reasoning Optimization for LLMs

145

AI's Impact on the US Workforce

146

LLaMA Factory: Easy LLM Fine-Tuning

147

Project Vend: Can Claude Run a Small Shop?

148

Self-Adapting Language Models (SEAL)

149

The Illusion of the Illusion of Thinking

150

The Illusion of Thinking in Reasoning Models

151

Meta-Reinforcement Learning with Minimum Attention

152

AI Persuasion Through Reinforcement Learning and Rhetoric

153

Reinforcement Learning for Assembly Code Optimization with LLMs

154

FileFix: Browser to PowerShell Social Engineering

155

Reinforcement Learning Under Unmeasured Confounding

156

Reinforcement Learning for Urban Air Quality Management

157

Reinforcement Learning in Non-Stationary Environments

158

Personalized Policy Learning from Heterogeneous Data

159

Boosting Reinforcement Learning with Human Feedback via SeRA

160

AXIOM: Active Inference Object-Centric World Models

161

Entropy and Reinforcement Learning for LLMs

162

FLEX Robot-Agnostic Force-Based Manipulation Learning

163

Agent RL Scaling for Mathematical Problem Solving

164

Beyond Reward: Limits of RL in LLM Reasoning

165

Reward Model Variance in RLHF

166

Power Grid Topological Control with Graph Reinforcement Learning

167

Decentralized RL for Multi-Resource Allocation via Dynamic Cluster Agreements

168

Reinforcement Learning for Humanoid Dexterous Manipulation

169

µCODE: Code Generation with Single-Step Rewards

170

Confidence-Reward Preference Optimization for Machine Translation

171

Personalized Preference Learning with MiCRo

172

ProRL Expands LLM Reasoning Boundaries

173

ProxyThinker: Guiding Large Models with Small Reasoners

174

Open CaptchaWorld: Benchmarking MLLM Agents

175

DexMachina: Functional Dexterous Bimanual Manipulation

176

3DMEM-BENCH: Long-Term Memory for Embodied AI

177

Fine-Tuning Large Language Models: A Comprehensive Guide

178

Maximizing Confidence Alone Improves Reasoning

179

Critical Points of Random Neural Networks

180

BAGEL: Vision-Language Model for Visual Generation

181

Incentivizing Knowledge Acquisition in LLMs via RL

182

RL for Image Generation: DPO vs GRPO

183

Let Androids Dream Framework

184

SmolVLM: Compact and Efficient Vision-Language Models

185

Federated Learning: Privacy-Preserving Collaborative Intelligence Survey

186

Compressed Federated Learning of Tiny Language Models

187

Mobile Intelligence Language Understanding Benchmark

188

AI-RAN: Converging Communications and Computing

189

Ollama LLM Fine-Tuning Methods

190

Customizing LLMs for High-Performance VHDL Design

191

Adaptively Weighted Nearest Neighbors for Matrix Completion

192

SAD Neural Networks, Divergent Gradient Flows, and Optimality

193

WavReward: Evaluating Spoken Dialogue Models

194

BLIP3-o Unified Multimodal Models

195

CodePDE: LLM-Driven PDE Solver Generation

196

Online Learning Neural Networks: Bounds and Characterization

197

UAV Visual Object Search in City Space

198

Benchmark for Auto-bidding Task

199

Reinforcement Learning with Human Feedback Improvements

200

T2I-R1: Reinforcing Image Generation with Bi-level CoT

201

Pretraining for Heterogeneous Treatment Effects

202

AI Jekyll-Hyde Tipping Point Formula

203

Personalizing Multimodal Models with Yo'Chameleon

204

Current Advances and Applications of AI, April 2025 Overview

205

Min-Form Credit Assignment for Process Reward Model Reasoning

206

Language Models for Automated Patient Record Linkage

207

Parameter-Efficient Continual Learning: A Survey

208

Building an Agent: LLM, Loop, and Tokens

209

Uncertainty-Guided Lung Tumor Segmentation via Coarse-to-Fine Refinement

210

Complex Instruction-Based Image Editing Benchmark

211

Sleep-Time Compute: Pre-computation for Efficient LLM Inference

212

Miras: A Framework for Designing Deep Learning Architectures

213

RUKA: A Compact and Affordable Humanoid Robotic Hand

214

GenEAva: Expressive Cartoon Avatar Generation via Diffusion

215

VCR-Bench: Video Chain-of-Thought Reasoning Evaluation

216

Automating LLM Hallucination Detection with Reasoning

217

Llama 4: Natively Multimodal AI Innovation

218

Self-Steering Language Models via Probabilistic Programs

219

Amazon Q Developer: AI for Data Science in SageMaker Canvas

220

Adaptive SVD for Continual Learning in Large Language Models

221

Llama 4: Natively Multimodal AI Innovation

222

UniOcc: Unified Occupancy Prediction and Forecasting Benchmark

223

Graph Counterfactual XAI via Latent Space Traversal

224

Continual Forgetting for Pre-trained Vision Models

225

Age of Updates for Adaptive OFDM in Autonomous Vehicles

226

Video Generation Improvement via Human Preference Alignment

227

AnimeGamer: Infinite Anime Life Simulation via MLLM

228

NoProp: Learning Neural Networks Without Backpropagation

229

ACPBench Hard: Generative Planning Reasoning Tasks

230

Efficient Training of Large Language Models

231

Uni4D Dynamic 4D Modeling from Casual Video

232

KDTalker: Audio-Driven Talking Portraits via Implicit Keypoint Diffusion

233

OLMo 2: Fully Open Language Model Advancements

234

Stable-SCore Stable 3D Shape Correspondence via Registration

235

ProjectEval: Benchmarking Project-Level Code Generation by LLM Agents

236

Embodied Agent Confidence Elicitation in Dynamic Multimodal Environments

237

VLMs Playing StarCraft II: A Multimodal Decision Benchmark

238

M-Attack: Simple Yet Effective Attacks Against Strong Vision-Language Models

239

Deep Learning for Inverse Design of Radio-Frequency Circuits

240

Coding with LLMs A Developer's Guide by Simon Willison

241

Vision-R1 Reasoning in Multimodal Large Language Models via RL

242

OWL: Optimized Multi-Agent Assistance for Task Automation

243

Generalized Kullback-Leibler Divergence Loss for Enhanced Learning

244

Unsloth: A Practical Guide to LLM Fine-Tuning

245

Introducing the New PyTorch Landscape

246

Deep Learning for Inverse Design of Radio-Frequency Circuits

247

Distill Any Depth: Monocular Depth Estimation via Distillation

248

Economical Inference: DeepSeek's Multi-Head Latent Attention in LLMs

249

SWE-RL: Reinforcement Learning for LLMs on Software Evolution

250

Optimizing Quantum Circuit Mapping with SAT Solving at Amazon

251

LM Studio SDK: Python and TypeScript APIs for Local AI

252

GameFi AI Agents, DeFi, and Decentralized Virtual Ecosystems

253

LLMS Play Among Us

254

AN/UYK-1: Stored Logic Multiple-Purpose Digital Computer

255

Training Code Generation Models for Self-Debugging

256

LLMs in The Chameleon Game: Strategic Information Dynamics

257

GameFi: AI Agents, DeFi, and Decentralized Virtual Ecosystems

258

Training Code Generation Models for Self-Debugging

259

Accelerating Generative AI with PyTorch: Fast Inference with SAM2

260

V-HOP Visuo-Haptic 6D Object Pose Tracking

261

FACTR Force-Attending Curriculum Training for Contact-Rich Policy Learning

262

Language Model Training for Social Deduction in Among Us

263

Depth Pro Sharp Monocular Metric Depth Estimation

264

MME-CoT Benchmarking Chain-of-Thought in Large Multimodal Models

265

Unsloth Efficient GRPO for Long-Context Reasoning Models

266

CoT-Valve Tunable Length Control for Chain-of-Thought Reasoning

267

Implementing Transformers from Scratch

268

Reflection and Refraction

269

MixGCN Scalable Graph Convolutional Network Training

270

Open-Source AI The Imperative for Transparency

271

Forge Reasoning API and Nous Chat Advancing LLM Inference

272

Gradient Equilibrium in Online Learning

273

Encoder-Free 3D Large Multimodal Models An Investigation

274

Intel and PyTorch Empowering Generative AI

275

Iterative Prompting and LLM Code Optimization

276

Everything You Always Wanted To Know About Mathematics

277

The Instruct Monomyth_ Why Base Models Matter

278

DSJJJJ Desideratic AI and Mischievous Instability

279

Simplified PyTorch MLOps Workflow with Arm and GitHub

280

UMed-LVLM_ Unveiling Medical Abnormalities in Vision-Language Models

281

Ploppie_ A LiteLLM Abstraction Layer

282

Heat's Demise of Quantum Entanglement

283

Provably Autonomous AI Agents on Twitter

284

Confidence-Reward Driven Preference Optimization for Machine Translation

285

Exotic Smooth Four-Manifolds

286

Neuro-Symbolic AI A 2024 Systematic Review

287

YuLan-Mini A Data-Efficient Language Model

288

Jasper and Stella: Distilling State-of-the-Art Embedding Models

289

Creating a unique agent with ElizaOS

290

DeepSeek-V3 A 671B Parameter Mixture-of-Experts Language Model

291

Alice's Adventures in Differentiable Wonderland

292

Cline Development Assistant

293

Hyperbolic Time Chambers and Brain Emulation

294

Genesis A Universal Physics Engine for Robotics

295

Evolutionary & Market-Based Optimization

296

Benchmarking LLM Creativity and Diversity

297

Distilling GPT-4 for Wine Grape Variety Classification

298

Efficient Attention Mechanisms in Transformers

299

Byte Latent Transformer and Other AI Research at Meta

300

AI Agent Workflow and Deployment

301

Absolute Unit Neural Networks

302

LLMs and the Brain_ A Converging Architecture

303

Neuroevolution A Review

304

Building a High-Frequency Trading Exchange

305

The Unreasonable Effectiveness of Data and Scaling in AI

306

Patents and Interview: Inertial Mass Reduction in Craft

307

ChatGPT-4o in Financial Data Analysis

308

Exotic Smooth Four-Manifolds

309

Monolith_ A Real-Time Recommendation System

310

Automating Artificial Life Discovery with Foundation Models

311

Building Effective Agents with LLMs

312

Latent Reasoning in Large Language Models

313

LLM Multi-Step Reasoning_ Think-to-Talk or Talk-to-Think_

314

Neural Observation Field Guided Hybrid Camera Placement Optimization

315

Phi-4_ A 14B Parameter Language Model

316

Post-Hoc MOTS_ Time-Symmetric Multi-Object Tracking

317

Thompson Sampling Regret Bounds for Logistic Bandits

318

Bi-Level Optimization for Redundant Manipulator Trajectory Optimization

319

An end-to-end attention-based approach for learning on graphs

320

DMRA_ Diffusion Model with Representation Alignment for Protein Inverse Folding

321

Training Jacobians of Neural Networks

322

xAI's Colossus_ A Million-GPU Supercomputer

323

The Return of Pseudoscience in AI

324

Situational Awareness_ The Coming Age of Superintelligence

325

Surpassing OpenAI's O1_ Distillation and the Bitter Lesson

326

Rebooting the Arsenal of Democracy

327

QwQ_ Exploring AI Reasoning Capabilities

328

Parametric PerceptNet for Image Quality Assessment

329

Optimizing Mixed-Input Matrix Multiplication on NVIDIA Ampere

330

OpenAI's o1_ Reasoning with LLMs

331

O1 Replication_ Distillation, Progress, and Lessons

332

Nonlinear Unitary Photonic Circuits for Deep Learning

333

Moto_ A Latent Motion Token Language Model for Robot Manipulation

334

MAG-V_ A Multi-Agent Framework for Synthetic Data Generation and Verification

335

Machines of Loving Grace_ AI's Transformative Potential

336

LearnLM_ A Google AI for Education

337

Hybrid-SQuAD_ A Scholarly Question Answering Dataset

338

HunyuanVideo_ A Large Open-Source Video Generation Model

339

Fine-Tuning Mosquito Larvae Locomotion via Reinforcement Learning

340

Fine-Tuning LLMs with Ollama

341

FedDW_ Distilling Weights through Consistency Optimization in Heterogeneous Federated Learning

342

Exphormer_ Scaling Transformers for Graph-Structured Data

343

DHCP_ Detecting Hallucinations in Large Vision-Language Models

344

Benchmarking 25 State-of-the-Art LLMs

345

Detecting AI-Generated Responses in Multiple-Choice Assessments

346

Avoiding Rookie Mistakes in Machine Learning

347

AI-Powered Ultrasound for Global Maternal Healthcare

348

DeMo_ Decoupled Momentum Optimization for Large Neural Networks

349

CS Freshmen and ChatGPT_ A Log Analysis

350

AI Compiler for Autonomous Vehicles

351

Competitive Programmer's Handbook

352

AI Coding Tool Showdown_ Cursor, Bolt, Replit, and V0 Compared

353

Challenges in Human-Agent Communication

354

ASL Fingerspelling Recognition Competition

355

Accelerating Mobile AI with ExecuTorch and KleidiAI