Large Language Model (LLM) Talk cover art

All Episodes

Large Language Model (LLM) Talk — 68 episodes

#
Title
1

Context Engineering

2

Manus AI

3

Kimi K2

4

Mixture-of-Recursions (MoR)

5

MeanFlow

6

Mamba

7

LLM Alignment

8

Why We Think

9

Deep Research

10

vLLM

11

Qwen3: Thinking Deeper, Acting Faster

12

RAGEN: train and evaluate LLM agents using multi-turn RL

13

DeepSeek-Prover-V2

14

DeepSeek-Prover

15

Model Context Protocol (MCP)

16

LLM Post-Training: Reasoning

17

Agent AI Overview

18

FlashAttention-3

19

FlashAttention-2

20

FlashAttention

21

PPO (Proximal Policy Optimization)

22

"Deep Dive into LLMs like ChatGPT" - Andrej Karpathy's Tech Talk Learning

23

"Intro to Large Language Models" - Andrej Karpathy's Tech Talk Learning

24

DeepSeek-V2

25

Matrix Calculus in Deep Learning

26

S1: Simple Test-time Scaling

27

RLHF (Reinforcement Learning from Human Feedback)

28

GRPO (Group Relative Policy Optimization)

29

Model/Knowledge Distillation

30

Qwen-2.5

31

Qwen-2

32

Qwen-1

33

OpenAI-o1

34

GPT-4o

35

Kimi k1.5

36

DeepSeek-R1

37

Claude-3

38

GPT-4

39

LLM Training

40

MiniMax-01

41

DeepSeek v3

42

Tree-of-Thoughts

43

LLM Reasoning

44

LangChain

45

LlamaIndex

46

Chain of Thought (CoT)

47

Retrieval-Augmented Generation (RAG)

48

Fine-Tuning

49

Scaling Laws

50

LLaMA-3

51

LLaMA-2

52

LLaMA-1

53

Survey of Large Language Models

54

Mixture of Experts (MoE)

55

Multi-Task Learning

56

Gradient Descent Optimization Algorithms

57

GPT-1 (Generative Pre-trained Transformer)

58

Linear Transformers

59

BERT

60

Sora

61

Word2Vec

62

Stable Diffusion

63

Retrieval Transformer

64

GPT-2

65

GPT-3

66

Transformer

67

Prompt Engineering

68

Agentic AI