AI Journey OpenBook Podcast podcast artwork

PODCAST · technology

AI Journey OpenBook Podcast

Implementation for the different ML tasks on the limit computing resources. aisuko.substack.com

  1. 7

    Training AI Assistants: From Pre-Training to Human-Guided Conversations

    Pretraining and post-training chapter of Deep Dive into LLMs like ChatGPT by Andrej Karpathy. Get full access to AI Journey OpenBook at aisuko.substack.com/subscribe

  2. 6

    Understanding Base Model Inference: How AI Models Generate Text

    In this section, the speaker explores the concept of base model inference, explaining how large AI models are trained, released, and function as token simulators rather than full assistants. The key points discussed include:Base Models and Their Availability* Training large AI models is extremely costly, but big tech companies often release “base models“ after training.* A base model is a token simulator that predicts text sequences but is not yet an AI assistant.Examples of Base Models* GPT-2 (1.5B parameters, trained on 100B tokens) was one of the first widely released base models.* LLaMA3 (405B parameters, trained on 15T tokens by Meta) is a modern, larger base model.Components of a Model Release* Requires two main parts:* Python Code* Model ParametersBase Model Behavior* It functions as an advanced autocomplete system, generating text based on statistical patterns from training data.* It does not inherently provide factual or structured responses like an assistant.Characteristics of Base Models* Stochastic Nature - Given the same input, different competitions may be generated.* Knowledge Compression - Acts like a lossy “zip file” of internet text, storing probabilistic patterns rather than explicit facts.* Memorization & Regurgitation - Can recall high-frequency training data, sometimes verbatim (e.g, Wikipedia entries).Limitations of Base Models* Cannot provide factual updates beyond their training data cutoff.* Tends to “hallucinate” (generate plausible but false information)Practical Uses of Base Models* In-context Learning - Few-shot prompting enables them to recognise and follow simple patterns* Simulated AI assistants - Carefully structured prompts can trick a base model into behaving like an assistant by mimicking a conversation formatAcknowledgment Get full access to AI Journey OpenBook at aisuko.substack.com/subscribe

  3. 5

    The Internal Working and Training of Neural Networks

    The Neural Network Internals and Training of Angrej Karpathy's video Get full access to AI Journey OpenBook at aisuko.substack.com/subscribe

  4. 4

    The Neural Network I⧸O

    The video clip of Deep Dive into LLMs Like ChatGPT by Andrej Karpathy Get full access to AI Journey OpenBook at aisuko.substack.com/subscribe

  5. 3

    Speech Recognition via Whisper

    This is chapter3 from Andrej Karpathy video Get full access to AI Journey OpenBook at aisuko.substack.com/subscribe

  6. 2

    Understanding Reasoning Models

    Understanding Reasoning Model Get full access to AI Journey OpenBook at aisuko.substack.com/subscribe

  7. 1

    Understanding Reasoning in Large Language Models

    The original article is https://www.linkedin.com/pulse/understanding-reasoning-large-language-models-bowen-li-kgadc/?trackingId=WtLIWBUiTFmVwCPL4HCwUw%3D%3DThe audio generated by Amazon Polly. It is not better than the OpenAI ChatGPT audio service. Get full access to AI Journey OpenBook at aisuko.substack.com/subscribe

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

Implementation for the different ML tasks on the limit computing resources. aisuko.substack.com

HOSTED BY

Bowen

CATEGORIES

Frequently Asked Questions

How many episodes does AI Journey OpenBook Podcast have?

AI Journey OpenBook Podcast currently has 7 episodes available on PodParley. New episodes are automatically indexed when they're published to the podcast feed.

What is AI Journey OpenBook Podcast about?

Implementation for the different ML tasks on the limit computing resources. aisuko.substack.com

How often does AI Journey OpenBook Podcast release new episodes?

AI Journey OpenBook Podcast has 7 episodes. Check the episode list to see recent publication dates and frequency.

Where can I listen to AI Journey OpenBook Podcast?

You can listen to AI Journey OpenBook Podcast on PodParley by clicking any episode. We provide an embedded audio player for direct listening, and you can also subscribe via your preferred podcast app using the RSS feed.

Who hosts AI Journey OpenBook Podcast?

AI Journey OpenBook Podcast is created and hosted by Bowen.
URL copied to clipboard!