#60 - How to input text into your model? Understanding tokenizers.
An episode of the Life with AI podcast, hosted by Filipe Lauar, titled "#60 - How to input text into your model? Understanding tokenizers." was published on December 1, 2022 and runs 14 minutes.
December 1, 2022 ·14m · Life with AI
Summary
Hello everyone, in this episode I explain how tokenizers work. They are basically what enables us to input the text into a NLP algorithm like BERT or GPT. In the episode I explain 3 types of tokenizers, word based, character based and sub-word based representation. Instagram: https://www.instagram.com/podcast.lifewithai/ Linkedin: https://www.linkedin.com/company/life-with-ai Huuging Face blog about tokenizers: https://huggingface.co/docs/transformers/tokenizer_summary
Episode Description
Hello everyone, in this episode I explain how tokenizers work. They are basically what enables us to input the text into a NLP algorithm like BERT or GPT. In the episode I explain 3 types of tokenizers, word based, character based and sub-word based representation.
Instagram: https://www.instagram.com/podcast.lifewithai/
Linkedin: https://www.linkedin.com/company/life-with-ai
Huuging Face blog about tokenizers: https://huggingface.co/docs/transformers/tokenizer_summary
Similar Episodes
Mar 19, 2026 ·44m
Sep 11, 2025 ·29m
Jan 15, 2025 ·15m
Jan 15, 2025 ·18m