EPISODE · Sep 14, 2020 · 2 MIN
Tokenization in Natural Language Processing
from Code Logic · host Sarvesh Bhatnagar
In this episode we discuss about tokenization in Natural Language Processing. As discussed in previous episode, tokenisation is an important step in data cleaning and it entails dividing a large piece of text into smaller chunks. In this episode we discuss some of the basic tokenizers available from nltk.tokenize in nltk. If you liked this episode, do follow and do connect with me on twitter @sarvesh0829 follow my blog at www.stacklearn.org. If you sell something locally, do it using BagUp app available at play store, It would help a lot.
What this episode covers
In this episode we discuss about tokenization in Natural Language Processing. As discussed in previous episode, tokenisation is an important step in data cleaning and it entails dividing a large piece of text into smaller chunks. In this episode we discuss some of the basic tokenizers available from nltk.tokenize in nltk. If you liked this episode, do follow and do connect with me on twitter @sarvesh0829 follow my blog at www.stacklearn.org. If you sell something locally, do it using BagUp app available at play store, It would help a lot.
NOW PLAYING
Tokenization in Natural Language Processing
No transcript for this episode yet
Similar Episodes
Jun 29, 2025 ·3m
Jun 4, 2025 ·2m
May 15, 2025 ·4m
May 15, 2025 ·4m
May 15, 2025 ·4m
May 15, 2025 ·3m