Skip to Main Content
Purdue University Purdue Logo Purdue Libraries



Introduction NLP part1 - text processing

In this workshop, we will cover the text processing techniques commonly used in NLP. We will learn about regex expressions, tokenization, lemmatization, stemming, stop words, and word vectorization using a bag of words model. We will also look at an example of text classification using the Naive Bayes method. This will set the context of fundamental NLP methods used before machine learning.