Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
Purdue University Purdue Logo Purdue Libraries



Introduction NLP part1 - text processing

In this workshop, we will cover the text processing techniques commonly used in NLP. We will learn about regex expressions, tokenization, lemmatization, stemming, stop words, and word vectorization using a bag of words model. We will also look at an example of text classification using the Naive Bayes method. This will set the context of fundamental NLP methods used before machine learning.