Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
Purdue University Purdue Logo Purdue Libraries



Clustering Analysis

Clustering analysis is a form of exploratory data analysis in which observations are grouped using a similarity measure. Grouping a set of similar objects have extensive applications in data science and machine learning tasks. We will introduce the concept of hard and soft clustering and go through an application of k-means based clustering.