Clustering
Topics
Module Overview
Motivating Example
Data Handling
Exploratory Data Analysis
Exploratory Data Analysis2
Data Modelling
Regression1
Classification1
Regression2
Classification2
Clustering
Resources
Overview
Practical
Clustering
Given unlabeled data, look for subsets that help to improve understanding of the overall data set
Partitioning
Hierarchies
Overview
Introduction to clustering - distance measures, algorithms and success metrics
How to compute distances between instances
Algorithms that partition the data
Handout
Presentation
Practical
Practical session - using Python notebooks to cluster generated blob data and the NIST digits data, using K-means, GMM and (H)DBSCAN
Open