The notebook (right-click and download this file) introduces a data set we have not used before. It relates to predicting Diabetes status, for a group of Pima women, given measurements of BMI, blood glucose and other attributes that might be expected to predict Diabetes onset.
The data set has been provided here and
should be placed in a data subfolder of the folder where you placed the notebook.
The notebook itself takes you through an analysis of the data using the Naive Bayes and KNN algorithms that we met in class. These algorithms were chosen because they are very different in how they work.
Running the notebook cell-by-cell, studying the outputs and comparing your interpretation with the comments and guidance should help your understanding.