Machine Learning Projects


Email Spam Classifier

The task is to determine whether and email is spam or not. I use a bag of words model to create the vocabulary and implement a Naive Bayes classifier from scratch.

See Github repo:


Credit Card Fraud Detection

A classic class imbalance dataset - available at

The objective is to identify fraudulent credit card transactions. Since the class representations are highly unbalanced, confusion matrix accuracy won't do any good.

See Github repo:


Classification on the ILPD

The is a small dataset with slight class imbalance. The project contains some basic exploratory analysis and analyses the performance of various classification algorithms.

See Github repo: