Three Ways to Deal With Imbalance

Posted on Mon 02 March 2020 in Posts • Tagged with machine learning, logistic regression, python, scikit-learn, statistical learning

In this post, I put together an interesting example of what to do with imbalanced datasets and why precision and recall matter.

Introduction

The following is part of a Machine learning assignment I had to do while at CUNY. This particular example illustrates quite well the importance of understanding various …


Continue reading

Linear Regression as the Simplest Classifier

Posted on Mon 24 February 2020 in Posts • Tagged with machine learning, linear regression, python, scikit-learn, statistical learning

In this post I wanted to describe a simple application of a linear least squares method to a problem of data classification. It is a naive approach and is unlikely to beat more sophisticated techniques like Logistic Regression, for instance.

Imports

Some imports we are going to need for this …


Continue reading