Python

Speed up Linear Regression with Matrix Math

Linear Regression is an extremely popular and useful model. It's used by Excel Gurus and Data Scientists alike - but how can we fit lots of regression models quickly? This article walks through various ways to fit a linear regression and speed things up with some Linear Algebra.

Read
Python

Classification with Imbalanced Data

Building classification models on data that has largely imbalanced classes can be difficult. Using techniques such as oversampling, undersampling, resampling combinations, and custom filtering can improve accuracy.

Read
Python

A Straightforward Guide to A/B Testing

A/B Testing can be extremely useful during experimentation. Adding statistical rigor to situations where you compare one option against another. This is one step which can help guard against making faulty conclusions.

Read
Python

Preprocessing Text Data for Machine Learning

Unstructured text data requires unique steps to preprocess in order to prepare it for machine learning. This article walks through some of those steps including tokenization, stopwords, removing punctuation, lemmatization, stemming, and vectorization.

Read