Categories: NewsData

Why you should NEVER run a Logistic Regression (unless you have to) from Featured Blog Posts – Data Science Central

1 min read

Hello fellow Data Science-Centralists!

I wrote a post on my LinkedIn about why you should NEVER run a Logistic Regression. (Unless you really have to).

The main thrust is:

  • There is no theoretical reason why a least squares estimator can’t work on a 0/1.
  • There are very very narrow theoretical reasons that you want to run a logistic, and unless you fall into those categories it’s not worth the time.
  • The run time of a logistic can be up to 100x longer than an OLS model. If you are doing v-fold cross-validation save yourself some time.
  • The XB’s are exactly the same whether you use a Logistic or a linear regression. The model specification (features, feature engineering, feature selection, interaction terms) are identical — and this is what you should be focused on anyways.
  • Myth: Linear regression can only run linear models.
  • There is *one* practical reason to run a logistic: if the results are all very close to 0 or to 1, and you can’t hard code your prediction to 0 or 1 if the linear models falls outside a normal probability range, then use the logistic. So if you are pricing an insurance policy based on risk, you can’t have a hard-coded 0.000% prediction because you can’t price that correctly.

See video here and slides here.

I think it’d be nice to start a debate on this topic!

Matthew Emerick

Share
Published by
Matthew Emerick

Recent Posts

Top life hacks for prepping for your IT certification exam

I remember deciding to pursue my first IT certification, the CompTIA A+. I had signed…

3 years ago

Learn Transformers for Natural Language Processing with Denis Rothman

Key takeaways The transformer architecture has proved to be revolutionary in outperforming the classical RNN…

3 years ago

Learning Essential Linux Commands for Navigating the Shell Effectively

Once we learn how to deploy an Ubuntu server, how to manage users, and how…

3 years ago

Clean Coding in Python with Mariano Anaya

Key-takeaways:   Clean code isn’t just a nice thing to have or a luxury in software projects; it's a necessity. If we…

3 years ago

Exploring Forms in Angular – types, benefits and differences   

While developing a web application, or setting dynamic pages and meta tags we need to deal with…

3 years ago

Gain Practical Expertise with the Latest Edition of Software Architecture with C# 9 and .NET 5

Software architecture is one of the most discussed topics in the software industry today, and…

3 years ago