Difference between logistic regression and softmax regression
Asked Answered
T

5

27

I know that logistic regression is for binary classification and softmax regression for multi-class problem. Would it be any differences if I train several logistic regression models with the same data and normalize their results to get a multi-class classifier instead of using one softmax model. I assume the result is of the same. Can I say : "all the multi-class classifier is the cascading result of binary classifiers". (except neuron network)

Toadinthehole answered 17/3, 2016 at 4:8 Comment(3)
Be aware that there is a standard way of adapting logistic regression to multi-class problems, which is explained e.g. in en.wikipedia.org/wiki/Multinomial_logistic_regression with multiple interpretations, one of which mentions softmax.Immunochemistry
Thank you very much, it is really helpful.Toadinthehole
I have written an answer about that on stats.stackexchange.com. May that help.Prevalent
H
2

There are minor differences in multiple logistic regression models and a softmax output.

Essentially you can map an input of size d to a single output k times, or map an input of size d to k outputs a single time. However, multiple logistic regression models are confusing, and perform poorer in practice. This is because most libraries (TensorFlow, Caffe, Theano) are implemented in low level compiled languages and are highly optimized. Since managing the multiple logistic regression models is likely handled at a higher level, it should be avoided.

Herzig answered 25/5, 2016 at 17:45 Comment(0)
F
16

Echoing on what others have already conveyed.

  1. Softmax Regression is a generalization of Logistic Regression that summarizes a 'k' dimensional vector of arbitrary values to a 'k' dimensional vector of values bounded in the range (0, 1).
  2. In Logistic Regression we assume that the labels are binary (0 or 1). However, Softmax Regression allows one to handle classes.
  3. Hypothesis function:
    • LR:
    • Softmax Regression:

Reference: http://ufldl.stanford.edu/tutorial/supervised/SoftmaxRegression/

Foretop answered 2/4, 2017 at 23:48 Comment(0)
S
10

You can think of logistic regression as a binary classifier and softmax regression is one way(there are other ways) to implement an multi-class classifier. The number of output layers in softmax regression is equal to the number of class you want to predict.

Example: In the case of digit recognition, you have 10 class to predict[0-9] so you can think of this as a situation where you model outputs 10 probabilities for each class and in practice we choose the class that has the highest probability as our predicted class.

From the example above it can be seen that the output a softmax function is equal to the number of classes. These outputs are indeed equal to the probabilities of each class and so they sum up to one. For the algebraic explanation please take a look into Stanford University website which has a good and short explanation of this topic.

link:http://ufldl.stanford.edu/tutorial/supervised/SoftmaxRegression/

Difference: In the link above it is described in detail that a softmax with only 2 class is same as a logistic regression. Therefore it can be said the major difference is only the naming convention. we call it as logistic regression when we are dealing with a 2-class problem and softmax when we are dealing with a multinational(more than 2 classes) problem.

Note: It is worth to remember that softmax regression can also be used in other models like neural networks.

Hope this helps.

Sadick answered 6/12, 2016 at 23:44 Comment(0)
P
6

You can relate logistic regression to binary softmax regression when you transfer the latent model outputs pairs (z1, z2) to z = z1-z2 and apply the logistic function

softmax(z1, z2) = exp(z1)/(exp(z1) + exp(z2)) = exp(z1 - z2)/(exp(z1-z2) + exp(0)) = exp(z)/(exp(z) + 1) 
Photometer answered 19/10, 2019 at 7:36 Comment(1)
Presumably in the case that you are fitting a linear model as an argument for the softmax. This transformation can be acheived through the weights instead and you can write a forumla to translate one sets of weights into another such that the two methods of fitting are "equivalent" for binary classifiers.Pompei
H
2

There are minor differences in multiple logistic regression models and a softmax output.

Essentially you can map an input of size d to a single output k times, or map an input of size d to k outputs a single time. However, multiple logistic regression models are confusing, and perform poorer in practice. This is because most libraries (TensorFlow, Caffe, Theano) are implemented in low level compiled languages and are highly optimized. Since managing the multiple logistic regression models is likely handled at a higher level, it should be avoided.

Herzig answered 25/5, 2016 at 17:45 Comment(0)
H
0

Logistic regression is used for binary classification tasks, where the outcome variable has only two possible classes. Softmax regression, also known as multinomial logistic regression, is an extension of logistic regression used for multiclass classification tasks, where the outcome variable can have more than two classes. While logistic regression calculates the probability of belonging to one class, softmax regression calculates the probabilities for each class and normalizes them using the softmax function, ensuring they sum up to one.

Harrumph answered 1/5, 2024 at 7:48 Comment(0)

© 2022 - 2025 — McMap. All rights reserved.