Why does TensorFlow's documentation call a softmax's input "logits"?

Question

TensorFlow calls each of the inputs to a softmax a logit. They go on to define the softmax's inputs/logits as: "Unscaled log probabilities."

Wikipedia and other sources say that a logit is the log of the odds, and the inverse of the sigmoid/logistic function. I.e., if sigmoid(x) = p(x), then logit( p(x) ) = log( p(x) / (1-p(x)) ) = x.

Is there a mathematical or conventional reason for TensorFlow to call a softmax's inputs "logits"? Shouldn't they just be called "unscaled log probabilities"?

Perhaps TensorFlow just wanted to keep the same variable name for binary logistic regression (where it makes sense to use the term logit) and categorical logistic regression...

This question was covered a little bit here, but no one seemed bothered by the use of the word "logit" to mean "unscaled log probability".

lejlot · Accepted Answer

Logit is nowadays used in ML community for any non-normalised probability distribution (basically anything that gets mapped to a probability distribution by a parameter-less transformation, like sigmoid function for a binary variable or softmax for multinomial one). It is not a strict mathematical term, but gained enough popularity to be included in TF documentation.

Why does TensorFlow's documentation call a softmax's input "logits"?

Tags:

documentation

machine-learning

tensorflow

logistic-regression

softmax

Brian Bartoldson

1 Answers

lejlot

Recent Activity

Donate For Us

Why does TensorFlow's documentation call a softmax's input "logits"?

Tags:

documentation

machine-learning

tensorflow

logistic-regression

softmax

Brian Bartoldson

1 Answers

lejlot

Related questions

Recent Activity

Donate For Us