Workign with small probabilities, via logs

Q: What does taking the log of a probability do?

Taking the log not only simplifies the subsequent mathematical analysis, but it also helps numerically because the product of a large number of small probabilities can easily underflow the numerical precision of the computer, and this is resolved by computing instead the sum of the log probabilities.

Q: How do you calculate log probability?

2. obtain the log-odds for a given probability by taking the natural logarithm of the odds, e.g., log(0.25) = -1.3862944 or using the qlogis function on the probability value, e.g., qlogis(0.2) = -1.3862944.

Q: What is Logprobs?

The logprob is the log of the probability that a token comes next.

Tags:

algorithm

precision

logarithm

Source: Google Code Jam. https://code.google.com/codejam/contest/10224486/dashboard#s=a&a=1

We're asked to calculate Prob(K successes from N trials) where each of the N trials has a known success probability of p_n.

Some Analysis and thoughts on the problem are given after the Code Jam.

They observe that evaluating all possible outcomes of your N trials would take you an exponential time in N, so instead they provide a nice "dynamic programming" style solution that's O(N^2).

Let P(p#q) = Prob(p Successes after the first q Trials) Then observe the fact that:

Prob(p#q) = Prob(qth trial succeeds)*P(p-1#q-1) + Prob(qth trial fails)*P(p#q-1)

Now we can build up a table of P(i#j) where i<=j, for i = 1...N

That's all lovely - I follow all of this and could implement it.

Then as the last comment, they say:

In practice, in problems like this, one should store the logarithms of
probabilities instead of the actual values, which can become small
enough for floating-point precision errors to matter.

I think I broadly understand the point they're trying to make, but I specifically can't figure out how to use this suggestion.

Taking the above equation, and substuting in some lettered variables:

P = A*B + C*D

If we want to work in Log Space, then we have:

Log(P) = Log(A*B + C*D),

where we have recursively pre-computed Log(B) and Log(D), and A & B are known, easily-handled decimals.

But I don't see any way to calculate Log(P) without just doing e^(Log(B)), etc. which feels like it would defeat to point of working in log space`?

Does anyone understand in better detail what I'm supposed to be doing?

774

asked May 13 '17 08:05

Brondahl

1 Answers

Starting from the initial relation:

P = A⋅B + C⋅D

Due to its symmetry we can assume that B is larger than D, without loss of generality. The following processing is useful:

log(P) = log(A⋅B + C⋅D) = log(A⋅e^log(B) + C⋅e^log(D)) = log(e^log(B)⋅(A + C⋅e^{log(D) - log(B)})

log(P) = log(B) + log(A + C⋅e^{log(D) - log(B)}).

This is useful because, in this case, log(B) and log(D) are both negative numbers (logarithms of some probabilities). It was assumed that B is larger than D, thus its log is closer to zero. Therefore log(D) - log(B) is still negative, but not as negative as log(D).

So now, instead of needing to perform exponentiation of log(B) and log(D) separately, we only need to perform exponentiation of log(D) - log(B), which is a mildly negative number. So the above processing leads to better numerical behavior than using logarithms and applying exponentiation in the trivial way, or, equivalently, than not using logarithms at all.

answered Sep 23 '22 05:09

qwertyman

Related questions
                            
                                Fisher Yates variation
                            
                                algorithm - Sort an array with LogLogN distinct elements
                            
                                Why does Big-O Notation use O(1) instead of O(k)?
                            
                                Removing minimum no of edges to disconnect two vertices in a graph
                            
                                Dictionary using Red-Black tree - deletion error
                            
                                Algorithm for merging spatially close paths / line segments
                            
                                What does `(i & (i + 1)) - 1` mean? (in Fenwick Trees)
                            
                                Fast calculation of floating 1/N if factorization of very large integer N is known
                            
                                looking for a tuple matching algorithm
                            
                                Partitioning big rectangle to small ones (2D Packing)
                            
                                generating an sequential five digit alphanumerical ID
                            
                                Algorithm to find a repeated number in a list that may contain any number of repeats
                            
                                algorithm to parse string with dictionary
                            
                                Binary GCD Algorithm vs. Euclid's Algorithm on modern computers
                            
                                minimum sum required to make xor of some integers to zero
                            
                                Find pairs in an array such that a%b = k , where k is a given integer
                            
                                Dijkstra's Algorithm: Why is it needed to find minimum-distance element in the queue
                            
                                How can I analyze or improve my niece's simple compression algorithm that is based on Morse code?
                            
                                Python and OpenCV - Improving my lane detection algorithm
                            
                                Quicksort - reason for equals checks

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With