Best way to calculate the fundamental matrix of an absorbing Markov Chain?

Tags:

I have a very large absorbing Markov chain (scales to problem size -- from 10 states to millions) that is very sparse (most states can react to only 4 or 5 other states).

I need to calculate one row of the fundamental matrix of this chain (the average frequency of each state given one starting state).

Normally, I'd do this by calculating (I - Q)^(-1), but I haven't been able to find a good library that implements a sparse matrix inverse algorithm! I've seen a few papers on it, most of them P.h.D. level work.

Most of my Google results point me to posts talking about how one shouldn't use a matrix inverse when solving linear (or non-linear) systems of equations... I don't find that particularly helpful. Is the calculation of the fundamental matrix similar to solving a system of equations, and I simply don't know how to express one in the form of the other?

So, I pose two specific questions:

What's the best way to calculate a row (or all the rows) of the inverse of a sparse matrix?

What's the best way to calculate a row of the fundamental matrix of a large absorbing Markov chain?

A Python solution would be wonderful (as my project is still currently a proof-of-concept), but if I have to get my hands dirty with some good ol' Fortran or C, that's not a problem.

Edit: I just realized that the inverse B of matrix A can be defined as AB=I, where I is the identity matrix. That may allow me to use some standard sparse matrix solvers to calculate the inverse... I've got to run off, so feel free to complete my train of thought, which I'm starting to think might only require a really elementary matrix property...

519

asked Jul 29 '12 00:07

Ryan Marcus

2 Answers

Assuming that what you're trying to do is work out is the expected number of steps before absorbtion, the equation from "Finite Markov Chains" (Kemeny and Snell), which is reproduced on Wikipedia is:

t=N1

Or expanding the fundamental matrix

t=(I-Q)^-1 1

Rearranging:

(I-Q) t = 1

Which is in the standard format for using functions for solving systems of linear equations

A x = b

Putting this into practice to demonstrate the difference in performance (even for much smaller systems than those you're describing).

import networkx as nx
import numpy

def example(n):
    """Generate a very simple transition matrix from a directed graph
    """
    g = nx.DiGraph()
    for i in xrange(n-1):
        g.add_edge(i+1, i)
        g.add_edge(i, i+1)
    g.add_edge(n-1, n)
    g.add_edge(n, n)
    m = nx.to_numpy_matrix(g)
    # normalize rows to ensure m is a valid right stochastic matrix
    m = m / numpy.sum(m, axis=1)
    return m

Presenting the two alternative approaches for calculating the number of expected steps.

def expected_steps_fundamental(Q):
    I = numpy.identity(Q.shape[0])
    N = numpy.linalg.inv(I - Q)
    o = numpy.ones(Q.shape[0])
    numpy.dot(N,o)

def expected_steps_fast(Q):
    I = numpy.identity(Q.shape[0])
    o = numpy.ones(Q.shape[0])
    numpy.linalg.solve(I-Q, o)

Picking an example that's big enough to demonstrate the types of problems that occur when calculating the fundamental matrix:

P = example(2000)
# drop the absorbing state
Q = P[:-1,:-1]

Produces the following timings:

%timeit expected_steps_fundamental(Q)
1 loops, best of 3: 7.27 s per loop

And:

%timeit expected_steps_fast(Q)
10 loops, best of 3: 83.6 ms per loop

Further experimentation is required to test the performance implications for sparse matrices, but it's clear that calculating the inverse is much much slower than what you might expect.

A similar approach to the one presented here can also be used for the variance of the number of steps

answered Sep 21 '22 11:09

Andrew Walker

The reason you're getting the advice not to use matrix inverses for solving equations is because of numerical stability. When you're matrix has eigenvalues that are zero or near zero, you have problems either from lack of an inverse (if zero) or numerical stability (if near zero). The way to approach the problem, then, is to use an algorithm that doesn't require that an inverse exist. The solution is to use Gaussian elimination. This doesn't provide a full inverse, but rather gets you to row-echelon form, a generalization of upper-triangular form. If the matrix is invertible, then the last row of the result matrix contains a row of the inverse. So just arrange that the last row you eliminate on is the row you want.

I'll leave it to you to understand why I-Q is always invertible.

answered Sep 21 '22 11:09

eh9

Related questions
                            
                                Python: Pickling highly-recursive objects without using `setrecursionlimit`
                            
                                How to display total record count against models in django admin
                            
                                Is it possible to scan for Wi-Fi using Python?
                            
                                use the system monospace font in gtk textview
                            
                                Python multiprocessing Pool.map is calling aquire?
                            
                                Why is self only a convention and not a real Python keyword?
                            
                                Python: Organization of user-defined exceptions in a complete project
                            
                                sending password to command line tools
                            
                                Edit with IDLE (Python GUI) context menu on Windows&nbsp;7
                            
                                Is it pythonic to use interfaces / abstract base classes?
                            
                                os.getcwd() for a different drive in Windows
                            
                                Designing an async API in Python
                            
                                Dividing decimals yields invalid results in Python 2.5 to 2.7
                            
                                Using python multiprocessing pipes
                            
                                Does Django cache related ForeignKey and ManyToManyField fields once they're accessed?
                            
                                markdown to html using a specified css
                            
                                Django JSON De-serialization Security
                            
                                Checking for NaN presence in a container
                            
                                What are the advantages of PyQt over PyGTK and vice-versa? [closed]
                            
                                Python 3 exception not printing new line

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best way to calculate the fundamental matrix of an absorbing Markov Chain?

Tags:

python

algorithm

math

sparse-matrix

markov-chains

Ryan Marcus

People also ask

2 Answers

Andrew Walker

eh9

Recent Activity

Donate For Us