Efficient way to generate random contingency tables?

Tags:

What is an efficient way to generate a random contingency table? A contingency table is defined as a rectangular matrix such that the sum of each row is fixed, and the sum of each column is fixed, but the individual elements may be anything as long as the sum of each row and column is correct.

Note that it's very easy to generate random contingency tables, but I'm looking for something more efficient than the naive algorithm.

266

asked Jun 04 '09 02:06

dsimcha

2 Answers

Looking at the code of the networksis package for R might be helpful. I believe that efficient computation requires fancy Markov Chain sequential importance resampling techniques, so you might want to avoid reimplementing this if you can avoid it.

Edit: The relevant paper is Chen, Diaconis, Holmes, and Liu (2005). In the words of the authors, "[o]ur method compares favorably with other existing Monte Carlo- based algorithms, and sometimes is a few orders of magnitude more efﬁcient."

answered Sep 24 '22 03:09

othercriteria

This sounds like a constraint satisfaction problem (CSP) to me.

You would basically start at some point and choose a cell's value randomly from the set of allowed values. Then you update the sets of eligible values for all cells in the same row/column and choose the next cell (according to the CSP heuristic you are using) to (randomly) assign a value to, again from its set of eligible values. Again, you also have to update the sets of eligible values for all cells in the same row/column. In case you encounter a cell that has an empty set of eligible values, you have to do backtracking.

However, the notion of 'set of eligible values' might be hard to represent in a data structure, depending on the range of values you are allowing.

answered Sep 23 '22 03:09

Roland Ewald

Related questions
                            
                                Upper bounds and Lower bounds in Algorithms
                            
                                Algorithm for 2D nearest-neighbour queries with dynamic points
                            
                                Generate any number in the fewest step using multiply by 2 or divide by 3?
                            
                                Is there a pushable/poppable hash function for stack-like objects?
                            
                                Alibaba interview: print a sentence with min spaces
                            
                                What concepts or algorithms exist for parallelizing parsers?
                            
                                Optimal way to determine If it is possible to arrive at pair (c,d) when starting from (a,b)
                            
                                Trying to solve Sudoku with cvxpy
                            
                                How to efficiently find similar strings in a unique string in JavaScript?
                            
                                Divide array into sub arrays such that no sub array contains duplicate elements
                            
                                Why is my code not calculating the correct value for the expression string?
                            
                                Closure Number Method for Generate Parenthesis Problem
                            
                                Splitting an array finding minimum difference between the sum of two subarray in distributed environment
                            
                                The point that minimizes the sum of euclidean distances to a set of n points
                            
                                My algorithm for calculating the modulo of a very large fibonacci number is too slow
                            
                                Algorithm for miter joins on multiple lines
                            
                                Extracting Leaf paths from n-ary tree in F#
                            
                                Good algorithm for drawing solid 2-dimensional polygons?
                            
                                Efficient evaluation of hypergeometric functions
                            
                                Securing AJAX Requests via GUID

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Efficient way to generate random contingency tables?

Tags:

algorithm

optimization

statistics

montecarlo

dsimcha

People also ask

2 Answers

othercriteria

Roland Ewald

Recent Activity

Donate For Us