How large a system is it reasonable to attempt to do a linear regression on? Specifically: I have a system with ~300K sample points and ~1200 linear terms. Is this computationally feasible?

The linear regression is computed as (X'X)^-1 X'Y. If X is an (n x k) matrix: <ol> <li>(X' X) takes O(n*k^2) time and produces a (k x k) matrix</li> <li>The matrix inversion of a (k x k) matrix takes O(k^3) time</li> <li>(X' Y) takes O(n*k^2) time and produces a (k x k) matrix</li> <li>The final matrix multiplication of two (k x k) matrices takes O(k^3) time</li> </ol> So the Big-O running time is O(k^2*(n + k)). See also: http://en.wikipedia.org/wiki/Computational_complexity_of_mathematical_operations#Matrix_algebra If you get fancy it looks like you can get the time down to O(k^2*(n+k^0.376)) with the Coppersmith–Winograd algorithm.

You can express this as a matrix equation: <img src="https://i.stack.imgur.com/X4Ue0.gif" alt="alt text"> where the matrix <img src="https://i.stack.imgur.com/6WDxs.gif" alt="alt text"> is 300K rows and 1200 columns, the coefficient vector <img src="https://i.stack.imgur.com/rZ8dX.gif" alt="alt text"> is 1200x1, and the RHS vector <img src="https://i.stack.imgur.com/m2u0T.gif" alt="alt text"> is 1200x1. If you multiply both sides by the transpose of the matrix <img src="https://i.stack.imgur.com/axaPK.gif" alt="alt text">, you have a system of equations for the unknowns that's 1200x1200. You can use LU decomposition or any other algorithm you like to solve for the coefficients. (This is what least squares is doing.) So the Big-O behavior is something like O(mmn), where m = 300K and n = 1200. You'd account for the transpose, the matrix multiplication, the LU decomposition, and the forward-back substitution to get the coefficients.

What is the BigO of linear regression?

2 Answers

The linear regression is computed as (X'X)^-1 X'Y.

If X is an (n x k) matrix:

(X' X) takes O(n*k^2) time and produces a (k x k) matrix
The matrix inversion of a (k x k) matrix takes O(k^3) time
(X' Y) takes O(n*k^2) time and produces a (k x k) matrix
The final matrix multiplication of two (k x k) matrices takes O(k^3) time

So the Big-O running time is O(k^2*(n + k)).

See also: http://en.wikipedia.org/wiki/Computational_complexity_of_mathematical_operations#Matrix_algebra

If you get fancy it looks like you can get the time down to O(k^2*(n+k^0.376)) with the Coppersmith–Winograd algorithm.

answered Oct 13 '22 21:10

Emiller

You can express this as a matrix equation:

alt text

where the matrix alt text is 300K rows and 1200 columns, the coefficient vector is 1200x1, and the RHS vector is 1200x1.

If you multiply both sides by the transpose of the matrix alt text , you have a system of equations for the unknowns that's 1200x1200. You can use LU decomposition or any other algorithm you like to solve for the coefficients. (This is what least squares is doing.)

So the Big-O behavior is something like O(mmn), where m = 300K and n = 1200. You'd account for the transpose, the matrix multiplication, the LU decomposition, and the forward-back substitution to get the coefficients.

answered Oct 13 '22 21:10

duffymo

Related questions
                            
                                Big O of an algorithm that relies on convergence
                            
                                Space-efficient algorithm for checking if strings with backspaces are equal?
                            
                                How do you find the space complexity of recursive functions such as this one?
                            
                                Time Complexity of Doubly Linked List Element Removal?
                            
                                How is it possible to do binary search on a doubly-linked list in O(n) time?
                            
                                What is constant factors and low-order term in algorithms?
                            
                                apache poi excel big auto column width
                            
                                Is 2^(2n) = O(2^n)
                            
                                What is the Big O Complexity of Reversing the Order of Columns in Pandas DataFrame?
                            
                                I don't understand how the time complexity for this algorithm is calculated
                            
                                Minimum value of maximum values in sub-segments ... in O(n) complexity
                            
                                What's the big O for JavaScript's array when used as a hash?
                            
                                time complexity of random access in deque in Python [duplicate]
                            
                                Big-O notation finding c and n0
                            
                                shrink_to_fit() vs swap trick
                            
                                What is the Best/Worst/Average Case Big-O Runtime of a Trie Data Structure?
                            
                                Difference in complexity of append and concatenate for this list code?
                            
                                n^2 log n complexity
                            
                                What order of time does the .NET System.String.Length property take?
                            
                                Print the biggest K elements in a given heap in O(K*log(K))?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the BigO of linear regression?

Tags:

big-o

linear-regression

blas

gsl

BCS

People also ask

2 Answers

Emiller

duffymo

Recent Activity

Donate For Us