Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Check if two scipy.sparse.csr_matrix are equal

I want to check if two csr_matrix are equal.

If I do:

x.__eq__(y) 

I get:

raise ValueError("The truth value of an array with more than one " ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all(). 

This, However, works well:

assert (z in x for z in y) 

Is there a better way to do it? maybe using some scipy optimized function instead?

Thanks so much

like image 852
omerbp Avatar asked Jun 06 '15 16:06

omerbp


People also ask

How do you compare two sparse matrices in Python?

If you have 2 CSR matrices (X and Y) of different shapes, you can simply use X!= Y and it will return True. If CSR matrices are of the same shape, X!= Y returns a matrix of the same shape with boolean elements.

What does SciPy sparse Csr_matrix do?

The function csr_matrix() is used to create a sparse matrix of compressed sparse row format whereas csc_matrix() is used to create a sparse matrix of compressed sparse column format.

Is sparse array and sparse matrix same?

In numerical analysis and scientific computing, a sparse matrix or sparse array is a matrix in which most of the elements are zero.

How do SciPy sparse matrices multiply?

We use the multiply() method provided in both csc_matrix and csr_matrix classes to multiply two sparse matrices. We can multiply two matrices of same format( both matrices are csc or csr format) and also of different formats ( one matrix is csc and other is csr format).


1 Answers

Can we assume they are the same shape?

In [202]: a=sparse.csr_matrix([[0,1],[1,0]]) In [203]: b=sparse.csr_matrix([[0,1],[1,1]]) In [204]: (a!=b).nnz==0    Out[204]: False 

This checks the sparsity of the inequality array.

It will give you an efficiency warning if you try a==b (at least the 1st time you use it). That's because it has to test all those zeros. It can't take much advantage of the sparsity.

You need a relatively recent version to use logical operators like this. Were you trying to use x.__eq__(y) in some if expression, or did you get error from just that expression?

In general you probably want to check several parameters first. Same shape, same nnz, same dtype. You need to be careful with floats.

For dense arrays np.allclose is a good way of testing equality. And if the sparse arrays aren't too large, that might be good as well

np.allclose(a.A, b.A) 

allclose uses all(less_equal(abs(x-y), atol + rtol * abs(y))). You can use a-b, but I suspect that this too will give an efficiecy warning.

like image 63
hpaulj Avatar answered Oct 10 '22 06:10

hpaulj