Is there an efficient way of concatenating scipy.sparse matrices?

Tags:

I'm working with some rather large sparse matrices (from 5000x5000 to 20000x20000) and need to find an efficient way to concatenate matrices in a flexible way in order to construct a stochastic matrix from separate parts.

Right now I'm using the following way to concatenate four matrices, but it's horribly inefficient. Is there any better way to do this that doesn't involve converting to a dense matrix?

rmat[0:m1.shape[0],0:m1.shape[1]] = m1 rmat[m1.shape[0]:rmat.shape[0],m1.shape[1]:rmat.shape[1]] = m2 rmat[0:m1.shape[0],m1.shape[1]:rmat.shape[1]] = bridge rmat[m1.shape[0]:rmat.shape[0],0:m1.shape[1]] = bridge.transpose()

793

asked Jul 27 '11 13:07

jones

2 Answers

The sparse library now has hstack and vstack for respectively concatenating matrices horizontally and vertically.

126

answered Sep 28 '22 16:09

Erik

Using hstack, vstack, or concatenate, is dramatically slower than concatenating the inner data objects themselves. The reason is that hstack/vstack converts the sparse matrix to coo format which can be very slow when the matrix is very large not and not in coo format. Here is the code for concatenating csc matrices, similar method can be used for csr matrices:

def concatenate_csc_matrices_by_columns(matrix1, matrix2):     new_data = np.concatenate((matrix1.data, matrix2.data))     new_indices = np.concatenate((matrix1.indices, matrix2.indices))     new_ind_ptr = matrix2.indptr + len(matrix1.data)     new_ind_ptr = new_ind_ptr[1:]     new_ind_ptr = np.concatenate((matrix1.indptr, new_ind_ptr))      return csc_matrix((new_data, new_indices, new_ind_ptr))

answered Sep 28 '22 15:09

Amos

Related questions
                            
                                Python insert numpy array into sqlite3 database
                            
                                Why are tuples constructed from differently initialized sets equal?
                            
                                Excluding a top-level directory from a setuptools package
                            
                                Force Python to forego native sqlite3 and use the (installed) latest sqlite3 version
                            
                                How to convert country names to ISO 3166-1 alpha-2 values, using python
                            
                                Dictionary keys and values to separate numpy arrays
                            
                                pandas - Merging on string columns not working (bug?)
                            
                                Matplotlib bar graph x axis won't plot string values
                            
                                django - getlist()
                            
                                Removing list of words from a string
                            
                                Selecting columns by list (and columns are subset of list)
                            
                                How can I render a ManyToManyField as checkboxes?
                            
                                How to compare plain text password to hashed password using bcrypt?
                            
                                AttributeError while using Django Rest Framework with serializers
                            
                                Table 'roles_users' is already defined for this MetaData instance
                            
                                Matplotlib y axis values are not ordered [duplicate]
                            
                                suds install error: no module named client
                            
                                Pandas - Replace values based on index
                            
                                Introspection to get decorator names on a method?
                            
                                Import from sibling directory

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there an efficient way of concatenating scipy.sparse matrices?

Tags:

python

concatenation

scipy

sparse-matrix

jones

People also ask

2 Answers

Erik

Amos

Recent Activity

Donate For Us