I have a large <code>scipy.sparse.csc_matrix</code> and would like to normalize it. That is subtract the column mean from each element and divide by the column standard deviation (std)i. <code>scipy.sparse.csc_matrix</code> has a <code>.mean()</code> but is there an efficient way to compute the variance or std?

You can calculate the variance yourself using the mean, with the following formula: <pre class="prettyprint"><code>E[X^2] - (E[X])^2 </code></pre> <code>E[X]</code> stands for the mean. So to calculate <code>E[X^2]</code> you would have to square the <code>csc_matrix</code> and then use the <code>mean</code> function. To get <code>(E[X])^2</code> you simply need to square the result of the <code>mean</code> function obtained using the normal input.

How do I compute the variance of a column of a sparse matrix in Scipy?

1 Answers

You can calculate the variance yourself using the mean, with the following formula:

E[X^2] - (E[X])^2

E[X] stands for the mean. So to calculate E[X^2] you would have to square the csc_matrix and then use the mean function. To get (E[X])^2 you simply need to square the result of the mean function obtained using the normal input.

127

answered Sep 21 '22 07:09

Sicco

Related questions
                            
                                How can I get my setup.py to use a relative path to my files?
                            
                                What is a good Python library for decision trees? [closed]
                            
                                Python packages installation in Windows
                            
                                C / C++ equivalents to the Python Standard Library
                            
                                What is a nice, reliable short way to get the charset of a webpage?
                            
                                What Solr client lib for Python can you recommend and why? [closed]
                            
                                what does the last argument to SWIG_NewPointerObj mean?
                            
                                Debug slow program; Restart from middle
                            
                                python multiprocessing pickle protocol
                            
                                Apache + mod_wsgi interaction
                            
                                Python extension module with variable number of arguments
                            
                                Google App Engine dev app server does not display detailed error message
                            
                                In emacs Python mode, how do I set a different auto-fill width for docstrings and code?
                            
                                Matplotlib 3D scatter color lost after redraw
                            
                                Best practice for handling path/executables in project scripts in Python (e.g. something like Django's manage.py, or fabric)
                            
                                GeoDjango distance filter with distance value stored within model - query
                            
                                Reading/writing to a Popen() subprocess
                            
                                Shared memory between python processes
                            
                                concat pandas DataFrame along timeseries indexes
                            
                                How do I test a module that depends on boto and an Amazon AWS service?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I compute the variance of a column of a sparse matrix in Scipy?

Tags:

python

numpy

scipy

nickponline

People also ask

1 Answers

Sicco

Recent Activity

Donate For Us