I'm using a logistic sigmoid for an application. I compared the times using the <code>scipy.special</code> function, <code>expit</code>, versus using the hyperbolic tangent definition of the sigmoidal. I found that the hyperbolic tangent was 3 times as fast. What is going on here? I also tested times on a sorted array to see if the result was any different. Here is an example that was run in IPython: <pre class="prettyprint"><code>In [1]: from scipy.special import expit In [2]: myexpit = lambda x: 0.5*tanh(0.5*x) + 0.5 In [3]: x = randn(100000) In [4]: allclose(expit(x), myexpit(x)) Out[4]: True In [5]: timeit expit(x) 100 loops, best of 3: 15.2 ms per loop In [6]: timeit myexpit(x) 100 loops, best of 3: 4.94 ms per loop In [7]: y = sort(x) In [8]: timeit expit(y) 100 loops, best of 3: 15.3 ms per loop In [9]: timeit myexpit(y) 100 loops, best of 3: 4.37 ms per loop </code></pre> <hr> <h3>Edit:</h3> Machine info: <ul> <li>Ubuntu 16.04</li> <li>RAM: 7.4 GB</li> <li>Intel Core i7-3517U CPU @ 1.90GHz × 4 </li> </ul> Numpy/Scipy info: <pre class="prettyprint"><code>In [1]: np.__version__ Out[1]: '1.12.0' In [2]: np.__config__.show() lapack_opt_info: libraries = ['openblas', 'openblas'] library_dirs = ['/usr/local/lib'] define_macros = [('HAVE_CBLAS', None)] language = c blas_opt_info: libraries = ['openblas', 'openblas'] library_dirs = ['/usr/local/lib'] define_macros = [('HAVE_CBLAS', None)] language = c openblas_info: libraries = ['openblas', 'openblas'] library_dirs = ['/usr/local/lib'] define_macros = [('HAVE_CBLAS', None)] language = c blis_info: NOT AVAILABLE openblas_lapack_info: libraries = ['openblas', 'openblas'] library_dirs = ['/usr/local/lib'] define_macros = [('HAVE_CBLAS', None)] language = c lapack_mkl_info: NOT AVAILABLE blas_mkl_info: NOT AVAILABLE In [3]: import scipy In [4]: scipy.__version__ Out[4]: '0.18.1' </code></pre>

<h3>edit:</h3> I'll refer future people to this question. <hr> To summarize results from helpful comments: <blockquote> "Why is using tanh definition of logistic sigmoid faster than scipy's expit?" </blockquote> Answer: It's not; there's some funny business going on with the <code>tanh</code> and <code>exp</code> C functions on my specific machine. It's turns out that on my machine, the C function for <code>tanh</code> is faster than <code>exp</code>. The answer to why this is the case obviously belongs to a different question. When I run the C++ code listed below, I see <pre class="prettyprint"><code>tanh: 5.22203 exp: 14.9393 </code></pre> which matches the ~3x increase in the <code>tanh</code> function when called from Python. The strange thing is that when I run the identical code on a separate machine that has the same OS, I get similar timing results for <code>tanh</code> and <code>exp</code>. <pre class="prettyprint"><code>#include <iostream> #include <cmath> #include <ctime> using namespace std; int main() { double a = -5; double b = 5; int N = 10001; double x[10001]; double y[10001]; double h = (b-a) / (N-1); clock_t begin, end; for(int i=0; i < N; i++) x[i] = a + i*h; begin = clock(); for(int i=0; i < N; i++) for(int j=0; j < N; j++) y[i] = tanh(x[i]); end = clock(); cout << "tanh: " << double(end - begin) / CLOCKS_PER_SEC << "\n"; begin = clock(); for(int i=0; i < N; i++) for(int j=0; j < N; j++) y[i] = exp(x[i]); end = clock(); cout << "exp: " << double(end - begin) / CLOCKS_PER_SEC << "\n"; return 0; } </code></pre>

Why is using tanh definition of logistic sigmoid faster than scipy's expit?

Tags:

python

numpy

scipy

I'm using a logistic sigmoid for an application. I compared the times using the scipy.special function, expit, versus using the hyperbolic tangent definition of the sigmoidal.

I found that the hyperbolic tangent was 3 times as fast. What is going on here? I also tested times on a sorted array to see if the result was any different.

Here is an example that was run in IPython:

In [1]: from scipy.special import expit

In [2]: myexpit = lambda x: 0.5*tanh(0.5*x) + 0.5

In [3]: x = randn(100000)

In [4]: allclose(expit(x), myexpit(x))
Out[4]: True

In [5]: timeit expit(x)
100 loops, best of 3: 15.2 ms per loop

In [6]: timeit myexpit(x)
100 loops, best of 3: 4.94 ms per loop

In [7]: y = sort(x)

In [8]: timeit expit(y)
100 loops, best of 3: 15.3 ms per loop

In [9]: timeit myexpit(y)
100 loops, best of 3: 4.37 ms per loop

Edit:

Machine info:

Ubuntu 16.04
RAM: 7.4 GB
Intel Core i7-3517U CPU @ 1.90GHz × 4

Numpy/Scipy info:

In [1]: np.__version__
Out[1]: '1.12.0'

In [2]: np.__config__.show()
lapack_opt_info:
    libraries = ['openblas', 'openblas']
    library_dirs = ['/usr/local/lib']
    define_macros = [('HAVE_CBLAS', None)]
    language = c
blas_opt_info:
    libraries = ['openblas', 'openblas']
    library_dirs = ['/usr/local/lib']
    define_macros = [('HAVE_CBLAS', None)]
    language = c
openblas_info:
    libraries = ['openblas', 'openblas']
    library_dirs = ['/usr/local/lib']
    define_macros = [('HAVE_CBLAS', None)]
    language = c
blis_info:
  NOT AVAILABLE
openblas_lapack_info:
    libraries = ['openblas', 'openblas']
    library_dirs = ['/usr/local/lib']
    define_macros = [('HAVE_CBLAS', None)]
    language = c
lapack_mkl_info:
  NOT AVAILABLE
blas_mkl_info:
  NOT AVAILABLE

In [3]: import scipy

In [4]: scipy.__version__
Out[4]: '0.18.1'

891

asked Mar 26 '17 19:03

Matt Hancock

1 Answers

edit:

I'll refer future people to this question.

To summarize results from helpful comments:

"Why is using tanh definition of logistic sigmoid faster than scipy's expit?"

Answer: It's not; there's some funny business going on with the tanh and exp C functions on my specific machine.

It's turns out that on my machine, the C function for tanh is faster than exp. The answer to why this is the case obviously belongs to a different question. When I run the C++ code listed below, I see

tanh: 5.22203
exp: 14.9393

which matches the ~3x increase in the tanh function when called from Python. The strange thing is that when I run the identical code on a separate machine that has the same OS, I get similar timing results for tanh and exp.

#include <iostream>
#include <cmath>
#include <ctime>

using namespace std;

int main() {
    double a = -5;
    double b =  5;
    int N =  10001;
    double x[10001];
    double y[10001];
    double h = (b-a) / (N-1);

    clock_t begin, end;

    for(int i=0; i < N; i++)
        x[i] = a + i*h;

    begin = clock();

    for(int i=0; i < N; i++)
        for(int j=0; j < N; j++)
            y[i] = tanh(x[i]);

    end = clock();

    cout << "tanh: " << double(end - begin) / CLOCKS_PER_SEC << "\n";

    begin = clock();

    for(int i=0; i < N; i++)
        for(int j=0; j < N; j++)
            y[i] = exp(x[i]);

    end = clock();

    cout << "exp: " << double(end - begin) / CLOCKS_PER_SEC << "\n";


    return 0;
}

107

answered Sep 30 '22 21:09

Matt Hancock

Related questions
                            
                                How to manipulate expressions in matrices using sympy?
                            
                                How to use Delete method in python flask
                            
                                Config Kivy > Invert input for y axis
                            
                                How to keep session alive when using async websockets?
                            
                                Replace rows of strings in dataframe with corresponding words in other dataframe pandas
                            
                                Pymongo - ValueError: NaTType does not support utcoffset when using insert_many
                            
                                Are there feature selection algorithms that can be applied to categorical data inputs?
                            
                                How to change Keras optimizer code
                            
                                How can I left align a Python Matplotlib pie chart?
                            
                                Why does pandas.cut() behave differently in unique count in two similar cases?
                            
                                Force Python Scrapy not to encode URL
                            
                                Why Django migration alter field (AlterField) that is not touched?
                            
                                Are there best practices for extensible magic methods in python?
                            
                                Mock a connection class in pytest
                            
                                Pandas select rows where query is in column of tuples
                            
                                How in Django/Python can I ensure safety from WYSIWYG-entered HTML?
                            
                                Naive install of PySpark to also support S3 access
                            
                                Is definition order available in a module namespace?
                            
                                Python flask ajax get image - last EDIT is the issue
                            
                                Accessing RNN weights- Tensorflow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is using tanh definition of logistic sigmoid faster than scipy's expit?

Tags:

python

numpy

scipy

Edit:

Matt Hancock

People also ask

1 Answers

edit:

Matt Hancock

Recent Activity

Donate For Us