Float16 is much slower than Float32 and Float64 in numpy [duplicate]

Tags:

I was trying to run a code snippet which looks like,

import numpy as np
import time

def estimate_mutual_info(X, neurons, bins = 5):
    xy = np.histogram2d(X, neurons, bins)[0]
    x = np.histogram(X, bins)[0]
    y = np.histogram(neurons, bins)[0]
    ent_x = -1 * np.sum( x / np.sum(x) * np.log( x / np.sum(x)))
    ent_y = -1 * np.sum( y / np.sum(y) * np.log( y / np.sum(y)))
    ent_xy = -1 * np.sum( xy / np.sum(xy) * np.log( xy / np.sum(xy)))
    return (ent_x + ent_y - ent_xy)

tic = time.time()
X = np.random.rand(12000, 1200)
Y = np.random.rand(12000, 10)
for j in Y.T:
    mi = 0
    for i in range(X.shape[1]):
        mi += estimate_mutual_info(X.T[i], j, bins = 2)
    print(mi)
toc = time.time()
print(str(toc - tic)+" seconds")

To increase the speed, I used float16, hoping to see some improvement, but float16 was much slower than float32 and float64.

X = np.random.rand(12000, 1200).astype('float16')
Y = np.random.rand(12000, 10).astype('float16')

changing them to float16 results in execution time of 84.57 seconds, whereas float64 and float32 executed for 36.27 seconds and 33.25 seconds respectively. I am not sure, what causes this poor performance for flaot16. My processor is 64 bit, using python3.7 and numpy-1.16.2. I don't think 64 bit processor treats all 16 bit, 32 bit and 64 bit indifferent. Any correction and insight is much appreciated.

624

asked Jun 21 '19 05:06

Vigneswaran C

2 Answers

The most likely explanation is that your processor does not natively support FP16 arithmetic, so it's all being done in software, which is, of course, much slower.

In general, consumer Intel processors don't support FP16 operations.

160

answered Nov 03 '22 09:11

gmds

it is happening because there is no equivalent of float16 in c.

since python is based on c, as there is no equivalent of that in c, numpy created a method to perform for float16.

( float is a 32 bit IEEE 754 single precision Floating Point Number1 bit for the sign, (8 bits for the exponent, and 23* for the value), i.e. float has 7 decimal digits of precision)

Becacause of this( process making equivalent to work on float16) float16 is slower than float32 or float64

answered Nov 03 '22 09:11

sahasrara62

Related questions
                            
                                conda-forge: Why does Conda inconsistently want to downgrade NumPy?
                            
                                How to remove image noise using opencv - python?
                            
                                Python - No module named 'fabric.api - Windows 10
                            
                                How can i dynamically (in a loop) show images in google colab?
                            
                                how to use arrays in gekko optimizer for python
                            
                                RuntimeError: Attempting to deserialize object on CUDA device 2 but torch.cuda.device_count() is 1
                            
                                Python websockets send to client and keep connection alive
                            
                                How do I include files with pyinstaller?
                            
                                Pip install error in Mac OS(error: command '/usr/bin/clang' failed with exit status 1)
                            
                                How do I increase the max length of captured Python parameters in Sentry?
                            
                                Python: while not Exception
                            
                                Classification Report - Precision and F-score are ill-defined
                            
                                Is there some way to save best model only with tensorflow.estimator.train_and_evaluate()?
                            
                                ModuleNotFoundError: No module named 'flask'
                            
                                i am getting this error while connecting to mongodb Atlas: "dns.exception.Timeout: The DNS operation timed out after 30.000985383987427 seconds"
                            
                                SQLAlchemy best way to define __repr__ for large tables
                            
                                Pillow, how to put the text in the center of the image
                            
                                Is there a more elegant way to filter the failed results of a function?
                            
                                Groups of unique pairs where members appear once per group
                            
                                Using Enum item as a list index

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Float16 is much slower than Float32 and Float64 in numpy [duplicate]

Tags:

performance

python

numpy

Vigneswaran C

People also ask

2 Answers

gmds

sahasrara62

Recent Activity

Donate For Us