In python, I am trying to find the quickest to hash each value in a pandas data frame. I know any string can be hashed using: <pre class="prettyprint"><code>hash('a string') </code></pre> But how do I apply this function on each element of a pandas data frame? This may be a very simple thing to do, but I have just started using python.

Pandas also has a function to apply a hash function on an array or column: <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame({'a':['asds','asdds','asdsadsdas']}) df["hash"] = pd.util.hash_array(df["a"].to_numpy()) </code></pre>

Hash each value in a pandas data frame

Tags:

python

pandas

hash

In python, I am trying to find the quickest to hash each value in a pandas data frame.

I know any string can be hashed using:

Click to copy

hash('a string')

But how do I apply this function on each element of a pandas data frame?

This may be a very simple thing to do, but I have just started using python.

748

asked May 09 '15 19:05

user3664020

1 Answers

Pandas also has a function to apply a hash function on an array or column:

Click to copy

import pandas as pd

df = pd.DataFrame({'a':['asds','asdds','asdsadsdas']})
df["hash"] = pd.util.hash_array(df["a"].to_numpy())

answered Sep 20 '22 15:09

bert wassink

Related questions
                            
                                What is faster - Loading a pickled dictionary object or Loading a JSON file - to a dictionary? [closed]
                            
                                PyXML install - memmove does not exist on this platform
                            
                                In Python, how to write a string to a file on a remote machine?
                            
                                How can I open two consoles from a single script
                            
                                Django many-to-many generic relationship
                            
                                python: convert numerical data in pandas dataframe to floats in the presence of strings
                            
                                dev_appserver.py: command not found
                            
                                Idiomatic way to unpack variable length list of maximum size n
                            
                                Python try-except with of if else
                            
                                How do I stop pyCharm from complaining about underscore strings?
                            
                                Running python scripts within a subpackage of my package
                            
                                how to build uWSGI with SSL support to use the websocket handshake API function?
                            
                                Django error: "'ChoiceField' object has no attribute 'is_hidden'"
                            
                                Python.h not found using swig and Anaconda Python
                            
                                Run bash script with python - TypeError: bufsize must be an integer
                            
                                ValueError: total size of new array must be unchanged
                            
                                Plotting datetimeindex on x-axis with matplotlib creates wrong ticks in pandas 0.15 in contrast to 0.14
                            
                                Sorting string values according to a custom alphabet in Python
                            
                                Calculating angles between line segments (Python) with math.atan2
                            
                                Does running IPython/Jupyter Notebook affect the speed of the program?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With