NumPy - What is the difference between frombuffer and fromstring?

Tags:

numpy

They appear to give the same result to me:

In [32]: s Out[32]: '\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x15\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'  In [27]: np.frombuffer(s, dtype="int8") Out[27]: array([ 0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0, 21,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0], dtype=int8)  In [28]: np.fromstring(s, dtype="int8") Out[28]: array([ 0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0, 21,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0], dtype=int8)  In [33]: b = buffer(s)  In [34]: b Out[34]: <read-only buffer for 0x035F8020, size -1, offset 0 at 0x036F13A0>  In [35]: np.fromstring(b, dtype="int8") Out[35]: array([ 0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0, 21,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0], dtype=int8)  In [36]: np.frombuffer(b, dtype="int8") Out[36]: array([ 0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0, 21,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,     0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0,  0], dtype=int8)

When should one be used vs. the other?

792

asked Mar 06 '14 21:03

1 Answers

From a practical standpoint, the difference is that:

x = np.fromstring(s, dtype='int8')

Will make a copy of the string in memory, while:

x = np.frombuffer(s, dtype='int8')

x = np.frombuffer(buffer(s), dtype='int8')

Will use the memory buffer of the string directly and won't use any* additional memory. Using frombuffer will also result in a read-only array if the input to buffer is a string, as strings are immutable in python.

(*Neglecting a few bytes of memory used for an additional python ndarray object -- The underlying memory for the data will be shared.)

If you're not familiar with buffer objects (memoryview in python3.x), they're essentially a way for C-level libraries to expose a block of memory for use in python. It's basically a python interface for managed access to raw memory.

If you were working with something that exposed the buffer interface, then you'd probably want to use frombuffer. (Python 2.x strings and python 3.x bytes expose the buffer interface, but you'll get a read-only array, as python strings are immutable.)

Otherwise, use fromstring to create a numpy array from a string. (Unless you know what you're doing, and want to tightly control memory use, etc.)

answered Sep 29 '22 10:09

Joe Kington

Related questions
                            
                                How to check if python module exists and can be imported [duplicate]
                            
                                How to handle "duck typing" in Python?
                            
                                Consuming a kinesis stream in python
                            
                                Google API: getting Credentials from refresh token with oauth2client.client
                            
                                How to set same color for markers and lines in a matplotlib plot loop?
                            
                                What does NN VBD IN DT NNS RB means in NLTK?
                            
                                Why are some variables and comments in my IPython notebook red?
                            
                                pandas rounding when converting float to integer
                            
                                How to apply LabelEncoder for a specific column in Pandas dataframe
                            
                                How to check similarity of two images that have different pixelization
                            
                                FFT for Spectrograms in Python
                            
                                How to implement a pythonic equivalent of tail -F?
                            
                                Can SQLAlchemy DateTime Objects Only Be Naive?
                            
                                Are there builtin functions for elementwise boolean operators over boolean lists?
                            
                                Recommended NoSQL Database for use with Python [closed]
                            
                                Overriding special methods on an instance
                            
                                Combine Python Dictionary Permutations into List of Dictionaries
                            
                                Python pandas: select columns with all zero entries in dataframe
                            
                                How to create HTTPS tornado server
                            
                                Using "and" and "or" operator with Python strings [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

NumPy - What is the difference between frombuffer and fromstring?

Tags:

python

numpy

user202987

People also ask

1 Answers

Joe Kington

Recent Activity

Donate For Us