join function of a numpy array composed of string

Tags:

I'm trying to use the join function on a numpy array composed of only strings (representing binary floats) to get the joined string in order to use the numpy.fromstring function, but the join function doesn't seem to work properly.

Any idea why? Which alternative function can I use to do that?

Here is a standalone example to show my problem:

import numpy as np

nb_el = 10

table = np.arange(nb_el, dtype='float64')
print table

binary = table.tostring()

binary_list = map(''.join, zip(*[iter(binary)] * table.dtype.itemsize))
print 'len binary list :', len(binary_list)
# len binary list : 10

join_binary_list = ''.join(binary_list)
print np.fromstring(join_binary_list, dtype='float64')
# [ 0.  1.  2.  3.  4.  5.  6.  7.  8.  9.]

binary_split_array = np.array(binary_list)
print 'nb el :', binary_split_array.shape
# nb el : (10,)
print 'nb_el * size :', binary_split_array.shape[0] * binary_split_array.dtype.itemsize
# nb_el * size : 80

join_binary_split_array = ''.join(binary_split_array)
print 'len binary array :', len(join_binary_split_array)
# len binary array : 72

table_fromstring = np.fromstring(join_binary_split_array, dtype='float64')
print table_fromstring
# [ 1.  2.  3.  4.  5.  6.  7.  8.  9.]

As you can see, using the join function on the list (binary_list) works properly, but on the equivalent numpy array (binary_split_array) it doesn't: we can see the string returned is only 72 characters long instead of 80.

413

asked May 20 '15 16:05

Thomas Leonard

1 Answers

The first element of your join_binary_split_array is an empty string:

print(repr(binary_split_array[0]))    
''

The first element in your list is:

'\x00\x00\x00\x00\x00\x00\x00\x00'

An empty string has a length of 0:

print([len("".join(a)) for a in binary_split_array])
print([len("".join(a)) for a in binary_list])
[0, 8, 8, 8, 8, 8, 8, 8, 8, 8]
[8, 8, 8, 8, 8, 8, 8, 8, 8, 8]

The length of the str of bytes 8:

print(len('\x00\x00\x00\x00\x00\x00\x00\x00'))
8

Calling tobytes will give the same output length as the list:

print(len(binary_split_array.tobytes()))
80

table_fromstring = np.fromstring(binary_split_array.tobytes(), dtype='float64')

print table_fromstring
[ 0.  1.  2.  3.  4.  5.  6.  7.  8.  9.]

The numpy array handles null bytes differently to python, null bytes are truncated.

195

answered Oct 26 '22 00:10

Padraic Cunningham

Related questions
                            
                                Flask: What happens when a user closes the browser while a long process is being executed?
                            
                                Symmetric colormap matplotlib
                            
                                Create Django Superuser on AWS Elastic Beanstalk
                            
                                Ipython notebook 3 disables seaborn settings
                            
                                how to write integration tests using pytest and how to repeat the integration tests
                            
                                Segmenting Python array into unique regions connected by a single cell or less?
                            
                                Allow dynamic choice in Django ChoiceField
                            
                                What is the difference between input() and sys.stdin?
                            
                                Django 1.8 Migrations - "NoneType" object has no attribute "_meta"
                            
                                Wandering star - codeabbey task
                            
                                mmap file inquiry for a blank file in Python
                            
                                Is it possible to use Django's SafeExceptionReporterFilter with something else than the AdminEmailHandler?
                            
                                How to fix a regex that attemps to catch some word and id?
                            
                                TypeError: histogram() got an unexpected keyword argument 'new'
                            
                                Django Rest Framework - How do I limit results returned with Geolocation?
                            
                                Python subprocess echo a unicode literal
                            
                                Offline Installation of python & pip
                            
                                out of memory error when reading csv file in chunk
                            
                                How to update the value of a row of a WPF DataGrid from IronPython?
                            
                                supplying variables to class dynamically

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

join function of a numpy array composed of string

Tags:

python

arrays

string

join

numpy

Thomas Leonard

People also ask

1 Answers

Padraic Cunningham

Recent Activity

Donate For Us