I have read samples out of a wave file using the wave module, but it gives the samples as a string, it's out of wave so it's little endian (for example, <code>\x00</code>). What is the easiest way to convert this into a python integer, or a numpy.int16 type? (It will eventually become a numpy.int16, so going directly there is fine). Code needs to work on little endian and big endian processors.

Kevin Burke's answer to this question works great when your binary string represents a single short integer, but if your string holds binary data representing multiple integers, you will need to add an additional 'h' for each additional integer that the string represents. For Python 2 Convert Little Endian String that represents 2 integers <pre class="prettyprint"><code>import struct iValues = struct.unpack("<hh", "\x00\x04\x01\x05") print(iValues) </code></pre> Output: (1024, 1281) Convert Little Endian String that represents 3 integers <pre class="prettyprint"><code>import struct iValues = struct.unpack("<hhh", "\x00\x04\x01\x05\x03\x04") print(iValues) </code></pre> Output: (1024, 1281, 1027) Obviously, it's not realistic to always guess how many "h" characters are needed, so: <pre class="prettyprint"><code>import struct # A string that holds some unknown quantity of integers in binary form strBinary_Values = "\x00\x04\x01\x05\x03\x04" # Calculate the number of integers that are represented by binary string data iQty_of_Values = len(strBinary_Values)/2 # Produce the string of required "h" values h = "h" * int(iQty_of_Values) iValues = struct.unpack("<"+h, strBinary_Values) print(iValues) </code></pre> Output: (1024, 1281, 1027) For Python 3 <pre class="prettyprint"><code>import struct # A string that holds some unknown quantity of integers in binary form strBinary_Values = "\x00\x04\x01\x05\x03\x04" # Calculate the number of integers that are represented by binary string data iQty_of_Values = len(strBinary_Values)/2 # Produce the string of required "h" values h = "h" * int(iQty_of_Values) iValues = struct.unpack("<"+h, bytes(strBinary_Values, "utf8")) print(iValues) </code></pre> Output: (1024, 1281, 1027)

The <code>struct</code> module converts packed data to Python values, and vice-versa. <pre class="prettyprint"><code>>>> import struct >>> struct.unpack("<h", "\x00\x05") (1280,) >>> struct.unpack("<h", "\x00\x06") (1536,) >>> struct.unpack("<h", "\x01\x06") (1537,) </code></pre> "h" means a short int, or 16-bit int. "<" means use little-endian.

Convert little endian string to integer

2 Answers

Kevin Burke's answer to this question works great when your binary string represents a single short integer, but if your string holds binary data representing multiple integers, you will need to add an additional 'h' for each additional integer that the string represents.

For Python 2

Convert Little Endian String that represents 2 integers

import struct
iValues = struct.unpack("<hh", "\x00\x04\x01\x05")
print(iValues)

Output: (1024, 1281)

Convert Little Endian String that represents 3 integers

import struct
iValues = struct.unpack("<hhh", "\x00\x04\x01\x05\x03\x04")
print(iValues)

Output: (1024, 1281, 1027)

Obviously, it's not realistic to always guess how many "h" characters are needed, so:

import struct

# A string that holds some unknown quantity of integers in binary form
strBinary_Values = "\x00\x04\x01\x05\x03\x04"

# Calculate the number of integers that are represented by binary string data
iQty_of_Values = len(strBinary_Values)/2

# Produce the string of required "h" values
h = "h" * int(iQty_of_Values)

iValues = struct.unpack("<"+h, strBinary_Values)
print(iValues)

Output: (1024, 1281, 1027)

For Python 3

import struct

# A string that holds some unknown quantity of integers in binary form
strBinary_Values = "\x00\x04\x01\x05\x03\x04"

# Calculate the number of integers that are represented by binary string data
iQty_of_Values = len(strBinary_Values)/2

# Produce the string of required "h" values
h = "h" * int(iQty_of_Values)

iValues = struct.unpack("<"+h, bytes(strBinary_Values, "utf8"))
print(iValues)

Output: (1024, 1281, 1027)

161

answered Sep 21 '22 02:09

David M. Helmuth

The struct module converts packed data to Python values, and vice-versa.

>>> import struct
>>> struct.unpack("<h", "\x00\x05")
(1280,)
>>> struct.unpack("<h", "\x00\x06")
(1536,)
>>> struct.unpack("<h", "\x01\x06")
(1537,)

"h" means a short int, or 16-bit int. "<" means use little-endian.

answered Sep 22 '22 02:09

Ned Batchelder

Related questions
                            
                                Generate PDF with WeasyPrint having common header/footer and pagination
                            
                                daily data, resample every 3 days, calculate over trailing 5 days efficiently
                            
                                Fast way to check if a numpy array is binary (contains only 0 and 1)
                            
                                How to change python version for use with pyinstaller
                            
                                What is the time complexity of Python list's count() function?
                            
                                Why is Python interpreting this string as a dictionary when formatting? [duplicate]
                            
                                Calculating the cosine similarity between all the rows of a dataframe in pyspark
                            
                                Inner class function without self
                            
                                PySpark - Adding a Column from a list of values using a UDF
                            
                                Tensorflow no module named official
                            
                                Avoiding None in f-string
                            
                                Can't find '_sqlite3' module when import it using python which installed by pyenv
                            
                                serializer call is showing an TypeError: Object of type 'ListSerializer' is not JSON serializable?
                            
                                Get item from bs4.element.Tag
                            
                                pandas dataframe index: to_list() vs tolist()
                            
                                Install Oracle Instant client into Docker container for Python cx_Oracle
                            
                                Keras: UnboundLocalError: local variable 'logs' referenced before assignment
                            
                                Django 3.2 exception: django.core.exceptions.ImproperlyConfigured
                            
                                Embedded Web Server in Python? [closed]
                            
                                Getting the lesser n elements of a list in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convert little endian string to integer

Tags:

python

numpy

wav

Jeffrey Aylesworth

People also ask

2 Answers

David M. Helmuth

Ned Batchelder

Recent Activity

Donate For Us