I'm looking at a 3rd party API and they have the following piece of code: <pre class="prettyprint"><code>def array_u16 (n): return array('H', '\0\0'*n) </code></pre> I understand that <code>'\0'</code> means <code>NULL</code>, does <code>'\0\0'</code> have any special meaning or does it just mean 2 <code>NULL</code>s?

The <code>array</code> class accepts a format character (called a typecode) followed by an initializer. <code>H</code> means an unsigned short, with a minimum size of 2 bytes so, <code>'\0\0'</code> satisfies that. The <code>* n</code> part is to initialize the entire array to NULL bytes.

It just assures that two bytes are provided <code>n</code> times so the size of the array will be equal to <code>n</code>. If <code>'\0'</code> was provided, the resulting array would have a <code>size == n//2</code> (due to the type-code <code>'H'</code> requiring <code>2</code> bytes); that is obviously counter intuitive: <pre class="prettyprint"><code>>>> array('H', '\0' * 10) # 5 elements array('H', [0, 0, 0, 0, 0]) >>> array('H', '\0\0' * 10) # 10 elements array('H', [0, 0, 0, 0, 0, 0, 0, 0, 0, 0]) </code></pre> Note that, in Python <code>3</code>, if you need the same snippet to work you must provide a <code>bytes</code> object as the <code>initializer</code> argument to <code>array</code>: <pre class="prettyprint"><code>>>> array('H', b'\0\0' * 10) array('H', [0, 0, 0, 0, 0, 0, 0, 0, 0, 0]) </code></pre> As you also can't provide a <code>u''</code> string in Python 2. Other than that, the behavior stays exactly the same. So <code>'\0\0'</code> is for convenience reasons, nothing more. No semantics are attached to <code>'\0\0'</code>. No semantics are really attached to <code>'\0'</code> either (as they do in, for example, <code>C</code>) <code>'\0'</code> is just another string in Python. <hr> As a further example for this behavior, take the initialization of an array with a type-code of <code>'I'</code> for unsigned ints with a minimum of <code>2</code> bytes but <code>4</code> on <code>64bit</code> builds of Python. In the spirit of the snippet you've provided, you'd initialize the array by doing something like this: <pre class="prettyprint"><code>>>> array('I', b'\0\0\0\0' * 10) array('I', [0, 0, 0, 0, 0, 0, 0, 0, 0, 0]) </code></pre> Yes, four times the <code>b'\0'</code> string to get <code>10</code> elements. <hr> As a final note -- the following timings are performed on Python 3 but 2 is the same -- you might be wondering why he used <code>'\0\0\' * n</code> instead of the more intuitive-looking <code>[0] * n</code> to initialize the array. Well, it's quite faster: <pre class="prettyprint"><code>n = 10000 %timeit array('I', [0]*n) 1000 loops, best of 3: 212 µs per loop %timeit array('I', b'\0\0\0\0'* n) 100000 loops, best of 3: 6.36 µs per loop </code></pre> Of course, you can do better (for type-codes other than <code>'b'</code>) by feeding a <code>bytearray</code> to <code>array</code>. One way to initialize a <code>bytearray</code> is by providing an <code>int</code> as the number of items to initialize with null bytes: <pre class="prettyprint"><code>%timeit array('I', bytearray(n)) 1000000 loops, best of 3: 1.72 µs per loop </code></pre> but, if I remember correctly, the <code>bytearray(int)</code> way of initializing a bytearray might get deprecated in <code>3.7+</code> :-).

Meaning of '\0\0' in Python?

I'm looking at a 3rd party API and they have the following piece of code:

def array_u16 (n): return array('H', '\0\0'*n)

I understand that '\0' means NULL, does '\0\0' have any special meaning or does it just mean 2 NULLs?

What does 0 ]* n mean in Python?

X =[0] * N , produces a list of size N, with all N elements being the value zero. for example, X = [0] * 8 , produces a list of size 8.

What is a [:] in Python?

[:] is the array slice syntax for every element in the array. This answer here goes more in depth of the general uses: Explain Python's slice notation.

Is 0 A string in Python?

The format() method allows you format string in any way you want. {0} and {1} are format codes. The format code {0} is replaced by the first argument of format() i.e 12 , while {1} is replaced by the second argument of format() i.e 31 .

What does \r and \n do in Python?

In Python strings, the backslash "\" is a special character, also called the "escape" character. It is used in representing certain whitespace characters: "\t" is a tab, "\n" is a newline, and "\r" is a carriage return.

The array class accepts a format character (called a typecode) followed by an initializer. H means an unsigned short, with a minimum size of 2 bytes so, '\0\0' satisfies that. The * n part is to initialize the entire array to NULL bytes.

It just assures that two bytes are provided n times so the size of the array will be equal to n. If '\0' was provided, the resulting array would have a size == n//2 (due to the type-code 'H' requiring 2 bytes); that is obviously counter intuitive:

>>> array('H', '\0' * 10)    # 5 elements
array('H', [0, 0, 0, 0, 0])
>>> array('H', '\0\0' * 10)  # 10 elements
array('H', [0, 0, 0, 0, 0, 0, 0, 0, 0, 0])

Note that, in Python 3, if you need the same snippet to work you must provide a bytes object as the initializer argument to array:

>>> array('H', b'\0\0' * 10)   
array('H', [0, 0, 0, 0, 0, 0, 0, 0, 0, 0])

As you also can't provide a u'' string in Python 2. Other than that, the behavior stays exactly the same.

So '\0\0' is for convenience reasons, nothing more. No semantics are attached to '\0\0'.

_{No semantics are really attached to '\0' either (as they do in, for example, C) '\0' is just another string in Python.}

As a further example for this behavior, take the initialization of an array with a type-code of 'I' for unsigned ints with a minimum of 2 bytes but 4 on 64bit builds of Python.

In the spirit of the snippet you've provided, you'd initialize the array by doing something like this:

>>> array('I', b'\0\0\0\0' * 10)
array('I', [0, 0, 0, 0, 0, 0, 0, 0, 0, 0])

Yes, four times the b'\0' string to get 10 elements.

As a final note -- the following timings are performed on Python 3 but 2 is the same -- you might be wondering why he used '\0\0\' * n instead of the more intuitive-looking [0] * n to initialize the array. Well, it's quite faster:

n = 10000
%timeit array('I', [0]*n)
1000 loops, best of 3: 212 µs per loop

%timeit array('I', b'\0\0\0\0'* n)
100000 loops, best of 3: 6.36 µs per loop

Of course, you can do better (for type-codes other than 'b') by feeding a bytearray to array. One way to initialize a bytearray is by providing an int as the number of items to initialize with null bytes:

%timeit array('I', bytearray(n))
1000000 loops, best of 3: 1.72 µs per loop

but, if I remember correctly, the bytearray(int) way of initializing a bytearray might get deprecated in 3.7+ :-).

Meaning of '\0\0' in Python?

Tags:

python

string

python-3.x

python-2.7

flashburn

People also ask

2 Answers

AndyG

Dimitris Fasarakis Hilliard

Recent Activity

Donate For Us

Meaning of '\0\0' in Python?

Tags:

python

string

python-3.x

python-2.7

flashburn

People also ask

2 Answers

AndyG

Dimitris Fasarakis Hilliard

Related questions

Recent Activity

Donate For Us