Assuming I have a byte b with the binary value of 11111111 How do I for example read a 3 bit integer value starting at the second bit or write a four bit integer value starting at the fifth bit?

Some 2+ years after I asked this question I'd like to explain it the way I'd want it explained back when I was still a complete newb and would be most beneficial to people who want to understand the process. First of all, forget the "11111111" example value, which is not really all that suited for the visual explanation of the process. So let the initial value be <code>10111011</code> (187 decimal) which will be a little more illustrative of the process. 1 - how to read a 3 bit value starting from the second bit: <pre class="prettyprint"><code> ___ <- those 3 bits 10111011 </code></pre> The value is 101, or 5 in decimal, there are 2 possible ways to get it: <ul> <li>mask and shift</li> </ul> In this approach, the needed bits are first masked with the value <code>00001110</code> (14 decimal) after which it is shifted in place: <pre class="prettyprint"><code> ___ 10111011 AND 00001110 = 00001010 >> 1 = ___ 00000101 </code></pre> The expression for this would be: <code>(value & 14) >> 1</code> <ul> <li>shift and mask</li> </ul> This approach is similar, but the order of operations is reversed, meaning the original value is shifted and then masked with <code>00000111</code> (7) to only leave the last 3 bits: <pre class="prettyprint"><code> ___ 10111011 >> 1 ___ 01011101 AND 00000111 00000101 </code></pre> The expression for this would be: <code>(value >> 1) & 7</code> Both approaches involve the same amount of complexity, and therefore will not differ in performance. 2 - how to write a 3 bit value starting from the second bit: In this case, the initial value is known, and when this is the case in code, you may be able to come up with a way to set the known value to another known value which uses less operations, but in reality this is rarely the case, most of the time the code will know neither the initial value, nor the one which is to be written. This means that in order for the new value to be successfully "spliced" into byte, the target bits must be set to zero, after which the shifted value is "spliced" in place, which is the first step: <pre class="prettyprint"><code> ___ 10111011 AND 11110001 (241) = 10110001 (masked original value) </code></pre> The second step is to shift the value we want to write in the 3 bits, say we want to change that from 101 (5) to 110 (6) <pre class="prettyprint"><code> ___ 00000110 << 1 = ___ 00001100 (shifted "splice" value) </code></pre> The third and final step is to splice the masked original value with the shifted "splice" value: <pre class="prettyprint"><code>10110001 OR 00001100 = ___ 10111101 </code></pre> The expression for the whole process would be: <code>(value & 241) | (6 << 1)</code> Bonus - how to generate the read and write masks: Naturally, using a binary to decimal converter is far from elegant, especially in the case of 32 and 64 bit containers - decimal values get crazy big. It is possible to easily generate the masks with expressions, which the compiler can efficiently resolve during compilation: <ul> <li>read mask for "mask and shift": <code>((1 << fieldLength) - 1) << (fieldIndex - 1)</code>, assuming that the index at the first bit is 1 (not zero)</li> <li>read mask for "shift and mask": <code>(1 << fieldLength) - 1</code> (index does not play a role here since it is always shifted to the first bit</li> <li>write mask : just invert the "mask and shift" mask expression with the <code>~</code> operator</li> </ul> How does it work (with the 3bit field beginning at the second bit from the examples above)? <pre class="prettyprint"><code>00000001 << 3 00001000 - 1 00000111 << 1 00001110 ~ (read mask) 11110001 (write mask) </code></pre> The same examples apply to wider integers and arbitrary bit width and position of the fields, with the shift and mask values varying accordingly. Also note that the examples assume unsigned integer, which is what you want to use in order to use integers as portable bit-field alternative (regular bit-fields are in no way guaranteed by the standard to be portable), both left and right shift insert a padding 0, which is not the case with right shifting a signed integer. Even easier: Using this set of macros (but only in C++ since it relies on the generation of member functions): <pre class="prettyprint"><code>#define GETMASK(index, size) ((((size_t)1 << (size)) - 1) << (index)) #define READFROM(data, index, size) (((data) & GETMASK((index), (size))) >> (index)) #define WRITETO(data, index, size, value) ((data) = (((data) & (~GETMASK((index), (size)))) | (((value) << (index)) & (GETMASK((index), (size)))))) #define FIELD(data, name, index, size) \ inline decltype(data) name() const { return READFROM(data, index, size); } \ inline void set_##name(decltype(data) value) { WRITETO(data, index, size, value); } </code></pre> You could go for something as simple as: <pre class="prettyprint"><code>struct A { uint bitData; FIELD(bitData, one, 0, 1) FIELD(bitData, two, 1, 2) }; </code></pre> And have the bit fields implemented as properties you can easily access: <pre class="prettyprint"><code>A a; a.set_two(3); cout << a.two(); </code></pre> Replace <code>decltype</code> with gcc's <code>typeof</code> pre-C++11.

How to read/write arbitrary bits in C/C++

2 Answers

Some 2+ years after I asked this question I'd like to explain it the way I'd want it explained back when I was still a complete newb and would be most beneficial to people who want to understand the process.

First of all, forget the "11111111" example value, which is not really all that suited for the visual explanation of the process. So let the initial value be 10111011 (187 decimal) which will be a little more illustrative of the process.

1 - how to read a 3 bit value starting from the second bit:

    ___  <- those 3 bits 10111011

The value is 101, or 5 in decimal, there are 2 possible ways to get it:

mask and shift

In this approach, the needed bits are first masked with the value 00001110 (14 decimal) after which it is shifted in place:

    ___ 10111011 AND 00001110 = 00001010 >> 1 =      ___ 00000101

The expression for this would be: (value & 14) >> 1

shift and mask

This approach is similar, but the order of operations is reversed, meaning the original value is shifted and then masked with 00000111 (7) to only leave the last 3 bits:

    ___ 10111011 >> 1      ___ 01011101 AND 00000111 00000101

The expression for this would be: (value >> 1) & 7

Both approaches involve the same amount of complexity, and therefore will not differ in performance.

2 - how to write a 3 bit value starting from the second bit:

In this case, the initial value is known, and when this is the case in code, you may be able to come up with a way to set the known value to another known value which uses less operations, but in reality this is rarely the case, most of the time the code will know neither the initial value, nor the one which is to be written.

This means that in order for the new value to be successfully "spliced" into byte, the target bits must be set to zero, after which the shifted value is "spliced" in place, which is the first step:

    ___  10111011 AND 11110001 (241) = 10110001 (masked original value)

The second step is to shift the value we want to write in the 3 bits, say we want to change that from 101 (5) to 110 (6)

     ___ 00000110 << 1 =     ___ 00001100 (shifted "splice" value)

The third and final step is to splice the masked original value with the shifted "splice" value:

10110001 OR 00001100 =     ___ 10111101

The expression for the whole process would be: (value & 241) | (6 << 1)

Bonus - how to generate the read and write masks:

Naturally, using a binary to decimal converter is far from elegant, especially in the case of 32 and 64 bit containers - decimal values get crazy big. It is possible to easily generate the masks with expressions, which the compiler can efficiently resolve during compilation:

read mask for "mask and shift": ((1 << fieldLength) - 1) << (fieldIndex - 1), assuming that the index at the first bit is 1 (not zero)
read mask for "shift and mask": (1 << fieldLength) - 1 (index does not play a role here since it is always shifted to the first bit
write mask : just invert the "mask and shift" mask expression with the ~ operator

How does it work (with the 3bit field beginning at the second bit from the examples above)?

00000001 << 3 00001000  - 1 00000111 << 1 00001110  ~ (read mask) 11110001    (write mask)

The same examples apply to wider integers and arbitrary bit width and position of the fields, with the shift and mask values varying accordingly.

Also note that the examples assume unsigned integer, which is what you want to use in order to use integers as portable bit-field alternative (regular bit-fields are in no way guaranteed by the standard to be portable), both left and right shift insert a padding 0, which is not the case with right shifting a signed integer.

Even easier:

Using this set of macros (but only in C++ since it relies on the generation of member functions):

#define GETMASK(index, size) ((((size_t)1 << (size)) - 1) << (index)) #define READFROM(data, index, size) (((data) & GETMASK((index), (size))) >> (index)) #define WRITETO(data, index, size, value) ((data) = (((data) & (~GETMASK((index), (size)))) | (((value) << (index)) & (GETMASK((index), (size)))))) #define FIELD(data, name, index, size) \   inline decltype(data) name() const { return READFROM(data, index, size); } \   inline void set_##name(decltype(data) value) { WRITETO(data, index, size, value); }

You could go for something as simple as:

struct A {   uint bitData;   FIELD(bitData, one, 0, 1)   FIELD(bitData, two, 1, 2) };

And have the bit fields implemented as properties you can easily access:

A a; a.set_two(3); cout << a.two();

Replace decltype with gcc's typeof pre-C++11.

142

answered Sep 28 '22 05:09

dtech

You need to shift and mask the value, so for example...

If you want to read the first two bits, you just need to mask them off like so:

int value = input & 0x3;

If you want to offset it you need to shift right N bits and then mask off the bits you want:

int value = (intput >> 1) & 0x3;

To read three bits like you asked in your question.

int value = (input >> 1) & 0x7;

answered Sep 28 '22 06:09

Geoffrey

Related questions
                            
                                bitwise not operator
                            
                                Declare variables at top of function or in separate scopes?
                            
                                C++ Parallelization Libraries: OpenMP vs. Thread Building Blocks [closed]
                            
                                How do I include the string header?
                            
                                Initialize integer literal to std::size_t
                            
                                In Clion's debugger, how do I show the entire contents of an int array
                            
                                How to use WinDbg to analyze the crash dump for VC++ application?
                            
                                C++ #include <atlbase.h> is not found
                            
                                Passing array to a function (and why it does not work in C++)
                            
                                Initializing Constant Static Array In Header File
                            
                                Overriding vs Virtual
                            
                                Use of observer_ptr
                            
                                What is the official name of C++'s arrow (->) operator?
                            
                                Getting array from std:vector
                            
                                Using boost thread and a non-static class function
                            
                                Why is the sum of an int and a float an int?
                            
                                Why doesn't a compiler optimize floating-point *2 into an exponent increment?
                            
                                Do global variables mean faster code?
                            
                                C++ string to double conversion
                            
                                #ifdef DEBUG with CMake independent from platform

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to read/write arbitrary bits in C/C++

Tags:

c++

c

memory

bit

read-write

dtech

People also ask

2 Answers

dtech

Geoffrey

Recent Activity

Donate For Us