Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

istream_iterator to iterate through bytes in a binary file

Given a file containing the following hex code: 0B 00 00 00 00 00 20 41

I'm trying to populate an std::vector <std::uint8_t> and then checking each byte manually.

Here's the code where I create my vector from two std::istream_iterators using the iterator constructor

using Bytes   = std::vector<std::uint8_t>;
using ByteItr = std::istream_iterator<std::uint8_t>;

Bytes getBytes()
{
    std::ifstream in;
    in.open("filepath");
    in.seekg(0, std::ios::beg);
    Bytes bytes;
    ByteItr start(in);
    ByteItr end;
    return Bytes(start, end);
}

Here's the unit test I'm trying to pass it through:

auto bytes = getBytes();

REQUIRE( bytes.size() == 8 );

CHECK( bytes[0] == 0x0B );
CHECK( bytes[1] == 0x00 );
CHECK( bytes[2] == 0x00 );
CHECK( bytes[3] == 0x00 );
CHECK( bytes[4] == 0x00 );
CHECK( bytes[5] == 0x00 );
CHECK( bytes[6] == 0x20 );
CHECK( bytes[7] == 0x41 );

Why is it that in this context, it skips two elements and implicitly converts my vector of std::uint8_t to unsigned chars?

like image 607
Burtonium Avatar asked Dec 15 '15 22:12

Burtonium


2 Answers

istream_iterator should not be used for reading binary files. It uses operator>>, which is also not suited for reading binary files (unless those files are of a very specific format which most binary files do not fit). You can use istreambuf_iterator instead. You also want to be sure to open your file in binary mode.

in.open("filepath", std::ios::in | std::ios::binary);
like image 53
Benjamin Lindley Avatar answered Nov 15 '22 23:11

Benjamin Lindley


Don't use std::istream_iterator<T>: that's intended for text formatted input. Most likely it'll skip spaces, for example (you can disable skipping spaces using std::noskipws, but that's still the wrong thing to do - use std::istreambuf_iterator<char> instead; the type char is the character type of the stream).

Also, when processing binary data make sure your stream is opened in binary mode to avoid line end conversions (in case you try that on a platform doing line end conversions). That is, you'd add std::ios_base::binary to the open mode.

like image 35
Dietmar Kühl Avatar answered Nov 16 '22 00:11

Dietmar Kühl