Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

C++ STL-based sending of binary serialized data for network transmission using sockets without the use of libraries

I need to send a number of complex objects over the network to a peer. I have written the code to serialize them using ostream and the operator<< for each class member in the objects that need to be serialized. The code I have written works successfully for serialization and network sending (htonl(), htons() etc. correctly used--I checked the aforementioned by writing to an ofstream (local file) in binary format (std::ios::bin). My next task, writing this binary data over the network socket, is where I am having issues.

I have Socket class which translates std::string objects into C-style strings before sending them over the socket like so:

int Socket::send (const std::string goodies) const
{
    status = ::send (socket, goodies.c_str(), goodies.size(), 0);
            return status;
}

the same Socket class, which I use in the receiver, uses the recv() to place the incoming message into a std::string before passing it to the deserializing application:

int Socket::recv (std::string& goodies)
{
    char buf [1024];
    goodies = "";
    memset (buf, 0, 1025);

    int status = ::recv (socket, buf, 1024, 0);

    if (status < 0)
    {
        return -1;
    }
    else if (status == 0)
    {
        return 0;
    }
    else
    {
        goodies = buf;
        return status;
    }
}

I do the sending using the following code:

ostringstream os (std::ios::binary);
GiantObject giantComplexObjectWithWholeLoadOfOtherObjects;
// Initialize and set up 
// giantComplexObjectWithWholeLoadOfOtherObjects.

// The following call works well--I tested it by writing to ofstream locally (file)
// and checked it out using a hex dump.
// Now, of course, my intent is to send it over the network, so I write to
// ostream&:
giantComplexObjectWithWholeLoadOfOtherObjects.serialize (os);

std::string someStr = os.str(); // Get me the stream in std::string format

mySocket.send(someStr); // Does not work--send sent correctly, but recv received 0 bytes

However, if I try:

std::string someStr ("Some silly string");
mySocket.send (someStr); // received by receiver (receiver is similar arch).

Thereby, something is not right about my call to send binary std::string to the socket. Any help is greatly appreciated. Once again, I do not want to use Boost, protobuf etc.

PS: I have spent a considerable amount of time looking over the old posts here, and the first response these types of questions receive is to use Boost. Please--there are other non-Boost, non-Protobuf ways, and I want to understand those other ways. I do appreciate what Boost and Protobuf bring to the table, I just want to do this in the manner I have set up the code.

like image 513
Sonny Avatar asked Oct 14 '11 14:10

Sonny


3 Answers

I believe one problem is the use of std::string as your container. Especially the line goodies = buf;. The std::string assignment operator from a char* assumes that the input ends at the first NUL. Since binary data can contain arbritrary NUL characters, it will likely terminate earlier than you expect.

For an application such as this, I recommend the use of std::vector<char> or std::vector<unsigned char>. Since the memory in the buffer is always contiguous, you can access the underlying buffer by taking the address of the first element.

For the receive side, you will want to use std::copy to copy the data. Or you can set it up to use a vector buffer, and swap once the receive is complete. This assumes that you want goodies unmodified if the status is <= 0. If you can accept goodies being changed even in those cases, you can resize it prior to calling ::recv(), and avoid creating the extra vector.

int Socket::send (const std::vector<char>& goodies) const {
    status = ::send (socket, &goodies[0], goodies.size(), 0);            
    return status; 

int Socket::recv (std::vector<char>& goodies)  {    
  std::vector<char> buffer(1024);
  int status = ::recv (socket, &buf[0], buf.size());
  if (status < 0) 
  {
     return -1;      
  }
  else if (status == 0)
  { 
     return 0;
  }
  else
  {
     buffer.resize(status);
     goodies.swap(buffer);
     return status;      
  } 
}
like image 139
Dave S Avatar answered Nov 17 '22 21:11

Dave S


One issue in the above is that you are assuming that 'send' will always send goodies.size() bytes in one call. It is not required to do so: you need to call send repeatedly until you have sent goodies.size() bytes. In addition, you probably want to take goodies by const ref, rather than by value to avoid copying your huge object.

On the recv side, again, you have no assurances that recv will return to you a complete object. You will need to repeatedly consume data into a buffer until you have determined that you have extracted a complete object. This means that when you send your object, you will either need to use a message oriented protocol like SCTP that delivers message boundaries at the protocol level, or use a streaming protocol, but include framing information in your message format. That will allow the consumer to extract a header describing the length of data to expect in the object.

This, BTW, is why you will get lots of replies to just use an existing framework, because they take care of these sorts of framing and message assembly issues for you.

like image 30
acm Avatar answered Nov 17 '22 20:11

acm


One thing that pops up immediately is that you are doing both input and output with the std::ostringstream, which I believe is only meant to do output. Are you using void ostringstream::str(string)? Or are you using the extraction operator? I'm not sure if either would work, but I would think the former would work. However, if you're going to do input and output on the stringstream, you should really just use stringstream (which sets the mode to ios_base::in | ios_base::out.

like image 1
gred Avatar answered Nov 17 '22 19:11

gred