What are the delimiters for protobuf messages?

2 Answers

For top level messages (i.e. separate calls to serialize): there literally isn't one. Unless you add your own framing, messages actively bleed into each-other, as the deserializer will (by default) just read to the end of a stream. So: if you have blindly concatenated multiple objects without your own framing protocol: you now have problems.

For the internals of messages, there are two ways of encoding sub-objects - length prefix and groups. Groups are largely deprecated, and the encoding of sub-objects is ambiguous in that it is also the same markers that describe strings, blobs (bytes), and "packed arrays". You probably don't want to try to handle that.

So: it sounds like you need to add your own framing protocol, in which case the answer will be : whatever your framing protocol defines. Just remember that protobuf is binary, so you cannot rely on any byte sequence as a sentinel / terminator. You should ideally use a length prefix approach instead.

167

answered Sep 24 '22 20:09

Marc Gravell

(In addition to existing answers 1, 2)

Common framing method for protocol buffers is to prepend a varint before actual protobuf message.

The implementation is already part of the protobuf library, e.g.:

for java: MessageLite.writeDelimitedTo(), Parser.parseDelimitedFrom()
for C: methods in header google/protobuf/util/delimited_message_util.h (e.g. SerializeDelimitedToFileDescriptor())

Good luck with your project!

EDIT> The official reference states that:

If you want to write multiple messages to a single file or stream, it is up to you to keep track of where one message ends and the next begins. The Protocol Buffer wire format is not self-delimiting, so protocol buffer parsers cannot determine where a message ends on their own. The easiest way to solve this problem is to write the size of each message before you write the message itself. When you read the messages back in, you read the size, then read the bytes into a separate buffer, then parse from that buffer. (If you want to avoid copying bytes to a separate buffer, check out the CodedInputStream class (in both C++ and Java) which can be told to limit reads to a certain number of bytes.)

answered Sep 21 '22 20:09

vlp

Related questions
                            
                                Unable to deserialize list directly inside rootelement using Jackson XML
                            
                                Can't deserialize object containing LocalDate received from AMQP messaging
                            
                                Serializing anonymous types
                            
                                JMS serializer yml datetime format
                            
                                Django rest_framework: child object count in serializer
                            
                                DRF create method in viewset or in serializer
                            
                                Groovy parsing JSON vs XML
                            
                                Serialize in a human readable text format
                            
                                Looking for a fast, compact, streamable, multi-language, strongly typed serialization format
                            
                                Java serialization, ObjectInputStream.readObject(), check if will block
                            
                                setAttribute: Non-serializable attribute (Java Object Serialization)
                            
                                How can I have more flexible serialization and deserialization in Java?
                            
                                De-serializing objects from a file in Java
                            
                                DataContractSerializer and immutable types. Deserialising to a known object instance
                            
                                how to stop serialization of subclass? [duplicate]
                            
                                No valid constructor during serialization
                            
                                C# Serialize an object with a list of objects in it
                            
                                How to write GSON custom deserializer for embedded object with two possible types
                            
                                In C#, how can I replace\u0026 with &?
                            
                                Serialize model fields into nested object/dict

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What are the delimiters for protobuf messages?

Tags:

serialization

protocol-buffers

Marko Bencik

People also ask

2 Answers

Marc Gravell

vlp

Recent Activity

Donate For Us