I have some repeating elements in my protobuf message. At runtime the length of the message could be anything - I see some questions already asked like this one - [1]: Maximum serialized Protobuf message size <ol> <li>I have a slightly different question here. If my JMS (Java Messaging service) provider (in this case my weblogic or tibco jms server) doesn't have any size limit on the max message size, will protocol buffer compiler complain at all about the maximum message size ? </li> <li>Does the performance of encoding/decoding suffer horribly at large sizes (around ~10MB)..?</li> </ol>

<ol> <li>I don't think the protobuf compiler will ever complain about message sizes. Atleast not until you get to the 18 exabyte maximum of <code>uint64_t</code>.</li> <li>For most implementations, performance starts to suffer at the point where the message cannot fit into RAM at once. So 10 MB should be fine, 10 GB not. Another possible issue is if you don't need all of the data - protobuf does not support random access, so you need to decode the whole message even if you only need a part of it.</li> </ol>

google protobuf maximum size

Tags:

protocol-buffers

I have some repeating elements in my protobuf message. At runtime the length of the message could be anything - I see some questions already asked like this one - [1]: Maximum serialized Protobuf message size

I have a slightly different question here. If my JMS (Java Messaging service) provider (in this case my weblogic or tibco jms server) doesn't have any size limit on the max message size, will protocol buffer compiler complain at all about the maximum message size ?
Does the performance of encoding/decoding suffer horribly at large sizes (around ~10MB)..?

725

asked Dec 07 '15 08:12

Robin Bajaj

2 Answers

10MB is pushing it but you'll probably be OK.

Protobuf has a hard limit of 2GB, because many implementations use 32-bit signed arithmetic. For security reasons, many implementations (especially the Google-provided ones) impose a size limit of 64MB by default, although you can increase this limit manually if you need to.

The implementation will not "slow down" with large messages per se, but the problem is that you must always parse an entire message at once before you can start using any of the content. This means the entire message must fit into RAM (keeping in mind that after parsing the in-memory message objects are much larger than the original serialized message), and even if you only care about one field you have to wait for the whole thing to parse.

Generally I recommend trying to limit yourself to 1MB as a rule of thumb. Beyond that, think about splitting the message up into multiple chunks that can be parsed independently. However, every application -- for some, 10MB is no big deal, for others 1MB is already way too large. You'll have to profile your own app to find out.

I've actually seen cases where people were happy sending messages larger than 1GB, so... it "works".

On a side note, Cap'n Proto has a very similar design to Protobuf but can support messages up to 2^64 bytes (2^32 segments of 4GB each), and it actually does allow you to read one field from the message without parsing the whole message (if it's in a file on disk, use mmap() to avoid reading the whole thing in).

(Disclosure: I'm the author of Cap'n Proto as well as most of Google's open source Protobuf code.)

178

answered Oct 07 '22 16:10

Kenton Varda

I don't think the protobuf compiler will ever complain about message sizes. Atleast not until you get to the 18 exabyte maximum of uint64_t.
For most implementations, performance starts to suffer at the point where the message cannot fit into RAM at once. So 10 MB should be fine, 10 GB not. Another possible issue is if you don't need all of the data - protobuf does not support random access, so you need to decode the whole message even if you only need a part of it.

answered Oct 07 '22 16:10

jpa

Related questions
                            
                                How to use a forked module, with versioned Go Modules (v1.11+, GO111MODULE=on)
                            
                                How can I represent a 2-dimensional array in Protocol Buffers?
                            
                                What's the best way to represent System.Decimal in Protocol Buffers?
                            
                                Protocol Buffer Error on compile during GOOGLE_PROTOBUF_MIN_PROTOC_VERSION check
                            
                                What's the reason behind ZigZag encoding in Protocol Buffers and Avro?
                            
                                Where to store proto files which are shared among projects?
                            
                                How can I remove an item from a repeated protobuf field in python?
                            
                                Check if a field has been set in protocol buffer 3
                            
                                What does the ProtoInclude attribute mean (in protobuf-net)
                            
                                protoc: command not found (Linux)
                            
                                Is there a good C implementation of Google Protocol Buffers
                            
                                Google protocol buffers on iOS
                            
                                Importing caffe results in ImportError: "No module named google.protobuf.internal" (import enum_type_wrapper)
                            
                                protocol buffers - store an double array, 1D, 2D and 3D
                            
                                How to solve the issue with Dalvik compiler limitation on 64K methods?
                            
                                Examining a protobuf message - how to get field values by name?
                            
                                Can protobuf service method return primitive type?
                            
                                Correct format of protoc go_package?
                            
                                raw decoder for protobufs format
                            
                                How to dynamically build a new protobuf from a set of already defined descriptors?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With