If a C++ program receives a Protocol Buffers message that has a Protocol Buffers <code>string</code> field, which is represented by a <code>std::string</code>, what is the encoding of text in that field? Is it UTF-8?

Protobuf strings are always valid <code>UTF-8</code> strings. See the Language Guide: <blockquote> A string must always contain UTF-8 encoded or 7-bit ASCII text. </blockquote> (And ASCII is always also valid UTF-8.) Not all protobuf implementations enforce this, but if I recall correctly, at least the Python library refuses to decode non-unicode strings.

Text encoding of Protocol Buffers string fields

1 Answers

Protobuf strings are always valid UTF-8 strings.

See the Language Guide:

A string must always contain UTF-8 encoded or 7-bit ASCII text.

(And ASCII is always also valid UTF-8.)

Not all protobuf implementations enforce this, but if I recall correctly, at least the Python library refuses to decode non-unicode strings.

135

answered Sep 29 '22 05:09

jpa

Related questions
                            
                                Implementing a move constructor of a tagged union
                            
                                Automaticaly convert array of structures to structure of arrays in C++
                            
                                Enabling curses in both Linux and Windows
                            
                                C++. Weighted std::shuffle
                            
                                Template Argument Deduction Broken in Clang 6 for Temporary Objects
                            
                                constexpr variadic template and unpacking std::array
                            
                                Template non-type parameter deduction
                            
                                C++ need a good technique for seeding rand() that does not use time()
                            
                                can floating point multiplication throw an exception in C++?
                            
                                Overload operator for both std::vector and std::list
                            
                                Template template argument causes compiler error under Clang but not GCC [duplicate]
                            
                                C++ Compile-time check that an overloaded function can be called with a certain type of argument
                            
                                how convert std::array<char, N> to char (&dest)[N]?
                            
                                Const qualifier and forward reference
                            
                                CRTP: why a difference between getting a nested type and nested method of the derived class?
                            
                                `std::condition_variable::wait_for` calls the predicate very often
                            
                                What is the complexity of C++ bitset constructor that converts from long?
                            
                                Dynamic_cast on non polymorphic types
                            
                                Disturbing order of evaluation
                            
                                Does gcc 6 support the use of std::sample (c++17)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Text encoding of Protocol Buffers string fields

Tags:

c++

character-encoding

protocol-buffers

Raedwald

People also ask

1 Answers

jpa

Recent Activity

Donate For Us