I'm conducting an investigation into the use of Protobuf for my organization and have a need to perform schema discovery. I've seen a number of articles related to Confluent Schema Registry with Avro. What are folks using for Protobuf?
AWS Glue Schema Registry now supports Protocol buffers (protobuf) schemas in addition to JSON and Avro schemas. This allows application teams to use protobuf schemas to govern the evolution of streaming data and centrally control data quality from data streams to data lake.
Schema Registry provides a serving layer for your metadata. It provides a RESTful interface for storing and retrieving Avro schemas. It stores a versioned history of all schemas, provides multiple compatibility settings and allows evolution of schemas according to the configured compatibility setting.
You can then use this Avro schema, for example, to serialize a Java object (POJO) into bytes, and deserialize these bytes back into the Java object. Avro not only requires a schema during data serialization, but also during data deserialization.
A majority of projects I've seen uses a git repository to store all their protocol buffer (.proto files). The goal is to share it with other teams. You can then import them as a git submodule in any of your project
Edit : There has been a project going on on this, but doesn't seem well maintained as of today. Here is the demo site
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With