Validating a yaml document in python

Tags:

One of the benefits of XML is being able to validate a document against an XSD. YAML doesn't have this feature, so how can I validate that the YAML document I open is in the format expected by my application?

624

asked Jul 16 '10 06:07

Jon

2 Answers

Given that JSON and YAML are pretty similar beasts, you could make use of JSON-Schema to validate a sizable subset of YAML. Here's a code snippet (you'll need PyYAML and jsonschema installed):

from jsonschema import validate import yaml  schema = """ type: object properties:   testing:     type: array     items:       enum:         - this         - is         - a         - test """  good_instance = """ testing: ['this', 'is', 'a', 'test'] """  validate(yaml.load(good_instance), yaml.load(schema)) # passes  # Now let's try a bad instance...  bad_instance = """ testing: ['this', 'is', 'a', 'bad', 'test'] """  validate(yaml.load(bad_instance), yaml.load(schema))  # Fails with: # ValidationError: 'bad' is not one of ['this', 'is', 'a', 'test'] # # Failed validating 'enum' in schema['properties']['testing']['items']: #     {'enum': ['this', 'is', 'a', 'test']} # # On instance['testing'][3]: #     'bad'

One problem with this is that if your schema spans multiple files and you use "$ref" to reference the other files then those other files will need to be JSON, I think. But there are probably ways around that. In my own project, I'm playing with specifying the schema using JSON files whilst the instances are YAML.

145

answered Sep 18 '22 17:09

Jack Kelly

I find Cerberus to be very reliable with great documentation and straightforward to use.

Here is a basic implementation example:

my_yaml.yaml:

name: 'my_name' date: 2017-10-01 metrics:     percentage:     value: 87     trend: stable

Defining the validation schema in schema.py:

{     'name': {         'required': True,         'type': 'string'     },     'date': {         'required': True,         'type': 'date'     },     'metrics': {         'required': True,         'type': 'dict',         'schema': {             'percentage': {                 'required': True,                 'type': 'dict',                 'schema': {                     'value': {                         'required': True,                         'type': 'number',                         'min': 0,                         'max': 100                     },                     'trend': {                         'type': 'string',                         'nullable': True,                         'regex': '^(?i)(down|equal|up)$'                     }                 }             }         }     } }

Using the PyYaml to load a yaml document:

import yaml def load_doc():     with open('./my_yaml.yaml', 'r') as stream:         try:             return yaml.load(stream)         except yaml.YAMLError as exception:             raise exception  ## Now, validating the yaml file is straightforward: from cerberus import Validator schema = eval(open('./schema.py', 'r').read())     v = Validator(schema)     doc = load_doc()     print(v.validate(doc, schema))     print(v.errors)

Keep in mind that Cerberus is an agnostic data validation tool, which means that it can support formats other than YAML, such as JSON, XML and so on.

answered Sep 17 '22 17:09

Menelaos Kotsollaris

Related questions
                            
                                Visual Studio Code pylint: Unable to import 'protorpc'
                            
                                Playing mp3 song on python
                            
                                finding first day of the month in python
                            
                                Pairwise circular Python 'for' loop
                            
                                Is there any way to use pythonappend with SWIG's new builtin feature?
                            
                                Infinite integer in Python
                            
                                How to replicate tee behavior in Python when using subprocess?
                            
                                Python: self.__class__ vs. type(self) [duplicate]
                            
                                Vim automatically removes indentation on Python comments [duplicate]
                            
                                TypeError: 'tuple' object does not support item assignment when swapping values
                            
                                dict.fromkeys all point to same list
                            
                                Log output of multiprocessing.Process
                            
                                What is python-dev package used for
                            
                                Why does Python allow out-of-range slice indexes for sequences?
                            
                                Why do list comprehensions write to the loop variable, but generators don't? [duplicate]
                            
                                XML parsing - ElementTree vs SAX and DOM
                            
                                flask-login: can't understand how it works
                            
                                Extracting an information from web page by machine learning
                            
                                store return value of a Python script in a bash script
                            
                                what is --use-feature=2020-resolver? error message with jupyter installation on ubuntu

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Validating a yaml document in python

Tags:

python

validation

yaml

Jon

People also ask

2 Answers

Jack Kelly

Menelaos Kotsollaris

Recent Activity

Donate For Us