What's the Pythonic way to report nonfatal errors in a parser?

Tags:

A parser I created reads recorded chess games from a file. The API is used like this:

import chess.pgn

pgn_file = open("games.pgn")

first_game = chess.pgn.read_game(pgn_file)
second_game = chess.pgn.read_game(pgn_file)
# ...

Sometimes illegal moves (or other problems) are encountered. What is a good Pythonic way to handle them?

Raising exceptions as soon as the error is encountered. However, this makes every problem fatal, in that execution stops. Often, there is still useful data that has been parsed and could be returned. Also, you can not simply continue parsing the next data set, because we are still in the middle of some half-read data.
Accumulating exceptions and raising them at the end of the game. This makes the error fatal again, but at least you can catch it and continue parsing the next game.

Introduce an optional argument like this:

game = chess.pgn.read_game(pgn_file, parser_info)
if parser_info.error:
   # This appears to be quite verbose.
   # Now you can at least make the best of the sucessfully parsed parts.
   # ...

Are some of these or other methods used in the wild?

816

asked Oct 14 '14 09:10

Niklas

1 Answers

The most Pythonic way is the logging module. It has been mentioned in comments but unfortunately without stressing this hard enough. There are many reasons it's preferable to warnings:

Warnings module is intended to report warnings about potential code issues, not bad user data.
First reason is actually enough. :-)
Logging module provides adjustable message severity: not only warnings, but anything from debug messages to critical errors can be reported.
You can fully control output of logging module. Messages can be filtered by their source, contents and severity, formatted in any way you wish, sent to different output targets (console, pipes, files, memory etc)...
Logging module separates actual error/warning/message reporting and output: your code can generate messages of appropriate type and doesn't have to bother how they're presented to end user.
Logging module is the de-facto standard for Python code. Everyone everywhere is using it. So if your code is using it, combining it with 3rd party code (which is likely using logging too) will be a breeze. Well, maybe something stronger than breeze, but definitely not a category 5 hurricane. :-)

A basic use case for logging module would look like:

import logging
logger = logging.getLogger(__name__) # module-level logger

# (tons of code)
logger.warning('illegal move: %s in file %s', move, file_name)
# (more tons of code)

This will print messages like:

WARNING:chess_parser:illegal move: a2-b7 in file parties.pgn

(assuming your module is named chess_parser.py)

The most important thing is that you don't need to do anything else in your parser module. You declare that you're using logging system, you're using a logger with a specific name (same as your parser module name in this example) and you're sending warning-level messages to it. Your module doesn't have to know how these messages are processed, formatted and reported to user. Or if they're reported at all. For example, you can configure logging module (usually at the very start of your program) to use a different format and dump it to file:

logging.basicConfig(filename = 'parser.log', format = '%(name)s [%(levelname)s] %(message)s')

And suddenly, without any changes to your module code, your warning messages are saved to a file with a different format instead of being printed to screen:

chess_parser [WARNING] illegal move: a2-b7 in file parties.pgn

Or you can suppress warnings if you wish:

logging.basicConfig(level = logging.ERROR)

And your module's warnings will be ignored completely, while any ERROR or higher-level messages from your module will still be processed.

140

answered Nov 16 '22 04:11

Lav

Related questions
                            
                                Assign value to multiple slices in numpy
                            
                                Replace fieldnames when using DictReader
                            
                                how to use Google Shortener API with Python
                            
                                Process data, much larger than physical memory, in chunks
                            
                                Text-Replace in docx and save the changed file with python-docx
                            
                                Choosing random integers except for a particular number for python?
                            
                                Python bottle vs uwsgi/bottle vs nginx/uwsgi/bottle
                            
                                best way to add additional fields to django-rest-framework ModelViewSet when create
                            
                                Run separate processes in parallel - Python
                            
                                Bisect a Python List and finding the Index
                            
                                Pytest init setup for few modules
                            
                                Reverse diagonal on numpy python
                            
                                Concise Ruby hash equivalent of Python dict.get()
                            
                                Python local vs global variables
                            
                                Get a header with Python and convert in JSON (requests - urllib2 - json)
                            
                                How to create a modal window in pyqt?
                            
                                How can I check whether a URL is valid using `urlparse`?
                            
                                Sorting XML in python etree
                            
                                Rotate a 2D image around specified origin in Python
                            
                                Python Multiprocessing: Only one process is running

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the Pythonic way to report nonfatal errors in a parser?

Tags:

python

exception-handling

error-handling

warnings

Niklas

People also ask

1 Answers

Lav

Recent Activity

Donate For Us