Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to control error handling and synchronization in Antlr 4 / c#

I'm using Antlr 4 with c# target. Here is a subset of my grammar:

/*
 * Parser Rules
 */
text : term+  EOF;
term : a1 a2 a3;
a1: ....
...
...

I want to accept valid data blocks as (term)s, when error exists I want to search for the next valid term and print out the whole text which caused the error for user to analyze manually.

How to synchronize input to the next valid term? and How to get the ignored text?

like image 456
Angie Avatar asked Mar 22 '23 14:03

Angie


2 Answers

You will need to create your own implementation of IAntlrErrorStrategy for this, and then set the Parser.ErrorHandler property to an instance of your error strategy. The documentation for the Java versions of the ANTLRErrorStrategy interface and default implementation DefaultErrorStrategy may provide useful information for implementing an error strategy, but I must warn you going in that creating a custom error strategy is an advanced feature with limited documentation. It's expected that the implementer is already an expert in ANTLR 4's implementation of the Adaptive LL(*) parsing algorithm (we're talking researcher-level understanding).

like image 175
Sam Harwell Avatar answered Apr 26 '23 19:04

Sam Harwell


For the first question (How to synchronize input to the next valid term?) I found some useful information that led me to acceptable solution.

Antlr generates next subcode for previous grammar:

public TextContext text() {
    TextContext _localctx = new TextContext(_ctx, State);
    EnterRule(_localctx, 0, RULE_text);
    int _la;
    try {
        EnterOuterAlt(_localctx, 1);
        State = 49;
        _errHandler.Sync(this);
        _la = _input.La(1);
        do {
            State = 48; term();
            State = 51;
            _errHandler.Sync(this);
            _la = _input.La(1);
        } while ( _la==KEYWORD );
        State = 53; Match(EOF);
    }
    catch (RecognitionException re) {
        _localctx.exception = re;
        _errHandler.ReportError(this, re);
        _errHandler.Recover(this, re);
    }
    finally {
        ExitRule();
    }
    return _localctx;
}

The call _errHandler.Sync(this); makes the parser advances through the input stream in an attempt to find next valid turn (as a result of "term+" component). To stop parser from sync in other subrules accept "term" rule", I Extended DefaultErrorStrategy Class as next:

public class MyErrorStrategy : Antlr4.Runtime.DefaultErrorStrategy
{
    public EventErrorStrategy() : base()
    { }

    public override void Sync(Antlr4.Runtime.Parser recognizer)
    {
        if(recognizer.Context is Dict.TextAnalyzer.DictionaryParser.TextContext)
            base.Sync(recognizer);
    }
}

then provided it to the parser:

parser.ErrorHandler = new MyErrorStrategy();
like image 24
Angie Avatar answered Apr 26 '23 18:04

Angie