Building a Generic Parser for Converting a Text File to a Data Structure in C#

Tags:

I have a definition for a SPAN file (http://www.cme-ch.com/span/spanl300.htm) that i'd like to use in constructing a parser to parse the string data into an in memory collection class (or even using lazy evalution with the yield keyword.)

All parsing techniques and libraries i've seen apply to constructing parse trees for implementing languages; i'd simply like to know of any good techniques to parse into a data structure, similar to how XML is parsed into an XMLDocument in the .net framework, but using the rules defined by SPAN.

679

asked Aug 18 '10 19:08

Pierreten

1 Answers

SPAN appears to be a bunch of record types, each record with a lot of detail.

It should be straightforward to define a classic grammar that covers all of the records (as nonterminals), in terms of any subrecords (as nonterminals) and terminal data types representing the various datatypes defined by SPAN. There might be a lot of nonterminals, but that just makes for a big grammar, but not a complicated one.

Most programming languages have a small set of terminal tokens that can generally appear anywhere. The truth is that grammars define expectations of what can appear next (called "first" and "follow" sets in the LR parser literature), including a very limited set of terminals. A SPAN grammar would not be different; each "parse state" of a parser implies a limited set of terminals that come next, and one organize a parser to take advantage of this. (I've built L(AL)R parsers, and one could easily use the "current" state to determine the subset of terminals that could happen next). So, a SPAN parser could determine just the small set of tokens that might occur next in each state, and use that to pick of the characaters comprising those next tokens (they must form disjoint sets!).

An easy way to implement this is with a recursive descent parser.

So I claim that all that parsing machinery would be just fine for parsing SPAN, with some bit of custom work possibly to pick up the tokens.

Parsing actions for conventional parsers build trees, but its just as easy to populate fields of a data structure.

answered Sep 28 '22 10:09

Ira Baxter

Related questions
                            
                                Should I use FxCop and why?
                            
                                Most flexibilities rule engine for .NET [closed]
                            
                                DDD - Entity state transition
                            
                                Why did .NET's EnableDecompression default value change between 2.0 and 3.0?
                            
                                Single 32-bit MSI with 32/64-bit drivers
                            
                                Why is there no ObservableKeyedCollection<TKey, TValue> in the .NET Framework?
                            
                                .NET: How Do I Create File Icon Overlays
                            
                                How to open associated files in the same instance of an application
                            
                                Connecting .NET to Common Lisp
                            
                                programmatically get BPM of a wave or MP3 from .Net
                            
                                .NET coupled with MATLAB or R?
                            
                                Can I encrypt web.config with a custom protection provider who's assembly is not in the GAC?
                            
                                Execute code in another users context
                            
                                Call .NET Webservice with Android
                            
                                .NET C# MouseEnter listener on a Control WITH Scrollbar
                            
                                Is it possible to link Web.config transform with publishing profile?
                            
                                LINQ on a DataTable IN a CLR Stored Procedure
                            
                                Compare two object from the same class with tons of fields
                            
                                I want the Task to handle any exceptions that are thrown, but am finding it difficult to stop them from reaching the parent

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Building a Generic Parser for Converting a Text File to a Data Structure in C#

Tags:

.net

data-structures

parsing

c#-3.0

Pierreten

People also ask

1 Answers

Ira Baxter

Recent Activity

Donate For Us