Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Parse Wiktionary

Is there any .Net library to parse pages I've retrieved through the mediawiki api? A standard mediawiki parser that could just give titles and the data in pure data would be fine, but I would rather have one that is specifically suited to wiktionary, one that could give me what type of word it is and all of the definitions.

I would prefer not to write my own parser for this. Any suggestions?

like image 279
user0 Avatar asked Dec 05 '11 23:12

user0


1 Answers

The dbnary project provides parsed information from Wiktionary in RDF form.

If you want something processed even further, I provide SQLite and TEI files generated from the dbnary data as part of my WikDict project at download.wikdict.com.

This does not really answer the question for .net libraries, but I'm sure you'll easily find libraries to read XML (TEI), SQLite or RDF.

like image 93
Karl Bartel Avatar answered Sep 30 '22 19:09

Karl Bartel