Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Where can I obtain an English dictionary with structured data? [closed]

I would like to download an English dictionary -- not just a word list -- in a structured format such as TXT, XML, or SQL.

Specifically, I need phonetic pronunciation and parts of speech (definition is not required).

Surprisingly, I can't find this online anywhere. Wiktionary is available for download, but it is only the MediaWiki articles themselves. Crawling all articles and extracting the phonetics and parts of speech would be a huge exercise.

Is this available anywhere? I don't mind paying.

Edit: a few people have asked what I would like to do. My immediate need is just curiosity, for example "what the most common two-syllable verbs?". Eventually my hope would be a tool that helps you find available domain names, and does so by pairing the correct parts of speech, with bonus points for phonetic matches.

Note: cross-posted on English Language and Usage.

like image 752
Portman Avatar asked Sep 25 '10 15:09

Portman


People also ask

Where can I find English dictionary?

BROWSE DICTIONARY.COM Dictionary.com is the world's leading online source for definitions, word origins, and a whole lot more. From Word of the Day to the stories behind today's slang, Dictionary.com unlocks the secrets of the English language for millions of people.

Which online dictionary is the best?

Dictionary by Merriam-Webster: America's most-trusted online dictionary.

What kind of information is available in a dictionary?

In addition to its basic function of defining words, a dictionary may provide information about their pronunciation, grammatical forms and functions, etymologies, syntactic peculiarities, variant spellings, and antonyms.


1 Answers

Go to http://www.speech.cs.cmu.edu/cgi-bin/cmudict and you will find the download page for the pronunciation dictionary at https://cmusphinx.svn.sourceforge.net/svnroot/cmusphinx/trunk/cmudict/

The latest version is currently cmudict.0.7a.

This is what I am currently using to implement the syllable counter for http://www.haikuvillage.com. It's in Ruby and I'd be happy to open source it for you if that helps.

like image 78
matthuhiggins Avatar answered Sep 29 '22 19:09

matthuhiggins