I would like to download an English dictionary -- not just a word list -- in a structured format such as TXT, XML, or SQL.
Specifically, I need phonetic pronunciation and parts of speech (definition is not required).
Surprisingly, I can't find this online anywhere. Wiktionary is available for download, but it is only the MediaWiki articles themselves. Crawling all articles and extracting the phonetics and parts of speech would be a huge exercise.
Is this available anywhere? I don't mind paying.
Edit: a few people have asked what I would like to do. My immediate need is just curiosity, for example "what the most common two-syllable verbs?". Eventually my hope would be a tool that helps you find available domain names, and does so by pairing the correct parts of speech, with bonus points for phonetic matches.
Note: cross-posted on English Language and Usage.
BROWSE DICTIONARY.COM Dictionary.com is the world's leading online source for definitions, word origins, and a whole lot more. From Word of the Day to the stories behind today's slang, Dictionary.com unlocks the secrets of the English language for millions of people.
Dictionary by Merriam-Webster: America's most-trusted online dictionary.
In addition to its basic function of defining words, a dictionary may provide information about their pronunciation, grammatical forms and functions, etymologies, syntactic peculiarities, variant spellings, and antonyms.
Go to http://www.speech.cs.cmu.edu/cgi-bin/cmudict and you will find the download page for the pronunciation dictionary at https://cmusphinx.svn.sourceforge.net/svnroot/cmusphinx/trunk/cmudict/
The latest version is currently cmudict.0.7a.
This is what I am currently using to implement the syllable counter for http://www.haikuvillage.com. It's in Ruby and I'd be happy to open source it for you if that helps.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With