Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Server-side software for translating languages? [closed]

I am searching for a server-side application (not a service, we need to host this ourselves) that can take a given string and translate it to another language. Open-source, paid, doesn't matter.

Can anyone provide some recommendations?

like image 444
tambler Avatar asked Nov 06 '22 04:11

tambler


1 Answers

Open Source Translation Packages

Here are some state-of-the-art open-source machine translation packages:

  • cdec (C++)
  • Joshua (Java)
  • Moses (C++)
  • Phrasal (Java) - soon to be released

Training Data

To use each of the packages above, you'll need training data. If you're translating between many European languages you can use Phillip Koehn's Europarl parallel corpus. If you're interested in a European Union (EU) language that's not in the Europarl parallel corpus, you can gather the data by crawling the proceedings of the European parliament. All the EU proceedings are translated into each of the EU languages and made available for free online, which makes them a very good source of machine translation training data.

You can get training data for non-European languages from the Linguistics Data Consortium (LDC) catalog (e.g., Chinese-to-English).

like image 69
dmcer Avatar answered Nov 09 '22 17:11

dmcer