We're looking for an open source Machine Translation Engine that could be incorporated into our localization workflow. We're looking at the options below:
Among these, Moses has the widest community support and has been tried out by many localization companies and researchers. We are actually leaning towards a Java-based engine since our applications are all in Java. Have any of you used either Joshua or Phrasal as part of your workflow. Could you please share your experiences with them? Or, is Moses way too far ahead of these in terms of the features it provides and ease of integration.
And, we require that the engine supports:
- Domain-specific training (i.e. it should maintain separate phrase tables for each domain that the input data belongs).
- Incremental training (i.e. avoiding having to retrain the model from scratch every time we wish to use some new training data).
- Parallelizing the translation process.