Skip to content

Latest commit

 

History

History
17 lines (12 loc) · 447 Bytes

README.md

File metadata and controls

17 lines (12 loc) · 447 Bytes

python+bash scripts for training a transliterator using a list of transliteration pairs.

dependencies:

  • m2m-aligner
  • python 2.7 (+ modules: argparser)
  • cdec decoder

disclaimer:

  • scripts are still under development and are not stable.

features:

  • reranking
  • many-to-many character alignments
  • multiple reference support

for details about how it works, refer to (Ammar et al. 2012) http://www.cs.cmu.edu/~wammar/pubs/translit-acl12.pdf