Transliteration system using pair HMM with weighted FSTs

Nabende, Peter

Transliteration system using pair HMM with weighted FSTs

Files

nabende-peter-ict-bookres.pdf (113.18 KB)

Date

2009-08-07

Authors

Nabende, Peter

Publisher

World Scientific Publishing Co Pte Ltd

Abstract

This paper presents a transliteration system based on pair Hidden Markov Model (pair HMM) training and weighted Finite State Transducer (WFST)techniques. Parameters used by WFSTs for transliteration generation are learned from a pair HMM. Parameters from pair HMM training on English-Russian data sets are found to give better transliteration quality than parameters trained for WFSTs for corresponding structures. Training a pair HMM on English vowel bigrams and standard bigrams for Cyrillic Romanization and using a few transformation rules on generated Russian transliterations to test for context improves the system's transliteration quality.

Description

This paper presents a transliteration system based on pair Hidden Markov Model (pair HMM) training and weighted Finite State Transducer (WFST)techniques.

Keywords

Machine transliteration, Pair hidden markov model, Weighted finite state transducers

Citation

Nabende, P. (2009). Transliteration system using pair HMM with weighted FSTs. In Proceedings of the 2009 named entities workshop: shared task on machine transliteration, ACL-IJCNLP, Singapore. Pages 100-103.

URI

http://hdl.handle.net/10570/380

Collections

School of Computing and Informatics Technology (CIT) Collection

Full item page