Show simple item record

dc.contributor.authorTiedemann, Jörg
dc.contributor.authorNabende, Peter
dc.date.accessioned2012-02-03T16:50:15Z
dc.date.available2012-02-03T16:50:15Z
dc.date.issued2009-08-03
dc.identifier.citationTiedemann, J. & Nabende, P. (2009). Translating transliterations. In Kizza, J. M., Lynch, K., Ravi, N., Aisbett, J., & Phoha V. (eds.), Special topics in computing and ICT research: strengthening the role of ICT in development, Fountain Publishers, pages 97-108en_US
dc.identifier.isbn978-9970-02-738-5
dc.identifier.urihttp://hdl.handle.net/10570/381
dc.descriptionBook Chapteren_US
dc.description.abstractTranslating new entity names is important for improving performance in Natural Language Processing (NLP) applications such as Machine Translation (MT) and Cross Language Information Retrieval (CLIR). Usually, transliteration is used to obtain phonetic equivalents in a target language for a given source language word. However, transliteration across different writing systems often results in different representations for a given source language entity name. In this paper, we address the problem of automatically translating transliterated entity names that originally come from a different writing system. These entity names are often spelled differently in languages using the same writing system. We train and evaluate various models based on finite state technology and Statistical Machine Translation (SMT) for a character-based translation of the transliterated entity names. In particular, we evaluate the models for translation of Russian person names between Dutch and English, and between English and French. From our experiments, the SMT models perform best with consistent improvements compared to a baseline method of copying strings.en_US
dc.description.sponsorshipNufficen_US
dc.language.isoenen_US
dc.publisherFountain Publishers, Kampala.en_US
dc.subjectMachine transliterationen_US
dc.subjectMachine translationen_US
dc.subjectWeighted finite state transducersen_US
dc.subjectPhrase-based statistical machine translationen_US
dc.subjectCharacter-based machine translationen_US
dc.titleTranslating transliterationsen_US
dc.typeBook Chapteren_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record