Title: | Bilingual Termbank Creation via Log-Likelihood Comparison and Phrase-Based Statistical Machine Translation |
Import to your calendar: |
---|---|---|
Conferència | ||
Presenter: | Andy Way, Dublin City University, Irlanda | |
Venue: | *ATENCIÓ: NOVA AULA*: Aula 16, Politècnica IV | |
Date&time: | 10:30 12/09/2014 | |
Estimated duration: | 1:00 hora | |
Contact person: | Forcada Zubizarreta, Mikel L. ( ) | |
Abstract: | Bilingual termbanks are important for many natural language processing (NLP) applications, especially in translation workflows in industrial settings. In this paper, we apply a log-likelihood comparison method to extract monolingual terminology from the source and target sides of a parallel corpus. Then, using a Phrase-Based Statistical Machine Translation model, we create a bilingual terminology with the extracted monolingual term lists. We manually evaluate our novel terminology extraction model on English-to Spanish and English-to-Hindi data sets, and observe excellent performance for all domains. Furthermore, we report the performance of our monolingual terminology extraction model comparing with a number of the state-of-the-art terminology extraction models on the English-to-Hindi datasets. |
[ Close ]