| Title: | Bilingual Termbank Creation via Log-Likelihood Comparison and Phrase-Based Statistical Machine Translation |
Import to your calendar:
|
|---|---|---|
| Conferència | ||
| Presenter: | Andy Way, Dublin City University, Irlanda | |
| Venue: | *ATENCIÓ: NOVA AULA*: Aula 16, Politècnica IV | |
| Date&time: | 10:30 12/09/2014 | |
| Estimated duration: | 1:00 hora | |
| Contact person: | Forcada Zubizarreta, Mikel L. (mlf ua.es) | |
| Abstract: | Bilingual termbanks are important for many natural language processing (NLP) applications, especially in translation workflows in industrial settings. In this paper, we apply a log-likelihood comparison method to extract monolingual terminology from the source and target sides of a parallel corpus. Then, using a Phrase-Based Statistical Machine Translation model, we create a bilingual terminology with the extracted monolingual term lists. We manually evaluate our novel terminology extraction model on English-to Spanish and English-to-Hindi data sets, and observe excellent performance for all domains. Furthermore, we report the performance of our monolingual terminology extraction model comparing with a number of the state-of-the-art terminology extraction models on the English-to-Hindi datasets. | |
[ Close ]
![[CSV]](/img/csv_file.32x32.png)
ua.es)