Department of Software and Computing Systems

Lecture

Title:Bilingual Termbank Creation via Log-Likelihood Comparison and Phrase-Based Statistical Machine Translation Import to your calendar:
[CSV]
Conferència
Presenter:Andy Way, Dublin City University, Irlanda
Venue:*ATENCIÓ: NOVA AULA*: Aula 16, Politècnica IV
Date&time:10:30 12/09/2014
Estimated duration:1:00 hora
Contact person:

Forcada Zubizarreta, Mikel L. ( )
Abstract:
Bilingual termbanks are important for many natural language processing (NLP)
applications, especially in translation workflows in industrial settings. In
this paper, we apply a log-likelihood comparison method to extract monolingual
terminology from the source and target sides of a parallel
corpus. Then, using a Phrase-Based Statistical Machine
Translation model, we create a bilingual terminology with
the extracted monolingual term lists. We manually evaluate
our novel terminology extraction model on English-to
Spanish and English-to-Hindi data sets, and observe
excellent performance for all domains. Furthermore, we
report the performance of our monolingual terminology extraction model
comparing with a number of the state-of-the-art terminology extraction models
on the English-to-Hindi datasets.

[ Close ]